New
Senior Technical Program Manager (AI-Driven Reliability & Product Innovation)
![]() | |
![]() United States, Washington, Redmond | |
![]() | |
OverviewWe're looking for a product-minded Senior Technical Program Manager (AI-Driven Reliability & Product Innovation) to shape how AI and Reliability come together for one of Microsoft's most mission-critical cloud services.This is not a traditional TPM role-it blends product strategy, engineering leadership, and technical execution. You'll lead multiple AI-powered reliability initiatives across Office Engineering Direct and M365, partnering with talented engineers to make our services faster, smarter, and more resilient.The ideal candidate brings a strong technical background and product mindset, with the ability to communicate across systems, collaborate with customers, and articulate a clear vision.If you thrive on solving complex reliability challenges using data, design, and empathy, this is your opportunity to redefine excellence at global scale. Why this role mattersOffice Engineering Direct (OED) is transforming how enterprise customers experience reliability and partnership with Microsoft. The customer-facing reliability portal you'll lead-alongside other AI-powered diagnostics and automation initiatives-is becoming a key differentiator for enterprise customers investing in OED and Microsoft 365's premium reliability offerings.By evolving engineering reliability into intelligent, automated systems, you'll help enterprises experience the impact of Microsoft reliability daily, while driving efficiency, reducing manual effort, and enabling sustainable growth. About the teamYou'll join the Office Engineering Direct (OED) Enterprise+ Cloud SRE organization, a globally distributed team across the U.S., Europe, and Asia.We're responsible for both live-site excellence and forward-looking engineering investments that raise the reliability bar for Microsoft 365.You'll collaborate closely with SRE Engineering leaders to define vision, set direction, and deliver measurable outcomes.Your success depends on deep partnership with engineers-understanding their challenges, celebrating progress, and translating technical achievements into customer and business value.
ResponsibilitiesDrive the engineering vision & roadmap executionOwn semester-based OKRs end-to-end, from planning and prioritization to progress visibility and retrospectives.Translate strategy into actionable, measurable engineering deliverables.Anticipate resource and execution risks; help the team stay focused on high-impact work.Represent the team's direction and outcomes to leadership with clarity and confidence.Take end-to-end ownership of established reliability products such as the OED customer-facing portal and internal AI-powered diagnostics and automation platform.Monitor engagement and adoption metrics to ensure features resonate with customers and internal partners; develop strategies to improve reach and satisfaction.Own AI-driven reliability product innovationLead multiple core code areas that enable predictive reliability, diagnostics, and automation, including customer-facing and internal platforms.Partner with engineers, PMs, and data scientists to define feature direction, evaluate ML use-cases, and align on measurable reliability outcomes (TTR, TTI, mis-route reduction, automation coverage).Drive integration of telemetry, analytics, and experimentation frameworks that make insights actionable.Guide the evolution of mature reliability platforms, balancing innovation with operational excellence.Translate customer and stakeholder input into prioritized roadmaps that deliver intelligence, transparency, and reliability at scale.Lead secure, compliant product deliveryInitiate and govern Security and Privacy Reviews for every new feature or major change.Ensure compliance with Microsoft's trust and regulatory frameworks.Maintain audit-ready documentation, validate RBAC and threat models, and champion privacy-by-design across releases.Connect engineering outcomes to customer valueBridge customer and engineering perspectives to ensure every investment advances service quality and user trust.Use telemetry, usage data, and feedback loops to guide prioritization and measure impact.Craft end-of-semester communications that tell the story of engineering impact, outcomes, learnings, and next-semester focus.Engage directly with customers to demo new and upcoming features, gather insights through surveys, and use data to guide roadmap prioritization.Partner with OED Program Owners to align product updates with customer communications and CAB agendas.Collaborate across Exchange, SharePoint, OneDrive, Teams, and Windows engineering teams to coordinate release readiness and ensure cohesive reliability experiences.Lead through technical depth and influenceParticipate in architecture reviews, demos, and design sessions with confidence and curiosity.Ask thoughtful questions at the code and system level to balance ambition with feasibility.Build alignment across partner teams in Exchange, Security, and M365 Engineering to deliver unified reliability experiences.Lead with vision. Think in systems. Build what matters. Redefine reliability for the AI-powered cloud. |