We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Senior Lead Incident Manager - Site Reliability Engineer - CTJ - Poly

Microsoft
United States, Washington, Redmond
Oct 22, 2025
OverviewThe Azure Senior Incident Manager - Site Reliability Engineer is responsible for driving the resolution of complex, multi-service outages across Azure's global infrastructure in our Air Gap Clouds. This role provides operational leadership during high-severity incidents, ensuring timely mitigation, clear stakeholder communication, and adherence to compliance and privacy standards. The position requires technical breadth, demonstrated leadership under pressure, and the ability to coordinate across engineering, operations, and customer-facing teams. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesCommand & Control: Act as the primary incident commander for major Azure outages, ensuring forward progress and clarity throughout the incident lifecycle.Incident Leadership: Lead cross-functional teams (engineering, support, operations) to restore services quickly and minimize customer impact.Provide timely, accurate updates to executives, internal stakeholders, and customer-facing teams.Process Governance: Ensure adherence to incident management protocols, including legal, privacy, and compliance requirements.Continuous Improvement: Conduct Post-Incident Reviews (PIRs), identify systemic issues, and drive platform improvements.Tooling & Automation: Leverage and enhance incident management tools such as Outage Hub and IcM for real-time visibility and coordination.Mentorship: Guide and coach other incident managers and engineers on best practices for incident responseRythm of Business: Ensure our Executive Leaders receive regular updates, critical signals and progress reports on cloud-wide initiatives.Embody our culture and values.
Applied = 0

(web-c549ffc9f-b5mrm)