We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Principal Software Engineer - Azure

Microsoft
United States, Washington, Redmond
Oct 05, 2024
OverviewAzure Edge + Platform brings together Edge platforms, devices, and services to deliver Edge solutions, operating systems, and engineering systems. Driven by its customers' needs, Azure Edge + Platform seeks to accelerate growth for Azure, E&D, and Microsoft's customers worldwide. The organization's portfolio spans the Cloud Edge Stack, Azure Engineering Systems, Azure Media Services - for end-to-end media workflow and analytics - and Microsoft's Operating Systems including the Azure Host OS and Windows. This portfolio impressively powers the world with more than one billion monthly active devices. ServiceDEEP is the latest addition to Azure's Health and Standards suite of diagnostic tools. It is an ML-based automated diagnosis platform designed to help engineers quickly identify and resolve service issues through actionable Root Cause Analysis (RCA) insights, delivered in near real-time. By minimizing troubleshooting time and enhancing the DRI (Designated Responsible Individual) health index, ServiceDEEP improves overall Azure service health. The platform's Auto-RCA ML model leverages rich health data from various diagnostic sources and refines it using domain-specific knowledge. We are seeking a motivated, experienced Principal Software Engineer to collaborate with data scientists, product managers, and customers in developing this diagnostic solution. In this role, you will work on end-to-end diagnostic scenarios, focusing on both correlation and causality analysis. This position offers the opportunity to learn about cutting-edge interconnect services and machine learning pipelines.
ResponsibilitiesDesigning new frameworks and leveraging existing ones to support ML-based diagnostic scenarios.Developing tools and scripts to optimize infrastructure execution and integrate large language models (LLMs) into the diagnostics space.Processing result data and collaborating with data scientists to refine algorithms and machine learning models.Continuously improving and gaining deeper insights into the performance, reliability, and scalability of Azure services.Engaging with both internal and external Azure partners.Mentoring and supporting the growth of junior developers.
Applied = 0

(web-5fdf5b7fb4-dtl8q)