We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Observability Engineer (3448) - Tampa/Dallas

Dexian DISYS
United States, Texas, Coppell
Oct 01, 2025
Observability Engineer (3448) - Tampa/Dallas
Job details
Posted

11 September 2025
Location

Coppell, TX
Job type

Permanent
Reference

977884
Job description

Observability & AIOps Engineer

Location: Dallas or Tampa | Hybrid: 3 days onsite
Contract: 6-month contract-to-hire

We are seeking a senior-level Observability & AIOps Engineer with hands-on experience in Java and Python to enhance enterprise IT observability, resilience, and reliability. This role blends hands-on engineering with architectural guidance to optimize monitoring, performance, and reliability across IT systems.

Key Responsibilities



  • Design, prototype, test, and document observability and reliability solutions.
  • Publish technology strategies, observability standards, and best practices.
  • Translate business goals into technical solutions that meet non-functional requirements.
  • Create Observability Driven Development procedures and promote adoption of open-standard frameworks (OTel, MELTS).
  • Implement AI-augmented testing strategies for federated execution and enterprise governance.
  • Collaborate with SREs and production support teams to improve distributed tracing, trade processing reliability, and chaos testing.
  • Design and implement full-stack applications for operational predictability and prescriptive disruption response.
  • Establish monitoring and alerting standards for performance, scalability, availability, and reliability.


Experience & Qualifications



  • Distributed Applications: 10+ years designing and implementing distributed systems.
  • Networking & Infrastructure: 5+ years in networking, middleware, infrastructure, and database architecture.
  • Highly Available Architecture: 5+ years implementing highly available solutions.
  • Disaster Recovery: 5+ years with disaster recovery methodologies and patterns.
  • Hands-On Development: Senior-level expertise in Java and Python for observability and reliability engineering.


Knowledge & Skills



  • Strong problem-solving and independent work capabilities.
  • Familiarity with public cloud environments (AWS, Azure) is a plus.
  • Performance analysis, tuning, and engineering experience is desirable.
  • Knowledge of monitoring/observability tools: Dynatrace, Splunk, Grafana, Prometheus, OpenTelemetry, CloudWatch, CloudTrail.
  • Ability to design solutions that improve resilience, reliability, and operational efficiency.


Dexian is a leading provider of staffing, IT, and workforce solutions with over 12,000 employees and 70 locations worldwide. As one of the largest IT staffing companies and the 2nd largest minority-owned staffing company in the U.S., Dexian was formed in 2023 through the merger of DISYS and Signature Consultants. Combining the best elements of its core companies, Dexian's platform connects talent, technology, and organizations to produce game-changing results that help everyone achieve their ambitions and goals.

Dexian's brands include Dexian DISYS, Dexian Signature Consultants, Dexian Government Solutions, Dexian Talent Development and Dexian IT Solutions. Visit https://dexian.com/ to learn more.

Dexian is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status.


Applied = 0

(web-759df7d4f5-7gbf2)