We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.

Job posting has expired

#alert
Back to search results

Site Reliability Engineer (Hybrid)

TENASKA
United States, Texas, Irving
300 East John W Carpenter Freeway (Show on map)
February 04, 2023
Description

Tenaska is one of the largest privately held companies in the United States, an organization that's adept in natural gas and power marketing, power management, development and acquisition of generation assets, operation of power plants and more.

Job Summary:

This position is responsible for developing and utilizing tools to monitor key metrics of our data systems, tracking the reliability and recoverability of our systems, and reporting outcomes of regular testing and monitoring activities. Additionally, this position is responsible for enabling the high availability of our systems through understanding our failover methods, implementing changes to our methods and systems to minimize downtime, and facilitating failover procedure testing. They are also responsible for making recommendations to the developers for better reliability patterns in our existing and newly created systems.

Essential Job Functions:



  • Utilize existing tools to create telemetry streams from each system that DevOps maintains.
  • Track trends of key metrics to build a repeatable snapshot of the current state of all systems within DevOps and predict failures.
  • Correlate data from disparate systems to determine underlying causes to issues that may be occurring in seemingly-unrelated parts of the enterprise.
  • Monitor existing logging and monitoring systems and reduce unnecessary logging or improperly tuned monitor probes.
  • Develop a suite of dashboards and tools that enable the SRE to track all incoming metrics and surface the most pressing issues
  • Continually improve these dashboards to make their information more useful in real time as well as for after-the-fact analysis
  • Generate "Post Mortem" reports for unplanned outages or system failures
  • Prepare "Scope of Impact" reports for upcoming planned outages or system changes
  • Work with the other members of DevOps and the Infrastructure team to ensure that underlying resources are ready for failover and to help plan for future growth
  • Maintain failover documentation and S.O.P.s.
  • Perform regularly scheduled failover testing in conjunction with the rest of the DevOps team, Infrastructure, and our Business teams.
  • Continually seek to improve our failover procedures.


Education/Experience/Skills

Basic Requirements:



  • A bachelor's degree in Computer Science, Data Science, Computer Information Systems, or a related field is preferred, but commiserate experience is acceptable in lieu of such a degree
  • A basic understanding of computer programming and experience working with code, databases, and operating systems is needed.
  • At least two years of experience working with data systems
  • The SRE is the "Control Tower" of DevOps. As such, they need to be familiar with how our data systems work and interact with one another. The candidate should have a basic understanding of computer programming and data systems architecture.
  • Ability to interact with various groups within the business to inform them of the basic details of upcoming changes or communicating the current state of system failures or outages.
  • Ability to interact with other developers and management to help define, implement, and enforce patterns for proper metric telemetry from systems, proper logging, and resilient failover patterns.
  • Should always be seeking to improve our system telemetry, uptime, and recoverability. Therefore, must be aware of the different technologies available in the industry and will help determine if they are a fit for our environment.


At Tenaska we care about the wellbeing of our employees and their families. That's why we offer our employees a comprehensive benefit package. Benefits included below:



  • Health, dental, vison, disability, and life insurance
  • Excellent 401(k) plan
  • Incentive-based, competitive salary packages
  • Health/dependent care flex accounts
  • Tuition assistance
  • Long-term disability coverage
  • Adoption benefits
  • Employee assistance program
  • Paid vacations and holidays
  • Generous sick leave
  • Charitable giving program
  • Paid maternity/paternity leave
  • Wellness programs


Tenaska is an equal opportunity employer.

(web-5bb4b78774-k29v8)