Why PlayStation? PlayStation isn't just the Best Place to Play - it's also the Best Place to Work. Today, we're recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed PlayStation software titles from PlayStation Studios, and more. PlayStation also strives to create an inclusive environment that empowers employees and embraces diversity. We welcome and encourage everyone who has a passion and curiosity for innovation, technology, and play to explore our open positions and join our growing global team. The PlayStation brand falls under Sony Interactive Entertainment, a wholly-owned subsidiary of Sony Group Corporation.
Staff Site Reliability Engineer
San Diego, CA
Responsibilities:
Your responsibilities will include hands-on application management of over 100 commerce and payment-related services within an AWS cloud environment, ensuring availability, resiliency, scalability, and performance. You will work closely with our service development teams to develop, automate, and provide the production readiness of all new services and features introduced.
- Operate as a Staff SRE with strong Java/C++ development expertise, capable of navigating complex application code, partnering closely with engineering teams, and influencing the technical direction of services through hands-on contributions to design, debugging, and production-grade code.
- Identify areas for operational process improvement and automation to enhance efficiency. Drive the hands-on development of scripts and tools to automate these processes within our environment.
- Increase observability on our platform by implementing robust monitoring and alerting patterns across our services. Develop rich, informational dashboards/reports on our services that provide valuable insight, and develop essential alerting patterns to drive down the MTTD and MTTR on platform incidents.
- Collaborate and partner with other SRE teams that specialize in areas such as data services, data platform, and platform hosting to inspire changes and ensure optimal application performance and resiliency across all back-end services within PlayStation.
- Iteratively lead performance and capacity validation analysis for our commerce platform services. Apply AWS patterns and technologies, such as spot instances, dynamic auto-scaling, and EKS, to efficiently optimize our AWS spend.
- Review service flows and architecture to influence resiliency, availability, and scalability for all services within our platform.
- Provide rotational on-call support where you'll respond, detect, triage, and resolve production incidents on the commerce and payments platform.
- Conduct, document, and present root cause analysis documents to share incident insights and findings with our broader engineering organization.
Qualifications:
- BS degree or equivalent experience in Computer Science, Engineering, or a related technical subject area.
- 10+ years of hands-on software or systems engineering experience working with Java and/or C++ services, with strong proficiency in building, deploying, and debugging Java-based services in production environments
- 7+ years of relevant experience in high-volume, production-critical software environments, including experience integrating, developing, and managing applications in AWS cloud infrastructure.
- 7+ years of experience with building automation into daily operational processes through one or more programming languages (preferably Python, Node.js, or Go).
- Strong experience in configuring, tuning, and automating operational responsibilities for AWS managed data services, including RDS, DynamoDB, and Elasticache.
- Experience with monitoring and log management tools (Datadog, CloudWatch, Splunk)
- Experience with container technologies and orchestration (Docker, Kubernetes, EKS)
- Ability to create reusable, secure, and compliant IaC using Terraform/CloudFormation stacks supporting multi-account/multi-region AWS architecture.
- Solid understanding of AWS networking systems and protocols (ie, ALB, R53, API-Gateway, TCP/IP, HTTP/HTTPS, DNS)
- Experience with developing or supporting Continuous Integration and Continuous Delivery/Deployment pipelines (CI/CD)
- Strong experience in incident response, blameless postmortem culture, and production readiness reviews with a reliability-first approach.
#LI-KS1
Please refer to ourCandidate Privacy Noticefor more information about how we process your personal information, and your data protection rights.
At SIE, we consider several factors when setting each role's base pay range, including the competitive benchmarking data for the market and geographic location.
Please note that the base pay range may vary in line with our hybrid working policy and individual base pay will be determined based on job-related factors which may include knowledge, skills, experience, and location.
In addition, this role is eligible for SIE's top-tier benefits package that includes medical, dental, vision, matching 401(k), paid time off, wellness program and coveted employee discounts for Sony products. This role also may be eligible for a bonus package. Clickhere to learn more.
The estimated base pay range for this role is listed below.
$199,400
—
$299,200 USD
Equal Opportunity Statement: Sony is an Equal Opportunity Employer. All persons will receive consideration for employment without regard to gender (including gender identity, gender expression and gender reassignment), race (including colour, nationality, ethnic or national origin), religion or belief, marital or civil partnership status, disability, age, sexual orientation, pregnancy, maternity or parental status, trade union membership or membership in any other legally protected category. We strive to create an inclusive environment, empower employees and embrace diversity. We encourage everyone to respond. PlayStation is a Fair Chance employer and qualified applicants with arrest and conviction records will be considered for employment.
|