At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better, too. Join us and build an exceptional experience for yourself, and a better working world for all.
The opportunity
As a Site Reliability Engineer, you will be responsible for the availability, performance, monitoring, and incident response, among other things, of the platforms and services the company provides. You will work closely with software engineers to support and maintain scalable and reliable infrastructure and to improve automation and tooling.
Your key responsibilities
Support, maintain & scale automated tools for deployment, monitoring, and operations of the company's systems. Troubleshoot and resolve issues in our dev, test, and production environments. Enhance the company's infrastructure and application monitoring and alerting systems. Drive incident management process and support a blameless post-mortem culture. Partner with software engineers to improve services through rigorous testing and release procedures. Participate in system design consulting, platform management, and capacity planning. Create sustainable systems and services through automation and uplifts. Balance feature development speed and reliability with well-defined service level objectives.Skills and attributes for success
Experience with distributed systems and microservices architecture. Prior involvement with high-scale, high-availability systems. Familiarity with database management and SQL/NoSQL databases. Certifications in cloud technologies or SRE methodologies.To qualify for the role, you must have
Bachelor’s degree in computer science, Engineering, or related field, or equivalent experience. Strong experience with Linux/Unix systems and a good understanding of system performance areas. Proficiency in one or more of the following: Go, Python, Ruby, Shell scripting. Experience with cloud services (e.g., AWS, GCP, Azure) and cloud infrastructure automation tools (e.g., Terraform, Ansible). Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes). Familiarity with continuous integration and deployment methodologies (CI/CD). Experience with monitoring and log aggregating frameworks like ELK, Prometheus, Grafana, and Splunk. Understanding of networking concepts (e.g., DNS, HTTP, TCP/IP) and load balancing. Strong problem-solving skills and ability to work under pressure.
What we offer
EY Global Delivery Services (GDS) is a dynamic and truly global delivery network. We work across six locations – Argentina, China, India, the Philippines, Poland and the UK – and with teams from all EY service lines, geographies and sectors, playing a vital role in the delivery of the EY growth strategy. From accountants to coders to advisory consultants, we offer a wide variety of fulfilling career opportunities that span all business disciplines. In GDS, you will collaborate with EY teams on exciting projects and work with well-known brands from across the globe. We’ll introduce you to an ever-expanding ecosystem of people, learning, skills and insights that will stay with you throughout your career.
EY | Building a better working world
EY exists to build a better working world, helping to create long-term value for clients, people and society and build trust in the capital markets.
Enabled by data and technology, diverse EY teams in over 150 countries provide trust through assurance and help clients grow, transform and operate.
Working across assurance, consulting, law, strategy, tax and transactions, EY teams ask better questions to find new answers for the complex issues facing our world today.