Bring your heart to CVS Health. Every one of us at CVS Health shares a single, clear purpose: Bringing our heart to every moment of your health. This purpose guides our commitment to deliver enhanced human-centric health care for a rapidly changing world. Anchored in our brand — with heart at its center — our purpose sends a personal message that how we deliver our services is just as important as what we deliver.
Our Heart At Work Behaviors™ support this purpose. We want everyone who works at CVS Health to feel empowered by the role they play in transforming our culture and accelerating our ability to innovate and deliver solutions to make health care more personal, convenient and affordable.
Position Summary
As a Senior Manager of Site Reliability Engineering (SRE) at CVS Health, you will lead a team of SREs responsible for ensuring the reliability, availability, and performance of our critical systems and services. This is a high performing integration platform which processes about 6 billion Transactions every month. You will collaborate with cross-functional teams to design, implement, and maintain scalable and resilient infrastructure solutions that support our business objectives. Your leadership will drive the adoption of best practices in site reliability, incident management, and continuous improvement.
As a Senior Manager of Site Reliability Engineering (SRE) you will
Lead and mentor a team of Site Reliability Engineers, fostering a culture of collaboration, innovation, and continuous learningEnsure the availability, reliability, and performance of critical services through proactive monitoring, capacity planning, and performance tuning.Design, implement, and maintain observability solutions using tools such as AppDynamics, Splunk, Prometheus, Grafana, or Open Telemetry.Collaborate with software engineering, operations, and product teams to design and deploy scalable and resilient systemsOversee incident management processes, ensuring timely resolution of incidents and minimizing downtimeEstablish and monitor key performance indicators (KPIs) to measure system reliability and performanceConduct post-incident reviews and implement lessons learned to prevent future occurrencesStay current with industry trends and emerging technologies to continuously improve SRE practicesManage budgets and resources effectively to support SRE initiatives and projectsIncident Management: Lead incident response efforts, perform root cause analysis (RCA), and drive post-mortem processes to improve system reliabilityAutomation & Infrastructure as Code (IaC): Develop automation to reduce manual operational tasks using Terraform, Ansible, or Kubernetes CI/CD & Deployment Pipelines: Work closely with development teams to enhance deployment strategies and improve continuous integration/continuous deployment (CI/CD) workflowsCloud & Kubernetes Operations: Manage and optimize cloud infrastructure (AWS, Azure, or GCP) and container orchestration platforms (Kubernetes, Docker)Security & Compliance: Implement best practices for security, compliance, and cost optimization in cloud environmentsRequired Qualifications
7+ years of experience in site reliability engineering, DevOps, or a related field5+ years of experience of cloud computing platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes, Docker)3+ years of experience in a leadership or management role, with a proven track record of managing high-performing teams3+ years of experience in scripting and programming languages (e.g., Python, Go, Java)3+ years of experience in monitoring and observability tools (e.g., Prometheus, Grafana, Splunk)Familiarity with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI etc)Excellent communication and interpersonal skills, with the ability to collaborate effectively across teamsStrong problem-solving skills and a proactive approach to identifying and addressing issuesAbility to thrive in a fast-paced, dynamic environment and manage multiple prioritiesExperience with Agile methodologies and DevOps practicesPreferred Qualifications
Ability to multi-task and rapid context switch between Applications, programs, and architecture initiativesAbility to assess the impact of architecture changes on the business, application relationships and information flowStrong understanding of SDLC - must have participated on many projects through complete lifecycles (requirements, design, development, testing, launch)Strong ability to facilitate collaboration among senior technical team members and senior business leadersStrong organizational, leadership and consensus building skills; ability to motivate and lead teams in matrix organizationExcellent interpersonal and communication skills to work with all levelsA strong base of experience in many disciplines of information technology, including operating systems, systems management and development tools, application program interfaces (APIs), database management systems, development methodologies, transaction processing monitors, messaging software, security, directory services, hardware, telecommunications, interoperability techniques and standards, services monitoring and alertingExperience in multiple technologies in stack (Data Power, IIB, Splunk) is a PLUSHealthcare experience or big box retail experience is a significant plus and will be given utmost considerationITCAM/Splunk experience is a PLUSExperience in delivering projects using Agile methodology in addition to waterfall is desirableAgile/PM certifications are a PLUSExperience with large-scale distributed systems using message queues, TPMs, or other related technologies in a mobile/portal environmentExperience om data architecture concepts and governance, including acting as a design authority for information and data within projects and programs is critical for this roleMastery of design considerations for high volume transaction systemsEducation
Bachelor’s degree in computer science engineering, or a related field; Master’s degree preferredPay Range
The typical pay range for this role is:
$118,450.00 - $260,590.00
This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program.
In addition to your compensation, enjoy the rewards of an organization that puts our heart into caring for our colleagues and our communities. The Company offers a full range of medical, dental, and vision benefits. Eligible employees may enroll in the Company’s 401(k) retirement savings plan, and an Employee Stock Purchase Plan is also available for eligible employees. The Company provides a fully-paid term life insurance plan to eligible employees, and short-term and long term disability benefits. CVS Health also offers numerous well-being programs, education assistance, free development courses, a CVS store discount, and discount programs with participating partners. As for time off, Company employees enjoy Paid Time Off (“PTO”) or vacation pay, as well as paid holidays throughout the calendar year. Number of paid holidays, sick time and other time off are provided consistent with relevant state law and Company policies.
For more detailed information on available benefits, please visit Benefits | CVS Health
Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.