Lead Site Reliability Engineer
Kforce
Kforce has a client that is seeking a Lead Site Reliability Engineer in San Diego, CA.
Responsibilities:
* Lead, mentor, and develop a team of Site Reliability engineers, fostering a collaborative and innovative work environment
* Lead Site Reliability Engineer will oversee an SRE team and drive the reliability strategy for the organization
* Conduct resiliency design reviews and lead complex problem-solving efforts
* Design, implement, and maintain monitoring systems to track the performance, availability, and reliability of services
* Respond to incidents promptly, investigate root causes, and coordinate efforts to mitigate and resolve them
* Analyze performance data, and plan for scalability and capacity requirements
* As a Lead Site Reliability Engineer, you will identify and optimize performance bottlenecks, both at the infrastructure and application levels
* Automate repetitive tasks and processes to improve efficiency and reduce manual intervention
* Implement and enforce change management practices to ensure safe and controlled changes to the production environment
* Design and implement fault-tolerant systems and practices to minimize downtime and ensure service availability
* Lead Site Reliability Engineer will collaborate with the GRC team on developing and maintaining disaster recovery plans and procedures relevant to the software supported to minimize the impact of catastrophic failures
* Collaborate with stakeholders to define RPO/RTO for Company's system footprint
Confirm your E-mail: Send Email
All Jobs from Kforce