DESCRIPTION:
Duties: Implement SRE frameworks to support globally multi-cloud environments. Failure analysis/root cause analysis. Develop technical engineering documentation. Drive software development lifecycle maturity. Quality control. Technical consultation. Perform deployment, administration, management, configuration, testing, and integration. Develop new cloud engineering strategies and implementations. Champion an automated DevOps model. Coaching and mentoring junior team members. Write operation documentation and knowledge base of known issues with solutions. Perform 24x7 SRE on-call rotations and escalation workflows.
QUALIFICATIONS:
Minimum education and experience required: Master’s degree in Software Engineering, Information Technology, or related field of study plus 3 years of experience in the job offered or as a Site Reliability Engineer, Member of Technical Staff, Automation Engineer, System Architect, or related occupation. The employer will alternatively accept a Bachelor's degree in Software Engineering, Information Technology, or related field of study plus 5 years of experience in the job offered or as a Site Reliability Engineer, Member of Technical Staff, Automation Engineer, System Architect, or related occupation.
Skills Required: Requires experience in the following: Git; Prometheus; Shell Scripting; Infrastructure as Code; AWS Cloud Computing; Jenkins; Grafana; Linux; Python; Networking; HTTPS; TCP; and UDP.
Job Location: 3223 Hanover St, Palo Alto, CA 94304.Telecommuting permitted up to 40% of the week.
Full-Time. Salary: $226,158 - $226,158 per year.