Bangalore
18 days ago
Site Reliability Engineer – Platform

Who we are

We're a leading, global security authority that's disrupting our own category.  Our encryption is trusted by the major ecommerce brands, the world's largest companies, the major cloud providers, entire country financial systems, entire internets of things and even down to the little things like surgically embedded pacemakers.  We help companies put trust - an abstract idea - to work. That's digital trust for the real world.

Job Summary

The SRE team in Engineering Organization is looking for a proven SRE with specialization in Platform to join our team.  This position will have primary responsibility over our Production and Engineering QA Lab environments.  You will be supporting our applications responsible for the SSL and PKI products within Digicert. This position will require solid experience with many technologies including CICD pipelines and automation using SALT, You will be involved with design, implementation, and day-to-day maintenance of these systems, including being available for incident related escalations. This role will require an ability to manage many diverse projects and be able to interface with various groups and teams including Architecture, SRE, Production Operation, Networking, and Engineering.

 

What you will do

Design, implement, and manage CI/CD pipelines for automated testing, deployment, and management of our services and applications, driving continuous integration and deployment practices. Collaborate with development teams to ensure that architectural solutions are aligned with SRE principles, advocating for reliability and scalability Monitor systems to identify performance bottlenecks and implement solutions to achieve optimal performance and reliability across all services. Develop comprehensive disaster recovery plans, conduct regular tests to validate data integrity, and update protocols as necessary for system availability. Share on-call responsibilities, promptly addressing and resolving production issues to uphold our SLA commitments. Create and update documentation related to system architecture, operational procedures, and best practices, including detailed runbooks for incident response. Continuously evaluate emerging technologies and tools to enhance the scalability, security, and efficiency of our platform, recommending implementations that can drive significant improvements. Automate process of resolving issues identified with monitoring tools like New Relic. Enhance the deployment and release process to obtain and maintain 99.99% uptime availability of our critical customer facing applications.

 

What you will have

Demonstrated expertise of more than 3-10 years with cloud platforms (e.g., AWS, GCP, Azure), containerization technologies (e.g., Docker, Kubernetes), and infrastructure as code (e.g., Terraform, CloudFormation). Proficiency in Linux Administration with more than 2 years’ experience. Proficiency in developing and maintaining CI/CD pipelines using tools such as Jenkins, GitLab CI, Circle CI, or similar technologies with more than 2 years exp.  Strong scripting skills in languages like Python or Bash to automate routine tasks and deployments with more than 2 years exp. A deep understanding of networking principles, security best practices, and database management. Excellent problem-solving abilities, capable of working under pressure to resolve incidents and ensure system stability. Effective communication skills, with a knack for working collaboratively across multidisciplinary teams.

 

Nice to have

Experience with monitoring tools (e.g., new relic, Prometheus, Grafana, ELK stack) and an ability to utilize logging and tracing to diagnose and resolve issues efficiently. Experience with service mesh implementations (e.g., Istio, Linkerd).

 

Benefits

Generous time off policies Top shelf benefits Education, wellness and lifestyle support

 

DigiCert offers a competitive benefits package for all our full-time employees.  If you want to know more about them, please reach out to us at TA@digicert.com.

 

#LI-SD1

Confirm your E-mail: Send Email