Devops Site Reliability Engineer
UST Global Inc
Key Skills : Devops, SRE, Any Cloud (Aws, Azure, GCP), IAC , Kubernetes
Exp : 7-11 Years
Loc: Noida / Pune/ Bangalore/Hyderabad /Chennai/Kochi/ Trivandrum
Job Summary:The Site Reliability Engineer (SRE) ensures the reliability, availability, and performance of critical systems and services. This role bridges the gap between development and operations teams, leveraging automation, monitoring, and best practices to enhance system scalability, reduce downtime, and improve overall efficiency.
Key Responsibilities:Reliability Engineering:
Design, implement, and maintain high-availability systems. Create and enforce Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs). Conduct root cause analysis for system failures and implement post-mortem processes to prevent recurrence.DevOps Automation:
Automate infrastructure provisioning, deployment pipelines, and operational processes. Build and maintain CI/CD pipelines using tools like Jenkins, GitHub Actions, or GitLab CI/CD. Develop Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, or Ansible.Monitoring and Incident Management:
Implement robust monitoring, logging, and reporting solutions using tools like Datadog or Splunk. Establish proactive incident response processes and manage on-call rotations. Ensure effective documentation for incident handling and resolution.Performance and Scalability:
Optimize system performance through capacity planning and resource management. Enable horizontal scaling of services to handle increasing loads. Collaborate with development teams to improve application resilience and performance.Security and Compliance:
Enforce security best practices in infrastructure and application development. Conduct vulnerability assessments and implement remediation measures. Ensure compliance with organizational and industry standards.Collaboration and Culture:
Act as a bridge between development and operations teams to foster a DevOps culture. Coach teams on best practices in reliability, automation, and DevOps. Advocate for a culture of ownership and continuous improvement. Key Skills and Competencies: Technical Skills: Expertise in cloud platforms like AWS, Azure, or GCP. Proficiency in Linux system administration and networking concepts. Strong programming/scripting skills (e.g., Python, Go, Bash). Familiarity with Terraform and Infrastructure as Code (IaC). Knowledge of containerization and orchestration tools like Docker and Kubernetes. Understanding of database management (SQL and NoSQL).
Confirm your E-mail: Send Email
All Jobs from UST Global Inc