Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.
Job Responsibilities :You will be engaging with multiple developer and project teams to implement, operate and evangelise modern interpretations of Platform, automation, SRE, observability and DevOps. This role is foundational and impactful to the organization and will help shape Razer's cloud software and microservices development culture for years to come.Essential Roles and Responsibilities
· Build excellent design, deployment and management of container-native DevOps platform based on Kubernetes to support Razer's microservice, data, and AI development teams.
· Design and Build proof-of-concepts to validate architecture and solutions leveraging open-sourced CNCF and/or Kubernetes-native projects to meet platform team goals.
· Build and constantly improve our automation and CI/CD pipeline based on Argo Workflows and ArgoCD to achieve smooth and easy-to-operate GitOps workflow.
· Initiate and drive new automation, process, or platform enhancements that improves developers’ or AI engineers’ productivity and efficiency in deploying to and managing their cloud workloads.
· Define and operate tools that tracks site reliability metrics and SLAs, maintain service uptime and performance goals as required by the various microservices.
· Assist in incident management, escalation and post-mortem processes to promote smooth operations.
· Collaborate with cloud software and AI development teams to ensure services, AI models, and data pipelines are reliable, scalable, and efficient.
· Participate in on-call rotation to support the team's 24x7 coverage of critical systems.
· Other duties related to the reliability, observability, security, and cost-efficiency of Razer ID and AI services as assigned.
Pre-Requisites :Requirements
· Bachelor's degree in Electrical/Computer Engineering, Computer Science, or related field.
· 3+ years of experience in site-reliability engineering or DevOps role supporting container or cloud-native services.
· Strong understanding of system design, networking, web technologies, and high-traffic systems
· Demonstrates innovative methods of declaratively automating cloud-based IaaS/PaaS deployments and applications using modern DevOps techniques and technologies
· Deep exposure to public cloud (GCP, AWS or Azure), Docker and Kubernetes
· Experience building pipelines and supporting services for mainstream programming language such as node.js, Go, Python.
· Broad experience with modern CI/CD pipelines (GitOps, Jenkins, Argo etc)
· Proficient in maintaining and troubleshooting Linux systems
· A fervent enthusiasm for declarative languages, particularly infrastructure-as-code
· Monitoring tooling such as CloudWatch, Datadog, Prometheus, Grafana
· Good at communication and documenting technical concepts
Are you game?