Site Reliability Engineer
IBM
As a Virtualization Platform Engineer, you will be part of the Cirrus Hybrid Cloud virtualization team responsible for ensuring the architectural integrity and successful delivery of a scalable virtualization platform for the IBM CIO Organization.
In this role you will focus on the management of virtualization platform for Cirrus Hybrid Cloud. This entails working on all aspects designing, engineering, implementing, and maintenance of various virtualization solutions.
You will help solve intriguing problems while partnering with other team members, customers, and vendors. For success in this role, you will have a strong Python or Ruby programming language background and a passion for learning and continuous improvement.
What you will do:Design, Management, maintenance, and support of various virtualization solutions especially the RedHat OpenShift Virtualization (OSV) and VMware.Create infrastructure using any from: Ansible, Terraform, Argo, OpenShift IPI, UPI, ZTP Zero Touch ProvisioningOperate in an agile manner and under strict change controlMaintain the environment according to the Policy Compliance Management requirements.Troubleshoot and resolve Hypervisor/Operating System-based issues from Performance to ConfigurationBacking up and protect virtual environments using platform-specific toolsPerform daily system checks, review, and respond to events reflected in various management tools, perform server patch management.Conduct system audit reviews and perform maintenance functions as required to ensure system health.Troubleshoot and resolve problems for all applications.Support, implement and maintain new applications coming into the environment.Present status information on issues and problems at the weekly team meetings.Document software changes.Document problem resolution steps.Assure best-practices and standards are implemented and adhered to for software systemsProvide on-call support and implementation after-hours on a rotating basisThink and act like a Site Reliability Engineer (SRE) as the environment relates to virtualization
In this role you will focus on the management of virtualization platform for Cirrus Hybrid Cloud. This entails working on all aspects designing, engineering, implementing, and maintenance of various virtualization solutions.
You will help solve intriguing problems while partnering with other team members, customers, and vendors. For success in this role, you will have a strong Python or Ruby programming language background and a passion for learning and continuous improvement.
What you will do:Design, Management, maintenance, and support of various virtualization solutions especially the RedHat OpenShift Virtualization (OSV) and VMware.Create infrastructure using any from: Ansible, Terraform, Argo, OpenShift IPI, UPI, ZTP Zero Touch ProvisioningOperate in an agile manner and under strict change controlMaintain the environment according to the Policy Compliance Management requirements.Troubleshoot and resolve Hypervisor/Operating System-based issues from Performance to ConfigurationBacking up and protect virtual environments using platform-specific toolsPerform daily system checks, review, and respond to events reflected in various management tools, perform server patch management.Conduct system audit reviews and perform maintenance functions as required to ensure system health.Troubleshoot and resolve problems for all applications.Support, implement and maintain new applications coming into the environment.Present status information on issues and problems at the weekly team meetings.Document software changes.Document problem resolution steps.Assure best-practices and standards are implemented and adhered to for software systemsProvide on-call support and implementation after-hours on a rotating basisThink and act like a Site Reliability Engineer (SRE) as the environment relates to virtualization
Confirm your E-mail: Send Email
All Jobs from IBM