ZAPOPAN, JALISCO, Mexico
8 days ago
Senior Site Reliability Engineer / DevOps Engineer

Project DescriptionOracle Store is an eCommerce platform for selling Oracle’s products and services to its customers and partners. It is a one stop place for the consumers to create, view and manage various transactions such as purchase SW, HWand Cloud based services as well as track orders, subscribe memberships in various Oracle Programs, renew support, manage cloud subscriptions and so on. These are modern, intuitive and mobile-friendly/responsive web applications, modern Tech stacks and utilize many cutting edge Oracle technologies like Oracle Fusion and others.We are looking for a senior SRE engineer to join their eCommerce Systems team to manage and build world class solutions for Oracle Customers. As a Site Reliability engineer (SRE) at Oracle you will be responsible for ensuring the reliability, scalability, and performance of our ecommerce Applications and services . You will work closely with our development team to design and implement processes and systems that ensure the stability and availability of our service.

Responsibilities:

Identify technical and process gaps to implement improvements that increase operational reliability and operational efficiency, as well as promote stability through automation Support build and configuration of Kubernetes clusters, setting up monitoring framework Help teams perform post-incident reviews to eliminate the possibility of reoccurrence Develop Dashboards for alerting and monitoring to ensure application systems service reliability and availability Help to meet performance and stability requirements by working with the team to implement load tests, tracing, monitoring, etc. Manage and maintain the release pipelines, help with manual and automated deployments.  Perform daily system monitoring, verifying the integrity and availability of all  hardware, server resources, systems and key processes, reviewing system  and application logs, and verifying completion of scheduled jobs such as backups. Perform regular security monitoring to identify any possible intrusions  Perform/Validate scheduled backup operations, ensuring all required file systems and system data are successfully backed up to the appropriate media. Create/Manage (Change and Delete) user accounts as needed. Repair and recover from hardware or software failures as needed. Coordinate and communicate with impacted constituencies. Provide System Production support per request from various lines of   business. Investigate and troubleshoot issues.  Investigate Seed data, Translations and other issues to provide appropriate \ support.

Technical Skills:

 Bachelor’s Degree in Computer Science/Engineering or related field.  5+ years of experience in relevant area A foundational knowledge of CI/CD, like Jenkins An DevOps/Engineering Technical Practices Focus Excellent knowledge of scripting knowledge such as Python, ansible, linux, etc.. Microservices with Kubernetes Supported applications and technologies that are large in scale Experience with documentation (Nice to have) Knowledge of Shephard, Terraform, etc Excellent knowledge of version control system

Other skills: 

Strong collaboration and teamwork skills necessary.

 Ability to effectively share technical information, contribute towards design and    development  Self starter and motivated with ability to work in a fast paced Rapid    Application Development environment.  Excellent verbal, written and communication skills.  Proven problem solving skills from problem assessment to solution selection    and implementation.  Ability to demonstrate critical thinking  Capacity to embrace changes, adapt to new technologies  Ability to handle multiple tasks and/or assignments simultaneously  Experience in leading independent tasks/projects

 

Career Level - IC3

Confirm your E-mail: Send Email