Mumbai, Maharashtra, India
22 hours ago
Site Reliability Engineer II

Play a key role in ensuring system reliability at one of the world’s most iconic and largest financial institutions.

As a Site Reliability Engineer II at JPMorgan Chase within the Commercial & Investment Bank, you will use technology to solve business problems and leverage software engineering best practices as we strive towards excellence. This role often works independently to execute small to medium projects, but you’ll also have the opportunity to collaborate with cross functional teams to continually improve your level of knowledge about JPMorgan Chase’s business and relevant technologies.

Job responsibilities

 

Independently manage small to medium-sized projects with initial guidance, progressing to designing and delivering projects autonomously.Utilize technology to address business challenges by developing high-quality, maintainable, and robust code in line with software engineering best practices.Engage in triaging, analyzing, diagnosing, and resolving incidents, collaborating with others to address root causes.Identify repetitive tasks within your role and proactively work to eliminate them through appropriate channels.Comprehend observability patterns and strive to implement and enhance service level indicators, objectives, monitoring, and alerting solutions for optimal transparency and analysis. Design, code, test, and deliver software solutions to automate manual operational tasks.Troubleshoot high-priority incidents, facilitate blameless post-mortems, and ensure the permanent resolution of incidents.Identify application patterns and analytics to support improved service level objectives. Implement necessary telemetry and observability to monitor and measure service quality in real-time against established SLOs.Maintain a strong focus on automation and processes, designing, implementing, improving, and utilizing key monitoring tools. Collaborate with SRE, Operations, and Development teams to balance manual operational work with engineering efforts.Possess a strong understanding of Incident, Problem, and Change Management processes and tools. Participate in Support Rota coverage as needed. Effectively escalate issues and risks across the support framework when necessary.

 

Required qualifications, capabilities, and skills

 

Formal training or certification on software engineering concepts and 2+ years applied experienceAbility to code in at least one programming language. Experience maintaining a Cloud-base infrastructureProficiency in one or more technology domains, with the ability to solve complex and mission-critical problems within a business or across the firm. Excellent debugging and troubleshooting skills.Proficient in coding with at least one programming language and open to learning modern technologies, such as Python, Java, etc.Extensive expertise in the instrumentation, customization, and use of modern monitoring tools like Dynatrace, Grafana, Splunk, AWS, Kubernetes, Geneos, Kafka, MQ, etc.Hands-on experience with modern cloud technologies such as AWS, Gaia, etc. Expertise in at least one relational database (e.g., SQL Server, Oracle, DB2).Skilled in performance monitoring and capacity management of large systems using various tools. Comfortable working in an Agile environment and proficient in Continuous Integration and Continuous Delivery practices.Strong attention to detail and time-management skills. Proficient in Site Reliability Engineering (SRE) concepts, principles, and practices. Proficient with containers or common server operating systems such as Linux and Windows.

 

Preferred qualifications, capabilities, and skills

G Certification in programming languages and/or cloud technologies.Experience in Custody, Securities, or Trading domains, including areas such as FX Cross Currency, High and Low Value Payments, SWIFT, Real-Time Payments, Trading, Corporate Actions, etc.General knowledge of the financial services industry.
Confirm your E-mail: Send Email