Pennington, New Jersey
17 hours ago
Senior Site Reliability Engineer

Job Description:

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day.

One of the keys to driving Responsible Growth is being a great place to work for our teammates around the world. We’re devoted to being a diverse and inclusive workplace for everyone. We hire individuals with a broad range of backgrounds and experiences and invest heavily in our teammates and their families by offering competitive benefits to support their physical, emotional, and financial well-being.

Bank of America believes both in the importance of working together and offering flexibility to our employees. We use a multi-faceted approach for flexibility, depending on the various roles in our organization.

Working at Bank of America will give you a great career with opportunities to learn, grow and make an impact, along with the power to make a difference. Join us!

The AWS Senior Site Reliability Engineer will be responsible for working closely with the Cloud Engineering and development teams to create new features, stabilize production systems and introduce efficiencies in the maintenance of critical Cloud provided Identity systems.  The role involves using automation tools to monitor and observe reliability in production environment. They are also experienced in finding problems within the operation and suggesting solutions to resolve them. Candidate should have in depth knowledge about AWS, Azure Cloud environment. With specific knowledge about Identity Services.

Responsibilities:

Serves as a consultant on a broad range of technologies, platforms, and vendor offerings to drive targeted business outcomesDevelops solutions to address manual and repeatable work or inefficient processes and contributes to the technology strategy for end-to-end operations solutions and provides feedback to the architect and engineering teamsTranslates business requirements into technical definitions, reference models, blueprints, and playbooks for deployment in compliance with architecture standards and policiesCreates SRE process for the entire operations team and are on hand to support escalation issuesProvides documented procedures to support team(s) to help them effectively deal with issuesMentors and guides team members to ensure operations process are optimizedCreates an inclusive and healthy working environment and help to resolve blockers that could impact OperationsAssist in improving application and service lifecycle by holding post-incident reviews, documents all software problems and respective solutions in a shared knowledge baseEngage as a subject matter expert (SME) in major incident triage efforts, failure scenario modelling and work with the Problem Manager to diagnose root causes for complex/high impact major incident / problem management investigationsEmergency incident response/ Change management/ IT Infrastructure managementProcess Improvement

Required Qualifications:

7+ years of Cloud experienceOverarching broad and deep technical knowledge of various Cloud related Identity systemsStrong knowledge of KMS and Secrets Vault in the CloudHand-on experience implementing security services on AWS, Azure and/or GCPExtensive experience and advanced knowledge implementing Identity solutions in cloudExtensive knowledge of Identity best-practices, latest security threats/trends and mitigation thereofAdvanced scripting experience and capabilities using python, Perl, java and/or PowerShellDeep, in-depth working knowledge of Kerberos and NTLM authentication, MFA, SSO and federation technologiesExperience and confidence to be the subject matter expert (SME) in an environment of this size and scale to coordinate technical efforts and resolve issues across multiple teamsWorking knowledge of Certificate/CA/PKI infrastructureExcellent communication skills, including proven experience effectively communicating technical challenges and solutions to peers, customers, and senior managementGeneral understanding of Certificate Authorities and PKIAuthentication tools and servicesSecurity event and incident management systems and/or incident reporting systems and networksExperience developing scripts using python, python, java, shell scripts on Linux systemsExtensive experience using various technologies such as REST, WebApi, SQL, ORM, IoC, Unit Testing, Integration Testing, CI/CDExperience with LDAP queries

Skills:

CollaborationInnovative ThinkingResult OrientationSolution DesignAdaptabilityAnalytical Thinking

Shift:

1st shift (United States of America)

Hours Per Week: 

40

Pay Transparency details

US - NJ - Jersey City - 101 Hudson St - 101 Hudson (NJ2101)

Pay and benefits information

Pay range

$149,800.00 - $188,900.00 annualized salary, offers to be determined based on experience, education and skill set.

Discretionary incentive eligible

This role is eligible to participate in the annual discretionary plan. Employees are eligible for an annual discretionary award based on their overall individual performance results and behaviors, the performance and contributions of their line of business and/or group; and the overall success of the Company.

Benefits

This role is currently benefits eligible. We provide industry-leading benefits, access to paid time off, resources and support to our employees so they can make a genuine impact and contribute to the sustainable growth of our business and the communities we serve.
Confirm your E-mail: Send Email