Bucharest, Romania
6 days ago
Site Reliability Engineer @ING Bank

Discover ING Bank Romania

ING believes in a world where everyone has the right to grow and progress in their own way. We express this in our global tagline, “do your thing”. Perhaps more than in any other large company, we extend our belief in the power of autonomy to our own people. But there’s a catch. In return for great freedom, we expect people to do great things for our customers, our stakeholders, and ING at large.

To work here is to be surrounded by people who are energetic, ambitious, friendly and respectful: talented specialists who take the responsibility and autonomy to make great things happen. We stay curious, thrive on change, and seek new and better ways to make it happen. Active in Romania for 30 years, ING Bank pioneered and challenged the local banking industry. Technology and innovation are at the core of what we do, making our products relevant for our customers’ lives and businesses.

ING Bank Romania is the only bank with an organic growth within the top 10 local banks by assets, without acquisitions of client portfolios or other banks. ING Bank Romania is an universal bank with more than 1.8 million customers from three business segments: individuals (retail), SME and Mid-Corporate companies and Wholesale Banking.

Join us!

Mission

The SRE team is responsible to roll-out the SRE (Site Reliability Engineering) practices to improve the reliability of Critical Business Services for ING Bank Romania. The SRE team is responsible for defining, introducing, and promoting SRE processes and practices like Observability, Incident & Problem Management, Capacity & Performance Management, IT Service Continuity, Well-Architected Review Framework, Operational Resilience & Reliability Testing, Release Procedures & Change Management, Reliability reporting & error budgeting, etc.

As part of the SRE team you will:

Develop, innovate, mature & implement the operational SRE practices and related IT processes across ING, in close cooperation with the Global SRE Team, having as main purpose to improve reliability of our Critical Business Services.

Adopt the global standards for reliability practices and ensure proper documentation, training material and knowledge is created and is available for our engineers within ING Bank Romania.

Act as reliability expert for key operational activities related to Critical Business Services and incidents affecting multiple entities; this includes an active role in Critical Business Service reviews to identify weaknesses to be solved, supporting global incidents (as expert within your area), providing input to ensure high quality root-cause analysis and ensuring follow-up of structural findings with all Tech domains.

Your Day to Day

The initial focus of the SRE team will be to lead the implementation of Global SRE processes and practices within ING Bank Romania using the global platforms & tooling and in alignment with global standards and strategies. The rest of the activities include:

Perform gap assessments and analyse outcome against defined maturity stages for the designated IT Process in SRE ownership. Document findings, agree priority of their resolution and ensure backlog items for improvement are created. Follow-up on the resolution of findings;

Create awareness regarding SRE best practices and follow-up with Tech organization to ensure their adoption and proper execution in the day-to-day activity . Ensure sufficient and accurate documentation for practices exists;

Contribute to the increase of resilience by SRE performing the activities specific to the process in your ownership and ensure reliability is appropriately prioritized at QBR;

Provide support in solving complex reliability issues;

Define local metrics for process in your ownership together with SRE Reporting & Knowledge and ensure their implementation;

Ensure accurate reporting related to process in your ownership in both local and global dashboards and provide input & feedback, when required;

Identify improvements related to process and practices via direct observation and drive their implementation;

Overall focus on system monitoring and process automation;

Attend Postmortem sessions and provide feedback when the root cause of P1/Major Incident is related to process in your ownership;

Attend global process guild;

Create Entity SRE Roadmap for process in ownership based on Global SRE Roadmap;

Collaborate with Global SRE Team and contribute to improve existing processes, practices and knowledge.

What you bring to the team

We’re looking for you if the below looks like your description

Education: Bachelor's or Master's degree in computer science, information systems, or a related discipline

Experience: 10+ Years in software engineering/IT operations and/or IT architect roles

Technical skills:

Knowledgeable about technology in all levels in the technology stack (from infrastructure to front-end, from CI/CD to observability tooling) with expert knowledge & hands-on experience on one or more levels (e.g. infrastructure & back-end development and/or observability & CI/CD tooling);

In-depth knowledge of system design and experience with scalable and reliable infrastructure;

Understanding of network protocols, security best practices, and ability to implement secure and robust solutions

Competence in using Cloud services;

Tools:  ING Private Cloud or Public Cloud (Azure or Google Cloud) and related VM/container stacks & tooling; application-level technologies & tooling heavily in use at ING e.g. spring boot, ING’s API SDK, Azure DevOps, Prometheus/ELK stack/Tracing or ING’s specific implementations (e.g. RTK2, Log4All, MDPL).

Proven experience or interest in the Site Reliability Engineering (SRE) methodology, IT security and compliance. Familiarity with DevOps culture and practices;

Proven experience with ITIL processes and ITSM tools (ServiceNow, Azure DevOps, etc.);

Strong analytical and problem-solving skills;

High accuracy in performing duties;

Ability to efficiently promote in the organization the SRE concepts and frameworks;

Effective communication, both written and verbal, to convey complex technical concepts in a clear and understandable manner;

Strong stakeholder management abilities.

Nice to have

Skilled in Configuration Management, automation tools (Ansible, Azure Pipeline) and scripting languages (PowerShell, Bash, Python);

Familiar with observability, logging and monitoring tools such as ELK stack, Kibana, Prometheus and Grafana;

Experienced with Project Management.

What we offer

Impactful work in a fun and collaborative environment.

Open-concept offices designed for both team work and relaxation.

Corporate events and social gatherings.

Hybrid way of working with flexible working schedule and short week options.

Monthly budget on Benefit platform.

Extra annual leave days depending on the total length of working experience.

Growth opportunities through upskilling/ reskilling programs and a variety of learning and development platforms: ING Learning Centre, Udemy, Bookster, as well as through trainings and certifications.

Possibility to access Internal roles, International Short-Term Assignments or Long-Term Assignments.

Context to make an impact through Sustainability and Corporate Social Responsibility projects.

Confirm your E-mail: Send Email
All Jobs from ING Group