Others
35 days ago
Site Reliability Engineer

Site Reliability Engineer

Lead I - Software Engineering

Who We Are:

Born digital, UST transforms lives through the power of technology. We walk alongside our clients and partners, embedding innovation and agility into everything they do. We help them create transformative experiences and human-centered solutions for a better world.

UST is a mission-driven group of 29,000+ practical problem solvers and creative thinkers in more than 30 countries. Our entrepreneurial teams are empowered to innovate, act nimbly, and create a lasting and sustainable impact for our clients, their customers, and the communities in which we live.

With us, you’ll create a boundless impact that transforms your career—and the lives of people across the world.

Visit us at UST.com.

 

You Are:

We are seeking a skilled and experienced Site Reliability Engineer (SRE) to deliver ongoing support for critical member-facing Application Management (AM) and Hosting services. As a key member of our team, the candidate will be responsible for ensuring the reliability, availability, security, and scalability of our systems while driving continuous improvement through automation and data analysis.

The opportunity:

·       Ensure Site Reliability: Provide concrete recommendations for internal and external partner teams, including troubleshooting, CI/CD automations, and tool development to support the DevOps model.

·       System Health Monitoring: Monitor all possible metrics within the platform to always maintain a precise understanding of system health.

·       Advanced System Monitoring: Implement advanced end-to-end system monitoring, including smart monitoring of integrated and coupled partner and vendor systems.

·       Process Documentation: Document processes that cannot be automated, offering engineering teams expertise in availability, performance, and scalability.

·       Data Analysis: Conduct comprehensive data analysis to identify service trends and drive improvements.

·       Platform Documentation: Build end-to-end documentation and instrumentation of our platform to ensure visibility, automation, self-healing, and resiliency throughout the stack.

·       Compliance: Ensure compliance with general requirements for all deployments to production, including diagrams, dependencies, monitoring and logging plans, backups, and high availability setups.

·       Change Management: Support Change Management processes and functions, ensuring adherence to requirements for new services and deployments.

·       Incident Response: Lead incident response efforts, including root cause analysis and post-mortem reviews to prevent future occurrences.

·       Capacity Planning: Conduct capacity planning to ensure systems can handle future growth and demand.

·       Security Best Practices: Implement and enforce security best practices to protect systems and data.

·       Scalability and Cloud Migration: Design and implement scalable solutions to meet changing business needs, including migrating systems to cloud environments. Ensure seamless integration and operation of cloud-based services.

·       High Traffic Load Management: Develop strategies and implement solutions to scale systems efficiently to handle high traffic loads, ensuring optimal performance and reliability.

·       API Management: Support and manage API management tools, ensuring secure and efficient API integrations and operations.

·       Adaptability to Change: Continuously adapt to changing technologies, business needs, and industry trends. Implement upgrades, patches, and migrations to keep systems up-to-date and secure.

 

This position description identifies the responsibilities and tasks typically associated with the performance of the position. Other relevant essential functions may be required.

 

What you need:

·       Extensive experience as a Site Reliability Engineer or in a similar role, with a deep understanding of industry-standard technical skill sets, Agile methodologies, and capacity planning.

·       Proven experience in designing and implementing scalable systems to handle high traffic loads.

·       Experience with API management tools and practices to ensure secure and efficient API integrations.

·       Ability to manage and implement system upgrades, patches, and migrations to keep systems current and secure

·       Experience with cloud platforms (e.g., AWS, Azure, Google Cloud) and cloud-native technologies

·       Strong analytical and problem-solving skills, with the ability to troubleshoot complex issues and recommend proactive solutions.

·       Knowledge of change management processes and functions, including compliance requirements for deployments.

·       Excellent communication and collaboration skills, with a demonstrated ability to work effectively with cross-functional teams and external partners.

·       Proven ability to thrive in a fast-paced environment and adapt to changing priorities.

·       Programming Languages: SQL, Unix Scripting, Perl, Python, Java.

·       CI/CD Tools: Ansible, Jenkins, Artifactory, SonarQube, etc.

·       Monitoring Tools: AppDynamics, ThousandEyes, Dynatrace.

·       SDLC Tools: SharePoint, Jira.

·       Operating Systems: Linux, Windows.

·       Database: SQL Server.

·       Source Code Control: GitHub.

·       Service Management: ServiceNow and ITIL.

·       Web Servers: Apache, IIS, Web and Mobile Portal/App.

·       API Management: Apigee Gateway.

·       Other Tools (Nice to Have): AppViewX Certificate Management Tool, Splunk, GnuPG/PGP.

 

Compensation can differ depending on factors including but not limited to the specific office location, role, skill set, education, and level of experience. UST provides a reasonable range of compensation for roles that may be hired in various U.S. markets as set forth below.

Role Location: Remote

Compensation Range: $73,000-$109,000

 

 

Benefits

Full-time, regular employees accrue a minimum of 10 days of paid vacation per year, receive 6 days of paid sick leave each year (pro-rated for new hires throughout the year), 10 paid holidays, and are eligible for paid bereavement leave and jury duty. They are eligible to participate in the Company’s 401(k) Retirement Plan with employer matching. They and their dependents residing in the US are eligible for medical, dental, and vision insurance, as well as the following Company-paid Employee Only benefits: basic life insurance, accidental death and disability insurance, and short- and long-term disability benefits. Regular employees may purchase additional voluntary short-term disability benefits, and participate in a Health Savings Account (HSA) as well as a Flexible Spending Account (FSA) for healthcare, dependent child care, and/or commuting expenses as allowable under IRS guidelines. Benefits offerings vary in Puerto Rico.

Part-time employees receive 6 days of paid sick leave each year (pro-rated for new hires throughout the year) and are eligible to participate in the Company’s 401(k) Retirement Plan with employer matching.

Full-time temporary employees receive 6 days of paid sick leave each year (pro-rated for new hires throughout the year) and are eligible to participate in the Company’s 401(k) program with employer matching. They and their dependents residing in the US are eligible for medical, dental, and vision insurance.

Part-time temporary employees receive 6 days of paid sick leave each year (pro-rated for new hires throughout the year).

All US employees who work in a state or locality with more generous paid sick leave benefits than specified here will receive the benefit of those sick leave laws.

 

What we believe:

We proudly embrace the values that have shaped UST since day one. We build our culture of Humility, Humanity, and Integrity. These values inspire us to nurture a people-first, human centric culture that fosters diversity, prioritizes sustainable solutions, and keeps our people and clients at the forefront of all decisions.

 

Humility:

We will listen, learn, be empathetic and help selflessly in our interactions with everyone.

Humanity:

Through business, we will better the lives of those less fortunate than ourselves.

Integrity:

We honor our commitments and act with responsibility in all our relationships.

 

Equal Employment Opportunity Statement


UST is an Equal Opportunity Employer.

 

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other applicable characteristics protected by law. We will consider qualified applicants with arrest or conviction records in accordance with state and local laws and “fair chance” ordinances.

UST reserves the right to periodically redefine your roles and responsibilities based on the requirements of the organization and/or your performance.

 

 

#UST

#CB

#LI-IS1

#LI-Remote

Confirm your E-mail: Send Email