Lowell, MA
17 days ago
Director - Site Reliability Engineering

 

Company Overview 

 

With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And we’re only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieve? Read on.  

 

 

 

At UKG, you get more than just a job. You get to work with purpose. Our team of U Krewers are on a mission to inspire every organization to become a great place to work through our award-winning HR technology built for all. 

 

 

 

Here, we know that you’re more than your work. That’s why our benefits help you thrive personally and professionally, from wellness programs and tuition reimbursement to U Choose — a customizable expense reimbursement program that can be used for more than 200+ needs that best suit you and your family, from student loan repayment, to childcare, to pet insurance. Our inclusive culture, active and engaged employee resource groups, and caring leaders value every voice and support you in doing the best work of your career. If you’re passionate about our purpose — people —then we can’t wait to support whatever gives you purpose. We’re united by purpose, inspired by you.   

 

About the Team:

As a Director of Site Reliability Engineering, you would combine software and systems engineering to build and run robust, fault-tolerant, distributed systems to operate at scale. A pivotal role with transformational leadership qualities creates a vision for SRE and drives the “Automation and AI” first culture. Responsible for End-to-end observability, availability, performance, and uptime of mission-critical services.

About the Role:
Responsibilities:
• This pivotal role will lead the SRE team and be responsible for maintaining reliability, performance, availability, and scalability.
• Help drive change across the company, working towards a standard methodology based around Site Reliability Engineering and Solid System Engineering practices.
• Lead the team in driving further adoption of Site Reliability practices such as, Chaos engineering, SLOs, Error Budgets, release safety, load testing, and disaster recovery strategies.
• Build teams through hiring and people growth while balancing your ownership workload through delegation. Define and review individual and team goals, fostering a culture of continuous improvement and innovation.
• Stay current with emerging technologies and industry trends, advocating for their adoption where appropriate to drive innovation and productivity enhancement within the team (e.g., AIOps, CoPilot)
• Collaborate cross-organization to complete successful delivery with the broader functions, including but not limited to Security, Architecture, Operations, and Product Managers.
• Responsible for guiding and encouraging the personal and technical development, engagement, and growth of your direct reports
• Coach the organization on the principles of SREs, including incident response, automation, observability improvements, toil reduction, self-healing, and root cause analysis.
• Manage on-call rotations across the globe and implement the follow-the-sun model.

About You:
Minimum Qualifications:
• Bachelor's or master's degree in engineering or a related technical field.
• 10+ years of experience in DevOps, SRE, and Cloud Infrastructure engineering/Operations, with at least 5+ years of management experience.
• Experience and Knowledge of Public Cloud Infrastructure like Google Cloud, Cloud-based Applications, Containerization, and microservices architecture.
• Experience with Observability and incident management tools like Prometheus, Datadog, Splunk, Prometheus, and PagerDuty. etc
• Experience in running the CI/CD pipelines infrastructure as Code (Terraform), Config Management (Ansible), GitHub Actions, and Jenkins. etc
• Strong leadership, problem-solving skills, attention to detail, delivering high-quality solutions, excellent communication, and interpersonal skills, with the ability to influence and drive technical decisions across the organization

 

Where we’re going 

 

UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet it’s our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow!   

 

 

 

Equal Opportunity Employer    

 

Ultimate Kronos Group is proud to be an equal opportunity employer and is committed to maintaining a diverse and inclusive work environment. All qualified applicants will receive considerations for employment without regard to race, color, religion, sex, age, disability, marital status, familial status, sexual orientation, pregnancy, genetic information, gender identity, gender expression, national origin, ancestry, citizenship status, veteran status, and any other legally protected status under federal, state, or local anti-discrimination laws.      

 

View The EEO Know Your Rights poster and its supplement.      

 

View the Pay Transparency Nondiscrimination Provision     

 

UKG participates in E-Verify. View the E-Verify posters here.   

 

 

 

Disability Accommodation 

 

For individuals with disabilities that need additional assistance at any point in the application and interview process, please email UKGCareers@ukg.com.  

 

 The pay range for this position is $179,800 to $258,500, however, base pay offered may vary depending on skills, experience, job-related knowledge and location. This position is also eligible for a short-term incentive and a long-term incentive as part of total compensation. Information about UKG’s comprehensive benefits can be reviewed on our careers site at https://www.ukg.com/careers   

 

Confirm your E-mail: Send Email