Pittsburgh, PA, 15222, USA
4 days ago
Manager, Data Engineering
Join Xylem in the global mission to #LetsSolveWater! As a leading water technology company with 23,000 employees operating in over 150 countries, Xylem is at the forefront of addressing the world's most critical water challenges. We invite passionate individuals to join our team, dedicated to exceeding customer expectations through innovative and sustainable solutions. **THE ROLE:** As the Manager of Data Engineering, you will be responsible for architecting, developing, and maintaining scalable data pipelines that empower business decisions through advanced analytics. This role requires a strategic leader capable of driving data initiatives across the organization while collaborating with cross-functional teams. **CORE RESPONSIBILITIES:** + **Lead and manage the Data Engineering team** to design, develop, and implement robust data pipelines that facilitate data collection, transformation, and analysis. + **Architect and optimize scalable data solutions** using Azure services, including Azure Data Factory, Databricks, and Azure SAP Data Sphere. + **Integrate SAP ERP systems** with modern cloud-based data platforms, ensuring smooth data flow across the enterprise which includes legacy ERP systems and SAP S4. + **Drive innovation** by leveraging Azure’s advanced data tools to optimize data processes and workflows. + **Collaborate with stakeholders** in IT, Sourcing, and other departments to understand data requirements, deliver insights, and ensure data integrity and security. + **Implement data governance and best practices** to ensure the accuracy, availability, and performance of data assets. + Oversee the **end-to-end data lifecycle management** , including ingestion, storage, processing, and retrieval of data for advanced analytics and business reporting. + **Mentor and develop the skills of the data engineering team** , ensuring continuous improvement and knowledge growth in cloud data platforms. + **Work with analytics teams** to support machine learning, data science, and reporting initiatives by providing well-structured and timely data. + Ensure compliance with data privacy regulations and **security best practices** in cloud data environments. **QUALIFICATIONS:** + **Bachelor’s degree** in Computer Science, Data Engineering, Information Systems, or a related field. Master’s degree preferred. + **5+ years of experience** in data engineering, with **at least 2 years in a management or leadership role.** + Extensive experience with **Azure technologies** , including Azure Data Factory, Databricks, and Azure SAP Data Sphere. + Strong knowledge of **SAP data integration** with cloud platforms. + Experience integrating legacy systems + Proficiency in **data modeling, ETL processes, and building large-scale data pipelines** . + Expertise in **SQL, Python, or Spark** for data engineering tasks. + Proven track record of managing and delivering **complex data projects** in a collaborative environment. + Strong understanding of **data governance, data quality, and compliance** in enterprise environments. + Excellent communication and leadership skills with the ability to influence stakeholders at all levels. **Technical Knowledge:** + Deep understanding of Azure cloud infrastructure and services, particularly those related to data management (e.g., Azure Data Lake, Azure Blob Storage, Azure SQL Database). + Experience with Azure Data Factory (ADF) for orchestrating ETL pipelines and automating data workflows. + Familiarity with Azure Databricks for big data processing, machine learning, and collaborative analytics. + Expertise in **Apache Spark** for distributed data processing and large-scale analytics. + Familiarity with **Databricks** , including managing clusters and optimizing performance for big data workloads. + Understanding of Databricks Bronze, Silver, and Gold Model. + Understanding of **distributed file systems** like HDFS and cloud-based equivalents like **Azure Data Lake** . + Proficiency in **SQL** and **NoSQL databases** , including designing schemas, query optimization, and managing large datasets. + Experience with **data warehousing** solutions like Databricks, **Azure Synapse Analytics** or **Snowflake** . + Familiarity with connecting data Lakehouse’s with Power BI. + Understanding of **OLAP** (Online Analytical Processing) and **OLTP** (Online Transaction Processing) systems. + Strong grasp of **data modeling techniques** , including conceptual, logical, and physical data models. + Experience with **star schema, snowflake schema, and normalization** for designing scalable, performant databases. + Knowledge of **data architecture best practices** , ensuring efficient data flow, storage, and retrieval. + Knowledge of **CI/CD pipelines** for automating the deployment of data pipelines, databases, and infrastructure. + Experience with **infrastructure as code** tools like **Terraform** or **Azure Resource Manager** to manage cloud resources. + Familiarity with **version control systems** like Git for managing codebases and collaborative work. + Expertise in managing **data lakes** , ensuring proper storage, governance, and retrieval of structured and unstructured data. + Familiarity with **data Lakehouse** architecture, integrating the best features of data lakes and data warehouses. + Experience with **partitioning strategies** to optimize data access in cloud environments. + Proficiency in **workflow orchestration** tools like **Apache Airflow** or **Azure Data Factory** , automating repetitive tasks and ensuring efficient pipeline operations. + Automating data ingestion and transformation workflows using cloud-native or third-party tools. + Experience with **monitoring tools** like Azure Monitor, Datadog, or Grafana for tracking the performance and health of data systems. + Proficiency in **debugging and troubleshooting** issues related to data pipelines, databases, and cloud infrastructure. **Preferred Qualifications:** + Experience with **machine learning, data science tools, and big data processing frameworks** . + Knowledge of **DevOps practices** related to CI/CD pipelines in data engineering. + Certifications in **Azure Data Engineering** or other relevant data platforms. Salary range: $91,500.00 - $164,500.00 Join the global Xylem team to be a part of innovative technology solutions transforming water usage, conservation, and re-use. Our products impact public utilities, industrial sectors, residential areas, and commercial buildings, with a commitment to providing smart metering, network technologies, and advanced analytics for water, electric, and gas utilities. Partner with us in creating a world where water challenges are met with ingenuity and dedication; where we recognize the power of diversity, equity and inclusion in driving innovation and allowing us to compete more effectively around the world. At Xylem, you'll not only contribute to solving water issues but also have the chance to make a difference through our paid Volunteer Program, Xylem Watermark. We embrace diversity and prioritize our employees' well-being through our DE&I initiatives and Employee Resource Groups (ERG). Proud to be an Equal Employment Opportunity (including disability and veterans) and Affirmative Action workplace, Xylem fosters an inclusive environment free from discrimination or harassment.   Please note that the information in this job description outlines the general nature of the position and is not an exhaustive list of duties. Xylem is dedicated to providing reasonable accommodations to enable all employees to perform their essential job functions. We reserve the right to modify this job description and assign additional duties as needed. Embrace the opportunity to be part of Xylem's transformative journey in shaping the future of water technology! #XylemCareers #GlobalImpact #WaterInnovation
Confirm your E-mail: Send Email