Austin, TX, 78703, USA
28 days ago
Senior Data Engineer
**Senior Data Engineer** Remote - US Cleo is a cloud integration technology company focused on business outcomes. Every day, we ensure that each one of our 4,000+ customers' potential is realized by delivering solutions that make it easy to discover and create value through the connections and integration of enterprise applications supporting critical workflows. By providing the industry’s most complete and flexible integration offerings, we are helping our clients build trusted relationships across their partner ecosystems today, while providing all the control and visibility they need to advance their business tomorrow. In a nutshell, Cleo is a rapidly growing category leader in ecosystem integration software and we have experienced tremendous growth over recent years. The **Senior Data Engineer** is a hands-on leader responsible for designing, developing, and maintaining data pipelines and infrastructure at Cleo. This role involves setting the strategy for data systems, collaborating closely with cross-functional teams, and ensuring the scalability, reliability, and efficiency of data solutions. The Senior Data Engineer will focus on data infrastructure needs for **AI/ML models** , overseeing the creation of a **data warehouse** and associated systems from scratch, and ensuring data is properly transformed and optimized for machine learning and artificial intelligence applications. This role is integral to building the processes that support **data transformation** , **data structures** , **metadata management** , **data quality controls** , and **workload management** . **What You Will Be Doing** + **Lead the Design and Build of Data Pipelines** : Develop and maintain scalable, reliable, and efficient data pipelines that collect, process, and store large datasets. Ensure these systems are optimized for **AI/ML** model training and inference. + **Set Data Infrastructure Strategy** : Define and execute the strategy for building and maintaining **data warehouses** , **data lakes** , and other data storage systems that support both operational and analytical needs. Ensure systems are optimized for AI/ML model workflows. + **Hands-On Data Transformation for AI/ML Models** : Design and implement data transformation processes, including **feature engineering** , **data preprocessing** , and **data augmentation** , to ensure data is in the right format for machine learning models. + **Build Data Structures and Metadata Management** : Build and manage the data structures, metadata repositories, and related systems that support data transformation, ensuring the organization has well-documented, accurate, and accessible data for AI/ML workflows. + **Data Quality Controls and Risk Management** : Establish and implement data quality controls to ensure that data used in machine learning models is clean, consistent, and accurate. Identify and raise risks at all stages of the data engineering process, including data ingestion, transformation, and storage. + **Collaborate with Cross-Functional Teams** : Work closely with **data scientists** , **ML engineers** , **product managers** , and business leaders to understand data requirements and ensure data systems meet the needs of AI/ML initiatives. Provide hands-on leadership to guide teams in transforming data for model training. + **ETL Development and Optimization for AI/ML** : Lead the development and optimization of ETL (Extract, Transform, Load) processes for ML/AI data. Focus on efficiently moving and transforming large datasets, ensuring data quality and readiness for model training and deployment. + **Optimize Data for Model Training and Inference** : Ensure data pipelines are designed to support the needs of AI/ML model training, such as handling large volumes of data, managing data quality, and enabling fast model iteration. + **Data Governance for AI/ML** : Define and implement data governance practices to ensure the secure, compliant, and ethical use of data in AI/ML workflows. + **Stay Current** : Keep up with emerging trends in data engineering, particularly those related to **AI/ML** model development, and implement best practices to optimize data systems for machine learning. **Your Qualifications** + **Experience** : 5-7+ years of experience in data engineering, with a focus on **transforming data for AI/ML models** and optimizing data systems to support machine learning and artificial intelligence workflows. + **Hands-On Expertise** : Proven experience in hands-on **data transformation** and building **data pipelines** for AI/ML, including data preprocessing, feature engineering, and model-specific data handling. + **Leadership** : Experience leading or mentoring data engineering teams, providing hands-on guidance for AI/ML-related projects and collaborating with cross-functional teams. + **Cloud and Big Data** : Strong experience with **cloud platforms** and **big data technologies** , particularly in the context of **AI/ML model development** . **A few things we have to offer:** + Competitive compensation + Great Healthcare + Dental + Vision + Flexible PTO + Culture of support, encouraging Life-Work balance + 401k match + FSA and HSA options + Employee Assistance Program + Paid Parental Leave + Representing a company with 4,000+ clients and a 99% retention rate + Accelerated title and salary growth potential + A fun and energetic work environment that makes you excited to go to work every day _Cleo Communications, LLC is an equal opportunity/affirmative action employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability status, protected veteran status or any other characteristic protected by law._
Confirm your E-mail: Send Email