Reston, VA, US
11 hours ago
Lead Data Engineer- Cloudera BDA

The Lead Data Engineer is responsible for orchestrating, deploying, maintaining and scaling cloud OR on-premises infrastructure targeting big data and platform data management (Relational and NoSQL, distributed and converged) with emphasis on reliability, automation and performance. This role will focus on leading the development of solutions and helping transform the company's platforms deliver data-driven, meaningful insights and value to company.

ESSENTIAL FUNCTIONS:

• Lead the team to design, configure, implement, monitor, and manage all aspects of Data Integration Framework. Defines and develop the Data Integration best practices for the data management environment of optimal performance and reliability.

• Develops and maintains infrastructure systems (e.g., data warehouses, data lakes) including data access APIs. Prepares and manipulates data using Hadoop or equivalent MapReduce platform.

• Provides detailed guidance and performs work related to Modeling Data Warehouse solutions in the cloud OR on-premise. Understands Dimensional Modeling, De-normalized Data Structures, OLAP, and Data Warehousing concepts.

• Oversees the delivery of engineering data initiatives and projects. Supports long term data initiatives as well as Ad-Hoc analysis and ELT/ETL activities. Creates data collection frameworks for structured and unstructured data. Applies data extraction, transformation and loading techniques in order to connect large data sets from a variety of sources.

• Enforces the implementation of best practices for data auditing, scalability, reliability and application performance. Develop and apply data extraction, transformation and loading techniques in order to connect large data sets from a variety of sources.

• Interprets data, analyzes results using statistical techniques, and provides ongoing reports.  Executes quantitative analyses that translate data into actionable insights.  Provides analytical and data-driven decision-making support for key projects.  Designs, manages, and conducts quality control procedures for data sets using data from multiple systems.

• Improves data delivery engineering job knowledge by attending educational workshops; reviewing professional publications; establishing personal networks; benchmarking state-of-the-art practices; participating in professional societies.

SUPERVISORY RESPONSIBILITY:

Position does not have direct reports but is expected to assist in guiding and mentoring less experienced staff. May lead a team of matrixed resources.

QUALIFICATIONS:

Education Level: Bachelor's Degree in Computer Science, Information Technology or Engineering or related field OR in lieu of a Bachelor's degree, an additional 4 years of relevant work experience is required in addition to the required work experience.

Experience: 8 years Experience in leading data engineering and cross functional team to implement scalable and fine tuned ETL/ELT solutions for optimal performance. Experience developing and updating ETL/ELT scripts. Hands-on experience with application development, relational database layout, development, data modeling.

Preferred Qualifications:

• Advanced (expert preferred) level experience in administrating and engineering relational databases (ex. MySQL, PostgreSQL), Big Data systems (ex. Cloudera Data Platform Public Cloud), Apache Solr as SME, ETL (ex. Ab Initio), BI (ex. MicroStrategy), automation tools (ex. Ansible, Terraform, Bit Bucket) and experience working cloud solutions (specifically data products on AWS) are necessary.

• At least 8 years or more of Experienced with all the tasks involved in administration of big data and Meta Data Hub such as Cloudera.

• Experience with Ab Initio, EMR, S3, Dynamo DB, Mongo DB, Athena, ProgreSQL, Redshift, RDS, DB2 is a Plus.

• DevOps (CI/CD Pipeline) is a Plus.

• Experience with Advance knowledge of UNIX and SQL

• Experience with manage metadata hub-MDH, Operational Console and troubleshoot environmental issues which affect these components

• Require prior experience with migration from on-premise to AWS Cloud.

• Experience with Cloudera CDP public cloud; Solr, NiFi SME.

• Strong technical and analytical and problem solving skills to troubleshoot to solve a variety of problems.

• Represents team in all architectural and design discussions. Knowledgeable in the end-to-end process and able to act as an SME providing credible feedback and input in all impacted areas. Require tracking and monitoring projects and tasks as the lead.

Knowledge Skills and Abilities (KSAs)

• Knowledge and understanding of database design and implementation concepts.

• Knowledge and understanding of data exchange formats.

• Knowledge and understanding of data movement concepts.

• Strong technical and analytical and problem-solving skills to troubleshoot to solve a variety of problems.

• Requires strong organizational and communication skills, written and verbal, with the ability to handle multiple priorities.

• Able to effectively provide direction to and lead technical teams.

Confirm your E-mail: Send Email