Sunnyvale, CA, US
8 days ago
Senior Data Engineer, Items and Offers Platform
Amazon is a fast paced innovative company that is developing software that no one has attempted before. If you are a data engineer who is passionate about writing code and loves to build large scale data pipelines which are , scalable, high throughput, fault tolerant and always available, then get in touch with us.

The Item and Offers team is responsible for a variety of services that form a core part of the Amazon eCommerce platform. We are primarily responsible for developing the services that process all of the Item information from millions of merchants who want to sell through the Amazon family of websites. Our expertise lies in managing billions of products in the catalog and developing large scale distributed systems that process hundreds of millions of changes to the catalog every day in real time. The team offers a unique blend of hard computer science problems and an opportunity to help the businesses model their new ideas.

Are you passionate about working with large datasets and code? Do you want to build and manage data engineering solutions that process a broad range of data schemas? Do you want to continuously improve the data pipelines that operate at Amazon’s catalog scale while ensuring our customer’s trust? If yes, then come join the Catalog Data Works (CDW) team with the charter to provide useful, fresh and historical catalog data that teams at Amazon can analyze and leverage for their business use cases. The published by this team are a critical component of building a catalog that earns our customers’ trust.

As a Sr. Data Engineer in the CDW team, you will own complex big data pipelines and data solutions to provide highly availability datasets. You will work with large data sets (in petabytes) and transformations involving multiple data sources to enable downstream analytics for our stakeholders. You will build and manage large datasets to help teams drive data-driven decisions through analytical and business metrics dashboards.

The Data Engineer will play a crucial role in designing, developing, and maintaining efficient and scalable data pipelines, data models, and data warehousing solutions. This position will be responsible for ensuring data integrity, quality, and availability across the organization, enabling data-driven decision-making and supporting business analytics and insight initiatives.

Key job responsibilities
• Define and optimize data models for rapid analytics on catalog product data, improving freshness and LLM consumption while reducing costs and undifferentiated work.

• Automate metrics generation to support S-team goals, including pack hierarchy scaling and standard KPIs, while leading strategy for scaling self-serve analysis and dashboards.

• Mentor engineers, establish best practices in data engineering and operational excellence, and stay current with latest technologies to recommend innovations.

• Conduct comprehensive data discovery, profiling, and performance analysis for various sources, designing effective models for Page0, entitlement, propensity, and other relevant data.

• Collaborate with stakeholders to translate requirements into optimized data structures, while establishing and enforcing data governance policies to maintain quality, consistency, and security.

• Take a long-term view of data solutions, proactively addressing architecture deficiencies and making appropriate trade-offs for usability, security, maintainability, scalability, and extensibility.

• Resolve root causes of endemic problems, unblocking innovation for related teams, and build consensus with stakeholders to influence and determine the best path forward.

About the team

Confirm your E-mail: Send Email