Data Quality Engineer
Do you want to be part of an inclusive team that works to develop innovative therapies for patients? Every day, we are driven to develop and deliver innovative and effective new medicines to patients and physicians. If you want to be part of this exciting work, you belong at Astellas!
Astellas Pharma Inc. is a pharmaceutical company conducting business in more than 70 countries around the world. We are committed to turning innovative science into medical solutions that bring value and hope to patients and their families. Keeping our focus on addressing unmet medical needs and conducting our business with ethics and integrity enables us to improve the health of people throughout the world. For more information on Astellas, please visit our website at www.astellas.com.
This is a remote position and is based in India. Remote work from certain states may be permitted in accordance with Astellas’ Responsible Flexibility Guidelines. Candidates interested in remote work are encouraged to apply.
Purpose and Scope:
The Data Quality (DQ) Engineer role is responsible for ensuring the accuracy, completeness, consistency, and reliability of data within key systems. This role is critical for maintaining the integrity of clinical, portfolio management, and product submission data, enforcing data quality principles, and bolstering regulatory compliance efforts. The ideal candidate should possess strong knowledge of the role data quality plays in the data lifecycle and ensures that data inputs and outputs meet all specified Data Quality (DQ) criteria.
Through close collaboration with study leads, regulatory experts, and DigitalX professionals, this role establishes and upholds robust Data Quality (DQ) standards that align with business objectives, regulatory mandates, and industry standards. Expertise in Data Quality, Data Governance, and adeptly managing governance processes is essential for maintaining the integrity of data assets. This role holds a pivotal position in advancing the life sciences company's mission by using data to drive the facilitation of data-driven decision-making processes, expediting drug discovery initiatives, and bolstering regulatory compliance efforts.
Essential Job Responsibilities:
Responsible for Data Quality (DQ) business rules implementation, DQ Analysis, and DQ Remediation activities.
DQ Checks Implementation: Participate as needed in the creation of data quality checks and standards for multiple source systems. Implement business data quality checks on the SaaS platform (E.g. Databricks), working in conjunction with business teams, and data lake teams. Team-up with Data Lake teams in orchestration of DQ checks code across the various environments, leading to a successful Production deployment. Experience with unit testing and test-driven development (TDD), Test Scripts Execution and Approval in Lifecycle Management. Perform required validation and testing, to ensure DQ output matches the rules’ criteria, ensuring data accuracy, integrity and consistency across systems and applications. Collaborate with external vendors and partners to optimize data quality services. DQ Support and Remediation:Identify, document and track DQ violations and defects, work with development, data lake teams to resolve issues. Define, monitor, and report on Data Quality metrics. Collaborate internally in the creation of repeatable processes and continuous improvements. Lead DQ investigations to analyse issues and determine business impact. Establish ongoing data quality monitoring to ensure DQ issues are quickly identified and resolved. Provide post-release support related to Data Quality problems, workflows, and DQ batch job runs.Other Responsibilities:
Assist and coordinate for data operations with data lake teams on creating, automating, and monitoring the data quality processes, ensuring continuous improvement. Assist in enforcing data governance and information management policies, SOPs, and standards to manage clinical and product data in a secure and compliant manner. Improving the delivery of complex DQ projects by implementing standard processes, tools, and templates that will be used consistently across the Division. Participate in process improvements initiatives involving business areas. Identify areas for improvement with current DQ tool sets and propose future state to support DigitalX use cases, analytics best practices, and regulations. Provide short range, tactical planning and execution of actions required to deliver DQ strategies and objectives. Implement as needed metadata standards, data quality and testing procedures for the data lake/warehouse consistent with Astellas SLC standards for GxP/non-GxP sources. Work closely with Business Users and Pod/Value Team leads in developing data health insights on Tableau, Qlik or equivalent dashboards. Monitoring the integration of data sources, such as clinical trials data, operational data, portfolio management data, and product submission data into the centralized data Lakehouse. Mentors analysts to perform a variety of ad-hoc analysis and reports as requested around data quality and integrity.