LAC Federal is seeking an Entry-Level Data Scientist to work at a major federal library in the Washington, DC area. The Data Scientist will work with a larger team to develop the information architecture and framework for an Open Access data repository containing scientific data from federally funded research. Under the direct of senior staff, the Data Scientist will be responsible for supporting the management, analysis, and utilization of scientific data within federal agency repositories. This role involves working closely with a team of librarians, information specialists, senior data scientists, data managers, and IT professionals to ensure the effective organization, accessibility, and integrity of scientific datasets. The incumbent will employ various data science techniques and tools to extract insights, support research initiatives, and enhance decision-making processes. This is a hybrid position with remote and onsite required.
RESPONSIBILITIES:
Data Management:
Collaborate with data managers to ensure the proper organization, documentation, and storage of scientific datasets. Implement data quality control measures to maintain the accuracy, consistency, and completeness of repository contents. Develop and maintain data pipelines for the efficient extraction, transformation, and loading (ETL) of data from diverse sources.Data Analysis:
Utilize statistical and machine learning techniques to analyze scientific data and extract meaningful insights. Conduct exploratory data analysis (EDA) to identify patterns, trends, and anomalies within large datasets. Develop predictive models to support forecasting, risk assessment, and decision-making processes.Data Visualization:
Create clear and compelling visualizations to communicate findings and insights to stakeholders. Design interactive dashboards and reports to facilitate data exploration and interpretation. Ensure that visualizations adhere to best practices for data presentation and accessibility.Research Support:
Collaborate with scientists and researchers to understand their data needs and provide analytical support for research projects. Assist in the design and execution of experiments and studies, including data collection and analysis. Contribute to the development of data-driven research strategies and methodologies.Technical Support:
Provide technical assistance and training to users of scientific data repositories. Troubleshoot issues related to data access, analysis, and interpretation. Stay abreast of emerging technologies and best practices in data science, informatics, and related fields.