2025 Summer Intern - gRED Computational Catalyst
Department Summary
At Genentech Research & Early Development (gRED), the Computational Catalysts group within the Computational Sciences (gCS) department is a diverse, curious, and action-driven team at the intersection of computation, engineering, and science. We focus on partnering with the informatics and scientific communities to create a computational and data ecosystem that powers scientific discovery and accelerates decision-making. By leveraging modern computational approaches, we aim to revolutionize how targets and therapeutics are discovered and developed, ultimately enabling novel treatments for patients across the world.
Within Computational Catalysts, our team specializes in the practical applications of AI to develop the next generation of tools for scientists and optimize operations. We work on diverse, impactful projects such as streamlining the authoring and review of regulatory documents, enhancing data exploration through chat-driven search, and integrating AI into visualization tools to interpret data and provide contextual insights. Our mission is to deliver scalable, innovative solutions that empower scientists and drive meaningful advancements in medicine.
The Opportunity
We are seeking a motivated and skilled intern to join our team and contribute to advancing our AI-powered tools for life sciences. During the internship, you will gain experience working on key projects that apply machine learning and natural language processing (NLP) to accelerate scientific research:
Conducting research and experiments to explore innovative ways of integrating AI into scientific discovery processes.
Developing and optimizing tools for life sciences applications, focusing on natural language processing (NLP) and vector database techniques.
Developing robust benchmarking and evaluation frameworks for AI-powered applications
Designing and implementing features that improve the usability and effectiveness of large language models (LLMs) within our ecosystem.
Collaborating with interdisciplinary and cross-functional teams including biologists, engineers, data scientists, and other stakeholders.
Program Highlights
Intensive 12-weeks, full-time (40 hours per week) paid internship.
Program start dates are in May/June (Summer)
A stipend, based on location, will be provided to help alleviate costs associated with the internship.
Ownership of challenging and impactful business-critical projects.
Work with some of the most talented people in the biotechnology industry.
Who You Are
Required Education:
Must be pursuing a Bachelor's Degree (enrolled student).
Must have attained a Bachelor's Degree (not currently enrolled in a graduate program).
Must be pursuing a Master's Degree (enrolled student).
Must have attained a Master's Degree.
Must be pursuing a PhD (enrolled student).
Must have attained a PhD.
Required Majors: Computer Science, Bioinformatics, Data Science, Computational Biology, or a related area
Required Skills:
Proficiency in Python
Familiarity with large language models (LLMs) and vector databases.
Basic knowledge of natural language processing (NLP) techniques and concepts.
Strong problem-solving abilities and the ability to work independently and collaboratively.
Interest in contributing to cutting-edge scientific research.
Preferred Knowledge, Skills, and Qualifications
Understanding of the life sciences or healthcare industries.
Experience with cloud technologies for hosting scalable AI models.
Excellent communication, collaboration, and interpersonal skills.
Relocation benefits are not available for this job posting.
The expected salary range for this position based on the primary location of STATE is $45-$50 hour. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. This position also qualifies for paid holiday time off benefits.
#GNE-R&D-Interns-2025
Genentech is an equal opportunity employer, and we embrace the increasingly diverse world around us. Genentech prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin or ancestry, age, disability, marital status and veteran status.