Santa Clara, CA, USA
65 days ago
PhD Research Intern, Large Language Models - 2025

By submitting your resume, you’re expressing interest in one of our 2025 Large Language Models focused Internships. We’ll review resumes on an ongoing basis, and a recruiter may reach out if your experience fits one of our many internship opportunities. 

NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society — from gaming to robotics, self-driving cars to life-saving healthcare, climate change to virtual worlds where we can all connect and create. We are passionate about research that pushes boundaries but also has impact in the real world. You will be part of an amazing collaborative research team that consistently publishes at the top venues in machine learning and systems. 

  

Our internships offer an excellent opportunity to expand your career and get hands on with one of our industry leading Large Language Models Research teams. We’re seeking strategic, ambitious, hard-working, and creative individuals who are passionate about helping us tackle challenges no one else can solve. 

 

What you'll be doing: 

Investigate novel approaches to infuse theory-of-mind reasoning into the post- or pre-training phases of large language models 

Collaborate with other team members, teams, and/or external researchers. 

Transfer your research to product groups to enable new products or types of products. 

Opportunity to publish original research. 

What we need to see: 

Currently pursuing a PhD Degree in Computer Science/Engineering, Electrical Engineering. 

Research experience in at least one of the following areas: 

Large Language Models – training, alignment, and evaluation 

Foundation Models 

Multimodal Models/Agents 

Vision-Language Models 

Deep Learning, Model Compression, and Acceleration Techniques 

Pruning 

Quantization 

NAS 

Efficient Backbone Architecture 

Distillation 

Neural Architecture Search 

Strong research track record and publication record at top-tier conferences. 

Excellent communication skills. 

Excellent programming skills in some rapid prototyping environment such as Python; C++ and parallel programming (e.g., CUDA) is a plus 

Hands-on experience with large-scale model training is a plus. 

Knowledge of common machine learning frameworks, such as PyTorch 

 

NVIDIA is widely considered one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world. Are you a creative and collaborative researcher with a real passion for computer graphics? If so, we want to hear from you! 

The hourly rate for our interns is 30 USD - 90 USD. Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience.

You will also be eligible for Intern benefits. NVIDIA accepts applications on an ongoing basis. ​

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Confirm your E-mail: Send Email