Remote, CA, United States of America
13 days ago
Full Stack Developer, AI and LLM

NVIDIA's Silicon Solutions Group is seeking a full-stack developer with AI/LLM expertise to help integrate AI into its data analysis and automation infrastructure. The solutions developed will support multiple critical large-scale automation initiatives. In this role, you will lead strategies and design of AI solutions to improve the efficiency of our existing and new automation workflows. The ideal candidate will combine technical expertise with hands-on experience to drive all AI planning, design, and implementation aspects. At NVIDIA, we strive for excellence, encourage innovation, and provide opportunities to explore new ways to succeed!

What You'll Be Doing:

Study and develop groundbreaking techniques in deep learning, graphs, machine learning, and data analytics, and perform in-depth analysis.

Collaborate with developers and cross-functional teams to identify current and emerging challenges.

Design and implement end-to-end generative AI solutions, specializing in Large Language Model (LLM) training, efficient deployment strategies, and sophisticated Retrieval-Augmented Generation (RAG) workflows.

What We Need to See:

MS (or equivalent experience) with 6+ years of software development; 2+ years relevant work experience in developing and deploying AI solutions

Proven full-stack development experience with a focus on improving application performance and user experience

Proficiency in Python, C++ programming, and Deep Learning frameworks

Ability to work independently and as part of a team

Motivated self-starter with strong analytical and debug skills

Ability to balance multiple simultaneous projects

Excellent verbal and written communication skills

Ways to Standout from the crowd:

Experience with CUDA programming and benchmarking and analyzing performance AI Agentic systems

Expertise in training, fine-tuning, and evaluating LLMs using popular frameworks such as TensorFlow or PyTorch

Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms

Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms ยท

NVIDIA is widely considered to be one of the world's most desirable employers in the technology field. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Confirm your E-mail: Send Email