Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Senior AI Engineer, NeMo Retriever - Model Optimization and MLOps image - Rise Careers
Job details

Senior AI Engineer, NeMo Retriever - Model Optimization and MLOps

NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars and robotics to co-pilots and more. Join us at the forefront of technological advancement in intelligent assistants and information retrieval. ​NVIDIA NIM provides containers to self-host GPU-accelerated inferencing microservices for pre-trained and customized AI models across clouds, data centers, RTX™ AI PCs, and workstations. NIM microservices expose industry-standard APIs for simple integration into AI applications, development frameworks, and workflows. Built on pre-optimized inference engines from NVIDIA and the community, including NVIDIA TensorRT and TensorRT-LLM, NIM microservices optimize response latency and throughput for each combination of foundation model and GPU.


NVIDIA NeMo Retriever is a collection of NIMs for building multimodal extraction, re-ranking, and embedding pipelines with high accuracy and maximum data privacy. It delivers quick, context-aware responses for AI applications like advanced retrieval-augmented generation (RAG) and Agentic AI workflows. The NeMo Retriever team is looking for an AI Engineer to join our team, focusing on the intersection of machine learning development, performance optimization, and MLOps. This role requires a unique blend of technical expertise in ML model development, system optimization, and operational excellence. We are looking for someone with a passion for working with the world's most complicated problems in Generative AI, LLM, MLLM, and RAG spaces using our innovative hardware and software platforms. You will leverage and augment existing tools that enable building NIMs, which power flexible, multi-modal retrievers and agents. If you're creative & passionate about solving real-world conversational AI problems, come join us.

What You'll Be Doing:

  • Develop and maintain NIMs that containerize optimized models using OpenAPI standards using Python or an equivalent performant language.

  • Work closely with partner teams to understand requirements, build & evaluate POCs, and develop roadmaps for production-level tools

  • Enable development of integrated systems - AI Blueprints that provide a unified, turnkey experience.

  • Help build and maintain our Continuous Delivery pipeline with the goal of moving changes to production faster and safer while ensuring key operational standards.

  • Provide peer reviews to other specialists, including feedback on performance, scalability, and correctness.

What We Need To See:

  • Bachelor’s or Master’s Degree program in Computer Science, Computer Engineering, or a related field (or equivalent experience).

  • 8+ years of demonstrated experience in a similar or related role

  • Python programming expertise with Deep Learning (DL) frameworks such as PyTorch.

  • Experience delivering software in a cloud context and is familiar with the patterns and processes of handling cloud infrastructure

  • Knowledge of MLOps technologies such as Docker-Compose, Containers, Kubernetes, Helm, data center deployments, etc.

  • Familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM.

  • Excellent in-depth hands-on understanding of NLP, LLM, MLLM, Generative AI , and RAG workflows

  • Self-starter with a passion for growth, enthusiasm for continuous learning, and sharing findings across the team

  • Extremely motivated, highly passionate, and curious about new technologies.

With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Due to unprecedented growth, our exclusive engineering teams are rapidly growing.

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA Glassdoor Company Review
4.6 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
NVIDIA DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of NVIDIA
NVIDIA CEO photo
Jensen Huang
Approve of CEO

Average salary estimate

$270250 / YEARLY (est.)
min
max
$184000K
$356500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Posted 11 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

An ambitious Senior Software Architect is needed at NVIDIA to lead software architecture design for cutting-edge AI server technologies.

Photo of the Rise User
Posted 11 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Lead and mentor a world-class AI infrastructure team at NVIDIA, driving innovation in large-scale distributed systems and LLM-based solutions.

Posted 6 days ago

Falconer is seeking a backend engineer to develop scalable AI-driven backend systems for a groundbreaking knowledge platform.

Lead .NET Core development and modernization projects remotely for a well-established Austin-based consultancy serving Texas state agencies.

Photo of the Rise User

Senior Software Engineer needed for cutting-edge embedded systems development at a defense technology firm in St. Louis, MO, requiring U.S. Citizenship and on-site presence.

Posted 2 days ago

Contribute as an AI Engineer at Yahoo, building advanced AI agentic ecosystems to enhance revenue optimization and sales automation.

Truelogic is looking for a passionate Senior Full-stack Engineer (.NET/Angular) to join their remote team driving digital transformation projects for top U.S. clients.

Photo of the Rise User
Posted 13 days ago
Inclusive & Diverse
Empathetic
Collaboration over Competition
Growth & Learning
Transparent & Candid

Lead software engineering efforts at Mastercard to develop innovative, scalable digital payment solutions for small and medium enterprises.

Photo of the Rise User

Experienced Senior Software Engineer needed at Palo Alto Networks to build innovative DevOps tools and platforms that enhance operational productivity and security.

Photo of the Rise User
Inclusive & Diverse
Mission Driven
Social Impact Driven
Passion for Exploration
Dare to be Different
Diversity of Opinions
Reward & Recognition
Empathetic
Feedback Forward
Work/Life Harmony
Collaboration over Competition
Growth & Learning
Transparent & Candid
Customer-Centric
Rise from Within
Friends Outside of Work
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Work Visa Sponsorship
Employee Resource Groups
401K Matching
Paid Time-Off
Maternity Leave
Social Gatherings
Company Retreats

Drive security innovation as a Principal Software Engineer at Microsoft AI, safeguarding Copilot by designing robust, cutting-edge AI data and privacy solutions.

Photo of the Rise User

Experienced Senior Software Engineer needed at Truist to lead automation and DevSecOps initiatives leveraging AWS technologies within Agile teams.

Photo of the Rise User
Google Hybrid San Francisco, California, United States
Posted 10 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Take Risks
Collaboration over Competition
Growth & Learning
Transparent & Candid
Customer-Centric
Social Impact Driven
Rapid Growth
Passion for Exploration
Dare to be Different
Reward & Recognition
Friends Outside of Work
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Conferences Stipend
Bias Training
Employee Resource Groups
401K Matching
Paternity Leave
Maternity Leave
Some Meals Provided
Social Gatherings

Innovate at the intersection of AI and XR as a Senior Software Engineer on Google's Android XR team, crafting advanced 3D user experiences.

Photo of the Rise User

Lead full stack software engineering teams at Capital One to deliver innovative cloud-native applications enhancing financial services.

Photo of the Rise User
SpaceX Hybrid Hawthorne, California, United States
Posted 3 days ago
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition

Innovate and develop manufacturing software solutions as a Software Engineer on SpaceX's Starship design team to drive next-generation space exploration.

Photo of the Rise User
Posted 4 days ago
Dental Insurance
Vision Insurance
Flexible Spending Account (FSA)
Family Medical Leave
Paid Holidays

Senior Backend Engineer at EarnIn, focusing on Kotlin service development and scalable product features in a fully remote environment.

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

492 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Diversity ChampionBadge Family FriendlyBadge Global CitizenBadge Work&Life Balance
CULTURE VALUES
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
BENEFITS & PERKS
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
May 9, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY