At NVIDIA, we are at the forefront of the constantly evolving field of large language models, and their application in agentic and reasoning use cases. As the scale and complexity of these LLM systems continues to increase, we are seeking outstanding engineers to join our team and help shape the future of LLM inference. Our team is dedicated to pushing the boundaries of what's possible with LLMs by improving the algorithmic performance and efficiency of systems that represent them. We constantly reflect on how to improve these systems, developing new inference algorithms and protocols, improving existing models, and seamlessly integrating improvements to ensure NVIDIA's solutions can efficiently handle large-scale, sophisticated tasks.
What you'll be doing:
Research and Development: Explore and incorporate contemporary research on generative AI, agents, and inference systems into the NVIDIA LLM software stack.
Workload Analysis and Optimization: Conduct in-depth analysis, profiling, and optimization of agentic LLM workloads to significantly reduce request latency and increase request throughput while maintaining workflow fidelity.
System Design and Implementation: Design and implement scalable systems to accelerate agentic workflows and efficiently handle sophisticated datacenter-scale use cases.
Collaboration and Communication: Advise future iterations of NVIDIA software, hardware, and system by engaging with a diverse set of teams at NVIDIA and external partners and formalizing the strategic requirements presented by their workloads.
What we need to see:
BS, MS, PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent experience).
Experience in deep learning and deep learning systems design.
Proficiency in Python and C++ programming
Strong understanding of computer architecture, and GPU/parallel datacenter computing fundamentals.
Proven interest in analyzing, modeling, and tuning application performance.
Ways to stand out from the crowd:
Experience in building large-scale LLM inference systems, especially those involving compound AI.
Experience with processor and system-level performance modeling.
GPU programming experience with CUDA or OpenCL.
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent.
The base salary range is 120,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead product marketing and software engineering efforts to create technical content that empowers developers on NVIDIA's AI platform software stack.
Seeking an experienced Senior Compiler Engineer to develop and optimize deep learning accelerator compilers at NVIDIA.
Samsara seeks an experienced Senior Machine Learning Engineer to develop and optimize AI models for edge devices, driving innovation in IoT-powered physical operations.
Experienced Senior AI Engineer needed for a part-time remote role to build and lead AI-driven data platform projects with creativity and ownership.
Ingredion seeks a skilled Reliability Maintenance Engineer to enhance operational efficiency and asset reliability at their Indianapolis facility.
TrueML is looking for a seasoned Staff Engineer to spearhead their omni-channel communications platform, driving technical strategy and delivery in a remote role.
Experienced Mechanical Designer needed at Sargent & Lundy's Government Services to develop detailed mechanical designs for specialized equipment supporting DOE nuclear projects.
ZEISS is seeking a Sr. Mechanical Design Engineer to lead mechanical design efforts for innovative medical devices in a collaborative, fast-paced environment.
Experienced Civil CAD Designers proficient in AutoCAD Civil 3D are sought to join KPFF Consulting Engineers in San Francisco to support civil design and site development projects.
Process Engineer needed to enhance logistics automation and operational efficiency through data-driven improvements and innovative strategies within a dynamic company.
Lead the strategic design and scaling of cutting-edge AI platforms at Palo Alto Networks to drive enterprise-wide AI innovation and impact.
Exciting opportunity for a recent graduate engineer to contribute to submarine acoustic testing and performance optimization in a dynamic and collaborative environment.
Experienced Systems Integrator with active TS/SCI Full Scope Poly clearance needed to manage and coordinate critical optical and network systems for a leading government contractor.
SpaceX seeks a Design for Manufacturing Engineer - PCBA to enhance the manufacturing quality and yield of Starlink satellite electronics.
Renesas Electronics is looking for a Senior Staff Digital Design Engineer to innovate and lead advanced digital power management IC design projects.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
493 jobsSubscribe to Rise newsletter