We are now looking for a Senior High-Performance LLM Training Engineer!
NVIDIA is seeking experienced engineers specializing in performance analysis and optimization to improve the efficiency of LLM training workloads, which are shaping the world's most advanced computing systems. This position focuses on optimizing NVIDIA’s high-performance LLM software stack in frameworks like PyTorch and JAX for high-performance training on thousands of GPUs, while also helping shape hardware roadmaps for the next generation of GPUs powering the AI revolution.
What you will be doing:
Understand, analyze, profile, and optimize AI training workloads on innovative hardware and software platforms.
Understand the big picture of training performance on GPUs, prioritizing and then solving problems across all state-of-the-art neural networks.
Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks.
Build and support NVIDIA submissions to the MLPerf Training benchmark suite.
Implement key DL training workloads in NVIDIA's proprietary processor and system simulators to enable future architecture studies.
Build tools to automate workload analysis, workload optimization, and other critical workflows.
What we want to see:
PhD in Computer Science, Electrical Engineering or Computer Engineering and 5+ years; or MS (or equivalent experience) and 8+ years of meaningful work experience.
Strong background in deep learning and neural networks, in particular training.
A deep background in computer architecture and familiarity with the fundamentals of GPU architecture.
Proven experience analyzing and tuning application performance & processor and system-level performance modelling.
Programming skills in C++, Python, and CUDA.
GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.
Widely considered to be one of tech's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. If you're excited to work across the full hardware & software stack—from GPU architecture to application code—to achieve optimal performance, we want to hear from you!
#LI-Hybrid
The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA is on the lookout for a Senior High-Performance LLM Training Engineer in beautiful Santa Clara, CA! If you're passionate about pushing the limits of AI and deep learning, this could be the perfect opportunity for you. In this role, you'll dive headfirst into optimizing NVIDIA's extensive LLM software stack, particularly within frameworks like PyTorch and JAX. The goal? To ensure that training workloads run efficiently on thousands of GPUs, driving the AI revolution forward. You'll get to analyze and profile AI training workloads, tackling performance issues for various state-of-the-art neural networks while implementing high-quality production software across multiple layers of NVIDIA's deep learning platform. Additionally, you'll have the unique chance to contribute to NVIDIA's submissions for the MLPerf Training benchmark suite and work with processor and system simulators to lay the groundwork for groundbreaking architectural studies. To thrive in this role, you should possess a PhD in a related field or equivalent experience, alongside a solid background in deep learning and GPU architecture. Programming skills in C++, Python, and CUDA will be essential tools in your toolkit. At NVIDIA, you'll collaborate with some of the brightest minds in tech, all while enjoying a creative, autonomous work environment and a competitive salary along with excellent benefits. If you're ready to revolutionize the field of AI and leave your mark on the future of technology, we invite you to join us!
Join NVIDIA as a Senior ASIC Design Engineer to work on innovative DFT solutions for complex semiconductor chips.
NVIDIA is looking for a Senior VLSI CAD R&D Engineer to enhance and innovate algorithms for advanced gate-level analysis tools.
Lead AECOM’s Civil and Structural Engineering Design Team for nuclear projects, shaping the industry's future through innovative solutions.
Join Medtronic as a Manufacturing Engineer II, where you'll enhance manufacturing processes for critical healthcare equipment in a collaborative environment.
As a Senior Manager, Engineering at Beacon Biosignals, you will lead a team of engineers in a mission-driven company transforming brain treatment through innovative technology.
Join Boeing as a Senior Level Ground Hardware Architect to lead innovative developments in Ground Segment hardware architecture.
Join Charm Industrial as a Pyrolyzer Operator, where you'll operate innovative machinery to contribute to impactful climate solutions.
A dynamic Solution Architect opportunity awaits with a leading global food and services company, focusing on comprehensive solution design.
Seeking a skilled Commissioning Engineering Manager to lead commissioning processes in data center construction projects in San Antonio.
Become a part of NVIDIA's innovative team as a Senior VLSI CAD Engineer and help shape the future of AI hardware design.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
359 jobsSubscribe to Rise newsletter