NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. Today, we lead in artificial intelligence, driving advances in natural language processing, computer vision, autonomous systems, and scientific research. We are looking for a forward-thinking HPC and AI Inference Software Architect to help shape the future of scalable AI infrastructure—focusing on distributed training, real-time inference, and communication optimization across large-scale systems.
Join our world-class team of researchers and engineers building next-generation software and hardware systems that power the most demanding AI workloads on the planet.
What you will be doing:
Design and prototype scalable software systems that optimize distributed AI training and inference—focusing on throughput, latency, and memory efficiency.
Develop and evaluate enhancements to communication libraries such as NCCL, UCX, and UCC, tailored to the unique demands of deep learning workloads.
Collaborate with AI framework teams (e.g., TensorFlow, PyTorch, JAX) to improve integration, performance, and reliability of communication backends.
Co-design hardware features (e.g., in GPUs, DPUs, or interconnects) that accelerate data movement and enable new capabilities for inference and model serving.
Contribute to the evolution of runtime systems, communication libraries, and AI-specific protocol layers.
Collaborate with customers to understand their needs and provide innovative solutions for them.
What we need to see:
Ph.D, Masters, or Bachelors in computer science, computer engineering, electrical engineering or a closely related field.
5+ years of experience in DNNs, Scaling of DNNs, Parallelism of DNN frameworks, or deep learning training workloads.
Deep understanding of Inference and Training workloads and optimizations, like Prefill/Decode, data parallelism, Tensor parallelism, FDSP, etc...
Experience with AI network parallelism using collective libraries and RDMA/RoCE.
Background in algorithm design, system programming, and computer architecture.
Strong programming and software development skills.
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment.
Ways to stand out from the crowd:
Deep understanding of technology and passion for what you do.
Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment.
Background with designing communication middleware for high-performance computing systems, including RoCE and DPUs.
Background with CUDA programming and NVIDIA GPUs and programming models for emerging architectures.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#LI-Hybrid
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA is looking for an experienced Senior Commercial Litigation Counsel to join their legal team, providing expert guidance on general non-patent civil litigation matters.
Experienced Customer Program Manager needed at NVIDIA to drive autonomous driving automotive programs and manage multinational project teams.
Experienced Senior Software Engineer skilled in Java and AWS technologies needed to enhance Experian's innovative data platform in a hybrid work setting.
Northrop Grumman seeks experienced Principal/Sr. Principal Software Engineers skilled in full stack and DevOps to advance cutting-edge defense technologies in Dayton, OH.
A Salesforce Technical Lead role at TreviPay leading a dynamic team to drive innovative automation and AI-enhanced solutions within the Salesforce ecosystem.
Develop cutting-edge frontend UIs for defense autonomy applications at Applied Intuition's Mission Control team in Mountain View.
Innovate AI-driven HR and Finance solutions as a Senior Full Stack Engineer at Workday, a company renowned for its employee-focused culture and cutting-edge technology.
The Washington Post seeks a Senior Full Stack Software Engineer to develop and maintain innovative content delivery applications and systems.
Vestwell seeks a skilled Backend Software Engineer to develop and maintain Python-based REST APIs for seamless integration with partners in a hybrid work environment.
Lead performance engineering initiatives for Palo Alto Networks’ Cortex Cloud, driving optimization and scalability of cloud-native distributed systems.
A fast-growing trampoline park company in Ogden seeks a Full Stack Developer to create and enhance innovative web applications using PHP, JavaScript, and MySQL.
Experienced Staff Software Engineer needed at NBCUniversal to lead cloud-native software development and manage enterprise IP rights systems.
PointClickCare seeks a detail-oriented Software Architect to design scalable healthcare software solutions and drive innovation within a leading health tech company.
Lead Riot Games' Accounts engineering teams to build and scale secure authentication platforms while mentoring managers and engineers.
Lead and mentor a world-class AI infrastructure team at NVIDIA, driving innovation in large-scale distributed systems and LLM-based solutions.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
485 jobsSubscribe to Rise newsletter