Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
High Performance Computing Software Engineer - Supercomputing image - Rise Careers
Job details

High Performance Computing Software Engineer - Supercomputing

About the Institute of Foundation Models

We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy.


As part of our team, you’ll have the opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem-solving skills will be instrumental in establishing MBZUAI as a global hub for high-performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers.




The Role


IFM is building the foundational compute infrastructure that will power tomorrow’s breakthroughs in AI and computational science. We’re looking for a High Performance Computing Software Engineer to help us design, develop, and operate the software systems that run our large-scale AI workloads.


In this role, you’ll work at the intersection of high-performance computing and machine learning. You’ll be part of a team responsible for crafting the software stack that enables training of cutting-edge ML models—spanning 1000+ GPUs—and ensuring our infrastructure is robust, performant, and developer-friendly.


Job Responsibilities
  • Design and implement high-performance, distributed software solutions for large-scale AI/ML training.
  • Optimize low-level system components including Linux kernel, GPU/accelerator kernels, and interconnects.
  • Develop and tune communication libraries such as NCCL, MPI, UCX, RCCL, and RDMA-based systems.
  • Partner with ML researchers and engineers to support frameworks like PyTorch, MegatronLM, and DeepSpeed in large-scale production environments.
  • Contribute to our scheduling, orchestration, and job management systems, including Slurm and Kubernetes.
  • Debug and resolve complex issues across the stack—from kernel to container to model.
  • Work closely with hardware vendors, upstream open-source communities, and internal teams to drive performance and reliability improvements.


Skills & Experience
  • Proven experience developing and optimizing software for large-scale ML workloads (1000+ GPUs preferred).
  • Deep understanding of Linux kernel internals and accelerator (GPU) kernel development.
  • Proficiency with distributed communication libraries (e.g., NCCL, RCCL, MPI, UCX, SHARP, Libfabric).
  • Experience with ML frameworks like PyTorch, TensorFlow, JAX, or MegatronLM.
  • Strong knowledge of HPC job scheduling and orchestration tools (e.g., Slurm, Kubernetes, Pyxis).
  • Excellent debugging and systems performance tuning skills.
  • A collaborative mindset with a focus on shared success and technical excellence.


$200,000 - $400,000 a year

Salary Range & Description

The starting base pay for this position is as shown above. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future.


Visa Sponsorship

This position is eligible for visa sponsorship.


Benefits Include

*Comprehensive medical, dental, and vision benefits 

 *Bonus

*401K Plan

*Generous paid time off, sick leave and holidays

*Paid Parental Leave

*Employee Assistance Program

*Life insurance and disability




Average salary estimate

$300000 / YEARLY (est.)
min
max
$200000K
$400000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Experienced Full Stack Developer wanted to build and deploy advanced AI applications alongside world-class researchers at the Institute of Foundation Models.

Photo of the Rise User
Posted 3 days ago

Experienced Senior Software Developer needed at MasterBrand Cabinets to build and maintain business applications using .Net technologies in a hybrid onsite-remote model.

Photo of the Rise User

Experience building scalable backend systems and integrations at Supermove, a Series A startup transforming the moving industry through innovative technology.

Photo of the Rise User
Posted 13 days ago
Inclusive & Diverse
Empathetic
Collaboration over Competition
Growth & Learning
Transparent & Candid

Lead software engineering efforts at Mastercard to develop innovative, scalable digital payment solutions for small and medium enterprises.

Photo of the Rise User

An AI-focused Software Engineer opportunity at Atmosera to lead collaborative developer sprints and mentoring around GitHub Copilot and Amazon Q technologies.

Photo of the Rise User
Posted 6 days ago

Seeking an experienced Java Front End Developer to join Jobsbridge, Inc., specializing in advanced web and enterprise application development.

An opportunity for experienced Full-Stack Engineers to join Truelogic's remote team and develop impactful healthtech solutions using React, React Native, and Node.js.

Posted 11 days ago

Experienced Senior Drupal PHP Engineer needed to drive development of scalable Drupal applications for an innovative, remote-first digital agency.

Photo of the Rise User
Posted 2 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Dare to be Different
Reward & Recognition
Fast-Paced
Maternity Leave
Paternity Leave
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off
Learning & Development
Social Gatherings

Robinhood is looking for a skilled Android Engineer to help expand their mobile platform and deliver elevated experiences for their users.

Photo of the Rise User
Opendoor Hybrid United States-Remote
Posted 22 hours ago

Contribute as a senior fullstack Software Engineer at Opendoor, crafting innovative digital experiences for home buyers and sellers in a fast-paced, collaborative environment.

LPL Financial is hiring an AVP, Lead Software Data Engineer to drive innovative data engineering solutions and lead a motivated team to support firm-wide modernization initiatives.

Photo of the Rise User

iCapital is looking for a Full Stack Engineer to contribute to building sophisticated financial software platforms using Ruby on Rails and React in a hybrid work environment.

Photo of the Rise User
ServiceNow Hybrid 4810 Eastgate Mall, San Diego, California, United States
Posted yesterday
Inclusive & Diverse
Mission Driven
Rise from Within
Diversity of Opinions
Work/Life Harmony
Empathetic
Feedback Forward
Take Risks
Collaboration over Competition
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Conferences Stipend
Paid Time-Off
Maternity Leave
Equity

Lead the software engineering team at ServiceNow to innovate workflow automation through AI-enhanced solutions and collaborative product development.

Posted 5 days ago

Senior Frontend Developer role at Nevoya to design and build scalable, efficient interfaces for AI-powered electric trucking logistics.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
May 24, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY