Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Software Engineer, Inference –  GPU Enablement image - Rise Careers
Job details

Software Engineer, Inference – GPU Enablement

About the Team
OpenAI’s Inference team ensures that our most advanced models run efficiently, reliably, and at scale. We build and optimize the systems that power our production APIs, internal research tools, and experimental model deployments. As model architectures and hardware evolve, we’re expanding support for a broader set of compute platforms - for example AMD GPUs - to increase performance, flexibility, and resiliency across our infrastructure.

We are forming a team to generalize our inference stack - including kernels, communication libraries, and serving infrastructure - to alternative hardware architectures like AMD.

About the Role
We’re hiring engineers to scale and optimize OpenAI’s inference infrastructure across emerging GPU platforms. You’ll work across the stack - from low-level kernel performance to high-level distributed execution - and collaborate closely with research, infra, and performance teams to ensure our largest models run smoothly on new hardware.

This is a high-impact opportunity to shape OpenAI’s multi-platform inference capabilities from the ground up.

In this role, you will:

  • Design and optimize high-performance GPU kernels for AMD accelerators using HIP, Triton, or other performance-focused frameworks.

  • Build and tune collective communication libraries (e.g., RCCL) used to parallelize model execution across many GPUs.

  • Integrate internal model-serving infrastructure (e.g., vLLM, Triton) into AMD-backed systems.

  • Debug and optimize distributed inference workloads across memory, network, and compute layers.

  • Validate correctness, performance, and scalability of model execution on large AMD GPU clusters.

You can thrive in this role if you:

  • Have experience writing or porting GPU kernels using HIP, CUDA, or Triton, and care deeply about low-level performance.

  • Are familiar with communication libraries like NCCL/RCCL and understand their role in high-throughput model serving.

  • Have worked on distributed inference systems and are comfortable scaling models across fleets of accelerators.

  • Enjoy solving end-to-end performance challenges across hardware, system libraries, and orchestration layers.

  • Are excited to be part of a small, fast-moving team building new infrastructure from first principles.

Nice to Have:

  • Contributions to open-source libraries like RCCL, Triton, or vLLM.

  • Experience with GPU performance tools (Nsight, rocprof, perf) and memory/comms profiling.

  • Prior experience deploying inference on AMD or other non-NVIDIA GPU environments.

  • Knowledge of model/tensor parallelism, mixed precision, and serving 10B+ parameter models.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. 

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. 

OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

OpenAI Glassdoor Company Review
4.2 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
OpenAI DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of OpenAI
OpenAI CEO photo
Sam Altman
Approve of CEO

Average salary estimate

$150000 / YEARLY (est.)
min
max
$120000K
$180000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

Experienced Customer Success Manager needed to drive AI adoption and customer success within OpenAI's government sector clients in Washington, DC.

Photo of the Rise User
Posted 4 days ago
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

Contribute to building advanced AI infrastructure as a Design Execution Manager at OpenAI’s Stargate team, overseeing strategic design and construction of mission-critical data centers.

Photo of the Rise User
Posted 6 days ago

Contribute expert software engineering skills at Anduril Industries to build cutting-edge defense technology solutions.

Photo of the Rise User

An innovative SaaS company is seeking a Senior Machine Learning Engineer to develop and deploy mission-critical ML models in a hybrid role based in King of Prussia, PA.

Photo of the Rise User

Experienced Full Stack Developer needed at Brandes Associates Inc. to architect and maintain critical DoD systems while mentoring junior staff.

JPMC Hybrid Jersey City, New Jersey, United States
Posted 5 days ago

A Principal Software Engineer role at JPMorgan Chase focused on leading data product development and strategic allocation transformation within a major financial institution.

Photo of the Rise User
Ajna Infotech Hybrid Charlotte, Charlotte, North Carolina, United States
Posted 12 days ago

Experienced SAP ABAP Developer needed to lead development and optimization initiatives in ECC, S/4HANA, and RAP applications at MSRcosmos.

Photo of the Rise User
Palo Alto Networks Hybrid Santa Clara, California, United States
Posted 24 hours ago

Innovate and enhance cybersecurity testing as a Senior Software Engineer in Test at Palo Alto Networks, driving quality in cloud-delivered security services.

Photo of the Rise User
DigitalOcean Hybrid Boston, Massachusetts, United States
Posted 12 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Customer-Centric
Rapid Growth
Social Impact Driven
Maternity Leave
Paternity Leave
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Paid Holidays

Senior Full Stack Engineer role at DigitalOcean to develop scalable security products and infrastructure in a remote, dynamic setting.

Photo of the Rise User
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Vision Insurance
Family Medical Leave
Paid Holidays

Innovative Graduate Engineer wanted at Anomali to develop and enhance groundbreaking cybersecurity software solutions in a hybrid role based in Redwood City, CA.

Posted 7 days ago

TetraScience is seeking an experienced Senior AI Infrastructure Engineer to design and maintain scalable AI/ML cloud infrastructure and enable advanced AI capabilities.

Photo of the Rise User

Domino seeks a Senior/Staff Software Engineer to advance their Compute team and drive scalable architecture for AI-driven data science solutions.

Photo of the Rise User
Posted 7 days ago

Lead the development of large-scale distributed systems as a Principal Backend Java Engineer at Rackspace Technology, a leader in multicloud solutions.

Photo of the Rise User

Senior MS Dynamics Developer / Team Lead needed to lead technical development and team collaboration on government IT transformation projects in a hybrid Washington D.C. setting.

Photo of the Rise User
Adonis Market Hybrid New York, New York, United States
Posted 6 days ago

Software Engineer needed to lead healthcare system integrations at Adonis, a cutting-edge AI orchestration startup based in New York City.

OpenAI is a US based, private research laboratory that aims to develop and direct AI. It is one of the leading Artifical Intellgence organizations and has developed several large AI language models including ChatGPT.

777 jobs
MATCH
VIEW MATCH
BADGES
Badge ChangemakerBadge Future MakerBadge InnovatorBadge Future UnicornBadge Rapid Growth
CULTURE VALUES
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning
FUNDING
SENIORITY LEVEL REQUIREMENT
INDUSTRY
TEAM SIZE
No info
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
May 9, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!