Job details

Senior Machine Learning Engineer - Hardware Abstractions & Performance Optimization

Get a free resume review

Luma’s mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will come from vision. So, we are working on training and scaling up multimodal foundation models for systems that can see and understand, show and explain, and eventually interact with our world to effect change.

We are looking for engineers with significant experience maintaining & designing highly efficient systems and code that can be optimized to run on multiple hardware platforms, bringing our state-of-the-art models to as many people at the best performance per dollar.

Responsibilities

Ensure efficient implementation of models & systems with a focus on designing, maintaining, and writing abstractions that scale beyond NVIDIA/CUDA hardware.
Identify and remedy efficiency bottlenecks (memory, speed, utilization, communication) by profiling and implementing high-performance PyTorch code, deferring to Triton or similar kernel-level languages as necessary.
Benchmarking our products across a variety of hardware & software to help the product team understand the optimal tradeoffs between latency, throughput and cost at various degrees of parallelism.
Work together with our partners to help them identify bottlenecks and push forward new iterations of hardware and software.
Work closely together with the rest of the research team to ensure systems are planned to be as efficient as possible from start to finish and raise potential issues for hardware integration.

Must have experience

Experience optimizing for memory, latency and throughput in Pytorch.
- Bonus: experience with non-NVIDIA systems
Experience using torch.compile / torch.XLA.
Experience benchmarking and profiling GPU & CPU code in Pytorch for optimal device utilization (examples: torch profiler, memory profilers, trace viewers, custom tooling).
Experience building tools & abstractions to ensure models run optimally on different hardware and software stacks .
Experience working with transformer models and attention implementations.
Experience with parallel inference, particularly with tensor parallelism, pipeline parallelism.

Good to have experience

Experience with high-performance Triton/CUDA and writing custom PyTorch kernels and ops. Top candidates will be able to write fused kernels for common hot paths, understand when to make use of lower level features like tensor cores or warp intrinsics, and will understand where these tools can be most impactful.
Experience writing high-performance parallel C++. Bonus if done within an ML context with PyTorch, like for data loading, data processing, inference code
Experience building inference / demo prototype code (incl. Gradio, Docker etc.)

Machine Learning Performance Optimization PyTorch CUDA Benchmarking Tensor Parallelism Transformer Models

Luma AI Glassdoor Company Review

4.4

Luma AI DE&I Review

4.3

CEO of Luma AI

Unknown name

Approve of CEO

Average salary estimate

$150000 / YEARLY (est.)

min

max

$120000K

$180000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Account Executive – Entertainment

Luma AI Hybrid Palo Alto

VIEW

Posted 2 days ago

Luma is looking for an Enterprise Account Executive to close deals and grow strategic partnerships within the entertainment industry leveraging innovative AI technology.

Senior Engineer

American Express Hybrid United States

VIEW

Posted 12 days ago

Inclusive & Diverse

Empathetic

Collaboration over Competition

Growth & Learning

Transparent & Candid

Medical Insurance

Dental Insurance

Mental Health Resources

Life insurance

Disability Insurance

Child Care stipend

Employee Resource Groups

Learning & Development

Shape and lead the creation of American Express's next-gen expense management platform as a Senior Software Engineer with full-stack and leadership expertise.

Principal Cloud Software Engineer (WildFire Cloud)

Palo Alto Networks Hybrid Santa Clara, California, United States

VIEW

Posted 7 days ago

Lead the design and implementation of scalable cloud-native security services at Palo Alto Networks as a Principal Cloud Software Engineer.

Software Engineer 1

Ocient Hybrid Chicago, Illinois, United States

VIEW

Posted 9 days ago

Contribute to advanced OLAP database solutions as a Software Engineer 1 at Ocient, a leading hyperscale data company.

Full-Stack Software Engineer (Associate/Experienced)

Boeing Hybrid USA - Chantilly, VA

VIEW

Posted 6 days ago

Boeing is hiring an onsite Full-Stack Software Engineer to develop and support software solutions for national intelligence programs.

Pod Engineering Lead

Clover Health Hybrid Remote - USA

VIEW

Posted 3 days ago

Lead engineering efforts within an empowered Pod to build innovative healthcare software that enhances patient care at Counterpart Health.

Senior Full-Stack Software Engineer

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted yesterday

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Senior Full-Stack Software Engineer opportunity at NVIDIA to develop cutting-edge AI infrastructure tools for GPU clusters.

Software Engineer III, Identity & Auth

Mapbox Hybrid No location specified

VIEW

Posted 10 days ago

Experienced software engineer needed at Mapbox to develop and maintain secure identity and authentication systems for a cutting-edge location platform.

Staff Platform Engineer

Alembic Hybrid San Francisco

VIEW

Posted 13 days ago

An early-stage AI-focused startup seeks a Staff Platform Engineer to build scalable cloud and on-prem infrastructure powering innovative marketing analytics solutions.

ERP Software Developer

ThinKom Hybrid Hawthorne, California, United States

VIEW

Posted 4 days ago

ThinKom Solutions needs a full-stack ERP Software Developer proficient in .NET, Python, PHP, and SQL to enhance and support their enterprise software platform on-site in Hawthorne, CA.

Technical Lead II

HubSpot Hybrid Remote - USA

VIEW

Posted 13 days ago

Mission Driven

Customer-Centric

Transparent & Candid

Growth & Learning

Fast-Paced

Inclusive & Diverse

Work/Life Harmony

Rise from Within

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Education Stipend

Learning & Development

Bias Training

Performance Bonus

Lead a small team as a Technical Lead II at HubSpot, driving technical excellence in machine learning and distributed systems within a remote setup.

AI Developer

Initiate Government Solutions Hybrid Washington, District of Columbia, United States

VIEW

Posted 4 days ago

Innovative IT firm IGS is hiring a remote AI Developer to build, deploy, and integrate data-driven AI solutions for federal government clients.

Senior Full Stack Developer

Campbell Ewald Hybrid Birmingham, Michigan, United States

VIEW

Posted 19 hours ago

Experienced Senior Full Stack Developer position at Campbell Ewald focusing on Next.js web application development and enhancement.

Senior System Software Engineer, Base OS Engineering Lead for Release

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 4 days ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead the Base OS Engineering Release for NVIDIA's GPU platforms, collaborating with cross-functional teams and external partners to deliver on critical software milestones.

Get a free resume review

L Luma AI

64 jobs

MATCH

Calculating your matching score...

FUNDING

Series A

DEPARTMENTS

Software Engineering

SENIORITY LEVEL REQUIREMENT

Senior

TEAM SIZE