Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
AI engineer - Distributed Systems image - Rise Careers
Job details

AI engineer - Distributed Systems

Responsibilities

  • Work closely with the rest of the research team on experiment tracking and tooling to ensure large-scale training runs can be logged and analyzed with low overhead

  • Automate & evolve the handling of python environments using tools such as docker and uv, as well as handling compilation of custom packages to ensure experiments and training runs can be reproduced 

  • Set up & maintain CI/CD pipelines to automatically test large codebases, optimizing for test coverage

  • Work on documentation & type correctness

  • Define test suites to automatically test cluster stability & performance for distributed ML workloads

  • Debug and resolve systems issues, ensuring that they are triaged & handled in a timely manner

Must have experience

  • Excellent software engineering skills, particularly with experience in maintaining & working on typed & tested Pytorch code bases

  • Experience with PyTorch

  • Experience with Slurm

  • Experience with Github CI/CD

  • Experience with Docker

At Luma AI, we believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will come from vision. So, we are working on training and scaling up multimodal foundation models for systems that can see and understand, show and explain, and eventually interact with our world to effect change.

We will deploy these systems to make a new kind of intelligent creative partner that can imagine with us. Free and away from the pressure of being creative. It's for all of us whose imaginations have been constrained, who've had to channel vivid dreams through broken words, hoping others will see what we see in our mind's eye. A partner that can help us show — not just tell.

Dream Machine is an early step to building that. Try it here

Why you should join us:

  • Luma is bringing together the best team in the world to achieve our goal, from researchers to engineers and designers to growth operators

  • Luma is not just a lab - we are deeply product focused and our vision merging AI models and delightful products is unique in the industry

  • We build. We ship. Our early products have been wildly successful

Luma AI Glassdoor Company Review
4.4 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Luma AI DE&I Review
4.3 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
CEO of Luma AI
Luma AI CEO photo
Unknown name
Approve of CEO

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About AI engineer - Distributed Systems, Luma AI

At Luma AI, we're on the lookout for an enthusiastic AI Engineer specializing in Distributed Systems to join our innovative team in San Francisco. In this exciting role, you'll collaborate closely with our research team, focusing on experiment tracking and tooling to ensure that our large-scale training runs are efficient and manageable. Automating and evolving the handling of Python environments is a key part of the position, utilizing tools like Docker and UV to maintain reproducibility in our experiments. You'll also set up and maintain CI/CD pipelines, optimizing them for large codebases to ensure robust test coverage. With your software engineering prowess, particularly in maintaining typed and tested PyTorch codebases and experience with Slurm and GitHub CI/CD, you'll be playing a crucial role in debugging systems and enhancing cluster stability for our distributed ML workloads. Our mission here at Luma AI is to push the boundaries of what AI can achieve by developing multimodal foundation models that can interpret and interact with the world in truly transformative ways. Join us as we empower creativity through technology, making it accessible for everyone. If you're ready to be part of a tightly-knit team that's not just focused on research but on building delightful products, we want to hear from you!

Frequently Asked Questions (FAQs) for AI engineer - Distributed Systems Role at Luma AI
What are the responsibilities of the AI Engineer - Distributed Systems at Luma AI?

As an AI Engineer specializing in Distributed Systems at Luma AI, your responsibilities will include working closely with the research team on experiment tracking and tooling, automating Python environments, maintaining CI/CD pipelines, and defining test suites for cluster stability. You will be instrumental in ensuring our training runs are efficient and reproducible while troubleshooting system issues.

Join Rise to see the full answer
What qualifications does one need for the AI Engineer - Distributed Systems role at Luma AI?

To qualify for the AI Engineer - Distributed Systems role at Luma AI, you should have excellent software engineering skills, particularly in maintaining and working with typed and tested PyTorch codebases. Additionally, experience with Slurm, GitHub CI/CD, and Docker is essential. Strong problem-solving skills and the ability to work collaboratively with a team are also important.

Join Rise to see the full answer
What technology stack is used by the AI Engineer - Distributed Systems at Luma AI?

The technology stack for the AI Engineer - Distributed Systems at Luma AI primarily involves Python, PyTorch, Docker, and Slurm, among other tools. Experience with GitHub CI/CD is also critical for setting up and maintaining automated testing and deployment pipelines.

Join Rise to see the full answer
What is the company culture like for AI Engineers at Luma AI?

At Luma AI, the culture is collaborative and product-focused. Engineers work alongside researchers and designers to create innovative solutions, making it a great place for those who thrive in a team-oriented environment. The company values creativity and encourages team members to contribute to building and shipping products that make a difference.

Join Rise to see the full answer
What growth opportunities exist for the AI Engineer - Distributed Systems at Luma AI?

As an AI Engineer - Distributed Systems at Luma AI, you will have numerous opportunities for growth. You’ll be part of a pioneering team that’s at the forefront of AI technology. The learning environment is dynamic, and your contributions will directly influence the development of multimodal AI systems, which can pave the way for advancement in your career.

Join Rise to see the full answer
Common Interview Questions for AI engineer - Distributed Systems
Can you explain your experience with PyTorch and how it relates to the AI Engineer - Distributed Systems role at Luma AI?

When answering this question, focus on specific projects or tasks where you've applied PyTorch. Highlight your understanding of how to manage typed and tested codebases, and how your experience can contribute to Luma AI's objectives in developing systems that are not only effective but innovative.

Join Rise to see the full answer
How do you approach debugging issues in distributed systems?

A good approach is to explain your systematic method for identifying the root cause of issues, such as using logging tools, monitoring system performance, and testing different components in isolation. This showcases your problem-solving skills crucial for the role.

Join Rise to see the full answer
Describe your experience with CI/CD pipelines. How would you implement them in the context of Luma AI?

Discuss your familiarity with CI/CD principles and specific tools you've used, explaining how you would tailor these implementations to suit Luma AI's distributed ML workloads. Mention the importance of automated testing to ensure codebase stability.

Join Rise to see the full answer
What strategies do you use for optimizing the performance of machine learning models?

Share strategies such as hyperparameter tuning, model architecture selection, and leveraging distributed computing resources. Emphasize adaptability based on the specific requirements of projects you'll be working on at Luma AI.

Join Rise to see the full answer
How do you ensure that your code is maintainable and scalable?

Explain your practices for writing clean, modular code, such as adhering to coding standards, using version control, and writing unit tests. Mention any tools you utilize to maintain code quality over time.

Join Rise to see the full answer
Can you give an example of a challenging project you worked on involving distributed systems?

Share a specific scenario that emphasizes your problem-solving skills, teamwork, and technical expertise. Focus on your contributions and what you learned from the experience.

Join Rise to see the full answer
How do you stay current with developments in AI and machine learning technology?

Discuss the resources you rely on for continuous learning, such as research papers, online courses, and professional networks. Emphasize your dedication to staying updated on trends that will impact your work at Luma AI.

Join Rise to see the full answer
What is your experience with using Docker in a machine learning context?

Detail projects where you’ve used Docker, focusing on how it facilitated creating reproducible environments for experiments and how it can help standardize deployments in Luma AI's distributed systems.

Join Rise to see the full answer
How would you handle a situation where an experiment did not yield the expected results?

Discuss your process for analyzing results, examining potential flaws in methodology, and executing necessary adjustments. Highlight your analytical mindset and resilience in research settings.

Join Rise to see the full answer
What excites you most about working at Luma AI as an AI Engineer in Distributed Systems?

Share your passion for AI and how Luma AI's vision of merging AI models with delightful products aligns with your career goals. Speak about the excitement of being part of a team working on groundbreaking technology.

Join Rise to see the full answer
Similar Jobs
Posted yesterday

Be at the forefront of multimodal AI by leading the development of the Dream Machine app at Luma AI.

Posted 2 days ago

Join Luma's applied research team as an Applied AI Engineer to build innovative interfaces that enhance multimodal AI capabilities.

Photo of the Rise User
Posted 3 days ago

Gameloft is looking for an Intermediate C++ Game Developer to collaborate in crafting engaging gaming experiences.

Photo of the Rise User

We are looking for a skilled Senior Associate Application Engineer to enhance our backend systems and contribute to meaningful projects at Discover.

Photo of the Rise User
Posted 2 days ago

Join bunch as a Senior Frontend Engineer and help shape the future of private market investments with your expertise in building seamless user interfaces.

Photo of the Rise User
Posted 13 days ago

Fandango is seeking a skilled Sr. Ad Attribution Engineer to architect innovative API solutions for ad attribution in a fully remote role.

Photo of the Rise User
Posted 8 days ago

Nautilus is on a mission to transform drug development and is looking for a dedicated Staff Software Engineer to elevate their SaaS platform.

Photo of the Rise User
Ridgeline Hybrid Reno, Nevada, United States
Posted 8 days ago

Join Ridgeline as a Staff Software Engineer to shape cutting-edge UI frameworks and cloud applications in the investment management industry.

Photo of the Rise User

Leading a talented team at Qiddiya Investment Company, the Senior Manager - Software Engineering will design and deliver innovative software solutions to enhance customer experience.

Vattenfall Remote Hamburg, Bundesrepublik Deutschland
Posted 8 days ago

Join Vattenfall as a Full Stack Software Engineer and contribute to the renewable energy sector by optimizing energy generation through innovative software solutions.

MATCH
VIEW MATCH
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
February 18, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Eastlake just viewed (REMOTE) Account Executive at Trellis
Photo of the Rise User
Someone from OH, Elyria just viewed Security Officer - Factory Patrol at Allied Universal
Photo of the Rise User
11 people applied to NodeJs developer at BlackStone eIT
Photo of the Rise User
Someone from OH, Cincinnati just viewed Staff Software Test Engineer, Platform at Clari
Photo of the Rise User
Someone from OH, Perrysburg just viewed Sourcing Leader, Minerals & Cullet at Owens Corning
Photo of the Rise User
Someone from OH, North Royalton just viewed Remote AI Voice Trainer (High-Quality Microphone Required) at Datadog
C
Someone from OH, Akron just viewed Phlebotomy Technician - Outpatient at CCF
Photo of the Rise User
23 people applied to Junior Unity Developer at Gameloft
Photo of the Rise User
Someone from OH, Solon just viewed Graphic Designer at Applause
Photo of the Rise User
18 people applied to Software Engineer at WalkMe