Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Member of Technical Staff - ML Performance image - Rise Careers
Job details

Member of Technical Staff - ML Performance

About Us:

At Modal, we build foundational technology, including an optimized container runtime, a GPU-aware scheduler, and a distributed file system.

We're a small team based out of New York, Stockholm and San Francisco, and have raised over $23M. Our team includes creators of popular open-source projects (e.g., SeabornLuigi), academic researchers, international olympiad medalists, and experienced engineering and product leaders with decades of experience.

The Role

We are looking for strong engineers with experience in making ML systems performant at scale. If you are interested in contributing to open-source projects and Modal’s container runtime to push language and diffusion models towards higher throughput and lower latency, we’d love to hear from you!

Details

  • Work in-person, in our NYC, San Francisco or Stockholm office

  • Full medical, dental, vision insurance

  • Competitive salary and equity

Requirements

  • 5+ years of experience writing high-quality production code.

  • Experience working with torch, huggingface libraries, modern inference engines (vLLM or TensorRT).

  • Familiarity with Nvidia GPU architecture and CUDA.

  • Familiarity with low-level operating system foundations (Linux kernel, file systems, containers, etc.)

  • Experience with ML performance engineering (tell us a story of when you pushed GPU utilization higher!)

Modal Glassdoor Company Review
3.4 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
Modal DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Modal
Modal CEO photo
Sameer Bhalla
Approve of CEO

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Member of Technical Staff - ML Performance, Modal

At Modal, we're on a mission to revolutionize machine learning performance, and we're excited to invite you to join us as a Member of Technical Staff focused on ML Performance in our vibrant New York office. Here, you'll be part of a dynamic team that is passionate about creating foundational technology, such as our optimized container runtime and GPU-aware scheduler. We believe in the power of collaboration and innovation, with team members who are creators of popular open-source projects and accomplished researchers. In this role, you'll leverage your 5+ years of experience in writing high-quality production code to enhance the performance of our ML systems, ensuring they operate at scale with maximum throughput and minimum latency. If you have expertise in working with torch, huggingface libraries, and modern inference engines like vLLM or TensorRT, and if you're familiar with Nvidia GPU architecture and CUDA, we want to hear your story about pushing GPU utilization higher! Enjoy the perks of full medical, dental, and vision insurance along with a competitive salary and equity options. If you thrive in an environment that supports professional growth and encourages contributions to cutting-edge open-source projects, Modal is the place for you. Come help us reshape the future of machine learning!

Frequently Asked Questions (FAQs) for Member of Technical Staff - ML Performance Role at Modal
What responsibilities does a Member of Technical Staff - ML Performance at Modal have?

As a Member of Technical Staff - ML Performance at Modal, you'll be responsible for optimizing and enhancing the performance of machine learning systems. This involves working on our GPU-aware scheduler and optimized container runtime to push the boundaries of language and diffusion models. You'll be tasked with writing high-quality production code, collaborating closely with fellow engineers, and contributing to open-source projects. Your expertise in performance engineering will be fundamental in increasing throughput and reducing latency in our ML frameworks.

Join Rise to see the full answer
What qualifications are needed to apply for the Member of Technical Staff - ML Performance role at Modal?

To be eligible for the Member of Technical Staff - ML Performance position at Modal, candidates should have at least 5 years of experience in writing high-quality production code. Familiarity with modern tools and libraries such as torch, huggingface, and inference engines like vLLM or TensorRT is crucial. Additionally, a solid understanding of Nvidia GPU architecture and CUDA, along with experience working with low-level operating systems like Linux, are required to excel in this role.

Join Rise to see the full answer
What can I expect in terms of company culture as a Member of Technical Staff - ML Performance at Modal?

At Modal, you can expect a culture that values innovation, collaboration, and intellectual curiosity. As a Member of Technical Staff - ML Performance, you'll be part of a small yet impactful team that prioritizes open collaboration and supports contributions to open-source projects. We foster an environment where creativity thrives, and your insights are valued in shaping the future of our technologies. Our diverse team includes experienced professionals, medal-winning researchers, and dedicated engineers, making for a collaborative and inspiring workplace.

Join Rise to see the full answer
What benefits does Modal offer to Members of Technical Staff - ML Performance?

Modal offers a comprehensive benefits package for Members of Technical Staff - ML Performance including full medical, dental, and vision insurance. In addition to competitive salary options, employees have access to equity in the company, reflecting our commitment to long-term growth and success together. We believe in supporting our team members both professionally and personally to ensure a balanced and rewarding experience.

Join Rise to see the full answer
How does the role of Member of Technical Staff - ML Performance impact Modal's products?

The role of Member of Technical Staff - ML Performance is crucial in enhancing Modal's product offerings, particularly in optimizing machine learning systems. By ensuring that our ML frameworks are performant at scale, you will directly contribute to the efficiency and speed of our technologies, impacting how users experience our solutions. Your work will be instrumental in pushing the boundaries of what's possible with language and diffusion models, ultimately shaping the next generation of ML applications.

Join Rise to see the full answer
Common Interview Questions for Member of Technical Staff - ML Performance
Can you describe your experience with ML performance engineering?

In answering this question, highlight specific projects or experiences where you successfully improved machine learning system performance. Discuss the techniques and tools you used to increase GPU utilization, reduce latency, or enhance throughput. Provide quantitative results, if possible, to demonstrate your impact.

Join Rise to see the full answer
What strategies do you use to optimize ML models for GPU performance?

Be prepared to discuss various optimization techniques, such as model quantization, pruning, and using efficient inference engines like TensorRT. Explain how you analyze model performance and identify bottlenecks, emphasizing any tools or frameworks you've successfully applied in past projects.

Join Rise to see the full answer
How do you approach troubleshooting performance issues in ML systems?

Discuss your systematic approach to troubleshooting, including identifying the root cause of performance issues, using profiling tools, and analyzing system logs. Describe instances where you resolved performance bottlenecks, and highlight any tools or metrics that you found particularly effective.

Join Rise to see the full answer
What is your experience working with CUDA and Nvidia GPU architecture?

Share specific projects where you utilized CUDA for programming GPUs. Highlight your understanding of Nvidia architecture, including memory management and optimization techniques. Mention any hands-on experience you have with performance tuning of applications on GPUs.

Join Rise to see the full answer
Can you explain the trade-offs you consider when optimizing ML models?

Discuss trade-offs between model accuracy and performance, such as compression techniques against potential accuracy loss. Explain how you evaluate these trade-offs based on project requirements and user needs, ensuring that the final product meets both performance and functional standards.

Join Rise to see the full answer
Describe your experience with open-source contributions.

Detail your past contributions to open-source projects, including the specific projects you've worked on, the nature of your contributions, and the impact they had. Emphasize your collaboration with other developers and how this experience has improved your skills and technical understanding.

Join Rise to see the full answer
What is your approach to writing high-quality production code?

Talk about your coding practices that ensure code quality, including code reviews, testing, and continuous integration techniques. Explain how you document your code and make it maintainable, and provide examples of how you've implemented best practices in past roles.

Join Rise to see the full answer
How do you stay up to date with the latest trends in machine learning and performance engineering?

Share your strategies for professional development, such as attending conferences, participating in webinars, or following industry leaders on platforms like GitHub or Twitter. Discuss any specific resources you find invaluable for keeping your knowledge current in the field.

Join Rise to see the full answer
How would you handle disagreements with team members regarding performance optimization strategies?

Explain how you would approach discussions respectfully and constructively, valuing data and rationale over personal opinions. Provide examples of how you’ve successfully navigated similar situations in the past, focusing on collaboration and mutual respect.

Join Rise to see the full answer
What tools do you use for monitoring and benchmarking ML system performance?

Mention specific tools you have experience with, such as TensorBoard, nvidia-smi, or custom Python scripts for performance analysis. Discuss how you leverage these tools to gather insights and make informed decisions to optimize performance in machine learning projects.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 9 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Dare to be Different
Reward & Recognition
Fast-Paced
Maternity Leave
Paternity Leave
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Paid Holidays
Paid Sick Days
Paid Time-Off
Learning & Development
Social Gatherings
Photo of the Rise User
Posted 9 days ago
Photo of the Rise User
PayPal Remote Austin, TX
Posted 6 days ago
Photo of the Rise User
Posted 3 days ago

At Modal, we build the future of auto commerce for the world’s largest auto brands and retailers. We take the moving parts of an auto purchase transaction and assemble them into a simple, digital transaction flow that seamlessly fits any webpage a...

2 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
December 19, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!