Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
ML Engineer – Inference/Training Optimization image - Rise Careers
Job details

ML Engineer – Inference/Training Optimization

We're looking for a Machine Learning Engineer to help us optimize the inference and training of our AI models.​

What you'll do:

  • Write custom CUDA Kernels to speed up multi-node inference on image and video models.

  • Work on various caching and dynamic compilation techniques to optimize the loading and unloading of the variety of AI models we serve at Krea.

  • Speed up and efficiency of training runs across our GPU clusters.​

We'd like you to have:

  • Proficiency in CUDA or parallel programming.

  • Python/C++ programming experience.

  • Experience in optimizing diffusion/transformer models for performance and scalability.​

  • High agency and resourcefulness.

You will collaborate closely with our AI research and infrastructure teams to integrate optimizations seamlessly.​

At Krea, we believe that AI is a new medium that allows us to express ourselves through various formats—text, images, video, sound, and even 3D. We're building better, smarter, and more controllable tools to harness this medium. If you're passionate about pushing the boundaries of AI and empowering human creativity, we'd love to hear from you.

Average salary estimate

$115000 / YEARLY (est.)
min
max
$90000K
$140000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About ML Engineer – Inference/Training Optimization, Krea

At Krea, we're on the lookout for an exceptional Machine Learning Engineer specializing in Inference and Training Optimization. Located in the vibrant city of San Francisco, you'll play a crucial role in enhancing the performance of our AI models. Your primary responsibility will involve writing custom CUDA Kernels aimed at accelerating multi-node inference specifically for our image and video models. As part of an innovative team, you will also explore various caching and dynamic compilation techniques to improve the loading and unloading processes of the diverse AI models we serve. We want someone who is not only adept at optimizing diffusion and transformer models for peak performance but also brings a high level of resourcefulness and autonomy to the table. Your collaboration with our AI research and infrastructure teams will be vital as you integrate optimization strategies seamlessly into our existing frameworks. At Krea, we believe in the limitless potential of AI as a new medium for creativity, whether it's through text, images, or sound. If you're passionate about revolutionizing AI and empowering creativity with smarter tools, we can't wait to hear from you and welcome you to our team!

Frequently Asked Questions (FAQs) for ML Engineer – Inference/Training Optimization Role at Krea
What are the responsibilities of a Machine Learning Engineer at Krea?

As a Machine Learning Engineer at Krea, your key responsibilities include writing custom CUDA Kernels to improve the speed of multi-node inference for image and video models, while also working on caching mechanisms and dynamic compilation techniques. You'll be tasked with optimizing the efficiency of training runs across our powerful GPU clusters, making your role essential in enhancing the performance of our AI offerings.

Join Rise to see the full answer
What qualifications are required for the Machine Learning Engineer position at Krea?

To be considered for the Machine Learning Engineer position at Krea, candidates should possess expertise in CUDA or parallel programming and have a strong background in Python or C++. Experience in optimizing diffusion and transformer models for both performance and scalability is critical. Additionally, candidates should exhibit high agency and resourcefulness, indicating a strong ability to solve complex problems independently.

Join Rise to see the full answer
How does a Machine Learning Engineer contribute to AI model improvements at Krea?

In the role of Machine Learning Engineer at Krea, you will directly contribute to the enhancement of AI models by implementing optimization techniques. This includes writing efficient CUDA Kernels and exploring innovative caching and dynamic compilation strategies. Your work will ensure faster inference and more efficient training processes, significantly elevating the capabilities of the models we develop.

Join Rise to see the full answer
What kind of projects will a Machine Learning Engineer work on at Krea?

A Machine Learning Engineer at Krea will engage in exciting projects that revolve around optimizing models for various formats such as text, image, and video. You will work closely with AI researchers and infrastructure teams to integrate optimizations into existing frameworks, effectively working on cutting-edge AI applications that push the boundaries of creativity.

Join Rise to see the full answer
What is the work environment like for a Machine Learning Engineer at Krea?

At Krea, the work environment for a Machine Learning Engineer is collaborative and innovative, located in the tech hub of San Francisco. You will be part of a team that values creativity and the power of AI, encouraging you to experiment, share ideas, and contribute to meaningful projects. The culture fosters resourcefulness and independence, making it an ideal setting for passionate individuals.

Join Rise to see the full answer
Common Interview Questions for ML Engineer – Inference/Training Optimization
Can you explain your experience with CUDA programming?

When addressing your experience with CUDA programming, be sure to highlight specific projects where you developed custom kernels. Discuss the impact your work had on performance and any challenges you faced and overcame. Sharing a clear example that demonstrates your skills can significantly strengthen your response.

Join Rise to see the full answer
What techniques do you use for optimizing AI model performance?

In your answer, categorize the techniques you typically use, such as reducing model size, employing quantization, or implementing caching strategies. Also, discuss your experience with transformer models and how you’ve implemented performance optimizations in previous projects. Clear, specific examples will showcase your expertise.

Join Rise to see the full answer
How do you approach debugging GPU-accelerated applications?

Discuss your systematic approach to debugging GPU-accelerated applications, mentioning tools like NVIDIA Nsight or CUDA-GDB. Highlight how you identify bottlenecks and optimize code iteratively. Providing a past experience where you diagnosed and solved a major issue can offer valuable context.

Join Rise to see the full answer
What is your experience with multi-node training of AI models?

Talk about the frameworks and methods you've used for multi-node training, such as distributed training with TensorFlow or PyTorch. Include specific details on how you ensured synchronization and data sharing across nodes, showcasing your understanding of distributed systems.

Join Rise to see the full answer
Can you describe a project where you optimized inference time?

Provide a detailed description of a specific project where your optimizations led to measurable improvements. Include the techniques you used (like caching or algorithmic changes) and the results in terms of latency reduction, emphasizing your contributions to the project's success.

Join Rise to see the full answer
What strategies do you use for managing large datasets?

Share your strategies for data management, such as versioning, preprocessing steps, and data augmentation techniques. Discuss how these strategies support efficient loading and processing, particularly in training and inference contexts, demonstrating a comprehensive understanding of data handling.

Join Rise to see the full answer
How do you keep up with the latest optimizations in AI?

Mention how you actively engage with the AI community through research publications, forums, blogs, and conferences. Discuss any specific influential papers or technologies you've learned from, showcasing a commitment to continuous learning in a rapidly evolving field.

Join Rise to see the full answer
What experience do you have collaborating with research teams?

In your response, detail your collaborative experiences, emphasizing communication, brainstorming innovative solutions, and implementing research findings into practical applications. Highlight any successful projects where your contributions made a tangible impact.

Join Rise to see the full answer
How do you prioritize tasks when working on multiple projects?

Discuss your strategies for prioritization, such as using project management tools, understanding project timelines, and defining key milestones. Demonstrating your organizational skills and ability to adapt can convey your aptitude for thriving in a dynamic work environment.

Join Rise to see the full answer
What excites you most about the potential of AI?

Express enthusiasm for the transformative power of AI and its ability to enhance human creativity across different mediums. Highlight your vision of future applications and the importance of ethical considerations in AI development, reflecting your passion and alignment with Krea's mission.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 5 days ago

Join VISA's VAS Product Development team as a Software Engineer Associate to create cutting-edge solutions in the payments industry.

Photo of the Rise User
Visa Remote Singapore, Singapore
Posted 14 days ago

Join Visa as a Senior Software Engineer focused on DevOps, where your expertise will enhance our systems' reliability and security in a hybrid work environment.

Photo of the Rise User
Aptiv Hybrid Troy, MI - USA
Posted 14 hours ago

Step into the role of Software Program Owner at Aptiv and drive the future of mobility through innovative software solutions.

Photo of the Rise User
ManTech Hybrid US, Anne Arundel County, MD; Maryland, Hanover, MD
Posted 3 days ago

A leading tech firm, ManTech, is on the lookout for a skilled CNO Java Software Engineer to contribute significantly to innovative software solutions.

Photo of the Rise User
Posted 4 days ago

Join Top Hat's Core Frontend Team as an Intermediate Frontend Developer to help shape the future of higher education with innovative technology.

Photo of the Rise User
Posted 5 days ago

Visa seeks a Chief Software Engineer to spearhead innovative payment processing initiatives with a focus on technical leadership and high scalability.

Photo of the Rise User

Looking for a Sr. MEAN Developer to join Halo’s fully remote team, specializing in interactive media strategy and development.

Posted 10 days ago

Join Acuity, Inc. as a Senior Full-Stack Developer to enhance government technology projects with your expertise in React and Java.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
April 8, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Cincinnati just viewed Data Analyst (Contact Center-Hybrid) at Dow Jones
Photo of the Rise User
7 people applied to SDE Intern (Summer) at Amazon
Photo of the Rise User
Someone from OH, Delaware just viewed Practice Group Manager at LifeStance Health
Photo of the Rise User
Someone from OH, Youngstown just viewed Event Services Human Resources Coordinator at Allied Universal
Photo of the Rise User
Someone from OH, Columbus just viewed IP Network Engineering Intern - Summer 2025 at Bandwidth
Photo of the Rise User
Someone from OH, Cleveland just viewed Director, Education Programs & Partnerships at Encoura
Photo of the Rise User
Someone from OH, Cleveland just viewed Operations Associate (Part-Time) - Pinecrest at Alo Yoga
Photo of the Rise User
Someone from OH, Dayton just viewed Medical Receptionist at LifeStance Health
Photo of the Rise User
Someone from OH, Coldwater just viewed Engineering Design Checker Jobs at Lockheed Martin
Photo of the Rise User
Someone from OH, Loveland just viewed SEO Admin & Business Support at Outliant
Photo of the Rise User
Someone from OH, Columbus just viewed Casting: Cedar Lake - Pilot Episode at Backstage
Photo of the Rise User
Someone from OH, Mount Orab just viewed Software Development Manager at Assured Guaranty
H
Someone from OH, Mansfield just viewed Medical Appointment Setter (Remote LatAm) at HireHawk
Photo of the Rise User
Someone from OH, Lewis Center just viewed Third Party Risk Analyst at Experian
Photo of the Rise User
Someone from OH, Columbus just viewed Lead Preschool Teacher at Guidepost Montessori
A
Someone from OH, Cincinnati just viewed Global Supply Manager - Taiwan at Also
Photo of the Rise User
Someone from OH, Cincinnati just viewed Global Supply Manager (Raptor Machining) at SpaceX