We're looking for a Machine Learning Engineer to help us optimize the inference and training of our AI models.
What you'll do:
Write custom CUDA Kernels to speed up multi-node inference on image and video models.
Work on various caching and dynamic compilation techniques to optimize the loading and unloading of the variety of AI models we serve at Krea.
Speed up and efficiency of training runs across our GPU clusters.
We'd like you to have:
Proficiency in CUDA or parallel programming.
Python/C++ programming experience.
Experience in optimizing diffusion/transformer models for performance and scalability.
High agency and resourcefulness.
You will collaborate closely with our AI research and infrastructure teams to integrate optimizations seamlessly.
At Krea, we believe that AI is a new medium that allows us to express ourselves through various formats—text, images, video, sound, and even 3D. We're building better, smarter, and more controllable tools to harness this medium. If you're passionate about pushing the boundaries of AI and empowering human creativity, we'd love to hear from you.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
At Krea, we're on the lookout for an exceptional Machine Learning Engineer specializing in Inference and Training Optimization. Located in the vibrant city of San Francisco, you'll play a crucial role in enhancing the performance of our AI models. Your primary responsibility will involve writing custom CUDA Kernels aimed at accelerating multi-node inference specifically for our image and video models. As part of an innovative team, you will also explore various caching and dynamic compilation techniques to improve the loading and unloading processes of the diverse AI models we serve. We want someone who is not only adept at optimizing diffusion and transformer models for peak performance but also brings a high level of resourcefulness and autonomy to the table. Your collaboration with our AI research and infrastructure teams will be vital as you integrate optimization strategies seamlessly into our existing frameworks. At Krea, we believe in the limitless potential of AI as a new medium for creativity, whether it's through text, images, or sound. If you're passionate about revolutionizing AI and empowering creativity with smarter tools, we can't wait to hear from you and welcome you to our team!
Join VISA's VAS Product Development team as a Software Engineer Associate to create cutting-edge solutions in the payments industry.
Join Visa as a Senior Software Engineer focused on DevOps, where your expertise will enhance our systems' reliability and security in a hybrid work environment.
Step into the role of Software Program Owner at Aptiv and drive the future of mobility through innovative software solutions.
A leading tech firm, ManTech, is on the lookout for a skilled CNO Java Software Engineer to contribute significantly to innovative software solutions.
Join Top Hat's Core Frontend Team as an Intermediate Frontend Developer to help shape the future of higher education with innovative technology.
Visa seeks a Chief Software Engineer to spearhead innovative payment processing initiatives with a focus on technical leadership and high scalability.
Looking for a Sr. MEAN Developer to join Halo’s fully remote team, specializing in interactive media strategy and development.
Join Acuity, Inc. as a Senior Full-Stack Developer to enhance government technology projects with your expertise in React and Java.
Subscribe to Rise newsletter