Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
AI/ML Inference Engineer image - Rise Careers
Job details

AI/ML Inference Engineer

Work on our inference systems. Example tasks are writing custom CUDA kernels (or tools to generate/review them) to speed up multi-node inference on models like Hunyuan; profile and optimize GPU code; or, speed-up large-scale training runs.

Example tacit skills we're looking for

  • CUDA or parallel programming

  • Python/C++ programming

  • Provisioning, optimization, and monitoring around multi-node inference and large-scale training

Your work will directly speed up our infrastructure for millions of users and all of our research team.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About AI/ML Inference Engineer, Krea

If you're an innovative AI/ML Inference Engineer looking for an exciting opportunity in San Francisco, this role at our dynamic company is just for you! You'll immerse yourself in developing and optimizing our state-of-the-art inference systems, focusing on enhancing performance using custom CUDA kernels and tools. As part of our talented team, you'll have the chance to speed up multi-node inference, particularly on advanced models like Hunyuan. Your expertise in GPU profiling and code optimization will play a pivotal role in streamlining large-scale training runs that benefit millions of users. We value programming skills in Python and C++, and your proficiency in parallel programming will be crucial in driving our mission forward. Beyond technical capabilities, we're looking for someone who thrives in a collaborative environment and is passionate about using their skills to make a real impact on our infrastructure and research initiatives. Hope you’re excited to contribute to cutting-edge projects that redefine the AI landscape while working alongside some of the brightest minds in the industry!

Frequently Asked Questions (FAQs) for AI/ML Inference Engineer Role at Krea
What are the primary responsibilities of an AI/ML Inference Engineer at this company?

An AI/ML Inference Engineer at our company is primarily responsible for enhancing and optimizing inference systems. This includes writing custom CUDA kernels, profiling GPU code, and working on multi-node inference optimization. Your role will directly contribute to improving the speed and efficiency of large-scale training runs, thus supporting the broader research team and impacting millions of users.

Join Rise to see the full answer
What qualifications are necessary for the AI/ML Inference Engineer position?

To qualify for the AI/ML Inference Engineer position, it's essential to have a solid foundation in CUDA or parallel programming, along with proficiency in Python or C++. Candidates should also demonstrate experience in provisioning, optimizing, and monitoring GPU performance for multi-node inference systems, ideally with a background in large-scale AI models.

Join Rise to see the full answer
What programming languages should an AI/ML Inference Engineer be proficient in?

An AI/ML Inference Engineer should be proficient in Python and C++. Additionally, expertise in CUDA programming for developing and optimizing performance-critical applications is a must. Familiarity with parallel programming concepts will also significantly benefit candidates in this role.

Join Rise to see the full answer
How does the role of an AI/ML Inference Engineer impact user experience?

The role of an AI/ML Inference Engineer fundamentally enhances user experience by ensuring that our AI models operate faster and more efficiently. By optimizing inference systems and improving training processes, you will help deliver smoother and faster interactions for millions of users relying on our technology.

Join Rise to see the full answer
What tools do AI/ML Inference Engineers use for optimization at this company?

AI/ML Inference Engineers at our company utilize a variety of tools for optimization, including profiling tools to analyze GPU performance, code optimization frameworks for CUDA, and monitoring tools to manage multi-node inference setups. These tools are essential to ensure the efficient operation of our AI models.

Join Rise to see the full answer
Common Interview Questions for AI/ML Inference Engineer
Can you explain your experience with CUDA programming?

When answering this question, highlight specific projects where you've developed custom CUDA kernels. Discuss the challenges you faced, the impact your solutions had on performance, and what you learned from the experience. Be prepared to also mention any tools you used for profiling and optimizing your CUDA code.

Join Rise to see the full answer
How do you approach optimizing GPU code?

Explain your optimization process. This could include initial profiling to identify bottlenecks, iterating on the code to enhance performance, and testing with various data sizes and workloads. Sharing specific examples of GPU optimization projects can reinforce your answer.

Join Rise to see the full answer
What methods do you use for multi-node inference optimization?

Discuss techniques such as load balancing, efficient data distribution, and minimizing communication overhead. Highlight any experiences where these methods improved performance in previous projects, outlining the quantitative impacts where applicable.

Join Rise to see the full answer
Describe a challenging problem you've solved in AI/ML.

Choose a specific example that showcases your technical skills while also demonstrating your problem-solving abilities. Emphasize the situation, the approach you took to tackle the problem, and the outcome—particularly focusing on value for users or the research team.

Join Rise to see the full answer
What role does Python play in your workflow as an AI/ML Inference Engineer?

Explain how Python is utilized for data manipulation, script automation, or interfacing with models. Share specific libraries you've used, such as NumPy or TensorFlow, and how they enhance your efficiency in managing and optimizing inference systems.

Join Rise to see the full answer
Can you discuss your experience with large-scale training runs?

Detail your hands-on experience with designing or managing large-scale training processes. Discuss how you ensure system reliability and efficiency, and any techniques you implemented to monitor training performance across multiple nodes.

Join Rise to see the full answer
What types of performance metrics do you analyze for inference systems?

Mention specific metrics such as throughput, latency, and resource utilization. Explain the significance of these metrics in assessing system performance, and illustrate your experience with monitoring tools and what your major findings informed.

Join Rise to see the full answer
How do you stay updated with advances in AI and machine learning technologies?

Discuss the resources you leverage for continuous learning—these might include attending conferences, reading scholarly articles, participating in webinars, or engaging with online communities. Demonstrating a commitment to staying on the cutting edge can set you apart.

Join Rise to see the full answer
What strategies do you employ to work effectively in a team?

Highlight your communication methods, such as regular updates, collaborative problem-solving sessions, and using project management tools. Providing examples of successful team projects can showcase your collaborative abilities, crucial for an AI/ML Inference Engineer.

Join Rise to see the full answer
Why do you believe optimization is critical in AI/ML systems?

Articulate the relevance of optimization in ensuring the scalability and efficiency of AI/ML systems. Frame your answer around user impact, highlighting how performance improvements can lead to better experiences and more effective research outcomes.

Join Rise to see the full answer
Similar Jobs
Corcentric Remote No location specified
Posted 4 days ago

Become a key player at Corcentric as a Senior Software Developer, crafting customer-focused solutions in a dynamic and collaborative environment.

CBA Remote Eveleigh, NSW - 1 Locomotive Street
Posted 9 days ago

Join CommBank's talented team as a Senior Software Engineer - iOS, where you will shape the future of banking with cutting-edge technology.

Photo of the Rise User
Posted 10 days ago

ButcherBox seeks a Senior Software Engineer with PHP expertise to enhance their innovative solutions in a collaborative remote environment.

Photo of the Rise User
Posted yesterday
Transparent & Candid
Customer-Centric
Collaboration over Competition
Rise from Within

Join Clari as a Staff Software Engineer to help innovate and build the next generation of revenue intelligence platforms.

Photo of the Rise User
Posted 3 days ago

Join a dynamic team in Perth as a Senior Developer, working with emerging technologies to drive innovation in software development.

Photo of the Rise User
Posted 5 days ago

Shield AI is searching for a Frontend Engineering Manager to lead a team in building innovative web platforms for autonomous systems.

Photo of the Rise User

Join Visa as a Senior Machine Learning Engineer to develop cutting-edge AI applications that redefine payment solutions on a global scale.

Photo of the Rise User
Posted 5 days ago

Join Zoetis' dynamic agile CRM team in Hyderabad as a Software Developer, shaping the future of animal healthcare through innovative Salesforce solutions.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
April 7, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Akron just viewed 3D Vehicle Artist (Unannounced Project) at Wargaming
Photo of the Rise User
168 people applied to Scrum Master-Remote at DICE
Photo of the Rise User
Someone from OH, Bowling Green just viewed Associate Designer at Newell Brands
Photo of the Rise User
Someone from OH, Twinsburg just viewed Finishing Operator - Nights at Avery Dennison
D
Someone from OH, Cleveland just viewed Technical Writer at DevSavant Inc.
S
Someone from OH, Dayton just viewed Inventory Control Associate at SCLogistics
a
Someone from OH, Newark just viewed Billing Follow Up Rep I at aah
Photo of the Rise User
Someone from OH, Columbus just viewed Assistant Merchandising and Inventory Manager at Jushi
Photo of the Rise User
Someone from OH, Akron just viewed Entry Level Communications at Smart Solutions
Photo of the Rise User
Someone from OH, Toledo just viewed Processing Technician at Jushi
Photo of the Rise User
Someone from OH, Greenfield just viewed HR Generalist II at Protolabs
C
Someone from OH, Bowling Green just viewed Field Service Administrator at Cornerstone Building Brands
Photo of the Rise User
Someone from OH, Cleveland just viewed Vice President, Revenue Operations at Docebo
Photo of the Rise User
Someone from OH, Mansfield just viewed Director, Professional Education at Evolus
1
Someone from OH, Cleveland just viewed Copywriter at 1840 & Company
Photo of the Rise User
Someone from OH, Louisville just viewed Communications Manager at Shearer's Foods
Photo of the Rise User
Someone from OH, Cincinnati just viewed Chief of Staff to the CFO at Super.com
Photo of the Rise User
Someone from OH, Columbus just viewed 5-8th Grade Art Teacher - SY 24-25 at ACCEL Schools
H
Someone from OH, Akron just viewed Brand Marketing Manager at Huntington