Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Machine Learning Engineer image - Rise Careers
Job details

Machine Learning Engineer

About Etched

Etched is building AI chips that are specialized for individual model architectures. Our first product (Sohu) only supports Transformers, but has an order of magnitude more throughput and lower latency than a B200. With Sohu, you can build products that are infeasible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Etched Labs is the organization within Etched whose mission is to democratize generative AI, pushing the boundaries of what will be possible in a post-Sohu world. 

Key responsibilities

  • Implement high-performance software components for the Kayak inference engine in Rust

  • Translate core mathematical operations from transformer models into optimized operation sequences for Sohu

  • Develop components of our model assembler and execution pipeline

  • Work on performance optimization for the Sohu programming interface and runtime

  • Collaborate with hardware engineers to maximize chip utilization and minimize latency

  • Implement efficient batching strategies and execution plans for inference workloads

  • Contribute to the evolution of our system architecture and programming model

  • Design and implement cutting edge inference time compute scaling methods

Representative projects

  • Optimize operation sequences to maximize Sohu's computational resources for specific transformer architectures such as Stable Diffusion.

  • Implement efficient memory management for KV cache sharing and prefix optimization

  • Build infrastructure for continuous batching and batch interleaving to improve throughput

  • Create components for the Sohu runtime that optimize for utilization, latency, and/or throughput

  • Implement model-specific inference-time acceleration techniques such as speculative decoding, tree search, KV cache sharing, priority scheduling, etc by interacting with the rest of the inference serving stack

  • Implement structured decoding and novel sampling algorithms for reasoning models

  • Develop testing and benchmarking tools to measure and improve performance

You may be a good fit if you have

  • Strong software engineering skills with systems programming experience

  • Experience with Rust programming language

  • Familiarity with transformer model architectures and/or inference serving stacks (vLLM, SGLang, etc.)

  • Strong mathematical skills, esp. in linear algebra

  • Understanding of computational graphs, tensor sharding operations, and ML workloads

  • Ability to reason about performance bottlenecks and optimization opportunities

  • Experience working cross-functionally in diverse software and hardware organizations

Strong candidates may also have

  • Experience with hardware accelerators, ASICs, or FPGAs

  • Deep expertise in ML systems engineering and hardware/software co-design with demonstrated impact (contributions to open-source projects or published papers)

  • Experience with low-level memory management and synchronization primitives

  • Track record of optimizing large-scale inference systems

Benefits

  • Full medical, dental, and vision packages, with 100% of premium covered

  • Housing subsidy of $2,000/month for those living within walking distance of the office

  • Daily lunch and dinner in our office

  • Relocation support for those moving to Cupertino

How we’re different

Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Machine Learning Engineer, Etched

Join Etched as a Machine Learning Engineer in Cupertino, where you’ll be at the forefront of groundbreaking technology! At Etched, we're crafting specialized AI chips, starting with our innovative Sohu product. While it pairs beautifully with Transformers, it boasts an incredible level of throughput and reduced latency that takes real-time video generation and deep reasoning capabilities to a whole new level. As a Machine Learning Engineer, your creativity and technical skills will shine as you implement high-performance software for the Kayak inference engine using Rust. You'll dive into translating complex mathematical operations from transformer models into optimized sequences tailored for Sohu, ultimately pushing the boundaries of generative AI. You'll collaborate with hardware engineers, refine our model assembler, and enhance our execution pipeline, ensuring that computational resources are maximized. If you’re passionate about machine learning, systems programming, and innovative technologies, you’ll find a supportive environment at Etched that encourages growth and exploration. From optimizing memory management techniques to developing cutting-edge inference acceleration methods, each day will be a new adventure in contributing to the evolution of AI. Plus, our competitive benefits and relocation support make it an exciting time to join us on this journey toward a revolutionary AI future!

Frequently Asked Questions (FAQs) for Machine Learning Engineer Role at Etched
What are the primary responsibilities of a Machine Learning Engineer at Etched?

As a Machine Learning Engineer at Etched, your primary responsibilities include implementing high-performance software components for our Kayak inference engine using Rust. You'll be translating mathematical operations from transformer models into optimized sequences tailored for our Sohu chip, developing components that enhance our model assembler and execution pipeline, and implementing efficient batching strategies to improve performance. Additionally, collaboration with hardware engineers and contributions to system architecture evolution are key aspects of this role. You'll also play a part in developing benchmarking tools to track and enhance performance.

Join Rise to see the full answer
What qualifications do I need to be a successful Machine Learning Engineer at Etched?

To thrive as a Machine Learning Engineer at Etched, you should possess strong software engineering skills, particularly in systems programming with experience in Rust. Familiarity with transformer model architectures and inference-serving stacks is essential, as well as strong mathematical skills, especially in linear algebra. A solid understanding of computational graphs and machine learning workloads is also important. Experience optimizing performance bottlenecks and working cross-functionally within software and hardware domains will make you a standout candidate.

Join Rise to see the full answer
What kind of projects will I work on as a Machine Learning Engineer at Etched?

As a Machine Learning Engineer at Etched, you'll engage in a variety of innovative projects. You might optimize operational sequences for specific transformer architectures such as Stable Diffusion or implement efficient memory management strategies for the KV cache. Additionally, you'll build infrastructure for continuous batching and explore novel sampling algorithms for reasoning models. Contributing to the Sohu runtime optimization and performance-related aspects will allow you to significantly impact our generative AI capabilities.

Join Rise to see the full answer
What perks does Etched offer to Machine Learning Engineers?

Etched offers a robust benefits package to its Machine Learning Engineers. This includes full medical, dental, and vision insurance, where we cover 100% of premium costs for our employees. We also provide a generous housing subsidy of $2,000/month for team members living in proximity to our Cupertino office. Enjoy complimentary daily lunches and dinners at our office, and for those relocating to join us, we offer relocation support to ease the transition. Our company culture and commitment to employee well-being make Etched a great place to work.

Join Rise to see the full answer
How does Etched support professional growth for Machine Learning Engineers?

At Etched, we are dedicated to nurturing professional growth for our Machine Learning Engineers. Our collaborative environment encourages cross-functional interactions between engineering and research, allowing you to contribute to both areas. We support participation in open-source projects and actively encourage publishing research papers. This holistic approach helps you expand your skill set and stay at the forefront of AI technology advancements. Our team values innovating and experimenting, making it a thriving place for career development.

Join Rise to see the full answer
Common Interview Questions for Machine Learning Engineer
Can you explain your experience with the Rust programming language?

When answering this question, highlight your practical experience with Rust, particularly in relation to performance-critical applications. Discuss specific projects where you utilized Rust and the outcomes. Emphasize any optimization techniques or system-level programming tasks you completed, and be ready to demonstrate your understanding of Rust’s unique features, such as memory safety and concurrency.

Join Rise to see the full answer
What strategies do you use to optimize machine learning models?

In your response, outline various strategies you’ve employed to optimize machine learning models. This might include techniques like reducing model complexity, employing quantization, or using efficient batching methods. Provide examples of how these strategies led to performance improvements and any tools or libraries you have experience with for model optimization.

Join Rise to see the full answer
How familiar are you with transformer architectures?

Discuss your familiarity with transformer architectures by mentioning specific models you’ve worked with, such as BERT or GPT. Explain how you used these architectures in past projects and any insights you gained about their advantages and limitations. You can refer to your technical expertise in handling tasks like training, fine-tuning, or deploying these models.

Join Rise to see the full answer
How do you approach debugging complex machine learning systems?

When discussing your debugging approach, highlight your systematic method for diagnosing issues. Describe how you use various tools for logging and monitoring, and potentially how you analyze error rates and performance metrics. Provide examples where your debugging efforts led to significant breakthroughs or improvements in model performance.

Join Rise to see the full answer
What do you understand by performance bottlenecks in machine learning?

Tailor your response to demonstrate your understanding of performance bottlenecks, perhaps by referring to common causes like inefficient data handling, slow model inference, or inadequate resource utilization. Share examples of identifying and resolving bottlenecks in previous projects and how those resolutions benefited model efficiency and speed.

Join Rise to see the full answer
Can you discuss your experience with hardware accelerators?

Your answer should reflect any direct experience you’ve had with hardware accelerators such as ASICs or FPGAs. Focus on specific projects where you’ve optimized machine learning workflows using these accelerators. Explain how they impacted model performance and efficiency in your work and any challenges you faced while integrating them.

Join Rise to see the full answer
How do you handle collaboration with cross-functional teams?

In your response, discuss the importance of clear communication and mutual respect when working with cross-functional teams. Highlight specific experiences where you collaborated successfully with hardware engineers or product managers, focusing on how shared goals facilitate better outcomes. Be sure to illustrate any strategies you use to overcome potential communication barriers.

Join Rise to see the full answer
What steps do you take to ensure the scalability of ML systems?

Discuss scalability in terms of both data and model size. Explain the methodologies you’ve applied to design scalable machine learning systems, such as distributed processing frameworks, model partitioning, or elastic serving infrastructures. Provide specific outcomes or projects where you successfully scaled an ML system, showcasing the impact of your work.

Join Rise to see the full answer
Could you share an instance of when you improved an ML model's inference speed?

Provide a concise yet detailed account of a project where you successfully improved inference speed. Discuss the techniques you implemented, such as pruning, quantization, or using more efficient algorithms, and mention the resulting performance metrics. This demonstrates your ability to practically apply optimization techniques effectively.

Join Rise to see the full answer
How do you stay updated with the latest advancements in AI and machine learning?

Share your strategies for staying current, such as following relevant journals, attending conferences, participating in online communities, or regularly engaging with popular AI research blogs. Mention any specific resources or thought leaders you follow that contribute to your ongoing education and inspiration in the field of machine learning.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Posted 5 days ago
Image Associates Inc. Hybrid abc, Cleveland, Ohio, United States
Posted 6 days ago
Photo of the Rise User
EDF UK Remote Bridgwater, UK
Posted 8 days ago
Photo of the Rise User
Anduril Industries Hybrid Lexington, Massachusetts, United States
Posted 6 days ago
Photo of the Rise User
Sopra Steria Remote 72 All. des Noisetiers, 69760 Limonest, France
Posted 8 days ago
Photo of the Rise User
Bosch Group Hybrid 14001 S Lakes Dr, Charlotte, NC 28273, USA
Posted 8 days ago
Inetum Remote Madrid, España
Posted 6 days ago
NXTGIG Remote No location specified
Posted 3 days ago

by burning the transformer architecture into our chips, we’re creating the world’s most powerful servers for transformer inference.

20 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
March 21, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Dayton just viewed Merchandiser at American Greetings
Photo of the Rise User
6 people applied to Assembly Mechanic at Boeing
Photo of the Rise User
10 people applied to GIS Specialist II at AECOM
Photo of the Rise User
8 people applied to Agile Scrum Master at DNAnexus
T
Someone from OH, Dublin just viewed Brand Marketing Intern-Summer 2025 at Trove Brands
Photo of the Rise User
Someone from OH, Mentor just viewed Supply Planning Analyst at Avery Dennison
Photo of the Rise User
Someone from OH, Columbus just viewed Medical Expert, Fertility and Pregnancy at Carrot Fertility
Photo of the Rise User
Someone from OH, Kent just viewed Finance Year-round Intern at Sherwin-Williams
Photo of the Rise User
Someone from OH, Cincinnati just viewed Product Owner, AI at Modernizing Medicine, Inc.
Photo of the Rise User
Someone from OH, Strongsville just viewed Used Car Buyer - Concord Toyota at Sonic Automotive
Photo of the Rise User
Someone from OH, Canton just viewed UI Designer - Website & Brand at Atlan
Photo of the Rise User
Someone from OH, Dayton just viewed Data Engineer - User Platform at Spotify
Photo of the Rise User
Someone from OH, Dayton just viewed Data Engineer - #1696 at MeridianLink
Photo of the Rise User
Someone from OH, Columbus just viewed Enterprise Sales Project Associate at Array
Photo of the Rise User
Someone from OH, Akron just viewed Medical Receptionist at LifeStance Health
Photo of the Rise User
Someone from OH, Thornville just viewed Finance Rotation Analyst at Huntington National Bank
Photo of the Rise User
8 people applied to Pega Engineer at Proxymity
Photo of the Rise User
Someone from OH, Columbus just viewed Cashier - Sawmill Road Market District at Giant Eagle
Photo of the Rise User
Someone from OH, Cincinnati just viewed Data Scientist at Apex Systems
Photo of the Rise User
Someone from OH, Mansfield just viewed POS Install Tech at TEKsystems
Photo of the Rise User
Someone from OH, Dublin just viewed Sr. Manager UX Design Research at Visa
Photo of the Rise User
Someone from OH, Columbus just viewed Case Manager at Release Recovery
Photo of the Rise User
Someone from OH, Cincinnati just viewed Recruiting Coordinator (Contractor) at Anduril Industries