Job details

Member of Technical Staff - Edge AI Inference Engineer

Get a free resume review

Liquid AI, an MIT spin-off, is a foundation model company headquartered in Boston, Massachusetts. Our mission is to build capable and efficient general-purpose AI systems at every scale.

Our goal at Liquid is to build the most capable AI systems to solve problems at every scale, such that users can build, access, and control their AI solutions. This is to ensure that AI will get meaningfully, reliably and efficiently integrated at all enterprises. Long term, Liquid will create and deploy frontier-AI-powered solutions that are available to everyone.

What this role actually is:

As we prepare to deploy our models across various edge device types, including CPUs, embedded GPUs, and NPUs, we seek an expert to optimize inference stacks tailored to each platform. We're looking for someone who can take our models, dive deep into the task, and return with a highly optimized inference stack, leveraging existing frameworks like llama.cpp, Executorch, and TensorRT to deliver exceptional throughput and low latency.

The ideal candidate is a highly skilled engineer with extensive experience in inference on embedded hardware and a deep understanding of CPU, NPU, and GPU architectures. They should be self-motivated, capable of working independently, and driven by a passion for optimizing performance across diverse edge hardware platforms.

Proficiency in building and enhancing edge inference stacks is essential. Additionally, experience with mobile development and expertise in cache-aware algorithms will be highly valued.

Responsibilities

Strong ML Experience: Proficiency in Python and PyTorch to effectively interface with the ML team at a deeply technical level.
Hardware Awareness: Must understand modern hardware architecture, including cache hierarchies and memory access patterns, and their impact on performance.
Proficient in Coding: Expertise in Python, C++, or Rust for AI-driven real-time embedded systems
Optimization of Low-Level Primitives: Responsible for optimizing core primitives to ensure efficient model execution.
Self-Guided and Ownership: Ability to independently take a PyTorch model and inference requirements and deliver a fully optimized edge inference stack with minimal guidance

Average salary estimate

$140000 / YEARLY (est.)

min

max

$120000K

$160000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Member of Technical Staff - Edge AI Inference Engineer, Liquid AI

At Liquid AI, we're on a mission to revolutionize the way artificial intelligence is integrated into everyday enterprises, and we're excited to announce that we're looking for a talented Member of Technical Staff - Edge AI Inference Engineer to join our dynamic team. Based in Boston, but with flexibility in location, this role is all about optimizing inference stacks for a variety of edge devices like CPUs, embedded GPUs, and NPUs. If you're a passionate engineer who is deeply familiar with hardware architectures and has hands-on experience with frameworks such as llama.cpp, Executorch, and TensorRT, this is the job for you! You'll dive deep into technical challenges, crafting high-performance inference solutions that push the limits of what's possible with edge hardware. We're seeking self-motivated individuals who thrive on ownership and aim to enhance model throughput and reduce latency. With a strong foundation in Python and PyTorch, you will be collaborating closely with the machine learning team to ensure seamless integration and execution of AI models. Fluidity in C++, Rust, or similar languages will also be beneficial! If you enjoy the thrill of optimizing low-level machine primitives and you have experience with mobile development, we would love to hear from you. At Liquid AI, we believe in empowering our team members to take initiative and deliver killer solutions that help shape the future of AI. Join us on this incredible journey and let's create the next generation of AI together!

Frequently Asked Questions (FAQs) for Member of Technical Staff - Edge AI Inference Engineer Role at Liquid AI

What are the key responsibilities of a Member of Technical Staff - Edge AI Inference Engineer at Liquid AI?

The Member of Technical Staff - Edge AI Inference Engineer at Liquid AI is responsible for optimizing inference stacks for various edge hardware platforms. This involves understanding modern hardware architectures and performance optimization, effectively working with frameworks such as llama.cpp, Executorch, and TensorRT, and delivering optimized solutions for AI models. You'll work closely with the machine learning team to enhance model execution and ensure low latency in real-time AI applications.