Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
LLM/ML Engineer (Inference) image - Rise Careers
Job details

LLM/ML Engineer (Inference)

About the role

About Us

The vast majority of enterprise data is in files like PDFs and spreadsheets. That includes everything from financial statements to medical records. Reducto helps AI teams turn those really complex documents into LLM-ready inputs with exceptional accuracy. This means they can build more reliable products while saving engineering time.

Our Traction

In less than a year we've scaled to 7 figures in ARR, serving customers from ambitious startups to Fortune 10 enterprises. We're now processing tens of millions of pages monthly.

The core work will include:

  • Architecting and implementing robust, scalable inference systems for serving state-of-the-art AI models

  • Optimizing model serving infrastructure for high throughput and low latency at scale

  • Developing and integrating advanced inference optimization techniques

  • Working closely with our research team to bring cutting-edge capabilities into production

  • Building developer tools and infrastructure to support rapid experimentation and deployment.

We would love to meet you if you:

  • Philosophy: You are your own worst critic. You have a high bar for quality and don’t rest until the job is done right—no settling for 90%. We want someone who ships fast, with high agency, and who doesn't just voice problems but actively jumps in to fix them.

  • Experience: You have deep expertise in Python and PyTorch, with a strong foundation in low-level operating systems concepts including multi-threading, memory management, networking, storage, performance, and scale. You're experienced with modern inference systems like TGI, vLLM, TensorRT-LLM, and Optimum, and comfortable creating custom tooling for testing and optimization.

  • Approach: You combine technical expertise with practical problem-solving. You're methodical in debugging complex systems and can rapidly prototype and validate solutions.

Bonus points if you:

  • Have experience with low-level systems programming (CUDA, Triton) and compiler optimization

  • Are passionate about open-source contributions and staying current with ML infrastructure developments

  • Bring practical experience with high-performance computing and distributed systems

  • Have worked in early-stage environments where you helped shape technical direction

  • Are energized by solving complex technical challenges in a collaborative environment

This is an in person role at our office in SF. We’re an early stage company which means that the role requires working hard and moving quickly. Please only apply if that excites you.

About Reducto

Nearly 80% of enterprise data is in unstructured formats like PDFs

PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that’s simply impractical for use in digital workflows. This isn’t an inconvenience—it’s a critical bottleneck that leads to dozens of wasted hours every week.

Traditional approaches fail at reliably extracting information in complex PDFs

OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with.

Reducto breaks document layouts into subsections and then contextually parses each depending on the type of content. This is made possible by a combination of vision models, LLMs, and a suite of heuristics we built over time. Put simply, we can help you:

  • Accurately extract text and tables even with nonstandard layouts

  • Automatically convert graphs to tabular data and summarize images in documents

  • Extract important fields from complex forms with simple, natural language instructions

  • Build powerful retrieval pipelines using Reducto’s document metadata

  • Intelligently chunk information using the document’s layout data

Review Reducto Company Benefits Here

Reducto is an Equal Opportunity Employer committed to diversity and inclusion in the workplace. All qualified applicants will receive consideration for employment without regard to sex, race, color, age, national origin, religion, physical and mental disability, genetic information, marital status, sexual orientation, gender identity/assignment, citizenship, pregnancy or maternity, protected veteran status, or any other status prohibited by applicable national, federal, state or local law.

Average salary estimate

$125000 / YEARLY (est.)
min
max
$100000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Posted 3 days ago

Serve as a Payload Operator Field Engineer at Northrop Grumman supporting tactical communication systems worldwide with extensive overseas travel.

Photo of the Rise User

Lead and guide the RF & Antenna Design engineering team at CesiumAstro, driving innovative space communication hardware from concept to orbit support.

Photo of the Rise User
General Motors (GM) Hybrid Warren, Michigan, United States of America
Posted 12 days ago

A Design Release Engineer role at General Motors focused on semiconductor development and integration to support next-generation automotive technology.

Photo of the Rise User
Foth Hybrid No location specified
Posted 3 days ago

Lead diverse municipal wastewater treatment projects and mentor engineering teams with Foth, a leading member-owned consulting firm.

Photo of the Rise User
Lithos Energy Hybrid Hayward, California, United States
Posted 14 days ago

Experienced integration engineer needed to develop and implement automated systems for cutting-edge lithium-ion battery manufacturing at Lithos Energy.

Photo of the Rise User

Walter P Moore seeks a motivated Senior Structural Engineer with extensive experience in commercial structures to join their Dallas team.

Photo of the Rise User
Microsoft Hybrid Pittsburgh, Pennsylvania, United States
Posted 3 days ago
Inclusive & Diverse
Mission Driven
Social Impact Driven
Passion for Exploration
Dare to be Different
Diversity of Opinions
Reward & Recognition
Empathetic
Feedback Forward
Work/Life Harmony
Collaboration over Competition
Growth & Learning
Transparent & Candid
Customer-Centric
Rise from Within
Friends Outside of Work
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Work Visa Sponsorship
Employee Resource Groups
401K Matching
Paid Time-Off
Maternity Leave
Social Gatherings
Company Retreats

Drive innovation and customer success as a Cloud & AI Solution Engineer for Microsoft's defense sector, leveraging cutting-edge data platform technologies.

Photo of the Rise User

Senior Robotics Software Engineer, Test to architect and implement comprehensive robotics and perception system testing frameworks for a pioneering startup focused on intelligent commercial kitchen robots.

Posted 5 days ago

Advance enterprise AI adoption as an AI Deployment Engineer working with Fortune 500 clients to deliver impactful large language model solutions on the Stack AI platform.

Posted 3 days ago

Parsons is looking for an experienced Senior Roadway Engineer to lead design efforts on major DOT and regional infrastructure projects.

Photo of the Rise User
Archer Hybrid San Jose, California, United States
Posted 3 days ago
Dental Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance

Archer is seeking a Staff Engineer with expertise in aerospace vehicle simulation and Python/MATLAB to advance all-electric aircraft performance analysis.

Photo of the Rise User
Posted 9 days ago
Mission Driven
Collaboration over Competition
Inclusive & Diverse
Growth & Learning
Maternity Leave
Paternity Leave
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Paid Time-Off

Airbnb is hiring a Staff Machine Learning Engineer to develop cutting-edge ML models that safeguard the platform from fraudulent listings while working remotely in the US.

Photo of the Rise User
Posted 10 days ago

Lead innovative transit station design and delivery efforts with AECOM’s NY Metro Buildings + Places team as a Transit Stations Delivery Principal.

MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
January 10, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!