Job details

Machine Learning Engineer

About the role

About Us

The vast majority of enterprise data is in files like PDFs and spreadsheets. That includes everything from financial statements to medical records. Reducto helps AI teams turn those really complex documents into LLM-ready inputs with exceptional accuracy. This means they can build more reliable products while saving engineering time.

Our Traction

Hundreds of companies have signed up to use Reducto since our launch, and we're now processing tens of millions of pages every month for teams ranging from startups to Fortune 10 enterprises. We’re hiring founding software engineers to help us continue to serve our customers as we build the ingestion layer that connects human data with LLMs.

The Opportunity

As a member of our founding team you’ll work on our core API and on prem deployments. That means you’ll have a hand in everything that our customers need.

We would love to meet you if you:

Philosophy: You are your own worst critic. You have a high bar for quality and don’t rest until the job is done right—no settling for 90%. We want someone who ships fast, with high agency, and who doesn't just voice problems but actively jumps in to fix them.
Experience: You have 2 to 5 years of experience with training, fine tuning, and evaluating ML models used in production systems
Language/Skills: You’re exceptional at Python or similar, and are well versed with both traditional computer vision and VLMs
Tools: Build your own tools as needed—like a quick Streamlit app to test hypotheses or create a dataset.
Approach: A quantitative approach to building products. Ability to debug, experiment, and iterate fast. You should be comfortable getting hands-on with the full development lifecycle, from ideation to shipping to users.

The core work will include:

Training and deploying new state of the art models for parsing and interpreting unstructured data
Experimenting with novel techniques to improve LLM accuracy
Build data pipelines, evaluate model performance, and integrate models into the product
Working directly with the founders and customers to shape the product direction and engineering strategy

Bonus points if you:

Have prior experience founding a company or building products at early stages
Are ambitious and driven, and care a lot about doing great work with great people
Keep up with the latest developments in ML/AI

This is an in person role at our office in SF. We’re an early stage company which means that the role requires working hard and moving quickly. Please only apply if that excites you.

About Reducto

Nearly 80% of enterprise data is in unstructured formats like PDFs

PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that’s simply impractical for use in digital workflows. This isn’t an inconvenience—it’s a critical bottleneck that leads to dozens of wasted hours every week.

Traditional approaches fail at reliably extracting information in complex PDFs

OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with.

Reducto breaks document layouts into subsections and then contextually parses each depending on the type of content. This is made possible by a combination of vision models, LLMs, and a suite of heuristics we built over time. Put simply, we can help you:

Accurately extract text and tables even with nonstandard layouts
Automatically convert graphs to tabular data and summarize images in documents
Extract important fields from complex forms with simple, natural language instructions
Build powerful retrieval pipelines using Reducto’s document metadata
Intelligently chunk information using the document’s layout data

Average salary estimate

$115000 / YEARLY (est.)

min

max

$100000K

$130000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Machine Learning Engineer, Reducto

At Reducto, we're on a mission to revolutionize how enterprise data is handled, and we’re looking for a talented Machine Learning Engineer to join our San Francisco-based team. In this role, you'll be at the forefront of transforming complex documents like PDFs and spreadsheets into high-quality inputs for machine learning models. Your main tasks will involve training and deploying cutting-edge models that can parse unstructured data accurately, using a quantitative approach that allows for rapid debugging and iteration. You’ll collaborate closely with founders and customers to shape our API and product direction, ensuring that we meet the highest quality standards while shipping features at an impressive pace. With 2 to 5 years of experience in ML, you’ll leverage your expertise in Python and traditional computer vision to make a significant impact. At Reducto, you will also have the creativity to build your own tools, like Streamlit apps for testing ideas or data visualization. If you are excited about working in a dynamic, fast-paced startup environment where you can make a difference, you’ll thrive here. Join us in tackling the challenges of document processing that impact enterprises, and help us develop innovative solutions that save time and enhance productivity!

Frequently Asked Questions (FAQs) for Machine Learning Engineer Role at Reducto

What are the responsibilities of a Machine Learning Engineer at Reducto?

As a Machine Learning Engineer at Reducto, you will be tasked with training and deploying state-of-the-art machine learning models for parsing complex documents. You'll experiment with new techniques to enhance LLM accuracy, build data pipelines, evaluate model performance, and integrate models into our product. Collaborating with our founders and customers will also play a crucial role in shaping the direction of our engineering strategy.