Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Machine Learning Engineer image - Rise Careers
Job details

Machine Learning Engineer

About the role

About Us

The vast majority of enterprise data is in files like PDFs and spreadsheets. That includes everything from financial statements to medical records. Reducto helps AI teams turn those really complex documents into LLM-ready inputs with exceptional accuracy. This means they can build more reliable products while saving engineering time.

Our Traction

Hundreds of companies have signed up to use Reducto since our launch, and we're now processing tens of millions of pages every month for teams ranging from startups to Fortune 10 enterprises. We’re hiring founding software engineers to help us continue to serve our customers as we build the ingestion layer that connects human data with LLMs.

The Opportunity

As a member of our founding team you’ll work on our core API and on prem deployments. That means you’ll have a hand in everything that our customers need.

We would love to meet you if you:

  • Philosophy: You are your own worst critic. You have a high bar for quality and don’t rest until the job is done right—no settling for 90%. We want someone who ships fast, with high agency, and who doesn't just voice problems but actively jumps in to fix them.

  • Experience: You have 2 to 5 years of experience with training, fine tuning, and evaluating ML models used in production systems

  • Language/Skills: You’re exceptional at Python or similar, and are well versed with both traditional computer vision and VLMs

  • Tools: Build your own tools as needed—like a quick Streamlit app to test hypotheses or create a dataset.

  • Approach: A quantitative approach to building products. Ability to debug, experiment, and iterate fast. You should be comfortable getting hands-on with the full development lifecycle, from ideation to shipping to users.

The core work will include:

  • Training and deploying new state of the art models for parsing and interpreting unstructured data

  • Experimenting with novel techniques to improve LLM accuracy

  • Build data pipelines, evaluate model performance, and integrate models into the product

  • Working directly with the founders and customers to shape the product direction and engineering strategy

Bonus points if you:

  • Have prior experience founding a company or building products at early stages

  • Are ambitious and driven, and care a lot about doing great work with great people

  • Keep up with the latest developments in ML/AI

This is an in person role at our office in SF. We’re an early stage company which means that the role requires working hard and moving quickly. Please only apply if that excites you.

About Reducto

Nearly 80% of enterprise data is in unstructured formats like PDFs

PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that’s simply impractical for use in digital workflows. This isn’t an inconvenience—it’s a critical bottleneck that leads to dozens of wasted hours every week.

Traditional approaches fail at reliably extracting information in complex PDFs

OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with.

Reducto breaks document layouts into subsections and then contextually parses each depending on the type of content. This is made possible by a combination of vision models, LLMs, and a suite of heuristics we built over time. Put simply, we can help you:

  • Accurately extract text and tables even with nonstandard layouts

  • Automatically convert graphs to tabular data and summarize images in documents

  • Extract important fields from complex forms with simple, natural language instructions

  • Build powerful retrieval pipelines using Reducto’s document metadata

  • Intelligently chunk information using the document’s layout data

Average salary estimate

$115000 / YEARLY (est.)
min
max
$100000K
$130000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Machine Learning Engineer, Reducto

At Reducto, we're on a mission to revolutionize how enterprise data is handled, and we’re looking for a talented Machine Learning Engineer to join our San Francisco-based team. In this role, you'll be at the forefront of transforming complex documents like PDFs and spreadsheets into high-quality inputs for machine learning models. Your main tasks will involve training and deploying cutting-edge models that can parse unstructured data accurately, using a quantitative approach that allows for rapid debugging and iteration. You’ll collaborate closely with founders and customers to shape our API and product direction, ensuring that we meet the highest quality standards while shipping features at an impressive pace. With 2 to 5 years of experience in ML, you’ll leverage your expertise in Python and traditional computer vision to make a significant impact. At Reducto, you will also have the creativity to build your own tools, like Streamlit apps for testing ideas or data visualization. If you are excited about working in a dynamic, fast-paced startup environment where you can make a difference, you’ll thrive here. Join us in tackling the challenges of document processing that impact enterprises, and help us develop innovative solutions that save time and enhance productivity!

Frequently Asked Questions (FAQs) for Machine Learning Engineer Role at Reducto
What are the responsibilities of a Machine Learning Engineer at Reducto?

As a Machine Learning Engineer at Reducto, you will be tasked with training and deploying state-of-the-art machine learning models for parsing complex documents. You'll experiment with new techniques to enhance LLM accuracy, build data pipelines, evaluate model performance, and integrate models into our product. Collaborating with our founders and customers will also play a crucial role in shaping the direction of our engineering strategy.

Join Rise to see the full answer
What qualifications are needed to apply for the Machine Learning Engineer position at Reducto?

To apply for the Machine Learning Engineer role at Reducto, you should have 2 to 5 years of experience in training, fine-tuning, and evaluating ML models used in production. Proficiency in Python and familiarity with traditional computer vision and vision-language models (VLMs) are essential skills. Additionally, a proactive approach to problem-solving and the ability to iterate quickly would be beneficial.

Join Rise to see the full answer
What tools and technologies will I be using as a Machine Learning Engineer at Reducto?

In the Machine Learning Engineer position at Reducto, you'll utilize a variety of tools aligned with machine learning, such as Python for model development. You may also build additional tools, such as Streamlit applications for hypothesis testing and data visualization. Familiarity with LLMs, traditional ML techniques, and data pipeline tools will also be crucial for success in this role.

Join Rise to see the full answer
Is the Machine Learning Engineer position at Reducto an in-person role?

Yes, the Machine Learning Engineer role at Reducto is an in-person position based in our San Francisco office. This setup allows for better collaboration with the founding team and quicker iterations on projects, which is vital for an early-stage company where working hard and moving fast is essential.

Join Rise to see the full answer
What is the company culture like at Reducto for a Machine Learning Engineer?

The company culture at Reducto is one that values high standards and rapid execution. As a Machine Learning Engineer, you'll be part of an innovative team where appreciation for quality work and proactive problem-solving is paramount. You'll have the freedom to take initiative, and your contributions will directly shape product development as we all work together toward a common goal.

Join Rise to see the full answer
Common Interview Questions for Machine Learning Engineer
How do you approach training machine learning models in production?

To train machine learning models in production, I start with understanding the business problem and gather a well-defined dataset. I prioritize data preprocessing and cleaning to ensure quality inputs. Subsequently, I select appropriate algorithms and split data into training and testing sets. Continuous evaluation of model performance using metrics and testing on real-world scenarios is essential for effectiveness before deployment.

Join Rise to see the full answer
What is your experience with fine-tuning LLMs?

My experience with fine-tuning LLMs involves customizing pre-trained models to specific tasks. I analyze the domain-specific data, adjust hyperparameters, and use techniques like transfer learning to improve the model's performance. I focus on monitoring model performance throughout the fine-tuning process, making adjustments as necessary based on feedback and validation results.

Join Rise to see the full answer
Can you explain a time when you successfully deployed a machine learning model?

In a previous project, I successfully deployed a machine learning model by thoroughly testing its performance in a staging environment. I closely monitored resource usage and response times after deployment, making optimizations based on real-world data. Collaboration with cross-functional teams ensured that the deployment was smooth, and I used tools for continuous monitoring to quickly identify and address any issues.

Join Rise to see the full answer
What tools do you use for building data pipelines?

I frequently use tools like Apache Airflow and Luigi for orchestrating data pipelines. In addition, I leverage Python libraries for data manipulation, such as Pandas and NumPy, and integrate with platforms like AWS or Azure for storage and computational capabilities. It’s all about designing scalable and reliable pipelines to ensure seamless data flow.

Join Rise to see the full answer
How do you evaluate the performance of a machine learning model?

I evaluate model performance by using metrics like accuracy, precision, recall, and F1 score, depending on the context of the problem. I also consider creating confusion matrices to visualize performance across various classes. Additionally, I implement A/B testing when feasible to compare model variants in production, helping ensure we choose the best performing model.

Join Rise to see the full answer
What is your experience with handling unstructured data?

Handling unstructured data requires a different strategy compared to structured data. My experience includes leveraging NLP techniques for text extraction and using computer vision for processing images. I focus on building custom preprocessing steps that help convert unstructured data into a more usable format. I also believe in using heuristics to design pipelines that effectively manage this complexity.

Join Rise to see the full answer
How do you stay updated with the latest developments in AI and ML?

I stay updated with AI and ML developments by following influential research papers, attending webinars, and participating in community discussions. I also subscribe to newsletters and online courses to continuously learn new techniques and tools. Networking with industry peers helps me exchange insights and best practices, which in turn fuels innovation in my work.

Join Rise to see the full answer
Can you describe a project where you had to debug a model-related issue?

In a recent project, I encountered an issue where the model's predictions were inconsistent. I systematically debugged by analyzing the input data, checking for anomalies, and ensuring that preprocessing steps matched the training phase. I also revised the training process to identify if any hyperparameters needed adjustment, leading to improved model accuracy post-debugging.

Join Rise to see the full answer
What role do you think collaboration plays in a machine learning project?

Collaboration is critical in machine learning projects, as complex problems often require insights from diverse perspectives. Interaction with data scientists, software engineers, and product managers ensures alignment of goals and innovative solutions. Furthermore, collective problem-solving can lead to faster iteration and improved product outcomes, as everyone brings unique skills to the table.

Join Rise to see the full answer
Why do you want to work as a Machine Learning Engineer at Reducto?

I am passionate about using AI to solve real-world problems, and Reducto’s mission to process unstructured data aligns perfectly with my interests. I admire the innovation in your approach and I’m excited about the opportunity to collaborate closely with a founding team and contribute to building impactful products that save time and enhance productivity for enterprises.

Join Rise to see the full answer
Similar Jobs
Posted 23 hours ago
Photo of the Rise User
Veeva Systems Remote California - San Luis Obispo
Posted 4 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Family Medical Leave
Maternity Leave
Paternity Leave
Lactation Facilities
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Paid Time-Off
Paid Volunteer Time
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Nagarro Remote Remote, China
Posted 7 days ago
Photo of the Rise User
Posted 14 hours ago
Photo of the Rise User
Posted 14 days ago
Photo of the Rise User
Persona Hybrid Persona - San Francisco
Posted 14 days ago
Photo of the Rise User
Altium Hybrid Frisco, Texas, United States
Posted 2 days ago
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
LOCATION
No info
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
January 9, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!