Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
LLM Performance Researcher image - Rise Careers
Job details

LLM Performance Researcher

Full-time • San Francisco or NYC

At Endeavor, we’re rebuilding ERP from first principles for $1B+ manufacturing and distribution companies. These companies run on PDFs, spreadsheets, and semi-structured chaos — and we’re building LLM-powered systems to parse, match, and reason through all of it with human-level reliability.

We’re looking for a researcher with deep experience in LLM performance on document tasks — especially extraction, entity linking, and record matching. You’ve likely published papers on it. You’ve probably run head-to-head evals on OpenAI, Claude, and open-source models. You’re fluent in both academic benchmarks and in the weird, grimy failure modes that only show up in production.

Your work will directly improve the core performance of our agentic ERP. You’ll prototype new techniques, run structured evals, improve few-shot + tool-augmented performance, and help shape how LLMs interface with structured business systems.

What You’ll Do

  • Design and run experiments to improve extraction, normalization, and matching across real-world documents

  • Evaluate LLM performance on noisy, multi-format inputs like scanned PDFs, OCR output, and Excel sheets

  • Improve model accuracy and reliability in the face of rare formats, abbreviations, bad formatting, and domain-specific vocab

  • Build and own our eval infrastructure for matching, linking, extraction, and schema alignment tasks

  • Work with the Applied AI Researcher and Backend Engineers to deploy improvements into production

  • Contribute to long-term strategy around fine-tuning, retrieval augmentation, tool use, or structured memory (if and when needed)

You Might Be a Fit If You

  • Have deep experience with document understanding and information extraction using LLMs

  • Have worked on schema alignment, record linking, or entity resolution at scale

  • Have published papers on LLM performance (e.g. extraction, evals, few-shot prompting, matching)

  • Understand both academic benchmarks and real-world weirdness

  • Know how to make evals meaningful, tight, and fast to iterate on

  • Want to work in a setting where research turns into production code fast

  • Have a PhD or equivalent research background in NLP, ML, or similar (but we care more about what you’ve done than what your title says)

Bonus Points

  • Experience with post-OCR workflows or noisy doc normalization

  • Deep intuition for failure modes in enterprise-scale matching/linking systems

  • Obsession with eval quality and reproducibility

  • Comfort implementing papers and benchmarking models at scale

  • Past work in procurement, invoicing, logistics, or any doc-heavy vertical

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About LLM Performance Researcher, Endeavor

At Endeavor, we're on a mission to overhaul ERP systems from the ground up for billion-dollar manufacturing and distribution companies. As an LLM Performance Researcher based in San Francisco, you'll be at the forefront of this innovative transformation. Imagine working with languages and systems that typically run on PDFs and spreadsheets, and using your expertise in large language models to revolutionize how businesses interact with their data. If you have deep experience in LLM performance on document tasks like extraction and entity linking, this is the perfect fit for you! We're looking for someone who's not only published research on LLM performance but also understands the gritty details of what can go wrong in real-world applications. You will design experiments that enhance data normalization and matching across various document formats, tackling challenges from noisy multi-format inputs to unusual abbreviation issues. Collaborating closely with our Applied AI Researchers and Backend Engineers, your efforts will directly contribute to our core agentic ERP's performance. We value practical experience—if you've worked on schema alignment or record linking at scale and have the passion to see your research impact production quickly, we want to hear from you. Join us at Endeavor and help shape the future of ERP systems with your cutting-edge research and enthusiasm for document understanding!

Frequently Asked Questions (FAQs) for LLM Performance Researcher Role at Endeavor
What are the key responsibilities of an LLM Performance Researcher at Endeavor?

As an LLM Performance Researcher at Endeavor, your primary responsibility will involve designing and conducting experiments to enhance the extraction, normalization, and matching of various real-world documents. You'll also critically evaluate the performance of LLMs on challenging inputs like scanned PDFs and OCR outputs. Another key duty is to improve model accuracy, specifically dealing with rare document formats and domain-specific vocabulary, ultimately contributing to the efficiency of our agentic ERP.

Join Rise to see the full answer
What qualifications are needed for the LLM Performance Researcher position at Endeavor?

To excel as an LLM Performance Researcher at Endeavor, candidates typically need a PhD or an equivalent research background in NLP, ML, or a related field. Nonetheless, hands-on experience is valued above formal qualifications, so if you have substantial experience in document understanding and information extraction using LLMs, you are encouraged to apply.

Join Rise to see the full answer
How does research translate into production code for an LLM Performance Researcher at Endeavor?

At Endeavor, we pride ourselves on a fast-paced environment where research is directly applied to production code. As an LLM Performance Researcher, you will collaborate with Applied AI Researchers and Backend Engineers to rapidly deploy improvements you've discovered through your experiments, which emphasizes our commitment to bridging the gap between theoretical studies and real-time applications.

Join Rise to see the full answer
What kind of work environment can an LLM Performance Researcher expect at Endeavor?

The work at Endeavor is dynamic and collaborative, embracing a culture where innovation thrives. As an LLM Performance Researcher, you will operate in a setting where your ideas are honored and valued, and your input can significantly impact project directions. Additionally, our focus on the practical implementation of research findings ensures that you'll see your contributions come to life.

Join Rise to see the full answer
What can an LLM Performance Researcher expect in terms of development and learning opportunities at Endeavor?

Endeavor is committed to continuous growth for our LLM Performance Researchers. You'll have numerous opportunities for professional development, whether it's through participating in cutting-edge projects, collaborating with experienced peers, or accessing ongoing training resources. We encourage innovation and exploration, allowing you to expand your skills and knowledge while tackling exciting challenges.

Join Rise to see the full answer
Common Interview Questions for LLM Performance Researcher
Can you explain your experience with document understanding and LLMs?

In answering this question, highlight specific projects where you successfully utilized LLMs for document understanding. Discuss how you approached challenges related to extraction, normalization, or entity linking. Demonstrating a clear understanding of both academic benchmarks and real-world applications will set you apart.

Join Rise to see the full answer
What strategies do you use to evaluate LLM performance on noisy inputs?

When responding, outline a strategy that encompasses systematic testing of various noisy inputs, including using benchmarks and allowing for real-world failure modes. Show your knowledge of specific tools and methodologies that have been effective in your past projects, reinforcing your experience.

Join Rise to see the full answer
Describe your approach to improving model accuracy in complex document formats.

Your answer should focus on techniques you employ to enhance model accuracy, such as fine-tuning LLMs to better handle domain-specific terminology or utilizing augmentations for noisy data. Mention any experience with relevant tools or frameworks that assist in refining accuracy.

Join Rise to see the full answer
How do you stay updated with the latest advancements in LLM research?

Share how you regularly engage with academic literature, attend conferences, or collaborate with peers in the field. Emphasize your enthusiasm for continuous learning and how you apply new insights to your projects at Endeavor.

Join Rise to see the full answer
Can you discuss a time when a research finding directly influenced production output?

Provide a specific example from your past experience, detailing the research you conducted, the findings obtained, and how these influenced your team’s production efforts. Highlight any measurable impacts this had on performance or efficiency.

Join Rise to see the full answer
What challenges have you faced when implementing LLM models and how did you overcome them?

Real-world challenges may include dealing with model failures or unexpected behaviors. Discuss specific instances, focusing on your problem-solving process and the resolutions you developed. This shows your capability in navigating typical hurdles in the field.

Join Rise to see the full answer
What methods do you employ for schema alignment in data processing?

Your response should outline the systematic approach you take to schema alignment, including the tools and frameworks used. Discuss how you ensure accuracy and efficiency in transferring data between aligned schemas.

Join Rise to see the full answer
How do you validate the quality and reproducibility of your evaluations?

Discuss the importance of evaluation quality and reproducibility in your work, and detail any protocols you follow to ensure consistency in your findings. Mention tools or practices employed to maintain high evaluation standards.

Join Rise to see the full answer
What role do communication and teamwork play in your research process?

Explain how you effectively communicate complex ideas with team members and stakeholders. Share examples of successful collaborations and how teamwork contributed to achieving research goals, emphasizing the importance of diverse perspectives.

Join Rise to see the full answer
What are your thoughts on the future of LLMs in enterprise systems?

This is an opportunity to express your vision and insights on potential developments in LLMs, particularly in enterprise settings. Show your awareness of current trends and how they could evolve, and convey how you’d like to contribute to that future at Endeavor.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 5 days ago

As the VP of Football for IMG in the AMERICAS, you will drive growth and strategy for our football-related business while partnering with top stakeholders in the industry.

Photo of the Rise User
Posted 5 days ago

We are looking for a Marketing Cloud Engineer to innovate our marketing strategies and enhance customer experiences for premium sports and entertainment events.

Photo of the Rise User
Arkema Hybrid King of Prussia, PA
Posted 2 days ago

Join Arkema as a Senior Staff Technician where you will leverage your expertise in chemical processes and safety initiatives in a collaborative R&D environment.

Sanofi EU Hybrid US, Morris County, NJ; New Jersey, Morristown, NJ
Posted 2 days ago

Join Sanofi as a Global Medical Director and lead innovative real-world evidence strategies to enhance patient care in Immunology.

Photo of the Rise User

Join Arizona Liver Health as a temporary Research Advanced Practice Provider focused on innovative patient care in liver disease.

Photo of the Rise User
Posted 4 days ago

Become a key player in Lonza's manufacturing team as a Process Expert specializing in endotoxin testing, driving process optimization and compliance.

Photo of the Rise User
Intel Hybrid US, Arizona, Phoenix
Posted yesterday
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Customer-Centric
Snacks
Onsite Gym
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Paid Time-Off
401K Matching
Maternity Leave
Paternity Leave

Embark on an exciting internship at Intel Foundry, focusing on cutting-edge AI research to enhance semiconductor classification techniques.

Photo of the Rise User
Posted 5 days ago

Lead and innovate in purification processes at Eurofins, contributing to high-impact research in life sciences.

Join Iambic Therapeutics as a Fall Graduate Research Intern and work on cutting-edge machine learning for protein structure prediction.

PSU Hybrid Penn State University Park
Posted 14 days ago

Join Dr. Song Tan's lab at Penn State University as a Part-Time Lab Assistant, aiding in groundbreaking protein research.

Endeavor, formerly WME | IMG, is a global leader in sports, entertainment and fashion operating in more than 30 countries. Named one of Fortune’s 25 Most Important Private Companies, Endeavor is the parent of a number of subsidiaries with leadersh...

6 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
April 11, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Mount Orab just viewed Backend Developer at G2i Inc.
Photo of the Rise User
Someone from OH, Cincinnati just viewed Executive Assistant, Tax at Netflix
Photo of the Rise User
Someone from OH, Cincinnati just viewed Product Marketing Manager at Cast & Crew
Photo of the Rise User
Someone from OH, Cincinnati just viewed Marketing Manager at Cast & Crew
o
Someone from OH, Cincinnati just viewed Administrative Assistant at osu
A
Someone from OH, Cincinnati just viewed Data Entry Clerk at Alphabe Insight Inc
Photo of the Rise User
Someone from OH, Cincinnati just viewed Machine Learning Engineer at Allstate
Photo of the Rise User
Someone from OH, Twinsburg just viewed Data Analyst/Power BI Developer at Datadog
Photo of the Rise User
Someone from OH, Cuyahoga Falls just viewed Small Fleet Underwriter at HDVI
Photo of the Rise User
Someone from OH, Dublin just viewed Product Designer, Entry Level at Govini
Photo of the Rise User
Someone from OH, Columbus just viewed Support Associate-7 at Tory Burch
Photo of the Rise User
Someone from OH, Columbus just viewed Project Manager at Treering
Photo of the Rise User
Someone from OH, Columbus just viewed Product Manager, Assessment Student Experience at Ellevation