Full-time • San Francisco or NYC
At Endeavor, we’re rebuilding ERP from first principles for $1B+ manufacturing and distribution companies. These companies run on PDFs, spreadsheets, and semi-structured chaos — and we’re building LLM-powered systems to parse, match, and reason through all of it with human-level reliability.
We’re looking for a researcher with deep experience in LLM performance on document tasks — especially extraction, entity linking, and record matching. You’ve likely published papers on it. You’ve probably run head-to-head evals on OpenAI, Claude, and open-source models. You’re fluent in both academic benchmarks and in the weird, grimy failure modes that only show up in production.
Your work will directly improve the core performance of our agentic ERP. You’ll prototype new techniques, run structured evals, improve few-shot + tool-augmented performance, and help shape how LLMs interface with structured business systems.
Design and run experiments to improve extraction, normalization, and matching across real-world documents
Evaluate LLM performance on noisy, multi-format inputs like scanned PDFs, OCR output, and Excel sheets
Improve model accuracy and reliability in the face of rare formats, abbreviations, bad formatting, and domain-specific vocab
Build and own our eval infrastructure for matching, linking, extraction, and schema alignment tasks
Work with the Applied AI Researcher and Backend Engineers to deploy improvements into production
Contribute to long-term strategy around fine-tuning, retrieval augmentation, tool use, or structured memory (if and when needed)
Have deep experience with document understanding and information extraction using LLMs
Have worked on schema alignment, record linking, or entity resolution at scale
Have published papers on LLM performance (e.g. extraction, evals, few-shot prompting, matching)
Understand both academic benchmarks and real-world weirdness
Know how to make evals meaningful, tight, and fast to iterate on
Want to work in a setting where research turns into production code fast
Have a PhD or equivalent research background in NLP, ML, or similar (but we care more about what you’ve done than what your title says)
Experience with post-OCR workflows or noisy doc normalization
Deep intuition for failure modes in enterprise-scale matching/linking systems
Obsession with eval quality and reproducibility
Comfort implementing papers and benchmarking models at scale
Past work in procurement, invoicing, logistics, or any doc-heavy vertical
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
At Endeavor, we're on a mission to overhaul ERP systems from the ground up for billion-dollar manufacturing and distribution companies. As an LLM Performance Researcher based in San Francisco, you'll be at the forefront of this innovative transformation. Imagine working with languages and systems that typically run on PDFs and spreadsheets, and using your expertise in large language models to revolutionize how businesses interact with their data. If you have deep experience in LLM performance on document tasks like extraction and entity linking, this is the perfect fit for you! We're looking for someone who's not only published research on LLM performance but also understands the gritty details of what can go wrong in real-world applications. You will design experiments that enhance data normalization and matching across various document formats, tackling challenges from noisy multi-format inputs to unusual abbreviation issues. Collaborating closely with our Applied AI Researchers and Backend Engineers, your efforts will directly contribute to our core agentic ERP's performance. We value practical experience—if you've worked on schema alignment or record linking at scale and have the passion to see your research impact production quickly, we want to hear from you. Join us at Endeavor and help shape the future of ERP systems with your cutting-edge research and enthusiasm for document understanding!
As the VP of Football for IMG in the AMERICAS, you will drive growth and strategy for our football-related business while partnering with top stakeholders in the industry.
We are looking for a Marketing Cloud Engineer to innovate our marketing strategies and enhance customer experiences for premium sports and entertainment events.
Join Arkema as a Senior Staff Technician where you will leverage your expertise in chemical processes and safety initiatives in a collaborative R&D environment.
Join Sanofi as a Global Medical Director and lead innovative real-world evidence strategies to enhance patient care in Immunology.
Join Arizona Liver Health as a temporary Research Advanced Practice Provider focused on innovative patient care in liver disease.
Become a key player in Lonza's manufacturing team as a Process Expert specializing in endotoxin testing, driving process optimization and compliance.
Embark on an exciting internship at Intel Foundry, focusing on cutting-edge AI research to enhance semiconductor classification techniques.
Lead and innovate in purification processes at Eurofins, contributing to high-impact research in life sciences.
Join Iambic Therapeutics as a Fall Graduate Research Intern and work on cutting-edge machine learning for protein structure prediction.
Join Dr. Song Tan's lab at Penn State University as a Part-Time Lab Assistant, aiding in groundbreaking protein research.
Endeavor, formerly WME | IMG, is a global leader in sports, entertainment and fashion operating in more than 30 countries. Named one of Fortune’s 25 Most Important Private Companies, Endeavor is the parent of a number of subsidiaries with leadersh...
6 jobsSubscribe to Rise newsletter