Braintrust is building the modern platform for evaluating and deploying AI systems. Our mission is to help enterprises build trust in their AI by making it easy to test, monitor, and improve models using real-world evaluation frameworks. We work with cutting-edge customers in finance, healthcare, and tech who are building production-grade AI systems.
As a Forward Deployed Engineer (FDE) on our Professional Services team, you'll work directly with customers to get them up and running with Braintrust’s platform. You’ll lead hands-on technical engagements, helping teams define, build, and operationalize AI evaluation frameworks that reflect their real-world goals.
This is a hybrid role that blends software engineering, customer collaboration, and delivery excellence. You’ll partner closely with customer ML, data, and engineering teams to implement evaluation pipelines, define metrics, and build custom logic that plugs into Braintrust’s platform.
You’ll also act as a feedback loop for our product and engineering teams, helping shape the future of the platform based on what you see in the field.
Deliver professional services engagements for new and existing customers—from kickoff to deployment.
Lead the implementation of evaluation frameworks for LLM and other model types, tailored to each customer’s use case.
Build integrations with customer infrastructure, data pipelines, and model endpoints using Python or TypeScript.
Write custom evaluators, metrics, and logic to test models in real-world conditions.
Guide customers on best practices for model evaluation, testing, and monitoring using Braintrust.
Collaborate with internal product and engineering teams to influence roadmap priorities and improve customer workflows.
Help document reusable patterns, templates, and tools for future engagements.
2–5 years of experience in software engineering, ML engineering, or technical consulting.
Experience working directly with external clients or stakeholders in a delivery or services capacity.
Proficiency in Python or TypeScript —you’re comfortable building scripts, tools, and integrations with modern software practices.
Familiarity with machine learning workflows, model evaluation, and LLM development (e.g., prompt engineering, RAG pipelines, quality metrics).
Strong communication and project management skills—you know how to translate between technical and business needs.
A self-starter mindset—you enjoy figuring things out in real time and aren’t afraid to roll up your sleeves.
Bonus: Background in professional services, solutions engineering, or customer delivery at an AI or developer tools company.
Be on the ground floor of a fast-growing startup defining the future of AI evaluation.
Work with top-tier enterprise customers solving real, high-impact problems with LLMs.
Collaborate with a tight-knit, mission-driven team of engineers, researchers, and operators.
Influence our services playbook and help scale how we deliver value to customers.
Medical, dental, and vision insurance
401k plan
Daily lunch, snacks, and beverages
Flexible time off
Competitive salary and equity
AI Stipend
Braintrust is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
At Braintrust, we're redefining how AI systems are evaluated and deployed, and we're looking for a Forward Deployed Engineer to join our Professional Services team! In this role, you'll work closely with our clients, guiding them in leveraging Braintrust’s innovative platform to achieve their AI goals. Whether you're leading technical engagements or collaborating with customer teams, your expertise will help them operationalize AI evaluation frameworks that really make a difference. Expect to be hands-on, crafting integrations with customer infrastructures and implementing evaluation frameworks that align with their specific use cases. You'll write custom logic using Python or TypeScript and provide best practice guidance for model evaluation and monitoring. Not only will you contribute to customers' success, but you'll also have a voice in influencing our product development by providing crucial feedback from your fieldwork. If you're a self-starter with a blend of technical prowess and exceptional communication skills, your impactful journey starts here at Braintrust, where we’re passionate about driving the future of AI!
Subscribe to Rise newsletter