Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Platform Engineer - Reliability & Scale image - Rise Careers
Job details

Platform Engineer - Reliability & Scale

About LangChain

At LangChain, our mission is to make intelligent agents ubiquitous. We help developers build mission-critical AI applications across the entire agent development lifecycle. Our open source frameworks — LangChain and LangGraph — see over 70+ million downloads per month. Developers rely on LangChain for composable integrations and LangGraph for controllable agent orchestration. Our commercial agent platform, consisting of LangSmith and LangGraph Platform, enables teams to build, test, run, and manage agents at scale across their organization.

Founded in 2023, LangChain powers top engineering teams at companies like Replit, Lovable, Clay, Klarna, LinkedIn, and more.

About the role

In person 5 days/week in San Francisco, CA or New York, NY

Join our platform engineering team as we scale LangSmith and LangGraph Platform products. You'll architect and operate the critical systems that power our customers' AI observability and LangGraph app deployments, working directly with cutting-edge technologies at the intersection of AI and distributed systems.

  • Scale critical systems: Design and implement high throughput data-intensive systems supporting our flagship SaaS products (LangSmith and LangGraph Platform)

  • Drive reliability: Build monitoring, alerting, and automated recovery systems that maintain high uptime

  • Solve complex problems: Debug performance bottlenecks, optimize database queries, and architect solutions for distributed system challenges

  • Shape platform strategy: Influence technical decisions around infrastructure, tooling, and operational practices as we grow from startup to enterprise scale

  • Respond to incidents: Participate in on-call rotation with focus on post-incident learning, automation and prevention

How to be successful in this role

  • Experience: 5+ years building and operating production systems at scale

  • Infrastructure expertise: Deep knowledge of Kubernetes, containerized infrastructure, cloud platforms (e.g. GCP)

  • Database expertise: Production experience with OSS datastores (PostgreSQL, Redis, Kafka)

  • Observability mastery: Hands-on experience with observability stacks (Datadog, Prometheus/Grafana, OpenTelemetry or similar)

  • Programming proficiency: Strong hands-on software engineering skills (Python, Go, Rust)

  • Operational mindset: "You build it, you run it, you own it" philosophy with the focus on sustainable practices

Nice to Have

  • Proficiency with analytical databases (e.g. ClickHouse)

  • Background in high-growth startups

  • Previous experience in AI/ML infrastructure

Compensation & Benefits

  • Competitive salary and equity stake for role and stage of company. Commensurate with experience.

  • Annual salary range: $145,000-$195,000 USD for Senior Engineers

Average salary estimate

$170000 / YEARLY (est.)
min
max
$145000K
$195000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

LangChain’s flexible abstractions and extensive toolkit unlocks developers to build context-aware, reasoning LLM applications.

6 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, onsite
DATE POSTED
July 8, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!