Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Member of Technical Staff, Model Serving  image - Rise Careers
Job details

Member of Technical Staff, Model Serving

Who are we?

Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers.

Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.

Join us on our mission and shape the future!

Why this role?

Are you energized by building high-performance, scalable and reliable machine learning systems? Do you want to help define and build the next generation of AI platforms powering advanced NLP applications?  We are looking for Members of Technical Staff to join the Model Serving team at Cohere. The team is responsible for developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints. In this role, you will work closely with many teams to serve optimized LLM models to production in low latency, high throughput, and high availability environments. You will also get the opportunity to interface with customers and create customized deployments to meet their specific needs.

You may be a good fit if you have:

  • Experience with serving ML models in production

  • Experience designing, implementing, and maintaining a production service at scale

  • Strong intuition for system behavior and resource estimation under different workloads

  • Familiarity with inference characteristics of deep learning models, specifically, Transformer based architectures

  • Familiarity with computational characteristics of accelerators (GPUs, TPUs, and/or Inferentia), especially how they improve inference latency and throughput

  • Strong understanding or working experience with distributed systems

  • Experience in performance benchmarking, profiling, and optimization

  • Experience with cloud infrastructure (e.g. AWS, GCP)

  • Experience in Golang (or, other languages designed for high-performance scalable servers)

If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! If you want to work really hard on a glorious mission with teammates that want the same thing, Cohere is the place for you.

We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

Full-Time Employees at Cohere enjoy these Perks:

🤝 An open and inclusive culture and work environment 

🧑‍💻 Work closely with a team on the cutting edge of AI research 

🍽 Weekly lunch stipend, in-office lunches & snacks

🦷 Full health and dental benefits, including a separate budget to take care of your mental health 

🐣 100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK

🎨 Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement

🏙 Remote-flexible, offices in Toronto, New York, San Francisco and London and co-working stipend

✈️ 6 weeks of vacation

Note: This post is co-authored by both Cohere humans and Cohere technology.

Cohere Glassdoor Company Review
3.8 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
Cohere DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Cohere
Cohere CEO photo
Unknown name
Approve of CEO

Average salary estimate

$140000 / YEARLY (est.)
min
max
$120000K
$160000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Member of Technical Staff, Model Serving , Cohere

At Cohere, we’re on a mission to scale intelligence to serve humanity, and we’re seeking a passionate Member of Technical Staff for our Model Serving team based in San Francisco. If you love building high-performance, scalable machine learning systems and want to help shape the next generation of AI platforms, this role is for you! As a member of this dynamic team, you’ll develop, deploy, and operate our groundbreaking AI platform that delivers large language models through user-friendly API endpoints. You’ll collaborate closely with various teams to optimize LLM models for production environments that prioritize low latency, high throughput, and maximum availability. Moreover, your insights will directly impact customer experiences, as you’ll create customized deployments tailored to their unique needs. We believe in the power of diverse perspectives, and we want to hear from you if you’re excited about crafting solutions that enhance the capabilities of our models. Ideal candidates will have hands-on experience serving ML models in production settings, designing scalable services, and possess a solid understanding of distributed systems. Familiarity with deep learning model inference, particularly Transformer architectures, is beneficial. If you’re someone who thrives in a fast-paced environment and relishes the challenge of building impactful AI solutions, consider joining us at Cohere and helping shape the future of technology!

Frequently Asked Questions (FAQs) for Member of Technical Staff, Model Serving Role at Cohere
What qualifications do I need to become a Member of Technical Staff at Cohere?

To become a Member of Technical Staff at Cohere, you should have experience serving machine learning models in production, designing scalable services, and ideally, familiarity with deep learning model inference, particularly Transformer architectures. Strong knowledge of distributed systems and cloud infrastructure, such as AWS or GCP, is also crucial. If you are proficient in Golang or similar high-performance languages and are passionate about AI, we encourage you to apply even if your experience doesn't perfectly match our criteria.

Join Rise to see the full answer
What will my daily responsibilities be in the Member of Technical Staff role at Cohere?

As a Member of Technical Staff at Cohere, your daily responsibilities will involve developing, deploying, and operating our AI platform. You’ll work on delivering optimized large language models through API endpoints while ensuring low latency and high availability. Additionally, you will collaborate with other teams, benchmark system performance, and engage with customers to provide tailored deployments that meet their needs.

Join Rise to see the full answer
What type of work environment can I expect at Cohere as a Member of Technical Staff?

At Cohere, you can expect an open and inclusive culture that fosters collaboration and innovation. We prioritize a flexible work environment and support our employees with weekly lunch stipends, comprehensive health benefits, and generous vacation policies. You'll be part of a diverse team of top professionals passionate about AI, working closely together to achieve our common mission.

Join Rise to see the full answer
How does Cohere support employee growth and well-being for Members of Technical Staff?

Cohere is committed to the growth and well-being of its employees. As a Member of Technical Staff, you will benefit from personal enrichment programs that support your acquisition of skills, arts and culture pursuits, fitness, and workspace improvement. We also provide a supportive environment for mental health, alongside generous parental leave policies.

Join Rise to see the full answer
How does Cohere ensure diversity and inclusivity in the workplace?

Cohere deeply values diversity and inclusivity and strives to create an environment where all voices are heard. We actively welcome applicants from a range of backgrounds and are committed to equal opportunity. We encourage everyone to apply and seek accommodations whenever needed during the recruitment process, ensuring a supportive experience for all potential candidates.

Join Rise to see the full answer
Common Interview Questions for Member of Technical Staff, Model Serving
Can you describe your experience with serving machine learning models in production?

In answering this question, provide specific projects where you’ve deployed ML models, detailing the types of models you worked with, the challenges you faced, and how you overcame them. Highlight what you learned during the process, such as scaling challenges or optimizing latency.

Join Rise to see the full answer
What strategies do you use for performance benchmarking and profiling?

Discuss the tools and methods you typically use for performance benchmarking, such as profiling tools and metrics you track. Share a specific example of how you’ve applied these strategies to improve model performance and the impact it had on the system.

Join Rise to see the full answer
How do you handle low latency requirements in model serving?

Explain your approach to maintaining low latency in model serving. This might include talking about architectural decisions, optimization techniques you've implemented, and balancing the trade-offs between performance and resource management.

Join Rise to see the full answer
What role do distributed systems play in your model-serving experience?

Your response should outline your understanding of distributed systems, including how they contribute to scalability and reliability in model serving. Share experiences where you've implemented distributed solutions and lessons learned from those implementations.

Join Rise to see the full answer
How familiar are you with cloud infrastructure for deploying models?

Discuss your experiences with cloud platforms such as AWS or GCP, including specific services you've utilized for deploying machine learning models. Share insights regarding cost management, scalability, and ease of deployment.

Join Rise to see the full answer
What challenges have you faced when integrating models with existing systems?

In your answer, provide examples of integration challenges, such as compatibility issues or data format discrepancies. Discuss how you collaborated with other teams to solve these challenges and how you ensured successful integration.

Join Rise to see the full answer
Can you explain your experience with high-performance programming languages?

This is the chance to highlight your proficiency in languages like Golang and discuss any relevant projects where such languages were key. Explain the improvements you noticed in performance and reliability of systems as a result of using these languages.

Join Rise to see the full answer
Describe a time when you had to customize a deployment for a specific customer.

Share a specific instance where you worked closely with a customer to understand their needs and tailor a model deployment. Highlight the steps you took, challenges faced, and the ultimate outcome of this customization.

Join Rise to see the full answer
How do you stay current with advancements in AI and model serving technologies?

Discuss the resources you use, such as blogs, forums, or online courses, and any conferences you attend. Share how your continuous learning has positively impacted your projects and enhanced your skills.

Join Rise to see the full answer
Why do you want to work as a Member of Technical Staff at Cohere?

Provide a personal touch in your response by discussing what excites you about Cohere's mission and culture. Talk about your passion for AI and how this role aligns with your career goals and interests in building scalable AI solutions.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Cohere Remote San Francisco
Posted 2 days ago
Startup Mindset
Collaboration over Competition
Growth & Learning
Inclusive & Diverse

Join Cohere as a Revenue Accountant and help us shape the future of AI with your expertise in financial accuracy.

Photo of the Rise User
Posted 5 days ago
Startup Mindset
Collaboration over Competition
Growth & Learning
Inclusive & Diverse

Join Cohere as a Solutions Architect and play a key role in shaping AI solutions for the Public Sector while working in a dynamic and collaborative environment.

Photo of the Rise User
Posted 4 days ago
Photo of the Rise User
Posted 2 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Join NVIDIA's team in advancing large language model systems with your expertise as a Senior Deep Learning Algorithm Engineer.

Photo of the Rise User
Ace Games Remote No location specified
Posted 2 days ago

Join Ace Games as a Game Developer and help create amazing gaming experiences for players around the world.

Photo of the Rise User
Adree Remote No location specified
Posted 12 days ago

Seeking a talented Flutter Developer to create high-performance mobile applications in a remote setting.

Photo of the Rise User
Posted 14 days ago

DMI is looking for a Mid-Level Power Platform Developer to enhance their digital services as part of a dynamic remote team.

Photo of the Rise User
Target Hybrid Tower 02, Manyata Embassy Business Park, Racenahali & Nagawara Villages. Outer Ring Rd, Bangalore 540065
Posted 12 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony

As a Sr Engineer at Target, you'll architect and develop cutting-edge generative AI applications to enhance our global retail capabilities.

Photo of the Rise User
Posted 6 days ago

Join Redhorse Corporation as a Software Developer, where you'll impact national security by developing cutting-edge data processing applications.

Photo of the Rise User

Join Bobtail as a Sr. Software Engineer to develop efficient backend solutions for our innovative supply chain platform.

Cohere, founded by AI pioneers, offers a leading enterprise AI platform that combines ease-of-use, data privacy, and unparalleled flexibility with its cloud-agnostic and API-accessible services,

162 jobs
MATCH
Calculating your matching score...
BADGES
Badge ChangemakerBadge Future MakerBadge Innovator
CULTURE VALUES
Startup Mindset
Collaboration over Competition
Growth & Learning
Inclusive & Diverse
FUNDING
SENIORITY LEVEL REQUIREMENT
INDUSTRY
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 9, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Columbus just viewed UX Researcher, Amazon Autos at Amazon
Photo of the Rise User
24 people applied to Front-end Developer at Venturenox
Photo of the Rise User
Someone from OH, Cincinnati just viewed AI training and enablement at Writer
Photo of the Rise User
Someone from OH, Cincinnati just viewed Data Analyst (Contact Center-Hybrid) at Dow Jones
Photo of the Rise User
7 people applied to SDE Intern (Summer) at Amazon
Photo of the Rise User
Someone from OH, Delaware just viewed Practice Group Manager at LifeStance Health
Photo of the Rise User
Someone from OH, Youngstown just viewed Event Services Human Resources Coordinator at Allied Universal
Photo of the Rise User
Someone from OH, Columbus just viewed IP Network Engineering Intern - Summer 2025 at Bandwidth
Photo of the Rise User
Someone from OH, Cleveland just viewed Director, Education Programs & Partnerships at Encoura
Photo of the Rise User
Someone from OH, Cleveland just viewed Operations Associate (Part-Time) - Pinecrest at Alo Yoga
Photo of the Rise User
Someone from OH, Dayton just viewed Medical Receptionist at LifeStance Health
Photo of the Rise User
Someone from OH, Coldwater just viewed Engineering Design Checker Jobs at Lockheed Martin
Photo of the Rise User
Someone from OH, Loveland just viewed SEO Admin & Business Support at Outliant
Photo of the Rise User
Someone from OH, Columbus just viewed Casting: Cedar Lake - Pilot Episode at Backstage
Photo of the Rise User
Someone from OH, Mount Orab just viewed Software Development Manager at Assured Guaranty
H
Someone from OH, Mansfield just viewed Medical Appointment Setter (Remote LatAm) at HireHawk
Photo of the Rise User
Someone from OH, Lewis Center just viewed Third Party Risk Analyst at Experian
Photo of the Rise User
Someone from OH, Columbus just viewed Lead Preschool Teacher at Guidepost Montessori