Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Head of ML Infrastructure  image - Rise Careers
Job details

Head of ML Infrastructure

About Us:

Hippocratic AI is developing the first safety-focused Large Language Model (LLM) for healthcare. Our mission is to dramatically improve healthcare accessibility and outcomes by bringing deep healthcare expertise to every person. No other technology has the potential for this level of global impact on health.

Why Join Our Team:

  • Innovative mission: We are creating a safe, healthcare-focused LLM that can transform health outcomes on a global scale.

  • Visionary leadership: Hippocratic AI was co-founded by CEO Munjal Shah alongside physicians, hospital administrators, healthcare professionals, and AI researchers from top institutions including Johns Hopkins, Stanford, Google, Meta, Microsoft and NVIDIA.

  • Strategic investors: Raised $137 million from top investors including General Catalyst, Andreessen Horowitz, Premji Invest, SV Angel, NVentures (Nvidia Venture Capital), and Greycroft.

  • Team and expertise: We are working with top experts in healthcare and artificial intelligence to ensure the safety and efficacy of our technology.

For more information, visit www.HippocraticAI.com.

We value in-person teamwork and believe the best ideas happen together. Our team is expected to be in the office five days a week in Palo Alto, CA unless explicitly noted otherwise in the job description

Position Overview:


We are seeking a highly skilled and innovative Head of ML Infrastructure to lead the design, development, and operation of our orchestration platform for a heterogeneous constellation of Large Language Models (LLMs). The ideal candidate will have deep expertise in infrastructure orchestration, multi-cloud environments, and tools such as Kubernetes and Terraform. This role is critical to ensuring that our AI systems are scalable, reliable, and seamlessly integrated into our broader technology ecosystem.

Key Responsibilities:

Orchestration Platform Development:


• Architect and implement an advanced orchestration platform to manage a diverse set of LLMs efficiently.
• Design solutions to optimize performance, scalability, and availability across various deployment environments.


Infrastructure Management:


• Utilize Kubernetes, Terraform, and other Infrastructure as Code (IAC) tools to automate and manage ML infrastructure.
• Collaborate with DevOps and cloud engineering teams to ensure seamless integration with CI/CD pipelines.
• Establish robust monitoring, logging, and alerting systems for ML infrastructure.


Multi-Cloud Strategy:

• Design and execute strategies to leverage multiple cloud providers for cost optimization, redundancy, and compliance.
• Manage cloud-native services to support model deployment and orchestration at scale.

Performance Optimization:


• Work closely with ML engineers to fine-tune model deployment strategies, focusing on latency, throughput, and fault tolerance.
• Conduct capacity planning and develop tools for model lifecycle management.

Leadership & Collaboration:

• Lead a team of infrastructure engineers, fostering a culture of innovation, collaboration, and excellence.
• Act as a bridge between ML research, engineering, and operations teams to align infrastructure capabilities with business needs.
• Stay abreast of emerging technologies and methodologies in ML infrastructure and orchestration.

Qualifications:

Technical Skills:

• Proven experience in building and managing ML infrastructure platforms, particularly for LLMs or other advanced AI systems.
• Expertise in Kubernetes, Terraform, and other IAC tools.
• Deep understanding of multi-cloud architectures (e.g., AWS, Azure, Google Cloud) and hybrid cloud solutions.
• Strong programming skills in Python, Go, or a similar language, with experience in building automation and orchestration tools.
• Familiarity with modern ML frameworks and tools (e.g., TensorFlow, PyTorch, Hugging Face).

Leadership & Communication:

  • Demonstrated success in leading infrastructure teams and managing large-scale projects

  • Excellent problem-solving and decision-making skills.

Strong communication skills, with the ability to convey complex technical ideas to non-technical stakeholders.

Education & Experience:

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field (or equivalent work experience).

  • 8+ years of experience in infrastructure engineering, with at least 3 years in a leadership

Hippocratic AI Glassdoor Company Review
4.8 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Hippocratic AI DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Hippocratic AI
Hippocratic AI CEO photo
Munjal Shah
Approve of CEO

Average salary estimate

$175000 / YEARLY (est.)
min
max
$150000K
$200000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Head of ML Infrastructure , Hippocratic AI

At Hippocratic AI, we're on a transformative journey as we develop the first safety-focused Large Language Model (LLM) tailored for healthcare. This is where you come in! We’re seeking a Head of ML Infrastructure to spearhead the design and operation of our advanced orchestration platform in Palo Alto. As part of a visionary team co-founded by industry leaders from prestigious institutions like Johns Hopkins and Stanford, you'll be at the forefront of innovation. Your role will involve managing and optimizing our infrastructure to ensure our LLMs are scalable, reliable, and seamlessly integrated. You'll utilize tools like Kubernetes and Terraform to automate our ML infrastructure, collaborating with DevOps to maintain seamless CI/CD pipelines. By leveraging multi-cloud strategies, you'll not only enhance performance but also ensure compliance and optimization. Leadership is key in this position; you’ll guide a talented team of engineers while bridging the gap between ML research and operational needs. This is an opportunity to leave a mark in the healthcare sector by ensuring our technologies are safe and effective. If you are passionate about AI and healthcare and have a track record in building robust ML infrastructure, we’d love to hear from you!

Frequently Asked Questions (FAQs) for Head of ML Infrastructure Role at Hippocratic AI
What are the responsibilities of the Head of ML Infrastructure at Hippocratic AI?

The Head of ML Infrastructure at Hippocratic AI is responsible for architecting and implementing an orchestration platform for diverse Large Language Models (LLMs). This includes optimizing performance, managing ML infrastructure with tools like Kubernetes and Terraform, and leading a team of engineers while ensuring integration with CI/CD pipelines.

Join Rise to see the full answer
What qualifications do I need to apply for the Head of ML Infrastructure position at Hippocratic AI?

To apply for the Head of ML Infrastructure at Hippocratic AI, candidates should possess a Bachelor's or Master's degree in Computer Science or a related field, along with 8+ years of infrastructure engineering experience. A strong background in Kubernetes, Terraform, and multi-cloud architectures is essential.

Join Rise to see the full answer
How does the Head of ML Infrastructure ensure scalability at Hippocratic AI?

At Hippocratic AI, the Head of ML Infrastructure ensures scalability by designing solutions that optimize performance and availability across cloud environments. They work closely with ML engineers to develop deployment strategies that enhance latency and throughput.

Join Rise to see the full answer
What kind of team will I be leading as Head of ML Infrastructure at Hippocratic AI?

As Head of ML Infrastructure at Hippocratic AI, you will lead a team of skilled infrastructure engineers, fostering a culture of collaboration and innovation. You’ll be responsible for aligning the team's capabilities with business needs while mentoring and supporting their professional growth.

Join Rise to see the full answer
What tools and technologies are emphasized for the Head of ML Infrastructure role at Hippocratic AI?

The Head of ML Infrastructure role at Hippocratic AI emphasizes expertise in Kubernetes, Terraform, and various Infrastructure as Code (IAC) tools. Additionally, knowledge of cloud environments such as AWS, Azure, and Google Cloud, as well as familiarity with ML frameworks, are critical for success in this position.

Join Rise to see the full answer
Common Interview Questions for Head of ML Infrastructure
Can you describe your experience with Kubernetes as it relates to ML infrastructure?

When answering this question, focus on specific projects where you implemented Kubernetes for ML infrastructure. Highlight how it enabled automation and scaling of machine learning workloads effectively.

Join Rise to see the full answer
How would you design an orchestration platform for handling multiple LLMs?

In your response, emphasize scalability and reliability. Discuss architectural considerations, performance optimizations, and the integration of monitoring systems to ensure smooth operations.

Join Rise to see the full answer
What are the key factors to consider when implementing a multi-cloud strategy?

Key factors include cost optimization, compliance, service redundancy, and ease of integration. Detail your experience with managing these aspects within your previous roles and how you use various cloud services effectively.

Join Rise to see the full answer
How do you approach team leadership in an engineering environment?

Discuss your leadership style and the importance of fostering innovation and collaboration. Provide examples of how you have mentored team members and connected engineering efforts to business goals.

Join Rise to see the full answer
What programming languages are you proficient in for building automation tools?

Mention your proficiency in languages such as Python or Go, and provide examples of specific automation tools or scripts you’ve developed to enhance ML infrastructure.

Join Rise to see the full answer
Can you explain how you monitor and optimize ML infrastructure?

Explain the metrics you track, tools you utilize for monitoring, and your strategies for identifying bottlenecks and optimizing solution performance. Include your experience with setting up health checks and alerting systems.

Join Rise to see the full answer
What experience do you have with CI/CD pipelines in relation to ML projects?

Highlight your experience in integrating CI/CD practices into machine learning projects, focusing on how you streamlined processes to enhance deployment frequency and reliability.

Join Rise to see the full answer
Can you discuss a challenging issue you’ve encountered in ML infrastructure and how you resolved it?

Identify a specific challenge, explain the impact it had, and describe the steps you took to resolve it, as well as any lessons learned that could benefit the team moving forward.

Join Rise to see the full answer
How do you keep up with emerging technologies in ML infrastructure?

Share your methods for staying current, such as attending conferences, participating in online communities, or reading industry publications, and explain how you incorporate new trends into your team's practices.

Join Rise to see the full answer
Why are you interested in the Head of ML Infrastructure position at Hippocratic AI?

This is your opportunity to express your passion for healthcare and AI's potential impact. Reflect on Hippocratic AI’s mission and how your skills align with their goals, emphasizing your desire to contribute to their innovative projects.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Solvd Remote No location specified
Posted 7 days ago
Photo of the Rise User
Posted 10 hours ago
Photo of the Rise User
Foth Remote No location specified
Posted 13 days ago
Photo of the Rise User
Posted 3 days ago
Photo of the Rise User
E.L.F. BEAUTY Remote Ahmedabad, Gujarat
Posted 12 days ago
Photo of the Rise User
Posted 13 days ago

Hippocratic AI is building a safety-focused large language model (LLM) for the healthcare industry. We believe that generative AI has the potential to massively increase healthcare access the world over but has to be built and tested responsibly. ...

45 jobs
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
January 6, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!