Manager, AI System Infrastructure and MLOps Engineering
The Chan Zuckerberg Initiative aims to tackle society’s challenges through innovative technology. We seek a hands-on Engineering Manager for AI System Infrastructure and MLOps to lead a team focused on enabling groundbreaking research in biomedical sciences.
Sign up for our
weekly newsletter
of fresh jobs
Skills
Hands-on AI/ML platform operations experience
MLOps experience with GPU clusters in Kubernetes
Strong coding skills in systems languages
Proficiency with cloud services (AWS, GCP, Azure)
Knowledge of Linux systems optimization
Responsibilities
Build and lead the MLOps and Systems Infrastructure Engineering team
Drive MLOps processes and ensure stability of GPU Cloud computing systems
Own on-call efforts and build alerting and monitoring for AI platform
Manage a variety of AI/ML development infrastructure projects
Mentor and coach team members
Education
BS, MS, or PhD in Computer Science or a related field
Benefits
100% match on 401(k) contributions
Annual funding for personal use
Paid time off for volunteer activities
Funding for family-forming benefits
Relocation support for Bay Area moves
To read the complete job description, please click on the ‘Apply’ button
Chan Zuckerberg Initiative Glassdoor Company Review