Deepgram is the leading voice AI platform for developers building speech-to-text (STT), text-to-speech (TTS) and full speech-to-speech (STS) offerings. 200,000+ developers build with Deepgram’s voice-native foundational models – accessed through APIs or as self-managed software – due to our unmatched accuracy, latency and pricing. Customers include software companies building voice products, co-sell partners working with large enterprises, and enterprises solving internal voice AI use cases. The company ended 2024 cash-flow positive with 400+ enterprise customers, 3.3x annual usage growth across the past 4 years, over 50,000 years of audio processed and over 1 trillion words transcribed. There is no organization in the world that understands voice better than Deepgram
Opportunity:
We are seeking an experienced Senior Systems Engineer – AI/ML Infrastructure to design, implement, and maintain our large-scale distributed systems infrastructure. You'll be responsible for building and optimizing our network architecture, storage solutions, and compute platforms that power our AI/ML workloads. This role combines expertise in network engineering, storage systems, and modern container orchestration platforms, with a focus on reliability, scalability, and cost-effectiveness.
What You’ll Do
Build and maintain bare-metal GPU compute clusters for AI training and inference workloads
Implement monitoring, alerting, and automation solutions for infrastructure management
Manage large-scale deployments using modern orchestration platforms like Kubernetes and Slurm
Design and implement reliable, high-performance network architectures for distributed systems
Architect and maintain large-scale storage solutions, including backup systems, distributed caching, and object storage
You’ll Love This Role If You
Are passionate about building reliable, scalable infrastructure systems
Enjoy optimizing complex distributed systems for performance and cost
Love solving challenging problems in networking and storage at scale
Are excited about working with cutting-edge GPU infrastructure
Want to work at the intersection of infrastructure and AI/ML systems
It’s Important To Us That You Have
5+ years of experience in infrastructure engineering or similar roles
Strong background in network engineering and design for reliability
Experience with large-scale storage systems (distributed file systems, caching solutions)
Proven track record of managing bare-metal infrastructure
Expertise in container orchestration platforms (Kubernetes, Slurm)
Experience with GPU infrastructure management and optimization
Strong automation and scripting skills
It Would Be Great if You Had
Experience with software-defined networking
Experience with infrastructure cost management and capacity planning
Familiarity with AI/ML workloads and their infrastructure requirements
Experience with multi-region infrastructure deployment
Background in performance optimization for distributed systems
Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!
Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.
We are happy to provide accommodations for applicants who need them.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
If you're looking for an exciting opportunity with Deepgram as a Senior Systems Engineer – AI/ML Infrastructure, then you may have just found your next big adventure! In this fully remote role, you'll dive into the heart of our voice AI platform, working with a vibrant team that supports over 200,000 developers harnessing the power of speech technology. Your key responsibilities will involve designing, implementing, and enhancing our large-scale distributed systems infrastructure, which is the backbone of our innovative AI/ML workloads. You'll get to build and maintain bare-metal GPU compute clusters, ensuring they are optimized for both reliability and performance. But wait, there's more! You’ll also have the chance to shape our network architecture, paving the way for scalable and efficient storage solutions that are vital for our success. If you're passionate about solving complex challenges in network engineering and storage, and if managing modern orchestration platforms like Kubernetes excites you, this may just be the perfect fit! Deepgram thrives on collaboration and creativity, and we are committed to helping you grow both personally and professionally within the ever-evolving AI landscape. Come join us and make an impact in the world of voice technology!
Our mission is to unlock the power of voice data to fuel the world’s big ideas and we need people who aren’t afraid to challenge how it’s always been done. Are you in?
18 jobsSubscribe to Rise newsletter