Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI platform including access to models for speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.
The Opportunity
At Deepgram, we believe that data is the key to unlock the future of voice-enabled experiences. But building with audio data is hard -- audio poses incredibly rich scientific, engineering, and infrastructure challenges that are orders of magnitude harder than working with text. As a Data Scientist at Deepgram, you will tackle conversational audio at scale, establishing automated data streams that will power the next generation of Voice AI foundation models. The models we build will go beyond basic transcription and comprehension to capture nuanced meanings in complex conversations, adapt robustly to diverse speech patterns, and generate empathic responses with human-like, contextualized speech. Domain-specific expertise in speech or language AI is not required. Rather we’re looking for seasoned scientists who have a track record of solving hard data problems while exploring research frontiers. Our start-up environment offers a stunning growth trajectory for adventure-seeking individuals, providing a level of project ownership and on-ground connection with end-customers that larger research labs simply cannot provide.
What You’ll Do
Build high performance data acquisition, preparation and synthesis pipelines and drive them to generate data for training foundational voice models across modalities and tasks
Develop advanced characterizations of complex conversational audio utilizing a diverse toolkit of signals processing techniques and deep learning methods
Collaborate with DataOps and Engineering to create automated systems which scale the ability of human annotators to label high value data and provide feedback on model outputs
Build advanced benchmarking methodologies for evaluating interactive, conversational agent systems
It’s Important To Us That You Have
Experience building data processing pipelines from a blank page and owning the entire data stack including acquisition, characterization, cleaning, serving and transformation
Experience applying statistical methods and deep learning models to understand complex data
Ability to design and carry out research programs independently and with minimal oversight
Strong software engineering skills with particular emphasis on developing clean, modular code in Python and working with Pytorch
Strong communication skills and the ability to translate complex concepts in simple terms, depending on the target audience
Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding after closing our Series B funding round last year. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!
Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.
We are happy to provide accommodations for applicants who need them.
Our mission is to unlock the power of voice data to fuel the world’s big ideas and we need people who aren’t afraid to challenge how it’s always been done. Are you in?
9 jobsSubscribe to Rise newsletter