At Coco, our mission is to revolutionize urban logistics by empowering cities, boosting local economies, and delivering delightful customer experiences. We connect people with local restaurants through our fleet of on-demand delivery robots, helping merchants reach their customers faster and more efficiently. By building innovative robotic systems that seamlessly navigate city sidewalks, Coco plays a key role in reshaping the future of last-mile delivery and enhancing local businesses.
To deliver on our mission, we are building an autonomy team to develop the AI technology that will enable our robot pilots to scale efficiently, sustainably, and safely. The involves building an autonomy stack ground-up based on our millions of miles of last-mile delivery routes, proprietary video streams, and LiDAR data.
What is the scope of this role?
As a Founding Data & ML Infrastructure Engineer, you will be responsible to stand up Coco’s autonomy stack alongside the CTO and fellow team members in the autonomy team. You will be responsible for developing and maintaining the infrastructure that supports the collection, processing, management, and training of large-scale datasets for our autonomous robots. The impact of this will be massive improvements to our robot-to-pilot ratio thereby allowing every person living in an urban area to benefit from last-mile delivery. In this role, you must accomplish the following:
Design and implement a high-performance data engine to mine and identify valuable data samples that enhance model training.
Build tools and pipelines for automatically extracting, cleaning, and curating data from various sources (sensors, logs, real-world interactions).
Enable seamless interaction with large-scale datasets, ensuring that the team can quickly retrieve and analyze data to drive insights.
Collaborate with the autonomy and AI engineers to develop the query layer and workflows for training and testing models
Build and maintain tools for dataset management, including data exploration, versioning, and interaction tools.
Architect and manage the infrastructure for model training and experimentation. This includes continuously optimizing data pipelines and infra for cost, scalability, and speed.
Create and maintain systems for dataset tracking and governance to ensure consistent and reproducible experiments.
Must have competencies:
3+ years of experience in software engineering, data engineering, or infrastructure engineering, with a focus on machine learning or AI systems.
Extremely well versed in building and managing cloud infrastructure for large-scale data processing and model training (AWS, GCP, Azure).
Excellent programming skills. Familiarity with ML frameworks i.e. TensorFlow, PyTorch.
Strong understanding of data pipelines, versioning, and data management best practices.
Experience working with containerization and orchestration tools (Docker, Kubernetes).
Strong experience with cloud platforms and infrastructure as code (Terraform, CloudFormation).
Familiarity with distributed systems, high-performance computing, and optimization for training large models.
Hands-on experience with tools for data management and interaction (e.g., DVC, Delta Lake, or similar tools).
Strong leadership and communication skills.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Join Coco as a Data/ML Infrastructure Engineer in the vibrant city of San Francisco and join us in our mission to revolutionize urban logistics! At Coco, we empower cities by connecting locals with their favorite restaurants through our innovative fleet of on-demand delivery robots. Your role will be pivotal; as a founding member of our autonomy team, you will help build the AI technology that drives our robot pilots. Your expertise will shine through designing and implementing a high-performance data engine crucial to enhancing our models' efficiency. You'll collaborate closely with our CTO and other talented engineers to develop robust tools and pipelines for data extraction, cleaning, and curation, making it super easy for our team to access actionable insights from large-scale datasets. Your knack for building and optimizing cloud infrastructure will play a key role in ensuring smooth operations of all data processes—whether you're diving into ML frameworks like TensorFlow or PyTorch, or championing best practices in data management. With your extensive experience in software and infrastructure engineering, you will help create systems that enable reproducible experiments, enhance model training, and track datasets effectively. Together, we will transform urban delivery, allowing every city dweller to experience the convenience of seamless last-mile logistics, making their lives easier and delighting them along the way. Come be a part of Coco's journey towards a smarter, more efficient future!
Subscribe to Rise newsletter