Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Data/ML Infrastructure Engineer image - Rise Careers
Job details

Data/ML Infrastructure Engineer

At Coco, our mission is to revolutionize urban logistics by empowering cities, boosting local economies, and delivering delightful customer experiences. We connect people with local restaurants through our fleet of on-demand delivery robots, helping merchants reach their customers faster and more efficiently. By building innovative robotic systems that seamlessly navigate city sidewalks, Coco plays a key role in reshaping the future of last-mile delivery and enhancing local businesses.

To deliver on our mission, we are building an autonomy team to develop the AI technology that will enable our robot pilots to scale efficiently, sustainably, and safely. The involves building an autonomy stack ground-up based on our millions of miles of last-mile delivery routes, proprietary video streams, and LiDAR data.

What is the scope of this role?

As a Founding Data & ML Infrastructure Engineer, you will be responsible to stand up Coco’s autonomy stack alongside the CTO and fellow team members in the autonomy team. You will be responsible for developing and maintaining the infrastructure that supports the collection, processing, management, and training of large-scale datasets for our autonomous robots. The impact of this will be massive improvements to our robot-to-pilot ratio thereby allowing every person living in an urban area to benefit from last-mile delivery. In this role, you must accomplish the following:

  • Design and implement a high-performance data engine to mine and identify valuable data samples that enhance model training.

  • Build tools and pipelines for automatically extracting, cleaning, and curating data from various sources (sensors, logs, real-world interactions).

  • Enable seamless interaction with large-scale datasets, ensuring that the team can quickly retrieve and analyze data to drive insights.

  • Collaborate with the autonomy and AI engineers to develop the query layer and workflows for training and testing models

  • Build and maintain tools for dataset management, including data exploration, versioning, and interaction tools.

  • Architect and manage the infrastructure for model training and experimentation. This includes continuously optimizing data pipelines and infra for cost, scalability, and speed.

  • Create and maintain systems for dataset tracking and governance to ensure consistent and reproducible experiments.

Must have competencies:

  • 3+ years of experience in software engineering, data engineering, or infrastructure engineering, with a focus on machine learning or AI systems.

  • Extremely well versed in building and managing cloud infrastructure for large-scale data processing and model training (AWS, GCP, Azure).

  • Excellent programming skills. Familiarity with ML frameworks i.e. TensorFlow, PyTorch.

  • Strong understanding of data pipelines, versioning, and data management best practices.

  • Experience working with containerization and orchestration tools (Docker, Kubernetes).

  • Strong experience with cloud platforms and infrastructure as code (Terraform, CloudFormation).

  • Familiarity with distributed systems, high-performance computing, and optimization for training large models.

  • Hands-on experience with tools for data management and interaction (e.g., DVC, Delta Lake, or similar tools).

  • Strong leadership and communication skills.

Average salary estimate

$110000 / YEARLY (est.)
min
max
$90000K
$130000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Data/ML Infrastructure Engineer, Coco

Join Coco as a Data/ML Infrastructure Engineer in the vibrant city of San Francisco and join us in our mission to revolutionize urban logistics! At Coco, we empower cities by connecting locals with their favorite restaurants through our innovative fleet of on-demand delivery robots. Your role will be pivotal; as a founding member of our autonomy team, you will help build the AI technology that drives our robot pilots. Your expertise will shine through designing and implementing a high-performance data engine crucial to enhancing our models' efficiency. You'll collaborate closely with our CTO and other talented engineers to develop robust tools and pipelines for data extraction, cleaning, and curation, making it super easy for our team to access actionable insights from large-scale datasets. Your knack for building and optimizing cloud infrastructure will play a key role in ensuring smooth operations of all data processes—whether you're diving into ML frameworks like TensorFlow or PyTorch, or championing best practices in data management. With your extensive experience in software and infrastructure engineering, you will help create systems that enable reproducible experiments, enhance model training, and track datasets effectively. Together, we will transform urban delivery, allowing every city dweller to experience the convenience of seamless last-mile logistics, making their lives easier and delighting them along the way. Come be a part of Coco's journey towards a smarter, more efficient future!

Frequently Asked Questions (FAQs) for Data/ML Infrastructure Engineer Role at Coco
What responsibilities does a Data/ML Infrastructure Engineer at Coco have?

As a Data/ML Infrastructure Engineer at Coco, you will implement a high-performance data engine, build tools for data extraction, and ensure seamless interaction with large-scale datasets. Collaborating with AI engineers, you will also manage the infrastructure for model training, optimize data pipelines, and establish systems for dataset governance.

Join Rise to see the full answer
What qualifications are required for the Data/ML Infrastructure Engineer position at Coco?

Candidates for the Data/ML Infrastructure Engineer role at Coco should have 3+ years of experience in software or data engineering, with a focus on AI systems. Proficiency in cloud infrastructure management, programming, and knowledge of ML frameworks like TensorFlow and PyTorch are crucial for success in this role.

Join Rise to see the full answer
What programming languages and technologies are preferred for the Data/ML Infrastructure Engineer at Coco?

The ideal Data/ML Infrastructure Engineer at Coco should be well-versed in programming languages and technologies related to machine learning and data management. Familiarity with cloud platforms (AWS, GCP, Azure), containerization tools (Docker, Kubernetes), and infrastructure as code (Terraform) is also highly preferred.

Join Rise to see the full answer
How does the Data/ML Infrastructure Engineer contribute to Coco's mission?

The Data/ML Infrastructure Engineer at Coco plays a vital role by enhancing the performance of our autonomous delivery robots. By developing data pipelines and infrastructure that improve model training, this position greatly contributes to our mission of making last-mile delivery more efficient and accessible in urban areas.

Join Rise to see the full answer
What is the team structure like for a Data/ML Infrastructure Engineer at Coco?

As a Data/ML Infrastructure Engineer at Coco, you will work closely with the CTO and collaborate with a dedicated autonomy team. This collaborative environment will allow you to engage with both AI engineers and data specialists, fostering innovation and ensuring the success of our autonomy initiatives.

Join Rise to see the full answer
Common Interview Questions for Data/ML Infrastructure Engineer
What experience do you have with machine learning frameworks like TensorFlow or PyTorch?

When answering this question, focus on specific projects where you utilized TensorFlow or PyTorch. Discuss how you approached model development, the challenges you faced, and how you overcame them. Be prepared to share results or performance metrics that illustrate your success.

Join Rise to see the full answer
Can you describe your experience managing cloud infrastructure for data processing?

Be specific about the cloud platforms you've worked with, such as AWS, GCP, or Azure. Articulate how you've designed or maintained infrastructure for large-scale data processing, including any challenges faced and how you ensured cost efficiency and scalability during operations.

Join Rise to see the full answer
What strategies do you use for data pipeline optimization?

Discuss specific techniques and tools you’ve used to optimize data pipelines, such as batching, parallel processing, or buffering. Highlight experiences where these strategies led to significant improvements in data retrieval and processing speeds.

Join Rise to see the full answer
How do you ensure the quality and governance of datasets?

Talk about practices you implement for dataset tracking and governance, including version control systems or data integrity checks. Mention any tools (e.g., DVC, Delta Lake) you have used for managing data quality and reproducibility.

Join Rise to see the full answer
How do you handle collaboration with AI engineers when building the infrastructure?

Emphasize the importance of communication and shared goals in cross-functional teams. Share experiences from your past roles where you worked with AI engineers, discussing how you aligned infrastructure capabilities with their modeling needs to achieve common objectives.

Join Rise to see the full answer
What is your approach to managing large-scale datasets?

Explain your methods for handling and processing large datasets, such as using distributed systems or cloud storage solutions. Highlight any algorithms or frameworks you prefer that facilitate efficient data management and analysis.

Join Rise to see the full answer
Describe a challenging project involving data engineering that you worked on.

Choose a project that exemplifies your problem-solving skills in data engineering. Detail the problem, your approach to solution design, and the impact your work had on the project or organization.

Join Rise to see the full answer
What tools do you use for data extraction and cleaning, and why?

Discuss your preferred tools for data extraction and cleaning, such as Apache Kafka or Pandas. Describe situations in which you used these tools effectively, focusing on their advantages in achieving clean, structured data for analysis.

Join Rise to see the full answer
Can you explain your experience with containerization and orchestration tools?

Specifically mention your experience with tools like Docker or Kubernetes. Describe how you have used these tools to streamline deployment processes, improve scalability, and ensure the reliability of applications.

Join Rise to see the full answer
How do you stay updated with emerging technologies in data engineering and AI?

Share your strategies for continuous learning, whether through attending workshops, participating in online courses, following industry leaders on social media, or engaging in relevant communities. Highlight how staying informed helps you apply cutting-edge technologies in your work.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 4 days ago
Photo of the Rise User
Posted 2 days ago
Photo of the Rise User
Posted 2 days ago
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning
Photo of the Rise User
Posted 9 days ago
Photo of the Rise User
Posted 10 days ago
Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Transparent & Candid
Growth & Learning
Fast-Paced
Collaboration over Competition
Take Risks
Friends Outside of Work
Passion for Exploration
Customer-Centric
Reward & Recognition
Feedback Forward
Rapid Growth
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Paternity Leave
Fully Distributed
Flex-Friendly
Some Meals Provided
Snacks
Social Gatherings
Pet Friendly
Company Retreats
Dental Insurance
Life insurance
Health Savings Account (HSA)
Photo of the Rise User
CoreWeave Remote Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA
Posted 14 days ago
Photo of the Rise User
Ushur Remote Bengaluru, KA
Posted 11 days ago
MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
December 13, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!