Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Technical Lead - Senior ML Infrastructure Software Engineer image - Rise Careers
Job details

Technical Lead - Senior ML Infrastructure Software Engineer

Technical Lead - Senior Machine Learning Infrastructure Software Engineer

Location: Hybrid in New York City, or US remote.


About Flip.shop:

Welcome to Flip.shop, where innovation meets the social commerce revolution! Fresh off our Series C funding round, we've raised $144 million, propelling our valuation to an impressive $1.05 billion. We’re redefining the shopping experience by giving consumers a voice in a space dominated by tech giants. Join us on this exhilarating journey where your technical skills will play a pivotal role in shaping the future of social commerce!


Why Join Us?

At Flip.shop, you’ll be at the forefront of innovation in social commerce. This isn’t just a job—it’s a chance to build infrastructure that empowers our AI-driven platform to scale and deliver personalized shopping experiences. You will have the opportunity to directly partner, work with and learn from the very best engineers and scientists who joined us from some of the leading big-tech companies! 

If you thrive in a fast-paced, collaborative environment where you can develop high-performance systems, we want to hear from you!


Role Overview:

We are seeking an experienced ML Infrastructure Lead to design, build, and optimize the infrastructure that powers our machine learning systems. You’ll drive the scalability, reliability, and performance of our recommendation and ads systems. This role involves leading the design, implementation, and optimization of our serving infrastructure to support high-throughput, low-latency workloads.

Furthermore, you'll ensure the efficient deployment, scaling, and monitoring of machine learning models, and will help streamline the development lifecycle. This role offers the opportunity to create scalable, production-level systems that support real-time recommendations and drive business growth.


You will work closely with our engineering and machine learning leaders to ensure our platform can scale efficiently and reliably as we grow.



Key Responsibilities:
  • Infrastructure Development: Design and implement scalable ML infrastructure for deploying, monitoring, and maintaining machine learning models in production environments. Ensure high availability, reliability, and performance of serving and infra systems.
  • Tooling & Automation: Build tools to automate workflows for model training, testing, and deployment, ensuring that machine learning models can move quickly from development to production.
  • Cloud Infrastructure: Leverage cloud platforms to create efficient, scalable systems for large-scale machine learning workloads.
  • Performance Optimization: Ensure the infrastructure supports high-performance model inference at scale, with a focus on minimizing latency and maximizing throughput.
  • Collaboration: Work closely with data scientists, machine learning engineers, and DevOps teams to create seamless integration between development and production environments.
  • Monitoring & Maintenance: Build robust monitoring systems to track model performance and infrastructure health, ensuring reliability and uptime of machine learning services.
  • Security & Compliance: Implement best practices in infrastructure security, data privacy, and compliance, particularly when handling sensitive user data.


Requirements:
  • Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
  • Experience: 7+ years of experience in infrastructure engineering, DevOps, or similar domains, with a focus on supporting machine learning workflows in production.
  • Technical Skills: Strong proficiency in cloud platforms (AWS, GCP, or Azure), containerization (Docker, Kubernetes), CI/CD pipelines, and infrastructure-as-code tools (Terraform, Ansible). Experience with SageMaker is a bonus. 
  • ML Workflow Knowledge: Experience working with machine learning frameworks (TensorFlow, PyTorch, or similar) and expertise with MLOps practices.
  • Performance & Scalability: Proven track record of optimizing infrastructure for performance, scalability, and reliability in production environments.
  • Collaboration: Strong teamwork skills, with the ability to partner with ML engineers and data scientists to streamline workflows.
  • Communication: Ability to communicate complex infrastructure solutions to technical and non-technical stakeholders.
  • Problem-Solving: Passion for solving infrastructure challenges that support real-time machine learning at scale.


Preferred Qualifications:
  • Experienced with using node.js for backend development
  • Experienced with infrastructure & tools of AWS
  • Experienced with message Queue such as RabbitMQ.


Why You’ll Love Working Here:

At Flip.shop, you’ll have the opportunity to build the backbone of our AI-driven platform, working on cutting-edge infrastructure that powers personalized shopping experiences for millions of users. Your work will directly contribute to scaling our machine learning systems, ensuring they run efficiently in a high-performance production environment. This is your chance to have a lasting impact and help Flip.shop shape the future of social commerce.


Ready to Build the Future?

If you're passionate about building scalable infrastructure and driving innovation in machine learning at scale, join us at Flip.shop! Let’s redefine the future of online shopping together.


Compensation & Benefits:

Base salary and total compensation will vary based on factors including but not limited to location, experience, and performance. Please note the base salary is just one component of the company’s total rewards package for exempt employees. Other rewards may include equity, bonuses, long term incentives, a PTO policy, and other progressive benefits.

What You Should Know About Technical Lead - Senior ML Infrastructure Software Engineer, Flip

Looking to make a significant impact in the world of social commerce? Join Flip.shop as a Technical Lead - Senior Machine Learning Infrastructure Software Engineer! This exciting position offers the chance to work remotely or in a hybrid model from New York City. At Flip.shop, we are at the forefront of innovation, reshaping how consumers engage with shopping through our AI-powered platform. With recent funding that values us at over $1 billion, we're committed to empowering consumers and transforming their shopping experiences. As a part of our team, you will design and build scalable infrastructure for our machine learning systems, ensuring they are reliable, high-performing, and can handle real-time data processing. Your role will also involve collaborating with top engineers and data scientists, automating workflows, optimizing performance, and maintaining robust security measures. If you're a tech-savvy leader with a passion for infrastructure and machine learning, you’ll thrive here. Your contributions will not only influence our technology but also help raise the bar for online shopping experiences across the industry. Join us at Flip.shop and be part of a visionary team where your work directly shapes the future of social commerce. Let's create something remarkable together!

Frequently Asked Questions (FAQs) for Technical Lead - Senior ML Infrastructure Software Engineer Role at Flip
What are the main responsibilities of a Technical Lead - Senior Machine Learning Infrastructure Software Engineer at Flip.shop?

As a Technical Lead - Senior ML Infrastructure Software Engineer at Flip.shop, you will be responsible for designing and implementing scalable ML infrastructure, ensuring high availability and performance of our systems. You will automate workflows for model training and deployment, collaborate with data scientists, and maintain rigorous monitoring and security practices for our machine learning models.

Join Rise to see the full answer
What qualifications are required for the Technical Lead - Senior Machine Learning Infrastructure Software Engineer role at Flip.shop?

The Technical Lead position at Flip.shop requires a Bachelor’s or Master’s degree in Computer Science or a related field, along with over 7 years of experience in infrastructure engineering, DevOps, or similar roles. Strong expertise in cloud platforms, containerization, and machine learning frameworks is also essential to succeed in this position.

Join Rise to see the full answer
How does Flip.shop support career growth for Technical Lead - Senior Machine Learning Infrastructure Software Engineers?

At Flip.shop, we prioritize professional development and growth. As a Technical Lead - Senior ML Infrastructure Software Engineer, you will have opportunities to work alongside top talent, gain exposure to advanced technologies, and take on leadership roles, all fostering an environment where you can expand your skillset and advance your career.

Join Rise to see the full answer
What is the company culture like for a Technical Lead - Senior Machine Learning Infrastructure Software Engineer at Flip.shop?

Flip.shop has a collaborative and innovative culture, focusing on teamwork and creativity. As a Technical Lead - Senior Machine Learning Infrastructure Software Engineer, you'll work in a fast-paced environment with like-minded professionals who are passionate about pushing boundaries and making a significant impact in the world of social commerce.

Join Rise to see the full answer
What technologies should a candidate be familiar with for the Technical Lead - Senior Machine Learning Infrastructure Software Engineer position at Flip.shop?

Candidates applying for the Technical Lead - Senior ML Infrastructure Software Engineer role at Flip.shop should have a solid understanding of cloud technologies (AWS, GCP, or Azure), containerization (Docker, Kubernetes), and be experienced with CI/CD pipelines and infrastructure-as-code tools. Proficiency in ML workflows and frameworks such as TensorFlow or PyTorch is also crucial for success.

Join Rise to see the full answer
Common Interview Questions for Technical Lead - Senior ML Infrastructure Software Engineer
Can you explain your experience with cloud platforms relevant to the Technical Lead - Senior Machine Learning Infrastructure Software Engineer role?

When answering, highlight specific cloud platforms you have used, projects you've completed that utilized these platforms, and discuss how you've leveraged their services to enhance machine learning workflows. Mention any challenges faced and how you overcame them.

Join Rise to see the full answer
What strategies have you implemented to optimize machine learning infrastructure for performance?

Discuss concrete examples of optimizations you've made, such as reducing latency or improving throughput. Explain the methodology you used to identify bottlenecks and the tools or frameworks that aided you in this process.

Join Rise to see the full answer
How do you approach collaboration with data scientists and ML engineers?

Share your experience with cross-functional teams, emphasizing communication strategies, tools you’ve used for collaboration, and how you ensure alignment on project goals and deliverables. Your ability to bridge technical and non-technical communication will be crucial.

Join Rise to see the full answer
Describe a time you solved a challenging problem related to machine learning infrastructure.

Choose a relevant scenario that showcases your problem-solving skills. Detail the steps you took, the outcome, and what you learned during the process. Highlight your analytical skills and creativity in troubleshooting.

Join Rise to see the full answer
What experience do you have with MLOps practices?

Explain your understanding of MLOps and how you have successfully applied its principles in previous roles to streamline the development and deployment of machine learning models.

Join Rise to see the full answer
How do you ensure the security and compliance of infrastructure handling sensitive data?

Discuss best practices you follow, including data encryption, regulation compliance (like GDPR, CCPA), and experience implementing security measures in your infrastructure. Mention specific tools or strategies you’ve employed.

Join Rise to see the full answer
What tools do you use for automating ML workflows?

Provide examples of the automation tools you've used, such as Jenkins, Airflow, or similar technologies, and explain how they helped streamline processes. Discuss the impact of automation on model deployment and management.

Join Rise to see the full answer
How do you handle model performance monitoring post-deployment?

Share the monitoring tools and methodologies you utilize to track the performance of live machine learning models. Discuss how you respond to anomalies and ensure continuous model improvement.

Join Rise to see the full answer
What techniques do you apply for minimizing latency in ML projects?

Mention any specific algorithms, design patterns, or architectural choices you employ to enhance speed and performance in your machine learning systems. Discuss any relevant metrics you monitor.

Join Rise to see the full answer
Why are you interested in the Technical Lead - Senior Machine Learning Infrastructure Software Engineer role at Flip.shop?

Craft a compelling narrative about your passions in social commerce, your admiration for Flip.shop's mission, and how your skill set aligns with the company’s goals. Showcase enthusiasm and commitment to contributing to innovative projects.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User

Join Flip.shop as a Senior Machine Learning Engineer to build scalable infrastructure for an AI-driven social commerce platform.

Photo of the Rise User

Join Flip.shop as a Senior Machine Learning Engineer and help redefine the online shopping experience with innovative AI solutions.

Photo of the Rise User
Posted 12 days ago
Photo of the Rise User

Join Scribd as a Senior Software Engineer, focusing on enhancing authentication and user experience for millions of readers worldwide.

Photo of the Rise User

Join Sparkrock as a Chief Architect and help drive the development of impactful ERP solutions for mission-driven organizations.

Photo of the Rise User

Join ING Hubs Romania as a Senior Android Developer and contribute to the transformative OneApp project in a dynamic, flexible environment.

Photo of the Rise User
Posted 13 days ago
Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
Posted 11 days ago

adweek is the leading source of news and insight serving the brand marketing ecosystem. first published in 1979, adweek's award-winning coverage reaches an engaged audience of professionals across platforms including print, digital, events, podcas...

18 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
April 6, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!