Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Principal MLOPs Engineer (Canada) image - Rise Careers
Job details

Principal MLOPs Engineer (Canada)

About the Role:


We are looking for a seasoned Principal ML OPS Engineer to architect, build, and optimize ML inference platform. The role demands an individual with significant expertise in Machine Learning engineering and infrastructure, with an emphasis on building Machine Learning inference systems. Proven experience in building and scaling ML inference platforms in a production environment is crucial. This remote position calls for exceptional communication skills and a knack for independently tackling complex challenges with innovative solutions.


What you will be doing:
  • Architect and optimize our existing data infrastructure to support cutting-edge machine learning and deep learning models.
  • Collaborate closely with cross-functional teams to translate business objectives into robust engineering solutions.
  • Own the end-to-end development and operation of high-performance, cost-effective inference systems for a diverse range of models, including state-of-the-art LLMs.
  • Provide technical leadership and mentorship to foster a high-performing engineering team.


Requirements:
  • Proven track record in designing and implementing cost-effective and scalable ML inference systems. 
  • Hands-on experience with leading deep learning frameworks such as TensorFlow, Keras, or Spark MLlib. 
  • Solid foundation in machine learning algorithms, natural language processing, and statistical modeling. 
  • Strong grasp of fundamental computer science concepts including algorithms, distributed systems, data structures, and database management. 
  • Proficiency and recent experience in Java is required (Must have)
  • Ability to tackle complex challenges and devise effective solutions. Use critical thinking to approach problems from various angles and propose innovative solutions.
  • Worked effectively in a remote setting, maintaining strong written and verbal communication skills. Collaborate with team members and stakeholders, ensuring clear understanding of technical requirements and project goals.
  • Proven experience in Apache Hadoop ecosystem (Oozie, Pig, Hive, Map Reduce).
  • Expertise in public cloud services, particularly in GCP and Vertex AI.


Must have:
  • Proven expertise in applying model optimization techniques (distillation, quantization, hardware acceleration) to production environments.
  • Proficiency and recent experience in Java is required (Must have)
  • In-depth understanding of LLM architectures, parameter scaling, and deployment trade-offs.
  • Technical degree: Bachelor's degree in Computer Science with a minimum of 10+ years of relevant industry experience, or
  • A Master's degree in Computer Science with at least 8+ years of relevant industry experience.
  • A specialization in Machine Learning is preferred. 


#LI-VM1

#Rackspace

#LI-Rackspace

#LI-USA

#LI-Remote



About Rackspace Technology

We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.

 

 

More on Rackspace Technology

Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.

 

 


What You Should Know About Principal MLOPs Engineer (Canada), Rackspace

Join the team at Rackspace Technology as a Principal MLOps Engineer and take the helm of architecting and optimizing our cutting-edge ML inference platform. In this role, you will leverage your extensive experience in Machine Learning engineering and infrastructure to build robust systems that deliver high-performance results for diverse models, including state-of-the-art LLMs. Your contributions will not only enhance our technical capabilities but also help shape a culture of innovation and excellence within the engineering team. Collaborating with cross-functional teams, you will translate business objectives into engineering solutions that are effective and cost-efficient. As a mentor and technical leader, you’ll help elevate your colleagues while fostering a high-performing environment. With a strong foundation in ML algorithms, deep learning frameworks like TensorFlow and Keras, and proficiency in Java, you will design scalable, production-ready ML inference systems. Your strong communication skills will be key in a remote setting, as you will continually interact with team members and stakeholders to ensure alignment on technical requirements and project goals. If you are passionate about tackling complex challenges, this position offers a unique opportunity to drive meaningful contributions in the fast-evolving field of machine learning.

Frequently Asked Questions (FAQs) for Principal MLOPs Engineer (Canada) Role at Rackspace
What are the responsibilities of the Principal MLOps Engineer at Rackspace Technology?

As a Principal MLOps Engineer at Rackspace Technology, you will take charge of architecting and optimizing the ML inference platform while collaborating with cross-functional teams. You'll manage the end-to-end development of high-performance inference systems and provide technical leadership to guide a skilled engineering team. Building scalable models and using optimization techniques will also be key responsibilities.

Join Rise to see the full answer
What qualifications do I need to become a Principal MLOps Engineer at Rackspace Technology?

To qualify for the Principal MLOps Engineer position at Rackspace Technology, you should have a Bachelor's degree in Computer Science with at least 10 years of relevant experience or a Master's degree with a minimum of 8 years. An expertise in Machine Learning, strong knowledge of deep learning frameworks like TensorFlow, and proficiency in Java are critical qualifications for this role.

Join Rise to see the full answer
What skills are required for the Principal MLOps Engineer position at Rackspace Technology?

Key skills for the Principal MLOps Engineer role at Rackspace Technology include a solid understanding of machine learning algorithms, natural language processing, and experience with Apache Hadoop. Moreover, your ability to optimize ML models and proficiency in cloud services, particularly in GCP and Vertex AI, will be essential in this role.

Join Rise to see the full answer
What can I expect in terms of collaboration and team dynamics as a Principal MLOps Engineer at Rackspace Technology?

In your role as a Principal MLOps Engineer at Rackspace Technology, you can anticipate a dynamic remote work environment that fosters collaboration. Your communication skills will be vital as you'll work closely with different teams to translate objectives into engineering solutions while mentoring colleagues and encouraging innovative problem-solving.

Join Rise to see the full answer
How does Rackspace Technology support diversity and inclusion for the Principal MLOps Engineer role?

Rackspace Technology is deeply committed to diversity and inclusion, ensuring equal employment opportunities regardless of various characteristics. As a Principal MLOps Engineer, you will be part of a culture that values unique perspectives, promoting innovation and enabling better service to customers and communities worldwide.

Join Rise to see the full answer
Common Interview Questions for Principal MLOPs Engineer (Canada)
Can you explain how you would architect a scalable ML inference system?

When architecting a scalable ML inference system, I would first assess the specific requirements of the models we plan to deploy. Then, I would leverage distributed computing technologies, ensuring efficient data flow from storage to the compute platform, and consider using orchestration tools to manage model versions and real-time predictions effectively.

Join Rise to see the full answer
What deep learning frameworks are you most proficient in, and why?

I am most proficient in TensorFlow and PyTorch due to their extensive community support and flexibility. TensorFlow provides robust tools for production deployment, while PyTorch’s dynamic computational graph is beneficial for research and development, making them suitable for varied project needs.

Join Rise to see the full answer
How do you approach the optimization of ML models for production?

To optimize ML models for production, I generally assess their performance metrics and identify bottlenecks. Techniques such as model quantization, distillation, and the use of GPU acceleration help reduce latency and improve efficiency while ensuring robustness in predictions.

Join Rise to see the full answer
Describe your experience with Apache Hadoop and its ecosystems.

I have hands-on experience with the Apache Hadoop ecosystem, including tools like Hive for querying and Pig for data processing. I have utilized Map Reduce for distributed data processing, which has enhanced my ability to work with large datasets effectively in various ML applications.

Join Rise to see the full answer
How do you ensure effective communication in a remote team setting?

In a remote team, I prioritize regular check-ins, utilize project management tools, and maintain open lines of communication through video calls and instant messaging. This ensures alignment on project goals and fosters a strong collaborative culture despite physical distances.

Join Rise to see the full answer
What methodologies do you use to mentor junior engineers effectively?

To mentor junior engineers effectively, I adopt a hands-on approach, encouraging them to work on real projects while providing guidance. I also stress the importance of feedback, facilitate knowledge-sharing sessions, and encourage them to explore new technologies relevant to their careers.

Join Rise to see the full answer
Can you discuss your experience with cloud services, specifically GCP and Vertex AI?

I have extensive experience with GCP, specifically using Vertex AI for deploying ML models. It allows seamless integration of various GCP services, such as data storage and orchestration, which streamlines the workflow from model training to production.

Join Rise to see the full answer
What challenges have you faced while implementing ML tools in a production environment?

One significant challenge was addressing the data pipeline's latency and ensuring real-time model predictions. By implementing efficient data streaming practices and optimizing model inference, I was able to improve latency significantly while maintaining accuracy.

Join Rise to see the full answer
How do you stay updated with the latest trends in ML and AI?

To stay updated with ML and AI trends, I regularly read research papers, follow industry leaders on social media, participate in webinars and conferences, and engage with communities on platforms like GitHub and Stack Overflow to learn from peers’ experiences.

Join Rise to see the full answer
What are your strategies for problem-solving in complex engineering tasks?

My strategies for problem-solving include breaking down complex challenges into manageable components, applying critical thinking to analyze each part, and brainstorming potential solutions. Collaborating with team members can provide additional perspectives that lead to innovative solutions.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 5 days ago
Photo of the Rise User
Zapier Remote No location specified
Posted 5 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Photo of the Rise User
Upstart Remote United States | Remote
Posted 4 days ago
Photo of the Rise User
Posted 11 days ago
Mission Driven
Social Impact Driven
Passion for Exploration
Reward & Recognition
Photo of the Rise User
EDF UK Remote Bristol, United Kingdom
Posted 10 days ago
Posted 6 days ago

Founded in 1998, Rackspace provides multi-cloud computing solutions and services. Offering advising to customers based on business challenges, designing solutions, building, and managing solutions. The company is headquartered in San Antonio, Texa...

273 jobs
MATCH
VIEW MATCH
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
March 27, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
6 people applied to Google Cloud Engineer at Miratech
Photo of the Rise User
Someone from OH, Akron just viewed Grad Intern - No Work Experience at Walmart
Photo of the Rise User
Someone from OH, Columbus just viewed Race & Sportsbook Office Manager at Westgate Resorts
S
Someone from OH, Akron just viewed Client Service Representative at Shine Productions
Photo of the Rise User
Someone from OH, Columbus just viewed Technical Support Specialist at Samsara
Photo of the Rise User
75 people applied to Electrical Apprentice at Aerotek
Photo of the Rise User
Someone from OH, Canton just viewed Full Stack Web Developer at Abnormal Security
Photo of the Rise User
Someone from OH, Canton just viewed Frontend Engineer, UX at Chainlink Labs
Photo of the Rise User
18 people applied to Internship summer 2025 at Boeing
R
Someone from OH, Toledo just viewed Global Marketing Intern at Reebok International, Ltd
Photo of the Rise User
Someone from OH, Toledo just viewed Intern, Corporate Communications at E.L.F. BEAUTY
Photo of the Rise User
Someone from OH, Cincinnati just viewed Immigration - E2 Visa at Upwork
Photo of the Rise User
Someone from OH, Dayton just viewed Senior Director - Brand & Marketing Content at Cielo
Photo of the Rise User
Someone from OH, Cleveland just viewed Scheduling Coordinator at Window Nation
T
Someone from OH, Columbus just viewed Power BI Developer - Remote at Two95 International Inc.
Photo of the Rise User
Someone from OH, Dayton just viewed Front Desk Clerk at Marriott International
Photo of the Rise User
Someone from OH, Hilliard just viewed Junior Digital Analyst at Jellyfish
Photo of the Rise User
Someone from OH, Hilliard just viewed Junior Digital Data Analyst at AECOM
Photo of the Rise User
Someone from OH, Columbus just viewed Data Analyst/R Programmer at Peet's
Photo of the Rise User
Someone from OH, Grandview Heights just viewed Service Drive Greeter at Jeff Wyler Automotive Family
Photo of the Rise User
Someone from OH, Washington Court House just viewed Administration and Clerical at Walmart
Photo of the Rise User
34 people applied to REMOTE Sr Piping Designer at Kelly