Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Senior Machine Learning Engineer – Cloud Observability - Visa AI as Services image - Rise Careers
Job details

Senior Machine Learning Engineer – Cloud Observability - Visa AI as Services - job 20 of 20

Ready to make a global impact by industrializing AI?

Visa AI as a Service (AIaS) operationalizes the delivery of AI and decision intelligence to ensure their ongoing business values. Built with composable AI capabilities, privacy-enhancing computation, and cloud native platforms, AIaS powers and automates industrialization of data, models, and applications for predictive and generative AI. Combined with strong governance, AIaS optimizes the performance, scalability, interpretability and reliability of AI models and services. If you want to be in the exciting payment and AI space, learn fast, and make big impacts, Visa AI as a Service is an ideal place for you!

This role is for a Sr. ML Engineer – Cloud Observability. We are seeking for a talented professional with a solid background in public cloud and AI/ML production systems. This role offers ample opportunities for learning and growth, and the chance to be part of delivering the next big thing for our AI as Services team.

Key Responsibilities:

  • Implement and Maintain Cloud Observability Solutions: Build and maintain monitoring, logging and tracing systems (E.g. Prometheus, Grafana, Druid, ELK Stack) for cloud-native AI services on AWS/Azure/GCP. Partner with data engineers and data scientists to embed observability into ML workflows and ensure real-time insights.

  • Collaborate on AI Model Monitoring: Work closely with data scientists and product owners to design and implement observability solutions for monitoring AI/ML model performance (e.g. accuracy, latency, data drift) in production. Develop dashboards and alerts to detect anomalies, model degradation, or bias, ensuring alignment with business SLAs.

  • Automate Devops Practices:  Develop tools for automated deployment, alerting and incident response using CI/CD pipelines like Jenkins and Github flows and infrastructure as code like Terraform.

  • Document & Reporting: Create and maintain clear documentation for observability processes and best practices. Generate reports to track system health and performance trends for business and technology stakeholders.

  • Incident Response: Assist in diagnosing and troubleshooting issues by analyzing metrics, logs and performance data and collaborate with cross functional teams to improve system level observability from the learning.

  • Stay Ahead of Trends: Explore emerging cloud and observability technologies to drive innovation.

If you are passionate about observability, cloud technology, AI, and machine learning, and are excited about making a significant impact, we would love to hear from you.

This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.

Average salary estimate

$140000 / YEARLY (est.)
min
max
$120000K
$160000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Senior Machine Learning Engineer – Cloud Observability - Visa AI as Services, Visa

Are you ready to make a global impact in the exciting realm of AI at Visa AI as a Service? We're searching for a Senior Machine Learning Engineer specializing in Cloud Observability to join our dynamic team in Austin. In this role, you will dive into the operationalization of AI and decision intelligence, ensuring that our innovative services deliver exceptional business value. You’ll utilize your expertise in public cloud and AI/ML production systems to implement and maintain robust observability solutions, leveraging tools like Prometheus and Grafana. By partnering with data engineers and data scientists, you'll help embed observability into machine learning workflows, providing real-time insights that enhance AI model performance. This position isn't just about monitoring; it’s about actively developing tools for automated deployments and incident responses using CI/CD pipelines and Terraform. You’ll also play a crucial role in documenting observability processes and generating reports that track system health, which is vital for our business and technical stakeholders. With an eye on the future, you’ll stay ahead of trends in cloud and observability technologies, driving innovation within the team. If you’re passionate about this field and eager to contribute to impactful projects, then the Senior Machine Learning Engineer position at Visa AI as a Service could be the perfect opportunity for you!

Frequently Asked Questions (FAQs) for Senior Machine Learning Engineer – Cloud Observability - Visa AI as Services Role at Visa
What are the responsibilities of a Senior Machine Learning Engineer – Cloud Observability at Visa AI as a Service?

As a Senior Machine Learning Engineer focused on Cloud Observability at Visa AI as a Service, your key responsibilities include implementing and maintaining monitoring solutions for cloud-native AI services, collaborating on the performance monitoring of AI/ML models, automating DevOps practices, and maintaining documentation for observability processes. Your work will ensure that the AI services are optimized for performance, scalability, and reliability.

Join Rise to see the full answer
What qualifications are required for the Senior Machine Learning Engineer – Cloud Observability position at Visa AI as a Service?

To be considered for the Senior Machine Learning Engineer – Cloud Observability role at Visa AI as a Service, candidates typically need a strong background in AI and machine learning, experience with public cloud platforms such as AWS, Azure, or GCP, and familiarity with observability tools like Prometheus and Grafana. Knowledge of CI/CD practices and infrastructure as code tools such as Terraform is also highly beneficial.

Join Rise to see the full answer
How does the Senior Machine Learning Engineer – Cloud Observability role contribute to AI at Visa AI as a Service?

The Senior Machine Learning Engineer – Cloud Observability role is pivotal at Visa AI as a Service, as it ensures the seamless operationalization of AI technologies. By developing and implementing observability solutions, this position helps in effectively monitoring AI model performance and facilitates real-time insights, which align directly with business objectives and enhance decision-making.

Join Rise to see the full answer
What tools and technologies will the Senior Machine Learning Engineer – Cloud Observability work with at Visa AI as a Service?

In the role of Senior Machine Learning Engineer – Cloud Observability at Visa AI as a Service, you will work with a variety of advanced tools and technologies, including Prometheus, Grafana, ELK Stack, and cloud services from AWS, Azure, or GCP. You’ll also utilize CI/CD tools like Jenkins and GitHub, along with Infrastructure as Code tools such as Terraform to automate deployments and enhance cloud observability.

Join Rise to see the full answer
Is the Senior Machine Learning Engineer – Cloud Observability position at Visa AI as a Service remote-friendly?

The Senior Machine Learning Engineer – Cloud Observability position at Visa AI as a Service is a hybrid role, which means while there are provisions for remote work, the specific expectation for days in the office will be confirmed by your hiring manager. This flexibility allows you to balance your work environment according to your needs while staying engaged with the team.

Join Rise to see the full answer
Common Interview Questions for Senior Machine Learning Engineer – Cloud Observability - Visa AI as Services
Can you explain your experience with cloud observability tools relevant to this Senior Machine Learning Engineer role?

When addressing your experience with cloud observability tools in the interview, highlight specific tools you've worked with, like Grafana or Prometheus. Discuss how you used them to improve performance monitoring and what challenges you faced. Providing concrete examples from your previous roles will showcase your practical knowledge and problem-solving skills.

Join Rise to see the full answer
Describe a project where you had to monitor an AI/ML model's performance in production.

In your response, detail a specific project where you successfully monitored an AI/ML model, explaining the metrics you tracked (like accuracy and latency), the tools you used, and the impact of your monitoring on the project's success. Emphasizing your analytical skills and ability to collaborate with cross-functional teams will demonstrate your capability for this role.

Join Rise to see the full answer
How do you ensure the reliability and scalability of machine learning models?

Discuss your strategies for testing and validating models thoroughly during development and the importance of setting up robust monitoring systems post-deployment. Highlight how you utilize automated tools for scaling and ensuring the models can handle varying loads without a dip in performance.

Join Rise to see the full answer
What best practices do you follow when documenting observability processes?

Talk about the importance of clear and concise documentation in observability processes, including how you organize information for ease of use and maintenance. Include examples of tools or templates you prefer and how they have helped your teams understand and implement observability best practices effectively.

Join Rise to see the full answer
Explain a time when you had to troubleshoot an incident with a machine learning model.

In your answer, outline the situation, focusing on your systematic approach to diagnostics. Describe the metrics and logs you analyzed, the conclusions you drew, and the effective changes you made, illustrating your problem-solving skills and ability to work under pressure.

Join Rise to see the full answer
Why do you believe observability is critical in cloud-native AI services?

Define observability and explain why it is crucial for monitoring the performance and health of AI services. Discuss the potential impact on business outcomes if observability is neglected, emphasizing how proactive monitoring can lead to quicker issue resolution and enhance user satisfaction.

Join Rise to see the full answer
How would you integrate observability practices into existing ML workflows?

Explain the steps you would take to seamlessly embed observability tools into existing ML workflows. Mention collaboration with data scientists and engineers, the type of metrics you would encourage them to monitor, and how this integration can lead to improved insights and model performance.

Join Rise to see the full answer
What strategies do you use to keep updated with emerging technologies in cloud and observability?

Outline your approach to staying informed about new technologies, such as attending industry conferences, participating in online forums, and following influential thought leaders. Mention any certifications or courses you've pursued that align with cloud observability trends and innovations.

Join Rise to see the full answer
Discuss your experience working in a hybrid work environment.

Share your experiences in a hybrid work setting, touching on the benefits and challenges you've encountered. Highlight tools and communication strategies that helped you remain productive and engaged with your team, emphasizing adaptability in your work style.

Join Rise to see the full answer
What do you find most exciting about the role of a Senior Machine Learning Engineer – Cloud Observability at Visa AI as a Service?

Share your passion for the convergence of AI, machine learning, and cloud technology. Discuss how working at Visa AI as a Service aligns with your professional goals and why you are excited to contribute to innovative projects and solutions that can reshape the industry.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
AECOM Hybrid New York, NY
Posted 4 days ago

Join AECOM as a Resident Engineer and contribute to transformative infrastructure projects across New York City.

Stellar Solutions seeks an experienced Mission Integration Engineering Lead to drive the technical design of tactical communications systems in a dynamic space environment.

Photo of the Rise User
Qualdoc Hybrid Petersburg, VA
Posted 8 days ago

Join a dedicated team as a Mechanic/Millwright, ensuring the optimal performance of our plant equipment.

Photo of the Rise User
Earnin Hybrid Mountain View, California, United States
Posted 7 days ago
Dental Insurance
Vision Insurance
Flexible Spending Account (FSA)
Family Medical Leave
Paid Holidays

Join EarnIn as a Technical Lead Manager to guide engineering teams in developing innovative financial solutions.

Photo of the Rise User

Join Jobgether as a Senior Engineering Manager to enhance the core product experience for millions of users in a fully remote setting.

Butterball Hybrid US, Hoke County, NC; North Carolina, Raeford, NC
Posted 7 days ago

Seeking a General Mechanical Engineer at Butterball, LLC to maintain and repair machinery in a turkey processing facility.

Argus Labs seeks a talented Senior Site Reliability Engineer to bolster the robustness of their innovative gaming platform in beautiful San Francisco.

Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...

9499 jobs
MATCH
VIEW MATCH
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 2, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
M
Someone from OH, Tallmadge just viewed General Merchandise IC at Meijer
B
Someone from OH, Cleveland just viewed Resource & Scheduling Specialist at Brightspeed
Q
Someone from OH, Parma just viewed Advanced Microsoft Office Trainer at QS4QS
Photo of the Rise User
Someone from OH, Pickerington just viewed Sr. Client Project Manager at Forge Biologics
Photo of the Rise User
Someone from OH, Columbus just viewed Warehouse People Ops Coordinator at Babylist
Photo of the Rise User
9 people applied to Pega Engineer at Proxymity
Photo of the Rise User
Someone from OH, Toledo just viewed Field Recruiter (MI) at Wonderschool
d
Someone from OH, Columbus just viewed Reconciliation & Payments Specialist at dopay
Photo of the Rise User
Someone from OH, Cuyahoga Falls just viewed VP of Customer Operations at OXIO Corporation
Photo of the Rise User
Someone from OH, Springfield just viewed IT helpdesk Team Leader at Optimiza
Photo of the Rise User
Someone from OH, Akron just viewed Director of Revenue Cycle Management at Gather Health
Photo of the Rise User
Someone from OH, Dayton just viewed Data Entry Clerk at Hireframe
Photo of the Rise User
Someone from OH, Cincinnati just viewed Customer Success Manager - Illinois at Alma Technologies (OR)
Photo of the Rise User
Someone from OH, Cleveland just viewed Client Services Manager at Vitesse PSP
Photo of the Rise User
Someone from OH, Fairborn just viewed IOS Developer at Advansys
Z
Someone from OH, Reynoldsburg just viewed Educator Onboarding Associate at Zen Educate
Photo of the Rise User
Someone from OH, Canton just viewed SEASONER at Shearer's Foods