Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Staff Site Reliability Engineer - Cloud Engineering image - Rise Careers
Job details

Staff Site Reliability Engineer - Cloud Engineering - job 16 of 20

Visa’s Technology Organization is a community of problem solvers and innovators reshaping the future of commerce. We operate the world’s most sophisticated processing networks capable of handling more than 65k secure transactions a second across 80M merchants, 15k Financial Institutions, and billions of everyday people. While working with us you’ll get to work on complex distributed systems and solve massive scale problems centered on new payment flows, business and data solutions, cyber security, and B2C platforms.

 

The Opportunity:

As a Staff Site Reliability Engineer in Product Reliability Engineering, you will be part of a team that maintains and supports Visa's Data Platform and provides support for key cloud based Big data and Kafka Platforms. You will be responsible for driving innovation for our partners and clients, within Visa and globally. You will work on open-source Big Data and Kafka clusters focusing on Cloud, ensuring their availability, performance, reliability, and improving operational efficiency.

 

The Work itself:

Essential Functions:

· Design, build and manage Big Data and Kafka infrastructure on AWS, GCP and Azure.

· Manage and optimize Apache Big Data and Kafka clusters for high performance, reliability, and scalability.

· Develop tools and processes to monitor and analyze system performance and to identify potential issues.

· Collaborate with other teams to design and implement Solutions to improve reliability and efficiency of the Big data cloud platforms.

· Ensure security and compliance of the platforms within organizational guidelines.

· Other responsibilities include effective root cause analysis of major production incidents and the development of learning documentation. The person will identify and implement high-availability solutions for services with a single point of failure.

· The role involves planning and performing capacity expansions and upgrades in a timely manner to avoid any scaling issues and bugs. This includes automating repetitive tasks to reduce manual effort and prevent human errors.

· The successful candidate will tune alerting and set up observability to proactively identify issues and performance problems. They will also work closely with Level 3 teams in reviewing new use cases and cluster hardening techniques to build robust and reliable platforms.

· The role involves creating standard operating procedure documents and guidelines on effectively managing and utilizing the platforms. The person will leverage DevOps tools, disciplines (Incident, problem, and change management), and standards in day-to-day operations.

· The individual will ensure that the platforms can effectively meet performance and service level agreement requirements. They will also perform security remediation, automation, and self-healing as per the requirement.

· The individual will concentrate on developing automations and reports to minimize manual effort. This can be achieved through various automation tools such as Shell scripting, Ansible, or Python scripting, or by using any other programming language.

 

The Skills You Bring:

· Energy and Experience: A growth mindset that is curious and passionate about technologies and enjoys challenging projects on a global scale.

·  Challenge the Status Quo: Comfort in pushing the boundaries, “hacking” beyond traditional solutions.

·  Language Expertise: Expertise in one or more general development languages (e.g., Java, python)

· Builder: Experience building and deploying distributed systems.

·  Learner: Constant drive to learn new technologies such as cloud technologies, Kubernetes, MLOPS.

· Partnership: Experience collaborating with Engineering, Application and Other functional teams.

 

**We do not expect that any single candidate would fulfill all these characteristics. For instance, we have awesome team members who are really focused on building scalable systems but didn’t work with payments technology or web applications before joining Visa.

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Staff Site Reliability Engineer - Cloud Engineering, Visa

Are you ready to join Visa's cutting-edge Technology Organization as a Staff Site Reliability Engineer in Cloud Engineering? Located in the vibrant city of Austin, you'll become an integral part of a dynamic team that's reshaping the future of commerce. In this role, you will be solving complex distributed system issues and tackling massive scaling challenges while ensuring the reliability and performance of our innovative cloud-based Big Data and Kafka platforms. Your responsibilities will include designing, building, and managing infrastructure on AWS, GCP, and Azure, while optimizing Apache clusters for peak performance. As a key player, you'll collaborate with cross-functional teams to enhance system efficiency and lead initiatives that drive operational excellence. You will also focus on automating processes to streamline operations and ensure the security and compliance of our platforms. With your passion for technology and a growth mindset, you'll constantly improve our partnerships and bring innovative solutions to the table. As a hybrid work position, you’ll enjoy the flexibility of working remotely some days while collaborating in the office. If you're excited to take on challenging projects on a global scale and are ready to push the boundaries of what's possible, this is the opportunity you've been waiting for!

Frequently Asked Questions (FAQs) for Staff Site Reliability Engineer - Cloud Engineering Role at Visa
What are the responsibilities of a Staff Site Reliability Engineer at Visa?

A Staff Site Reliability Engineer at Visa is responsible for designing and managing Big Data and Kafka infrastructure on leading cloud platforms like AWS, GCP, and Azure. This includes optimizing Apache clusters, developing monitoring tools to analyze system performance, collaborating with various teams to enhance reliability and efficiency, and ensuring security and compliance within the organization's guidelines.

Join Rise to see the full answer
What qualifications do I need to apply for the Staff Site Reliability Engineer position at Visa?

To apply for the Staff Site Reliability Engineer role at Visa, candidates should have experience with distributed systems, proficiency in general development languages such as Java or Python, and a keen interest in cloud technologies and tools. A solid understanding of DevOps practices is also crucial, along with an ability to collaborate effectively with engineering, application, and other functional teams.

Join Rise to see the full answer
What kind of projects will I work on as a Staff Site Reliability Engineer at Visa?

As a Staff Site Reliability Engineer at Visa, you will engage in projects that tackle complex problems around new payment flows and data solutions. You'll work on improving the performance of cloud-based platforms, automate repetitive tasks, and conduct root cause analysis of production incidents to foster continuous improvement and innovation within the organization.

Join Rise to see the full answer
What skills are essential for success in the Staff Site Reliability Engineer role at Visa?

Essential skills for a Staff Site Reliability Engineer at Visa include a strong foundation in cloud technologies, experience in optimizing distributed systems, and a thorough understanding of incident, problem, and change management. A growth mindset, comfort in challenging traditional solutions, and the ability to collaborate within a team setting are also essential for driving innovation and operational efficiency.

Join Rise to see the full answer
Is the Staff Site Reliability Engineer position at Visa remote or in-office?

The Staff Site Reliability Engineer position at Visa is a hybrid role, meaning you can alternate between remote work and working in the office. Employees in hybrid roles are typically expected to be in the office 2-3 days a week based on business needs, allowing for flexibility while maintaining collaborative team interactions.

Join Rise to see the full answer
Common Interview Questions for Staff Site Reliability Engineer - Cloud Engineering
Can you describe your experience with managing and optimizing Big Data infrastructure?

In preparing to answer this question, highlight specific instances where you successfully managed Big Data infrastructures. Discuss the tools and technologies you utilized, any challenges you faced, and how you optimized performance. Mention any automated solutions or monitoring processes you put in place.

Join Rise to see the full answer
How do you ensure the reliability and scalability of services?

Discuss your approach to assessing system performance and reliability. Mention specific metrics you track and how you conduct capacity planning and monitoring. Highlight any past experiences where you increased system reliability and scalability.

Join Rise to see the full answer
What experience do you have with Apache Kafka?

Share your direct experience with Apache Kafka, detailing how you deployed or managed Kafka clusters. Talk about your familiarity with its architecture and how you've implemented it for data streaming solutions in previous projects.

Join Rise to see the full answer
How do you approach incident management?

When answering this question, outline your incident management process, including detection, response, and post-mortem analysis. Discuss specific tools you utilize for incident management and how you ensure continuous improvement in your approach.

Join Rise to see the full answer
What automation tools have you used, and how have they improved your work?

Mention any automation tools you’ve utilized, such as Ansible or Shell scripting. Explain how these tools have streamlined processes, reduced manual errors, or enhanced system monitoring in your previous roles.

Join Rise to see the full answer
How do you stay updated on the latest cloud technologies?

Discuss various resources you utilize to keep your knowledge up to date, such as online courses, tech blogs, or community forums. Mention any relevant certifications you may have pursued or relevant projects that showcase your continuous learning.

Join Rise to see the full answer
Describe a challenging technical problem you solved.

Share a specific instance of a technical challenge you encountered. Describe the problem, the steps you took to resolve it, and the outcome. Highlight any collaboration with other teams and the tools used in finding a solution.

Join Rise to see the full answer
How do you prioritize tasks when managing multiple projects?

Explain your methods for task prioritization, such as using project management frameworks or tools. Provide examples from past experiences where your prioritization led to successful project outcomes.

Join Rise to see the full answer
What role do you typically play in team collaborations?

Discuss your preferred collaborative style and how you contribute to team projects. Provide examples showcasing your ability to communicate effectively, share ideas, and support team members in achieving common goals.

Join Rise to see the full answer
How do you address security and compliance in your work?

Emphasize your understanding of security best practices and compliance standards relevant to cloud-based technologies. Discuss any past experiences where you implemented security measures or participated in compliance audits.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 10 days ago

Become a vital part of Visa's mission as a Sr. Consultant in Technical Solutions, championing customer engagement and innovation in a hybrid work model.

Photo of the Rise User
Posted 10 days ago

Join Visa as a Staff Software Engineer II to develop cutting-edge solutions for payment services and transaction platforms.

Photo of the Rise User

Become a key player in our Data & Algorithm team as a Site Reliability Engineer, ensuring system reliability and automation.

Photo of the Rise User
Intel Remote US, Arizona, Phoenix
Posted 2 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Customer-Centric
Snacks
Onsite Gym
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Paid Time-Off
401K Matching
Maternity Leave
Paternity Leave

Intel is searching for a Senior Device Engineer to drive yield improvement and device performance in high-volume manufacturing settings.

Photo of the Rise User
Posted 3 days ago

Join AECOM as an Electrical Engineering Department Manager and lead a dynamic team dedicated to solving the world’s energy challenges.

Photo of the Rise User
JASARA PMC Remote No location specified
Posted 14 days ago

Become a vital part of JASARA PMC as a Design Low Voltage Engineer, focusing on compliance and project success in low voltage systems design.

Photo of the Rise User
Posted 8 days ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Customer-Centric
Snacks
Onsite Gym
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Paid Time-Off
401K Matching
Maternity Leave
Paternity Leave

Lead a talented team at Intel's Advanced Design Library Technology, driving innovations in silicon design and validation processes.

Photo of the Rise User
Posted 3 days ago

As a Senior Platform Engineer at Autodesk, you'll lead the charge in enhancing data platforms to meet innovative business needs.

Photo of the Rise User
Herbert Rowland & Grubic Remote King of Prussia, Pennsylvania, United States
Posted 6 days ago

HRG is looking for a skilled Lead Process Engineer to spearhead the design of innovative water and wastewater treatment systems in a collaborative and employee-owned environment.

Photo of the Rise User
Redwood Materials Hybrid McCarran, Nevada, United States
Posted 11 days ago

Join Redwood Materials as a Construction EHS Engineer to lead safety efforts on innovative construction projects focused on sustainability.

Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...

11734 jobs
MATCH
VIEW MATCH
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 3, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
9 people applied to Welder/Fabricator at Pyrotek
C
Someone from OH, Middletown just viewed Operations Analyst at Core Specialty Insurance
Photo of the Rise User
6 people applied to Technology Intern at SABIC
A
Someone from OH, Strongsville just viewed Graphic Design Intern at Anvil NorthWest
W
Someone from OH, Uhrichsville just viewed Director Operations at WVUMedicine
Photo of the Rise User
Someone from OH, Cincinnati just viewed Game Director, Scripps Sports at The E.W. Scripps Company
Photo of the Rise User
Someone from OH, Lorain just viewed 3D Modeler / Graphic Designer - Freelance at Twine
o
Someone from OH, Oxford just viewed Digital Media & Marketing Student Intern at osu
Photo of the Rise User
Someone from OH, Beachwood just viewed Dispensary Tech at Ayr Wellness
Photo of the Rise User
Someone from OH, Springfield just viewed Front Desk Clerk at Marriott International
L
Someone from OH, Akron just viewed Junior Graphic Designer at Little Spoon
Photo of the Rise User
Someone from OH, Columbus just viewed Licensing and Regulatory Compliance Analyst at Sportradar
Photo of the Rise User
Someone from OH, Mansfield just viewed US_EN_Operations_Warehouse Loader (Part Time) at Red Bull
Photo of the Rise User
Someone from OH, Dublin just viewed Salesforce Administrator at Multiverse
Photo of the Rise User
Someone from OH, Pickerington just viewed Salesforce Solution Analyst at GoodLeap
S
Someone from OH, Pickerington just viewed Salesforce Project Manager at Studio Science
Photo of the Rise User
Someone from OH, Dayton just viewed Medical Receptionist at LifeStance Health
C
Someone from OH, Massillon just viewed RN Ambulatory - Outpatient Infusion Therapy at CCF
Photo of the Rise User
Someone from OH, Columbus just viewed HR Business Partner (Maternity Cover) at Marshmallow
Photo of the Rise User
Someone from OH, Columbus just viewed Community Outreach Canvasser $24/Hr at Confidential
Photo of the Rise User
Someone from OH, Cincinnati just viewed Email Marketing Coordinator at Creative Circle
Photo of the Rise User
Someone from OH, Columbus just viewed UX Researcher, Amazon Autos at Amazon
Photo of the Rise User
Someone from OH, Cincinnati just viewed AI training and enablement at Writer