Visa’s Technology Organization is a community of problem solvers and innovators reshaping the future of commerce. We operate the world’s most sophisticated processing networks capable of handling more than 65k secure transactions a second across 80M merchants, 15k Financial Institutions, and billions of everyday people. While working with us you’ll get to work on complex distributed systems and solve massive scale problems centered on new payment flows, business and data solutions, cyber security, and B2C platforms.
The Opportunity:
As a Staff Site Reliability Engineer in Product Reliability Engineering, you will be part of a team that maintains and supports Visa's Data Platform and provides support for key cloud based Big data and Kafka Platforms. You will be responsible for driving innovation for our partners and clients, within Visa and globally. You will work on open-source Big Data and Kafka clusters focusing on Cloud, ensuring their availability, performance, reliability, and improving operational efficiency.
The Work itself:
Essential Functions:
· Design, build and manage Big Data and Kafka infrastructure on AWS, GCP and Azure.
· Manage and optimize Apache Big Data and Kafka clusters for high performance, reliability, and scalability.
· Develop tools and processes to monitor and analyze system performance and to identify potential issues.
· Collaborate with other teams to design and implement Solutions to improve reliability and efficiency of the Big data cloud platforms.
· Ensure security and compliance of the platforms within organizational guidelines.
· Other responsibilities include effective root cause analysis of major production incidents and the development of learning documentation. The person will identify and implement high-availability solutions for services with a single point of failure.
· The role involves planning and performing capacity expansions and upgrades in a timely manner to avoid any scaling issues and bugs. This includes automating repetitive tasks to reduce manual effort and prevent human errors.
· The successful candidate will tune alerting and set up observability to proactively identify issues and performance problems. They will also work closely with Level 3 teams in reviewing new use cases and cluster hardening techniques to build robust and reliable platforms.
· The role involves creating standard operating procedure documents and guidelines on effectively managing and utilizing the platforms. The person will leverage DevOps tools, disciplines (Incident, problem, and change management), and standards in day-to-day operations.
· The individual will ensure that the platforms can effectively meet performance and service level agreement requirements. They will also perform security remediation, automation, and self-healing as per the requirement.
· The individual will concentrate on developing automations and reports to minimize manual effort. This can be achieved through various automation tools such as Shell scripting, Ansible, or Python scripting, or by using any other programming language.
The Skills You Bring:
· Energy and Experience: A growth mindset that is curious and passionate about technologies and enjoys challenging projects on a global scale.
· Challenge the Status Quo: Comfort in pushing the boundaries, “hacking” beyond traditional solutions.
· Language Expertise: Expertise in one or more general development languages (e.g., Java, python)
· Builder: Experience building and deploying distributed systems.
· Learner: Constant drive to learn new technologies such as cloud technologies, Kubernetes, MLOPS.
· Partnership: Experience collaborating with Engineering, Application and Other functional teams.
**We do not expect that any single candidate would fulfill all these characteristics. For instance, we have awesome team members who are really focused on building scalable systems but didn’t work with payments technology or web applications before joining Visa.
This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Join Visa's Technology Organization as a Staff Site Reliability Engineer - Cloud Engineering in the vibrant city of Austin! Here at Visa, we’re a quirky team of problem solvers and innovators molding the future of commerce. Imagine working with some of the world’s most sophisticated processing networks that can handle over 65,000 transactions a second! In the role of Staff Site Reliability Engineer, you'll dive deep into Visa's Data Platform, working with cutting-edge cloud-based Big Data and Kafka platforms to ensure their availability and performance. You'll design, manage, and optimize these complex infrastructures across AWS, GCP, and Azure while collaborating with diverse teams to drive innovation and efficiency. Your expertise will shine as you monitor system performance, identify potential issues, and carry out effective root cause analyses for production incidents. But we don’t just want you to keep things running; we encourage a spirit of curiosity! Whether it's automating tasks or implementing solutions to improve reliability, your growth mindset will push the boundaries of traditional technology solutions. With opportunities to engage with diverse projects across the globe, you're in for a rewarding experience. If you're passionate about building distributed systems and eager to collaborate with other teams to make impactful changes, then you'll find a perfect home here at Visa!
Join Visa's Risk Authentication and Identity Solutions team as a Chief Software Engineer to lead cutting-edge developments in risk and fraud management solutions.
Take the lead in developing Visa's Commercial Flex Credential solutions as a Senior Product Manager in a hybrid role at Visa Commercial Solutions.
Enhance AI capabilities in software engineering as an AI Engineer at Sonar, a forward-thinking company dedicated to responsible code development.
Become a key player at Qualis LLC as a Sr Thermal/Fluids Engineer, guiding thermal design efforts for NASA's cutting-edge space initiatives.
Join AECOM’s renowned Dams team and contribute to impactful engineering projects while enjoying flexible work options.
Join Rackspace as a Lead Engineer to lead multicloud solutions and enhance customer experiences for a new UK Sovereign business unit.
Join Mortenson as an Engineer III focusing on Transmission Line engineering to drive impactful projects within high voltage substations.
Join L3Harris Technologies as an Associate Manager in Systems Engineering to drive key software testing initiatives in the defense sector.
Join Sargent & Lundy as a Lead Instrumentation & Controls Engineer and help drive the future of nuclear energy with cutting-edge technologies.
Join The Boeing Company as an Optical Sensors Engineer, where you'll drive innovative sensor system designs and enhancements in a dynamic development environment.
Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...
9778 jobsSubscribe to Rise newsletter