Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Staff Site Reliability Engineer - IT Disaster Recovery (ITDR) image - Rise Careers
Job details

Staff Site Reliability Engineer - IT Disaster Recovery (ITDR) - job 16 of 20

Visa’s Technology Organization is a community of problem solvers and innovators reshaping the future of commerce. We operate the world’s most sophisticated processing networks capable of handling more than 65k secure transactions a second across 80M merchants, 15k Financial Institutions, and billions of everyday people. While working with us you’ll get to work on complex distributed systems and solve massive scale problems centered on new payment flows, business and data solutions, cyber security, and B2C platforms.

The Opportunity:

We are currently seeking a Staff Site Reliability Engineer within the IT Disaster Recovery organization. The Staff Site Reliability Engineer will work to advance the development and execution of the Disaster Recovery Program within VISA. In this position the Staff Site Reliability Engineer will be expected to establish new standards for the design and technical implementation of disaster recovery capabilities for applications and infrastructure across core Visa platforms. The Staff Site Reliability Engineer will play a lead role on the VISA’s Disaster Recovery (DR) team and will work with systems and business teams to identify risks in the environment to create viable recovery solutions and mitigation plans.

The Work itself:

  • Collaborate with all Business and Technology units within VISA to identify critical, time-sensitive functions. Define associated recovery time objectives (RTOs) and recovery point objectives (RPOs).
  • Develop and implement a best-practices Disaster Recovery (DR) program to safeguard VISA’s information assets. Ensure appropriate information security measures and disaster recovery processes are in place.
  • Regularly validate and maintain the disaster recovery plan and program through rigorous testing, plan updates, and maintenance.
  • Lead annual DR testing activities to ensure VISA's readiness to maintain or restore systems online during a DR event.
  • Apply knowledge of network configurations, VPNs, firewalls, and other network components to investigate and provide guidance on potential network issues.
  • Utilize cloud-hosted services expertise in the implementation and management of disaster recovery processes.
  • Employ knowledge of storage devices and systems to define protective measures for data, including backups, replication, and encryption.
  • Utilize knowledge of databases like MySQL, Oracle, PostgreSQL, or MongoDB to provide expertise on data backup, restoration, replication, and database failover procedures.
  • Effectively collaborate with middleware teams, utilizing platforms such as Java Web Services, Apache Kafka/Tomcat, Hazelcast to integrate diverse and complex systems.
  • Develop and maintain necessary documentation, including impact analysis, risk assessment, and DR/resiliency standards for service architecture and technology domain patterns.
  • Lead governance, documentation, and coordination with various technology teams and stakeholders in executing Visa’s Offline Backup & Recovery program.

The Skills You Bring:

  • Lead Courageously: Act like an owner, Challenge the status quo.
  • Collaborate as OneVisa: Encourage constructive debate.
  • Execute with Excellence: Decide quickly and move fast, Learn from mistakes.

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

 

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Staff Site Reliability Engineer - IT Disaster Recovery (ITDR), Visa

Join Visa's dynamic Technology Organization as a Staff Site Reliability Engineer focusing on IT Disaster Recovery in Ashburn! In this exciting role, you will be at the forefront of enhancing our Disaster Recovery Program, working with some of the brightest minds in the industry to maintain and protect our expansive and sophisticated processing networks. As a Staff Site Reliability Engineer, you're not just ensuring the continuity of services; you're truly shaping the framework that supports our operations amid challenges. You'll dive deep into complex distributed systems to diagnose risks and establish robust recovery solutions that align with our critical business functions. You'll collaborate with cross-functional teams to set recovery time objectives (RTOs) and recovery point objectives (RPOs), while creating and validating a best-practice Disaster Recovery program that protects our vital information assets. With experience in cloud-hosted services, you will implement and manage disaster recovery processes, and your expertise in databases will come into play as you oversee backup and failover procedures. You'll lead annual DR testing activities and ensure that documentation is meticulously maintained. We're looking for someone who acts like an owner, thrives in teamwork, and is ready to challenge the status quo. So, if you're up for a rewarding challenge where you’ll make a meaningful impact, we invite you to apply for the Staff Site Reliability Engineer position at Visa.

Frequently Asked Questions (FAQs) for Staff Site Reliability Engineer - IT Disaster Recovery (ITDR) Role at Visa
What are the main responsibilities of a Staff Site Reliability Engineer at Visa?

As a Staff Site Reliability Engineer at Visa, your primary role is to advance our IT Disaster Recovery program by establishing new standards for disaster recovery capabilities. You'll collaborate across various teams to define recovery time objectives and create effective recovery solutions while leading rigorous testing and maintaining documentation related to disaster recovery processes.

Join Rise to see the full answer
What skills are required to be a successful Staff Site Reliability Engineer at Visa?

To excel as a Staff Site Reliability Engineer at Visa, candidates should possess a strong understanding of disaster recovery principles, network configurations, and cloud services. Excellent collaboration, documentation skills, and the ability to lead initiatives are also essential. Knowledge of databases and experience with middleware platforms are critical as well.

Join Rise to see the full answer
How does Visa ensure the effectiveness of its Disaster Recovery Plan?

Visa ensures the effectiveness of its Disaster Recovery Plan through regular validation and rigorous testing of all recovery processes. The Staff Site Reliability Engineer plays a crucial role in this by leading annual disaster recovery testing activities to guarantee that all systems can be efficiently restored during a disaster event.

Join Rise to see the full answer
What does the work environment look like for a Staff Site Reliability Engineer at Visa?

The work environment for a Staff Site Reliability Engineer at Visa is hybrid, allowing employees to alternate between remote and office work. It generally requires office presence for 2-3 days a week, fostering collaboration while also offering flexibility based on business needs.

Join Rise to see the full answer
What impact does a Staff Site Reliability Engineer have on Visa's operations?

A Staff Site Reliability Engineer plays a vital role in mitigating risks and ensuring business continuity at Visa. By developing effective disaster recovery plans and leading tests, they directly contribute to safeguarding Visa's infrastructure and ensuring that services remain available to 80 million merchants and billions of users worldwide.

Join Rise to see the full answer
Common Interview Questions for Staff Site Reliability Engineer - IT Disaster Recovery (ITDR)
How do you approach developing a disaster recovery plan?

When developing a disaster recovery plan, begin by understanding the critical business functions and associated risks. Collaborate with relevant teams to establish RTOs and RPOs, document all processes thoroughly, and ensure regular testing to validate the effectiveness of your plan.

Join Rise to see the full answer
What experience do you have with disaster recovery testing?

I have led comprehensive disaster recovery testing, which includes preparing the environment, coordinating stakeholders, executing tests, and analyzing results. It's essential to document findings and update recovery procedures appropriately based on test outcomes.

Join Rise to see the full answer
Can you explain the difference between RTO and RPO?

RTO, or Recovery Time Objective, is the maximum acceptable time that a system can be down after a disaster, while RPO, or Recovery Point Objective, defines the maximum acceptable amount of data loss measured in time. Both metrics are crucial for creating effective disaster recovery strategies.

Join Rise to see the full answer
How do you ensure collaboration between different teams when implementing a disaster recovery program?

Effective communication is key to ensuring collaboration across various teams. I promote regular meetings, encourage input from all stakeholders, and ensure that responsibilities are clearly defined. This also includes providing training and resources that foster cooperative efforts.

Join Rise to see the full answer
What tools and technologies do you utilize for disaster recovery?

I utilize a variety of tools including cloud services for backup and replication, as well as monitoring tools that notify us of system health. I also use infrastructure as code (IaC) to automate disaster recovery processes where feasible.

Join Rise to see the full answer
How do you stay updated on best practices in disaster recovery?

I stay updated on disaster recovery best practices through continuous learning, attending industry conferences, participating in webinars, and subscribing to relevant publications. Networking with other professionals in the field also provides valuable insights into emerging trends and practices.

Join Rise to see the full answer
Describe a challenging disaster recovery scenario you've managed.

In a previous role, we faced a severe service outage that affected multiple systems. I led the team in orchestrating a swift response by implementing our disaster recovery protocols, involving cross-functional teams, and communicating effectively at all levels. Post-incident, I ensured we updated our recovery documentation based on lessons learned.

Join Rise to see the full answer
What is your experience with cloud-based disaster recovery solutions?

I have extensive experience with cloud-based disaster recovery solutions, enabling rapid backups and recoveries. I am familiar with various cloud service providers and how to leverage their tools to create resilient and cost-effective disaster recovery strategies.

Join Rise to see the full answer
How do you define and track KPIs for disaster recovery processes?

I define KPIs based on critical recovery metrics such as time taken for recovery, data loss, and success rate of testing. These KPIs are tracked and reviewed regularly to ensure our disaster recovery processes meet the established objectives and to identify areas for improvement.

Join Rise to see the full answer
Why is documentation important in disaster recovery?

Documentation is crucial in disaster recovery as it outlines procedures, roles, and responsibilities, ensuring everyone knows what to do in the event of a disaster. It serves as a reference guide during stressful situations and is vital for training new team members to maintain readiness.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 12 days ago

We're seeking an Executive Administrator to enhance executive productivity through meticulous calendar and travel management in a hybrid work setting.

Photo of the Rise User

Join our Test Engineering team dedicated to ensuring software quality and fraud prevention technologies in a hybrid work environment.

Photo of the Rise User
SAIC Hybrid Vandenberg Space Force Base, CA
Posted 5 days ago
DMV IT Service Hybrid No location specified
Posted 9 days ago

We are on the lookout for an advanced Programmer Analyst 3 with a strong background in .NET and JAVA to enhance our client's IT infrastructure.

Photo of the Rise User
ManTech Hybrid US, Fairfax County, VA; Virginia, Herndon, VA
Posted 6 days ago

Be part of a team that protects the nation as a Cyber Network Defense Analyst with ManTech in Herndon, VA.

Photo of the Rise User

Join Peraton as a Cyber Intelligence Analyst and enhance national security through real-time cyber intelligence support.

Photo of the Rise User
Posted 14 days ago

Join Iterable as a Junior Systems Administrator, where you'll help maintain Salesforce and collaborate with a dynamic Operations team.

Photo of the Rise User
McKesson Hybrid US, Virginia, Richmond, VA
Posted 9 hours ago

McKesson is seeking a seasoned Lead Cloud Solution Architect to enhance their cloud services and drive business outcomes in healthcare.

Photo of the Rise User

Kickstart your career in Information Security with a paid internship at Randolph-Brooks Federal Credit Union, contributing to vital security operations.

Photo of the Rise User
Posted 3 days ago

Alimentiv is on the lookout for an IT Security & Compliance Analyst who excels in ensuring that IT systems adhere to industry regulations and standards.

Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...

11755 jobs
MATCH
VIEW MATCH
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 3, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
X
Someone from OH, Cleveland just viewed Lead / Senior Analyst - SAP HCM at Xcellink Pte Ltd
Photo of the Rise User
57 people applied to Security Analyst Jr at DEUNA
Photo of the Rise User
Someone from OH, Akron just viewed Accounting Co-Op at VEGA Americas
R
Someone from OH, Cincinnati just viewed Director, Payroll Tax at Ryan
Photo of the Rise User
13 people applied to Intern/Co-op-4 at GE
P
Someone from OH, Columbus just viewed Data Science for Smart Agriculture- Part-Time at PSU
Photo of the Rise User
Someone from OH, Cincinnati just viewed Brand Management & Partnerships Assistant at LAIKA
Photo of the Rise User
Someone from OH, Athens just viewed Senior Multimedia Artist, Design & Creative at RepRisk AG
Photo of the Rise User
62 people applied to Cyber Crime Analyst at TEKsystems
H
Someone from OH, Rocky River just viewed Training Manager at Hotel Bardo Savannah
F
Someone from OH, Columbus just viewed VP of Communications at Freedom Together Foundation
Photo of the Rise User
Someone from OH, Columbus just viewed Chief Organizational Communication Officer at Providence
Photo of the Rise User
Someone from OH, Cuyahoga Falls just viewed SEASONER at Shearer's Foods
Photo of the Rise User
Someone from OH, Columbus just viewed Bilingual Care Manager, Telephonic RN at Humana
Photo of the Rise User
Someone from OH, Columbus just viewed Talent Business Partner at Red Bull
Photo of the Rise User
Someone from OH, Brunswick just viewed Sanitation Team Member at Shearer's Foods
Photo of the Rise User
Someone from OH, Columbus just viewed Talent Acquisition Specialist at Beghou Consulting
C
Someone from OH, Middletown just viewed Operations Analyst at Core Specialty Insurance
A
Someone from OH, Strongsville just viewed Graphic Design Intern at Anvil NorthWest