Job Summary: As the Senior Director of Site Reliability Engineering (SRE), you will lead a team of SREs to ensure the highest level of performance and reliability of our services. You will be responsible for the end-to-end availability and performance of mission-critical services and building automation to prevent problem recurrence. The role requires a strategic leader who can create a vision for the SRE function and drive a culture of ‘automation first’ to improve the scalability and stability of our systems.
Essential Functions:
Lead and scale the SRE team, setting objectives and key results that align with the company’s strategic goals.
Develop and implement SRE policies, standards, and best practices for enterprise-wide systems.
Define standards for building reliable applications that are highly available and resilient.
Drive the adoption of a DevSecOps culture, fostering collaboration between development and operations teams.
Oversee the design and implementation of solutions for system monitoring, logging, alerting, and incident response.
Collaborate with product development teams to ensure reliability and scalability are considered at the design phase.
Manage on-call rotations, incident management processes, and post-mortem analyses to ensure continuous improvement.
Define Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets for all critical services.
Work closely with the security team to ensure compliance with industry standards and regulatory requirements.
Lead initiatives to improve CI/CD pipelines and automate infrastructure provisioning and deployment.
Provide technical leadership and mentorship to team members, encouraging professional growth and technical excellence.
This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.
Visa is not offering relocation assistance for this role.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
As the Senior Director of Site Reliability Engineering at our dynamic Foster City location, you will be at the helm of a dedicated team of SREs, focused on delivering top-tier performance and unwavering reliability for our services. Your mission is to oversee the complete availability and performance of mission-critical systems while championing automation to preemptively tackle potential issues. This role is not just about managing; it's about vision. You'll craft the future of SRE within our company, promoting a vital 'automation first' culture that enhances the robust nature of our infrastructure. Your responsibilities include scaling and leading the SRE team, establishing impactful objectives that align with our strategic ambitions, and implementing industry-leading SRE policies and best practices. Collaboration will be key as you partner with product development teams to prioritize reliability from day one, while also modernizing our incident management processes. You'll define and oversee crucial metrics like Service Level Objectives (SLOs) and Error Budgets to guarantee our services meet the highest standards. Additionally, your technical acumen will foster growth within your team, ensuring that we not only achieve reliability but also innovate and excel. This hybrid position offers the flexibility to balance remote work with essential in-office collaboration, as you help steer our path toward excellence in SRE. If you are ready to make a significant impact and thrive in a culture of continuous improvement, we would love to hear from you!
Join SAFRAN Engineering Services as a Systems Engineer to enhance Aerospace Cabin interior capabilities.
Join Northrop Grumman as a Principal or Sr. Principal Crew & Equipment Design Engineer to shape the future of aircraft design in Melbourne, FL.
Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...
8902 jobsSubscribe to Rise newsletter