Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Platform Site Reliability Engineering Senior Manager image - Rise Careers
Job details

Platform Site Reliability Engineering Senior Manager - job 1 of 2

About this role:Wells Fargo is seeking a Platform Site Reliability Engineering Senior Manager to help design durable and reliable services, automate wherever possible, drive observability, and provide coverage for incidents, change activity, business continuity, and other production related activities.In this role, you will:• Lead by example - focus on key aspects of SRE like Observability, Automation, Reliability, Resiliency, Scalability, Configuration Management & Actionable, Data Driven insights.• Act as a key transformation agent to help the team learn and develop SRE capabilities and advance the team through a defined SRE maturity model.• Attract, recruit, hire, and build top performing teams. Cultivate an engaged, diverse, inclusive and transparent culture• Ensure adherence to the Platform Architecture and meeting non-functional requirements for API management products and services.• Partner with, engage and influence architects and experienced engineers to incorporate Wells Fargo Technology technical strategies, while understanding next generation domain architecture and enable application migration paths to target architecture.• Function as the technical representative for the product during cross-team collaborative efforts and planning. Assess the availability of critical business flows, identify service level objectives and indicators, and conduct destructive and resiliency testing to reach 99.995% availability for the firm's critical products and services leading to improved customer experience and customer satisfaction.• Collaborate and influence Product Managers/Product Owners to drive user satisfaction, influence technology requirements and priorities in the product roadmap, promote innovative and intelligent solutions, generate corporate value and articulate technical strategy while being a solid advocate of agile and DevOps practices• Drive the buildout of automation to prevent problem recurrence, with the goal of automating response to all non-exceptional service conditions.• Introduce enterprise capabilities, tools, and innovation to improve availability in a multi-cloud ecosystem by evolving observability, monitoring, logging, CI/CD integration, continuous testing (performance, functional), continuous improvement, and standardization/automation of key SRE metrics and IT Service Operations processes.• Share support responsibilities for critical applications, to identify systemic issues, conduct blameless postmortems, root cause analysis, and introduce strategic solutions in code that solve the problem and eliminate repeat issues.• Apply technology background in software engineering and systems engineering to ensure the applications on-boarded to SRE are available, have full-stack observability, are integrated with CI/CD, and always-on by introducing continuous improvement through code and automation, continuous testing (performance, functional), and provide operational insight through analytics.• Troubleshoot, and analyze production job failures across the technology stack e.g., database, network file delivery, server, and application issues independently and provide solutions to recovery. Participate in root cause analysis and preventative actions to avoid recurring incidents.• Interact directly with third party vendors and technology service providers• Act as a key participant in developing standards and companywide best practices for engineering complex and large-scale technology solutions for technology engineering disciplines• Make decisions in developing standard and companywide best practices for engineering and technology solutions requiring understanding of industry best practices and new technologies, influencing and leading technology team to meet deliverables and drive new initiatives• Collaborate and consult with key technical experts, senior technology team, and external industry groups to resolve complex technical issues and achieve goals• Develop original and/or complex code, provide coding guidance/review, and create documentation• Manage and develop teams of individual contributors and managers in roles with moderate complexity and risk in Technology Operations• Manage the operational outcomes of key IT services delivered by network services and operations, database services, infrastructure services including server and storage services• Engage and influence stakeholders, internal partners and peers• Identify and recommend opportunities for technology operations process improvement and development• Leverage metrics to support infrastructure associated with applications that are highly automated, and latency sensitive, client facing and internal applications consumed by the Business• Drive key strategic initiatives associated with infrastructure availability• Manage backups, recovery and ensure recovery includes periodic tests to ensure business continuity• Work with IT risk management, compliance and all lines of defense, including Audit, to ensure platform risks are proactively managed• Institute controls in partnership with Operation Risks to ensure risk management is sustainable• Manage the costs, demand and resource capacity for the team resources, leveraging external resources as needed• Determine appropriate strategy and actions of technology operations team to meet deliverable• Interpret and develop policies and procedures• Collaborate with and influence all levels of professionals, including more experienced managers• Manage allocation of people and financial resources to ensure commitments are met and align with strategic objectives in technology operations• Develop and guide a culture of talent development to meet business objectives and strategyRequired Qualifications, US:• 6+ years of Systems Engineering and Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education• 3+ years of Management experience• 5+ years of Site Reliability Engineering experience• 5+ years of cloud technology experienceDesired Qualifications:• 3+ years managing Agile teams including use of tools such as Jira and Confluence• 5+ years’ experience with Agile Scrum (Daily Standup, Sprint Planning and Sprint Retrospective meetings)• 5+ years’ experience in two or more of the following tenets - Observability, Automation, Reliability, Resiliency, Scalability, Configuration Management & Actionable, Data Driven insights.• 5+ years’ experience troubleshooting and systems administration experience across multiple OS Platforms: Solaris, AIX, PKS, Kubernetes, OpenShift, Linux, Windows, VMware• 5+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education• 5+ years of software development experience with languages such as Perl, Python, Java, JavaScript, Ruby, JSON, Angular, NodeJS• 2+ years’ experience with Observability/Monitoring/Logging tools: AppDynamics, Grafana, Big Panda, MoogSoft, Splunk, Netcool, Sitescope, Elastic, Kibana, Kafka, Traffic Manager, Message Processor, Filebeat, Basemon, etc.• 2+ years’ experience with modern architectures – ex. private/public cloud -GCP/Azure, microservices, event-driven architecture, API Management and related technologies.• 2+ years’ experience with Automation Scripting: Bash, Shell, Ansible, Terraform, Azure DevOps• 2+ years’ experience with one or more CI/CD Pipeline (Github, Jenkins) and Automation tools: Gradle, Maven, Git, Ansible, Puppet• 2+ years Incident Management System experience• Experience with data center migrationsJob Expectations:• Ability to travel up to 10% of the time.• This position is not eligible for Visa Sponsorship.Pay RangeReflected is the base pay range offered for this position. Pay may vary depending on factors including but not limited to achievements, skills, experience, or work location. The range listed is just one component of the compensation package offered to candidates.$120,400.00 - $287,600.00BenefitsWells Fargo provides eligible employees with a comprehensive set of benefits, many of which are listed below. Visit Benefits - Wells Fargo Jobs for an overview of the following benefit plans and programs offered to employees.• Health benefits• 401(k) Plan• Paid time off• Disability benefits• Life insurance, critical illness insurance, and accident insurance• Parental leave• Critical caregiving leave• Discounts and savings• Commuter benefits• Tuition reimbursement• Scholarships for dependent children• Adoption reimbursementPosting End Date:2 Dec 2024• Job posting may come down early due to volume of applicants.We Value DiversityAt Wells Fargo, we believe in diversity, equity and inclusion in the workplace; accordingly, we welcome applications for employment from all qualified candidates, regardless of race, color, gender, national origin, religion, age, sexual orientation, gender identity, gender expression, genetic information, individuals with disabilities, pregnancy, marital status, status as a protected veteran or any other status protected by applicable law.Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.Candidates applying to job openings posted in US: All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.Applicants with DisabilitiesTo request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.Drug and Alcohol PolicyWells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.Wells Fargo Recruitment and Hiring Requirements:a. Third-Party recordings are prohibited unless authorized by Wells Fargo.b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.
Wells Fargo Glassdoor Company Review
3.6 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
Wells Fargo DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Wells Fargo
Wells Fargo CEO photo
Charlie Scharf
Approve of CEO

Average salary estimate

Estimate provided by employer
$167147 / ANNUAL (est.)
min
max
$146K
$188K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Platform Site Reliability Engineering Senior Manager, Wells Fargo

Wells Fargo is on the lookout for a passionate and knowledgeable Platform Site Reliability Engineering Senior Manager to join our team in Charlotte, NC. In this vital role, you'll lead efforts to design sturdy and dependable services while pushing for automation and driving observability across our systems. Your experience will shine as you educate and guide teams in mastering Site Reliability Engineering (SRE) principles, helping us evolve our SRE maturity model. Recruiting and nurturing top talent is essential, as you'll foster an inclusive culture while steering the team to meet our high operational standards. You’ll collaborate with architects and engineers to ensure our API management products meet stringent performance requirements, and partner with product managers to elevate user satisfaction through innovative solutions. Your technical know-how will lead the charge in adopting enterprise capabilities to enrich our cloud ecosystem, ensuring that services are always resilient. You'll also oversee incident management and conduct thorough root cause analyses, driving continuous improvement to enhance our customer experience. If you're keen on utilizing your expertise to drive reliability and your leadership skills to cultivate a talented team, this role at Wells Fargo is perfect for you.

Frequently Asked Questions (FAQs) for Platform Site Reliability Engineering Senior Manager Role at Wells Fargo
What are the key responsibilities of a Platform Site Reliability Engineering Senior Manager at Wells Fargo?

As the Platform Site Reliability Engineering Senior Manager at Wells Fargo, you will lead initiatives focused on observability, automation, and service reliability. You’ll be responsible for enhancing our SRE capabilities through training, driving user satisfaction, and ensuring that our API management products meet necessary performance metrics. Your role will involve collaboration across teams, managing incidents, conducting root cause analyses, and continuously improving operational processes.

Join Rise to see the full answer
What qualifications are required for the Platform Site Reliability Engineering Senior Manager position at Wells Fargo?

Wells Fargo requires candidates for the Platform Site Reliability Engineering Senior Manager role to have over 6 years of Systems Engineering experience, with at least 5 years in Site Reliability Engineering and cloud technologies. Management experience is also essential, alongside a proven ability to lead Agile teams. Familiarity with observability, automation, and various operating systems will be crucial for success in this position.

Join Rise to see the full answer
How does Wells Fargo support diversity in hiring for the Platform Site Reliability Engineering Senior Manager role?

Wells Fargo is committed to diversity, equity, and inclusion in its hiring practices. In the Platform Site Reliability Engineering Senior Manager role, you will find a workplace that embraces individuals from various backgrounds, ensuring a broad range of perspectives and experiences that contribute to innovative solutions and a strong team dynamic.

Join Rise to see the full answer
What kind of projects will the Platform Site Reliability Engineering Senior Manager lead at Wells Fargo?

As the Platform Site Reliability Engineering Senior Manager, you will lead projects focused on automation and reliability enhancement in our multi-cloud environment. You will oversee the implementation of observability tools and drive team initiatives to achieve high service availability standards, collaborating with product teams and external vendors to resolve technical challenges effectively.

Join Rise to see the full answer
What benefits does Wells Fargo offer to the Platform Site Reliability Engineering Senior Manager position?

Wells Fargo offers a comprehensive suite of benefits including health insurance, a 401(k) plan with employer matching, competitive paid time off, disability benefits, life insurance, and programs for career development. Additionally, they provide employee discounts, tuition reimbursement, and parental leave, ensuring a robust work-life balance.

Join Rise to see the full answer
Common Interview Questions for Platform Site Reliability Engineering Senior Manager
Can you describe your experience with Site Reliability Engineering?

When answering this question, emphasize your hands-on experience in SRE, detailing specific projects where you've driven improvements in system reliability and scalability. Highlight your familiarity with automation tools, observability practices, and your role in cross-team collaborations.

Join Rise to see the full answer
How do you handle incident management and root cause analysis?

Detail your systematic approach to incident management. Explain how you document incidents, guide your team during resolution, analyze root causes, and implement measures to prevent future occurrences. Sharing a specific example can illustrate your effectiveness.

Join Rise to see the full answer
What automation tools have you implemented in your previous roles?

Discuss specific automation tools you've worked with and how they contributed to improved operational efficiency or service availability. Key tools could include Terraform, Ansible, or various CI/CD pipeline solutions.

Join Rise to see the full answer
Describe a time you successfully led a team through a significant change.

Focus on a specific change initiative, detailing your leadership style and the steps taken to ensure team engagement and smooth transition. Emphasize the positive outcomes and how you measured success.

Join Rise to see the full answer
How do you prioritize technical debt against new feature development?

Explain your framework for prioritization, balancing the need to address technical debt while still delivering new features. Use examples to illustrate how you've navigated this challenge in your past roles.

Join Rise to see the full answer
What metrics do you consider most important for evaluating system reliability?

Share your perspective on key reliability metrics such as service uptime, latency, and error rates. Discuss how you have utilized these metrics in previous positions to inform decision-making and improve system performance.

Join Rise to see the full answer
How would you approach fostering a culture of observability within your team?

Illustrate your strategies for promoting observability, such as training sessions and the introduction of monitoring tools. Detail how cultivating this culture can lead to proactive problem-solving and increased system reliability.

Join Rise to see the full answer
What challenges have you faced when implementing SRE practices?

Discuss specific challenges like resistance to change or limitations in current tooling. Explain how you overcame these obstacles and the strategies you employed to gain buy-in from your team and stakeholders.

Join Rise to see the full answer
How do you ensure quality in CI/CD processes?

Describe your role in establishing quality checks within CI/CD pipelines. This might include automated testing practices, code reviews, and performance monitoring to ensure a smooth and consistent deployment process.

Join Rise to see the full answer
What is your management style and how does it contribute to team success?

Share insights into your management approach, possibly focusing on coaching and continuous feedback. Discuss how this affects team collaboration, engagement, and ultimately, the success of projects.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
Posted 12 days ago
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
Inclusive & Diverse
Diversity of Opinions
Collaboration over Competition
Growth & Learning
Transparent & Candid
Medical Insurance
Mental Health Resources
Learning & Development
Flex-Friendly
Photo of the Rise User
CIMA+ Remote 2004 Columbia Ave, Rossland, BC V0G 1Y0, Canada
Posted 14 days ago
Posted 8 days ago
Photo of the Rise User
Smiths Group Hybrid 2202 Lakeside Blvd, Edgewood, MD 21040, USA
Posted 17 hours ago

Wells Fargo & Company (NYSE: WFC) is a leading financial services company that has approximately $1.9 trillion in assets, proudly serves one in three U.S. households and more than 10% of small businesses in the U.S. Wells Fargo is No. 47 on Fortu...

378 jobs
MATCH
Calculating your matching score...
BADGES
Badge Diversity ChampionBadge Future MakerBadge Global CitizenBadge InnovatorBadge Work&Life BalanceBadge Rapid Growth
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
December 13, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!