Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Platform Site Reliability Engineering Senior Manager image - Rise Careers
Job details

Platform Site Reliability Engineering Senior Manager

About this role:Wells Fargo is seeking a Platform Site Reliability Engineering Senior Manager to help design durable and reliable services, automate wherever possible, drive observability, and provide coverage for incidents, change activity, business continuity, and other production related activities.In this role, you will:• Lead by example - focus on key aspects of SRE like Observability, Automation, Reliability, Resiliency, Scalability, Configuration Management & Actionable, Data Driven insights.• Act as a key transformation agent to help the team learn and develop SRE capabilities and advance the team through a defined SRE maturity model.• Attract, recruit, hire, and build top performing teams. Cultivate an engaged, diverse, inclusive and transparent culture• Ensure adherence to the Platform Architecture and meeting non-functional requirements for API management products and services.• Partner with, engage and influence architects and experienced engineers to incorporate Wells Fargo Technology technical strategies, while understanding next generation domain architecture and enable application migration paths to target architecture.• Function as the technical representative for the product during cross-team collaborative efforts and planning. Assess the availability of critical business flows, identify service level objectives and indicators, and conduct destructive and resiliency testing to reach 99.995% availability for the firm's critical products and services leading to improved customer experience and customer satisfaction.• Collaborate and influence Product Managers/Product Owners to drive user satisfaction, influence technology requirements and priorities in the product roadmap, promote innovative and intelligent solutions, generate corporate value and articulate technical strategy while being a solid advocate of agile and DevOps practices• Drive the buildout of automation to prevent problem recurrence, with the goal of automating response to all non-exceptional service conditions.• Introduce enterprise capabilities, tools, and innovation to improve availability in a multi-cloud ecosystem by evolving observability, monitoring, logging, CI/CD integration, continuous testing (performance, functional), continuous improvement, and standardization/automation of key SRE metrics and IT Service Operations processes.• Share support responsibilities for critical applications, to identify systemic issues, conduct blameless postmortems, root cause analysis, and introduce strategic solutions in code that solve the problem and eliminate repeat issues.• Apply technology background in software engineering and systems engineering to ensure the applications on-boarded to SRE are available, have full-stack observability, are integrated with CI/CD, and always-on by introducing continuous improvement through code and automation, continuous testing (performance, functional), and provide operational insight through analytics.• Troubleshoot, and analyze production job failures across the technology stack e.g., database, network file delivery, server, and application issues independently and provide solutions to recovery. Participate in root cause analysis and preventative actions to avoid recurring incidents.• Interact directly with third party vendors and technology service providers• Act as a key participant in developing standards and companywide best practices for engineering complex and large-scale technology solutions for technology engineering disciplines• Make decisions in developing standard and companywide best practices for engineering and technology solutions requiring understanding of industry best practices and new technologies, influencing and leading technology team to meet deliverables and drive new initiatives• Collaborate and consult with key technical experts, senior technology team, and external industry groups to resolve complex technical issues and achieve goals• Develop original and/or complex code, provide coding guidance/review, and create documentation• Manage and develop teams of individual contributors and managers in roles with moderate complexity and risk in Technology Operations• Manage the operational outcomes of key IT services delivered by network services and operations, database services, infrastructure services including server and storage services• Engage and influence stakeholders, internal partners and peers• Identify and recommend opportunities for technology operations process improvement and development• Leverage metrics to support infrastructure associated with applications that are highly automated, and latency sensitive, client facing and internal applications consumed by the Business• Drive key strategic initiatives associated with infrastructure availability• Manage backups, recovery and ensure recovery includes periodic tests to ensure business continuity• Work with IT risk management, compliance and all lines of defense, including Audit, to ensure platform risks are proactively managed• Institute controls in partnership with Operation Risks to ensure risk management is sustainable• Manage the costs, demand and resource capacity for the team resources, leveraging external resources as needed• Determine appropriate strategy and actions of technology operations team to meet deliverable• Interpret and develop policies and procedures• Collaborate with and influence all levels of professionals, including more experienced managers• Manage allocation of people and financial resources to ensure commitments are met and align with strategic objectives in technology operations• Develop and guide a culture of talent development to meet business objectives and strategyRequired Qualifications, US:• 6+ years of Systems Engineering and Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education• 3+ years of Management experience• 5+ years of Site Reliability Engineering experience• 5+ years of cloud technology experienceDesired Qualifications:• 3+ years managing Agile teams including use of tools such as Jira and Confluence• 5+ years’ experience with Agile Scrum (Daily Standup, Sprint Planning and Sprint Retrospective meetings)• 5+ years’ experience in two or more of the following tenets - Observability, Automation, Reliability, Resiliency, Scalability, Configuration Management & Actionable, Data Driven insights.• 5+ years’ experience troubleshooting and systems administration experience across multiple OS Platforms: Solaris, AIX, PKS, Kubernetes, OpenShift, Linux, Windows, VMware• 5+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education• 5+ years of software development experience with languages such as Perl, Python, Java, JavaScript, Ruby, JSON, Angular, NodeJS• 2+ years’ experience with Observability/Monitoring/Logging tools: AppDynamics, Grafana, Big Panda, MoogSoft, Splunk, Netcool, Sitescope, Elastic, Kibana, Kafka, Traffic Manager, Message Processor, Filebeat, Basemon, etc.• 2+ years’ experience with modern architectures – ex. private/public cloud -GCP/Azure, microservices, event-driven architecture, API Management and related technologies.• 2+ years’ experience with Automation Scripting: Bash, Shell, Ansible, Terraform, Azure DevOps• 2+ years’ experience with one or more CI/CD Pipeline (Github, Jenkins) and Automation tools: Gradle, Maven, Git, Ansible, Puppet• 2+ years Incident Management System experience• Experience with data center migrationsJob Expectations:• Ability to travel up to 10% of the time.• This position is not eligible for Visa Sponsorship.Pay RangeReflected is the base pay range offered for this position. Pay may vary depending on factors including but not limited to achievements, skills, experience, or work location. The range listed is just one component of the compensation package offered to candidates.$120,400.00 - $287,600.00BenefitsWells Fargo provides eligible employees with a comprehensive set of benefits, many of which are listed below. Visit Benefits - Wells Fargo Jobs for an overview of the following benefit plans and programs offered to employees.• Health benefits• 401(k) Plan• Paid time off• Disability benefits• Life insurance, critical illness insurance, and accident insurance• Parental leave• Critical caregiving leave• Discounts and savings• Commuter benefits• Tuition reimbursement• Scholarships for dependent children• Adoption reimbursementPosting End Date:2 Dec 2024• Job posting may come down early due to volume of applicants.We Value DiversityAt Wells Fargo, we believe in diversity, equity and inclusion in the workplace; accordingly, we welcome applications for employment from all qualified candidates, regardless of race, color, gender, national origin, religion, age, sexual orientation, gender identity, gender expression, genetic information, individuals with disabilities, pregnancy, marital status, status as a protected veteran or any other status protected by applicable law.Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.Candidates applying to job openings posted in US: All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.Applicants with DisabilitiesTo request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.Drug and Alcohol PolicyWells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.Wells Fargo Recruitment and Hiring Requirements:a. Third-Party recordings are prohibited unless authorized by Wells Fargo.b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.
Wells Fargo Glassdoor Company Review
3.6 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
Wells Fargo DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of Wells Fargo
Wells Fargo CEO photo
Charlie Scharf
Approve of CEO

Average salary estimate

Estimate provided by employer
$167147 / ANNUAL (est.)
min
max
$146K
$188K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Platform Site Reliability Engineering Senior Manager, Wells Fargo

Wells Fargo is seeking a Platform Site Reliability Engineering Senior Manager to join our dynamic team in Charlotte, NC. In this pivotal role, you will lead the charge in designing reliable, durable services while prioritizing efficiency through automation. You will set the standard by focusing on key SRE principles like Observability, Reliability, and Scalability. Imagine collaborating with talented architects and engineers to design strategies that help teams evolve and reach their SRE maturity goals. Your leadership will attract top talent, fostering a diverse and inclusive workplace where everyone’s voice matters. You’ll ensure our products meet essential non-functional requirements, delivering stellar customer experiences. This role is not just about management; you’ll also dive into the technical aspects, troubleshooting production failures and conducting root cause analyses. The goal? Achieving an impressive 99.995% availability for our critical products. By introducing innovative solutions and enterprise capabilities, you’ll be instrumental in evolving our multi-cloud ecosystem. If you’re passionate about driving operational excellence, automating responses, and developing strategic initiatives that lead to continuous improvements, then this is the opportunity for you. Join us to leverage cutting-edge tools and technologies while cultivating a vibrant team culture that rewards creativity and needs not any hesitation to embrace new challenges in the ever-evolving tech landscape.

Frequently Asked Questions (FAQs) for Platform Site Reliability Engineering Senior Manager Role at Wells Fargo
What are the main responsibilities of the Platform Site Reliability Engineering Senior Manager at Wells Fargo?

The Platform Site Reliability Engineering Senior Manager at Wells Fargo plays a crucial role in designing and managing reliable services. Key responsibilities include leading SRE initiatives, driving automation efforts, ensuring product reliability, and managing a talented team. You will also be responsible for developing technical strategies, assessing service levels, and collaborating with cross-functional teams to enhance user satisfaction and operational outcomes.

Join Rise to see the full answer
What qualifications are required for the Platform Site Reliability Engineering Senior Manager position at Wells Fargo?

Candidates applying for the Platform Site Reliability Engineering Senior Manager role at Wells Fargo should have at least 6 years of experience in Systems Engineering or Technology Architecture, along with 5 years in Site Reliability Engineering and cloud technology. Management experience of 3 years is essential. Proficiency in agile methodologies and hands-on experience in observability, automation, and various operating systems, such as Linux or Windows, are also highly desirable.

Join Rise to see the full answer
How does the Platform Site Reliability Engineering Senior Manager at Wells Fargo ensure service reliability?

The Platform Site Reliability Engineering Senior Manager at Wells Fargo ensures service reliability by setting service level objectives, conducting resiliency testing, and implementing proactive monitoring solutions. You will drive observability and automate responses to non-exceptional service conditions, which significantly reduces downtime and enhances user satisfaction. Your role also involves conducting root cause analyses and blameless postmortems to address systemic issues effectively.

Join Rise to see the full answer
What is the company culture like for the Platform Site Reliability Engineering Senior Manager at Wells Fargo?

At Wells Fargo, the culture is centered around diversity, equity, and inclusion, making it an enriching environment for a Platform Site Reliability Engineering Senior Manager. The company encourages collaboration, innovative thinking, and professional growth. You will be part of a supportive team that values transparency, engagement, and a commitment to achieving strategic goals, all while ensuring that all voices within the team are valued and heard.

Join Rise to see the full answer
What technologies will the Platform Site Reliability Engineering Senior Manager use at Wells Fargo?

In the role of Platform Site Reliability Engineering Senior Manager at Wells Fargo, you will work with a variety of technologies, including cloud platforms like GCP and Azure, as well as automation tools such as Ansible and Terraform. Familiarity with CI/CD pipelines and observability tools like Splunk and Grafana will be crucial. You will leverage your software development skills to implement innovative solutions that enhance operational efficiency.

Join Rise to see the full answer
Common Interview Questions for Platform Site Reliability Engineering Senior Manager
Can you explain your experience with Site Reliability Engineering?

When answering this question, it's important to highlight the specific projects you've worked on that demonstrate your SRE capabilities. Discuss any strategies you've developed for automating processes or ensuring system reliability. Mention your experience with observability tools and your understanding of key SRE principles like resilience and scalability.

Join Rise to see the full answer
How do you approach incident management and root cause analysis?

In your response, emphasize the importance of a blameless culture while conducting root cause analyses. Discuss the tools you use for incident management and the steps you take to ensure thorough investigations are conducted. Share any specific examples where your analysis led to significant improvements in system reliability.

Join Rise to see the full answer
What strategies do you employ for building and managing high-performance teams in SRE?

Focus on your leadership style and how you cultivate team dynamics. Discuss your approach to mentorship, creating an inclusive environment, and how you align team goals with company objectives. Highlight any successful recruitment methods you’ve used to attract top technical talent.

Join Rise to see the full answer
Describe a challenging technical problem you've solved in your previous roles.

Choose a relevant example that illustrates your problem-solving skills. Describe the challenge, the strategies you implemented to address it, and the outcomes. Be sure to mention any collaboration with different teams and how you utilized your technical expertise in SRE practices.

Join Rise to see the full answer
How do you prioritize tasks and projects in a fast-paced environment?

Discuss your approach to prioritization, incorporating Agile methodologies if applicable. Explain how you balance urgent issues with long-term projects and how you ensure alignment with business objectives. Mention the tools you use for managing tasks and orchestrating team efforts.

Join Rise to see the full answer
What experience do you have with cloud technologies and how have you implemented them in a previous role?

Provide examples of cloud platforms you have worked with and specific projects where you implemented cloud solutions. Highlight your understanding of cloud architecture principles and any significant improvements you facilitated by leveraging cloud technologies in your previous roles.

Join Rise to see the full answer
How do you drive automation within your team?

Articulate your philosophies around automation, offering examples of processes you have automated in past positions. Explain how these efforts contributed to efficiency and reliability while reducing manual workloads. Mention any tools or frameworks you've utilized in the automation process.

Join Rise to see the full answer
Tell me about your experience with CI/CD pipelines.

Detail your practical experience with CI/CD tools and how you've implemented them to enhance deployment processes. Discuss the benefits you observed from CI/CD practices and how they improved development cycles and reliability.

Join Rise to see the full answer
How do you keep up-to-date with the latest trends and technologies in Site Reliability Engineering?

Discuss your proactive learning methods including attending workshops, participating in industry conferences, or being part of relevant online communities. Mention any specific resources or leaders in the field from whom you draw inspiration.

Join Rise to see the full answer
What is your approach to ensuring an inclusive and diverse team culture?

Highlight your commitment to diversity and inclusion by discussing specific initiatives you've led or participated in. Share how you foster a team environment that values different perspectives and promotes equal opportunities for every team member.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Posted 6 days ago
Photo of the Rise User
Posted 3 days ago
Photo of the Rise User
Acquia Remote Pune or Delhi/NCR Remote
Posted 6 days ago
Photo of the Rise User
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning
Posted 10 days ago
Photo of the Rise User
Posted 9 days ago
Photo of the Rise User
B.F. Saul Company Hybrid Bethesda, Maryland
Posted 4 days ago
DeepJudge Remote No location specified
Posted 12 days ago

Wells Fargo & Company (NYSE: WFC) is a leading financial services company that has approximately $1.9 trillion in assets, proudly serves one in three U.S. households and more than 10% of small businesses in the U.S. Wells Fargo is No. 47 on Fortu...

372 jobs
MATCH
Calculating your matching score...
BADGES
Badge Diversity ChampionBadge Future MakerBadge Global CitizenBadge InnovatorBadge Work&Life BalanceBadge Rapid Growth
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
December 7, 2024

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!