Job details

Platform Site Reliability Engineering Senior Manager - job 1 of 2

About this role:Wells Fargo is seeking a Platform Site Reliability Engineering Senior Manager to help design durable and reliable services, automate wherever possible, drive observability, and provide coverage for incidents, change activity, business continuity, and other production related activities.In this role, you will:• Lead by example - focus on key aspects of SRE like Observability, Automation, Reliability, Resiliency, Scalability, Configuration Management & Actionable, Data Driven insights.• Act as a key transformation agent to help the team learn and develop SRE capabilities and advance the team through a defined SRE maturity model.• Attract, recruit, hire, and build top performing teams. Cultivate an engaged, diverse, inclusive and transparent culture• Ensure adherence to the Platform Architecture and meeting non-functional requirements for API management products and services.• Partner with, engage and influence architects and experienced engineers to incorporate Wells Fargo Technology technical strategies, while understanding next generation domain architecture and enable application migration paths to target architecture.• Function as the technical representative for the product during cross-team collaborative efforts and planning. Assess the availability of critical business flows, identify service level objectives and indicators, and conduct destructive and resiliency testing to reach 99.995% availability for the firm's critical products and services leading to improved customer experience and customer satisfaction.• Collaborate and influence Product Managers/Product Owners to drive user satisfaction, influence technology requirements and priorities in the product roadmap, promote innovative and intelligent solutions, generate corporate value and articulate technical strategy while being a solid advocate of agile and DevOps practices• Drive the buildout of automation to prevent problem recurrence, with the goal of automating response to all non-exceptional service conditions.• Introduce enterprise capabilities, tools, and innovation to improve availability in a multi-cloud ecosystem by evolving observability, monitoring, logging, CI/CD integration, continuous testing (performance, functional), continuous improvement, and standardization/automation of key SRE metrics and IT Service Operations processes.• Share support responsibilities for critical applications, to identify systemic issues, conduct blameless postmortems, root cause analysis, and introduce strategic solutions in code that solve the problem and eliminate repeat issues.• Apply technology background in software engineering and systems engineering to ensure the applications on-boarded to SRE are available, have full-stack observability, are integrated with CI/CD, and always-on by introducing continuous improvement through code and automation, continuous testing (performance, functional), and provide operational insight through analytics.• Troubleshoot, and analyze production job failures across the technology stack e.g., database, network file delivery, server, and application issues independently and provide solutions to recovery. Participate in root cause analysis and preventative actions to avoid recurring incidents.• Interact directly with third party vendors and technology service providers• Act as a key participant in developing standards and companywide best practices for engineering complex and large-scale technology solutions for technology engineering disciplines• Make decisions in developing standard and companywide best practices for engineering and technology solutions requiring understanding of industry best practices and new technologies, influencing and leading technology team to meet deliverables and drive new initiatives• Collaborate and consult with key technical experts, senior technology team, and external industry groups to resolve complex technical issues and achieve goals• Develop original and/or complex code, provide coding guidance/review, and create documentation• Manage and develop teams of individual contributors and managers in roles with moderate complexity and risk in Technology Operations• Manage the operational outcomes of key IT services delivered by network services and operations, database services, infrastructure services including server and storage services• Engage and influence stakeholders, internal partners and peers• Identify and recommend opportunities for technology operations process improvement and development• Leverage metrics to support infrastructure associated with applications that are highly automated, and latency sensitive, client facing and internal applications consumed by the Business• Drive key strategic initiatives associated with infrastructure availability• Manage backups, recovery and ensure recovery includes periodic tests to ensure business continuity• Work with IT risk management, compliance and all lines of defense, including Audit, to ensure platform risks are proactively managed• Institute controls in partnership with Operation Risks to ensure risk management is sustainable• Manage the costs, demand and resource capacity for the team resources, leveraging external resources as needed• Determine appropriate strategy and actions of technology operations team to meet deliverable• Interpret and develop policies and procedures• Collaborate with and influence all levels of professionals, including more experienced managers• Manage allocation of people and financial resources to ensure commitments are met and align with strategic objectives in technology operations• Develop and guide a culture of talent development to meet business objectives and strategyRequired Qualifications, US:• 6+ years of Systems Engineering and Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education• 3+ years of Management experience• 5+ years of Site Reliability Engineering experience• 5+ years of cloud technology experienceDesired Qualifications:• 3+ years managing Agile teams including use of tools such as Jira and Confluence• 5+ years’ experience with Agile Scrum (Daily Standup, Sprint Planning and Sprint Retrospective meetings)• 5+ years’ experience in two or more of the following tenets - Observability, Automation, Reliability, Resiliency, Scalability, Configuration Management & Actionable, Data Driven insights.• 5+ years’ experience troubleshooting and systems administration experience across multiple OS Platforms: Solaris, AIX, PKS, Kubernetes, OpenShift, Linux, Windows, VMware• 5+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education• 5+ years of software development experience with languages such as Perl, Python, Java, JavaScript, Ruby, JSON, Angular, NodeJS• 2+ years’ experience with Observability/Monitoring/Logging tools: AppDynamics, Grafana, Big Panda, MoogSoft, Splunk, Netcool, Sitescope, Elastic, Kibana, Kafka, Traffic Manager, Message Processor, Filebeat, Basemon, etc.• 2+ years’ experience with modern architectures – ex. private/public cloud -GCP/Azure, microservices, event-driven architecture, API Management and related technologies.• 2+ years’ experience with Automation Scripting: Bash, Shell, Ansible, Terraform, Azure DevOps• 2+ years’ experience with one or more CI/CD Pipeline (Github, Jenkins) and Automation tools: Gradle, Maven, Git, Ansible, Puppet• 2+ years Incident Management System experience• Experience with data center migrationsJob Expectations:• Ability to travel up to 10% of the time.• This position is not eligible for Visa Sponsorship.Pay RangeReflected is the base pay range offered for this position. Pay may vary depending on factors including but not limited to achievements, skills, experience, or work location. The range listed is just one component of the compensation package offered to candidates.$120,400.00 - $287,600.00BenefitsWells Fargo provides eligible employees with a comprehensive set of benefits, many of which are listed below. Visit Benefits - Wells Fargo Jobs for an overview of the following benefit plans and programs offered to employees.• Health benefits• 401(k) Plan• Paid time off• Disability benefits• Life insurance, critical illness insurance, and accident insurance• Parental leave• Critical caregiving leave• Discounts and savings• Commuter benefits• Tuition reimbursement• Scholarships for dependent children• Adoption reimbursementPosting End Date:2 Dec 2024• Job posting may come down early due to volume of applicants.We Value DiversityAt Wells Fargo, we believe in diversity, equity and inclusion in the workplace; accordingly, we welcome applications for employment from all qualified candidates, regardless of race, color, gender, national origin, religion, age, sexual orientation, gender identity, gender expression, genetic information, individuals with disabilities, pregnancy, marital status, status as a protected veteran or any other status protected by applicable law.Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.Candidates applying to job openings posted in US: All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.Applicants with DisabilitiesTo request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.Drug and Alcohol PolicyWells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.Wells Fargo Recruitment and Hiring Requirements:a. Third-Party recordings are prohibited unless authorized by Wells Fargo.b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.

Wells Fargo Glassdoor Company Review

3.6

Wells Fargo DE&I Review

No rating

CEO of Wells Fargo

Charlie Scharf

Approve of CEO

Average salary estimate

Estimate provided by employer

$167147 / ANNUAL (est.)

min

max

$146K

$188K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Platform Site Reliability Engineering Senior Manager, Wells Fargo

Wells Fargo is on the lookout for a passionate and knowledgeable Platform Site Reliability Engineering Senior Manager to join our team in Charlotte, NC. In this vital role, you'll lead efforts to design sturdy and dependable services while pushing for automation and driving observability across our systems. Your experience will shine as you educate and guide teams in mastering Site Reliability Engineering (SRE) principles, helping us evolve our SRE maturity model. Recruiting and nurturing top talent is essential, as you'll foster an inclusive culture while steering the team to meet our high operational standards. You’ll collaborate with architects and engineers to ensure our API management products meet stringent performance requirements, and partner with product managers to elevate user satisfaction through innovative solutions. Your technical know-how will lead the charge in adopting enterprise capabilities to enrich our cloud ecosystem, ensuring that services are always resilient. You'll also oversee incident management and conduct thorough root cause analyses, driving continuous improvement to enhance our customer experience. If you're keen on utilizing your expertise to drive reliability and your leadership skills to cultivate a talented team, this role at Wells Fargo is perfect for you.

Frequently Asked Questions (FAQs) for Platform Site Reliability Engineering Senior Manager Role at Wells Fargo

What are the key responsibilities of a Platform Site Reliability Engineering Senior Manager at Wells Fargo?

As the Platform Site Reliability Engineering Senior Manager at Wells Fargo, you will lead initiatives focused on observability, automation, and service reliability. You’ll be responsible for enhancing our SRE capabilities through training, driving user satisfaction, and ensuring that our API management products meet necessary performance metrics. Your role will involve collaboration across teams, managing incidents, conducting root cause analyses, and continuously improving operational processes.