Job details

Site Reliability Engineering - Sr. Consultant - PRE - job 13 of 20

Get a free resume review

Single window support: Leverage deep understanding of Hadoop and its related tools specially Hive, SPARK, HDFS and do complete RCA be it platform or user code/config related.
System configuration: Recommend necessary changes to the system to DAP platform engineering by checking system activity and user logs for triaging and troubleshooting.
Performance Tuning: Direct team members on crafting efficient queries, leveraging expertise in performance tuning and optimization strategies for big data technologies.
Issue resolution across Tech teams: Troubleshoot and resolve complex technical issues. Identify root causes, finding which Tech/Data platform team can fix it and coordinating among those teams.
Reliability engineering: Creating reports to define performance and resolution metrics for proactively identifying issues and generating alerts.
Office hours and liaising: Calls across regions in multiple time zones to ensure timely client delivery.
Knowledge cataloging and sharing: Share knowledge and cross-train peers across geographic regions using Wikis and communications. Provide comms around issues/outages affecting multiple users.
Develop Standards: The team would prepare standard configuration for a variety of VCA workloads to make the jobs run with optimal settings to maintain good cluster health while executing the jobs efficiently.
Continuous Learning of VCA workload: Continuously learn and stay updated with the changing nature of data science jobs to help improve Cluster utilization.

With active engagement, collaboration, effective communication, quality, integrity, and reliable delivery, develop and maintain a trusted and valued relationship with the team, customers, and business partners.

This is a SRE Role to provide support for Technical hadoop related issues impacting VCA data scientists users on day to day basis. it includes performance tuning, SPARK optimization, Data Availability, Issue triages and user communication.

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

Average salary estimate

$135000 / YEARLY (est.)

min

max

$120000K

$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineering - Sr. Consultant - PRE, Visa

If you're looking for an exciting opportunity to become a Site Reliability Engineering - Sr. Consultant at PRE in the vibrant city of Austin, you're in the right place! In this role, you'll serve as a single-window support champion, leveraging your deep understanding of Hadoop and its suite of tools like Hive, SPARK, and HDFS to tackle root cause analyses both for platform issues and user configurations. You’ll play a pivotal role in system configuration by recommending essential changes that enhance our DAP platform engineering, analyzing system activity and user logs to troubleshoot effectively. Performance tuning will be one of your main focuses, guiding team members in crafting efficient queries and optimizing big data technologies. You'll also act as a crucial link across technology teams, resolving complex technical issues by identifying root causes and coordinating solutions. As someone passionate about reliability engineering, you'll create insightful reports to define performance metrics and proactively identify potential issues that may arise. With your exceptional communication skills, you will handle calls across different time zones to ensure timely deliveries for our clients. You'll also share your extensive knowledge with peers through wikis, maintaining a culture of learning and collaboration. You'll develop standards to ensure optimal VCA workload configurations, ultimately enhancing cluster health and efficiency. This is a hybrid position, offering flexibility with remote work while still expecting you to engage with your team in the office 2-3 days a week, ensuring strong professional relationships and dependable delivery. If you’re eager to dive into this dynamic SRE role and make a difference for VCA data scientists, we can’t wait to hear from you!

Frequently Asked Questions (FAQs) for Site Reliability Engineering - Sr. Consultant - PRE Role at Visa

What are the primary responsibilities of a Site Reliability Engineering - Sr. Consultant at PRE?

As a Site Reliability Engineering - Sr. Consultant at PRE, you'll focus on single window support for Hadoop-related issues, perform root cause analyses, and recommend system changes while analyzing logs. You'll guide performance tuning, troubleshoot complex problems across tech teams, create reports defining performance metrics, and maintain effective communication for timely client delivery.