Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Site Reliability Engineering - Sr. Consultant - PRE image - Rise Careers
Job details

Site Reliability Engineering - Sr. Consultant - PRE - job 4 of 20

Single window support: Leverage deep understanding of Hadoop and its related tools specially Hive, SPARK, HDFS and do complete RCA be it platform or user code/config related.
System configuration: Recommend necessary changes to the system to DAP platform engineering by checking system activity and user logs for triaging and troubleshooting.
Performance Tuning: Direct team members on crafting efficient queries, leveraging expertise in performance tuning and optimization strategies for big data technologies.
Issue resolution across Tech teams: Troubleshoot and resolve complex technical issues. Identify root causes, finding which Tech/Data platform team can fix it and coordinating among those teams.
Reliability engineering: Creating reports to define performance and resolution metrics for proactively identifying issues and generating alerts.
Office hours and liaising: Calls across regions in multiple time zones to ensure timely client delivery.
Knowledge cataloging and sharing: Share knowledge and cross-train peers across geographic regions using Wikis and communications. Provide comms around issues/outages affecting multiple users.
Develop Standards: The team would prepare standard configuration for a variety of VCA workloads to make the jobs run with optimal settings to maintain good cluster health while executing the jobs efficiently.
Continuous Learning of VCA workload: Continuously learn and stay updated with the changing nature of data science jobs to help improve Cluster utilization.
 
With active engagement, collaboration, effective communication, quality, integrity, and reliable delivery, develop and maintain a trusted and valued relationship with the team, customers, and business partners.


This is a SRE Role to provide support for Technical hadoop related issues impacting VCA data scientists users on day to day basis. it includes performance tuning, SPARK optimization, Data Availability, Issue triages and user communication.

 

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

Average salary estimate

$135000 / YEARLY (est.)
min
max
$120000K
$150000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Site Reliability Engineering - Sr. Consultant - PRE, Visa

At PRE in Austin, we're excited to welcome a Site Reliability Engineering - Sr. Consultant to our dynamic team! In this pivotal role, you'll be the go-to person for leveraging your deep understanding of Hadoop and its ecosystem—especially tools like Hive, SPARK, and HDFS—to provide single window support. You'll conduct thorough root cause analyses for both platform and user code/configurations. You'll also collaborate with our platform engineering team, recommending necessary system changes and troubleshooting based on system activity and user logs. Performance tuning is a key aspect; you'll guide team members in crafting efficient queries while applying your optimization strategies for big data technologies. As you provide issue resolution across tech teams, you'll be instrumental in identifying root causes and coordinating solutions among various platform teams. Your ability to create insightful reports to define performance and resolution metrics will help us proactively spot and address potential issues. You’ll be a vital part of our hybrid work model, engaging in calls across time zones, delivering timely support to our VCA data scientists. Your efforts will be key in standardizing configurations for different VCA workloads to ensure optimal cluster health. By continuously learning and sharing knowledge through Wikis and communications, you'll help cultivate a collaborative environment. Join us at PRE, where we emphasize quality, integrity, and reliable delivery, ensuring our relationships with the team, customers, and partners are trusted and valued. If you're ready for an exciting challenge, this hybrid position is the perfect fit for you!

Frequently Asked Questions (FAQs) for Site Reliability Engineering - Sr. Consultant - PRE Role at Visa
What skills are required for the Site Reliability Engineering - Sr. Consultant position at PRE?

To thrive as a Site Reliability Engineering - Sr. Consultant at PRE, candidates should possess a profound understanding of Hadoop and its related tools like Hive, SPARK, and HDFS. Additionally, expertise in performance tuning and optimization strategies is essential. Strong troubleshooting skills, particularly in technical issue resolution across tech teams, and experience in creating insightful reports for performance metrics will be highly valued.

Join Rise to see the full answer
What responsibilities does a Site Reliability Engineering - Sr. Consultant have at PRE?

As a Site Reliability Engineering - Sr. Consultant at PRE, your core responsibilities include performing root cause analysis of technical problems, recommending changes to system configurations, and guiding team members in crafting efficient data queries. You'll also be involved in troubleshooting complex technical issues, issue triages, and maintaining effective communication with users to ensure there's minimal impact on the data scientist community.

Join Rise to see the full answer
How does the hybrid work model function for the Site Reliability Engineering - Sr. Consultant role at PRE?

At PRE, the Site Reliability Engineering - Sr. Consultant role operates under a hybrid model, where employees alternate between remote work and in-office attendance. It is expected that you will work from the office 2-3 days a week, based on business needs, with a general aim of being present in the office at least 50% of the time to foster collaboration and enhance team dynamics.

Join Rise to see the full answer
What will be my role in performance tuning as a Site Reliability Engineering - Sr. Consultant at PRE?

In the capacity of a Site Reliability Engineering - Sr. Consultant at PRE, you'll take the lead on performance tuning processes. This involves directly advising team members on crafting efficient queries and optimizing the performance of big data technologies. Your insights will be crucial in helping maintain high cluster health during data processing tasks.

Join Rise to see the full answer
What opportunities for continuous learning are available for a Site Reliability Engineering - Sr. Consultant at PRE?

At PRE, as a Site Reliability Engineering - Sr. Consultant, you will have ample opportunities for continuous learning. You'll be encouraged to remain up-to-date with the evolving landscape of data science jobs while actively participating in knowledge sharing with peers. Utilizing platforms like Wikis and engaging in team communications will help you and your colleagues stay informed and competitive in the field.

Join Rise to see the full answer
Common Interview Questions for Site Reliability Engineering - Sr. Consultant - PRE
How do you approach troubleshooting Hadoop-related issues?

When troubleshooting Hadoop-related issues, I first gather all relevant logs and metrics to identify anomalies. I like to follow a structured root cause analysis approach, pinpointing whether the issue stems from the platform or user code, and then collaborate with necessary tech teams to resolve it effectively.

Join Rise to see the full answer
Can you discuss your experience with performance tuning in big data environments?

Certainly! In my previous roles, I focused on analyzing query performance and using tools to refine process configurations. For instance, I've implemented data partitioning and indexing strategies that significantly enhanced retrieval times. I believe that understanding the underlying nature of data is key to optimizing performance.

Join Rise to see the full answer
What strategies do you implement for user communication during outages?

I prioritize transparency and clarity when communicating with users during outages. I typically utilize status pages and direct communications, ensuring users are informed about the issues and expected resolution times. My approach is to maintain a line of open communication and provide regular updates until the problem is fully resolved.

Join Rise to see the full answer
Describe a time you coordinated between technical teams to resolve an issue.

In a previous role, I identified a performance bottleneck impacting multiple teams. I coordinated cross-functions by organizing a triage meeting, where I presented data gathered during my troubleshooting efforts. This collaboration led to a swift resolution, demonstrating the importance of teamwork in addressing complex issues.

Join Rise to see the full answer
What factors do you consider while recommending system configuration changes?

When recommending system configuration changes, I assess the current system activity data, user logs, and any performance issues reported. I also consider the overall goals of the workloads being run, ensuring that the proposed changes align with maintaining optimal cluster health and support efficient job execution.

Join Rise to see the full answer
How do you stay updated with advancements in big data technology?

I actively participate in industry webinars, read relevant literature, and engage with communities focused on big data technologies. Networking with professionals in the field also offers insights into emerging trends and best practices, helping me stay informed and capable in my role.

Join Rise to see the full answer
What approaches do you use for knowledge sharing and cataloging?

In my experience, utilizing internal Wikis and collaborative tools has been effective for knowledge sharing. I advocate creating comprehensive guides and documents that encapsulate best practices and lessons learned, encouraging team members to contribute actively, thus fostering a culture of continuous improvement.

Join Rise to see the full answer
Explain your methodology for crafting efficient SQL queries.

My methodology involves understanding the dataset and anticipated query outcomes first. I ensure proper indexing, utilize aggregate functions wisely, and limit the dataset by using WHERE clauses effectively. Reviewing execution plans helps me identify any bottlenecks and optimize for better performance.

Join Rise to see the full answer
What role does collaboration play in your work as a Site Reliability Engineer?

Collaboration is crucial as a Site Reliability Engineer. It ensures that knowledge is shared across teams and helps in quick issue resolution. Working together fosters a deeper understanding of the system as a whole and allows for innovative solutions that might not be evident when working in silos.

Join Rise to see the full answer
How will you define success as a Site Reliability Engineering - Sr. Consultant?

I define success as ensuring minimal disruption for our users, fostering efficiency in systems, and achieving high-performance levels consistently. Building strong relationships with both the team and stakeholders, while proactively addressing potential issues, is fundamental to my definition of success in this role.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 5 days ago

Elevate client satisfaction as a Client Care Consultant with a focus on channel strategy and project management in a dynamic environment.

Photo of the Rise User
Posted 5 days ago

Join our dynamic US Client Marketing team as a Business Operations Manager, leading financial operations and supporting marketing services growth in a hybrid work environment.

Photo of the Rise User
American Express Hybrid Phoenix, Arizona, United States
Posted 3 hours ago
Inclusive & Diverse
Empathetic
Collaboration over Competition
Growth & Learning
Transparent & Candid
Medical Insurance
Dental Insurance
Mental Health Resources
Life insurance
Disability Insurance
Child Care stipend
Employee Resource Groups
Learning & Development

Join American Express as a Public Cloud Database Engineer and lead the way in transforming database technology in a dynamic and inclusive environment.

Posted yesterday

Booz Allen is on the lookout for a Senior Mechanical Engineer with expertise in design and advanced technical skills to advance complex mechanical systems.

Posted 7 days ago

Join BD as an Associate Principal Engineer, focusing on innovative solutions in medical technology.

Photo of the Rise User
Posted 9 days ago

Join Scalable Capital as a Senior Frontend Engineer to help develop innovative financial services for our clients.

Photo of the Rise User
AECOM Hybrid Houston , TX, United States
Posted 20 hours ago

Join AECOM as a Senior Mechanical Engineer and play a key role in delivering sustainable design solutions for impactful projects across various sectors.

Photo of the Rise User
Posted yesterday

At Relativity Space, contribute to ambitious aerospace technology as a Senior Robotics Engineer focusing on autonomous hardware systems.

Photo of the Rise User
Posted 5 days ago

Join Kimley-Horn as a Civil CAD Operator where your expertise in drafting and engineering principles will contribute to innovative land development projects.

Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...

9778 jobs
MATCH
VIEW MATCH
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 4, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!