Single window support: Leverage deep understanding of Hadoop and its related tools specially Hive, SPARK, HDFS and do complete RCA be it platform or user code/config related.
System configuration: Recommend necessary changes to the system to DAP platform engineering by checking system activity and user logs for triaging and troubleshooting.
Performance Tuning: Direct team members on crafting efficient queries, leveraging expertise in performance tuning and optimization strategies for big data technologies.
Issue resolution across Tech teams: Troubleshoot and resolve complex technical issues. Identify root causes, finding which Tech/Data platform team can fix it and coordinating among those teams.
Reliability engineering: Creating reports to define performance and resolution metrics for proactively identifying issues and generating alerts.
Office hours and liaising: Calls across regions in multiple time zones to ensure timely client delivery.
Knowledge cataloging and sharing: Share knowledge and cross-train peers across geographic regions using Wikis and communications. Provide comms around issues/outages affecting multiple users.
Develop Standards: The team would prepare standard configuration for a variety of VCA workloads to make the jobs run with optimal settings to maintain good cluster health while executing the jobs efficiently.
Continuous Learning of VCA workload: Continuously learn and stay updated with the changing nature of data science jobs to help improve Cluster utilization.
With active engagement, collaboration, effective communication, quality, integrity, and reliable delivery, develop and maintain a trusted and valued relationship with the team, customers, and business partners.
This is a SRE Role to provide support for Technical hadoop related issues impacting VCA data scientists users on day to day basis. it includes performance tuning, SPARK optimization, Data Availability, Issue triages and user communication.
This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
If you're looking for an exciting opportunity to become a Site Reliability Engineering - Sr. Consultant at PRE in the vibrant city of Austin, you're in the right place! In this role, you'll serve as a single-window support champion, leveraging your deep understanding of Hadoop and its suite of tools like Hive, SPARK, and HDFS to tackle root cause analyses both for platform issues and user configurations. You’ll play a pivotal role in system configuration by recommending essential changes that enhance our DAP platform engineering, analyzing system activity and user logs to troubleshoot effectively. Performance tuning will be one of your main focuses, guiding team members in crafting efficient queries and optimizing big data technologies. You'll also act as a crucial link across technology teams, resolving complex technical issues by identifying root causes and coordinating solutions. As someone passionate about reliability engineering, you'll create insightful reports to define performance metrics and proactively identify potential issues that may arise. With your exceptional communication skills, you will handle calls across different time zones to ensure timely deliveries for our clients. You'll also share your extensive knowledge with peers through wikis, maintaining a culture of learning and collaboration. You'll develop standards to ensure optimal VCA workload configurations, ultimately enhancing cluster health and efficiency. This is a hybrid position, offering flexibility with remote work while still expecting you to engage with your team in the office 2-3 days a week, ensuring strong professional relationships and dependable delivery. If you’re eager to dive into this dynamic SRE role and make a difference for VCA data scientists, we can’t wait to hear from you!
Visa is looking for a Senior Manager to lead global talent acquisition compliance, focusing on governance and risk in hiring practices.
Elevate the reliability of our financial transaction systems as a Senior Site Reliability Engineer within a leading company committed to security and innovation.
Join Boeing's engineering team as an Electrical Design and Analyst Engineer focusing on the F-15 Lab Infrastructure.
TKDA is seeking a Senior Professional Mechanical Engineer to spearhead mechanical design projects in a new Milwaukee office, focusing on complex systems across diverse industries.
AECOM is looking for a skilled Construction Safety Specialist to travel to construction projects and ensure safety compliance.
Join American Packaging Corporation as a Maintenance Engineer to lead hands-on projects that enhance facility operations and maintenance.
Join Relativity Space as a Weld Engineer I, leveraging your welding expertise to advance cutting-edge aerospace technologies.
Lead the systems engineering initiatives at GE Aerospace focusing on innovative design solutions in a collaborative environment.
Join our dynamic team as a Site Reliability Engineer focused on enhancing our Big Data cloud platforms in a collaborative and innovative environment.
Join KS Engineers, P.C. as a Senior Bridge Engineer and play a crucial role in bridge design and construction management.
Visa Inc. operates as a payments technology company worldwide. The company facilitates commerce through the transfer of value and information among consumers, merchants, financial institutions, businesses, strategic partners, and government entiti...
11718 jobsSubscribe to Rise newsletter