Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Sr Big Data Engineer - Oozie and Pig (GCP) image - Rise Careers
Job details

Sr Big Data Engineer - Oozie and Pig (GCP)

About the Role 


We are seeking a Senior Big Data Engineer with deep expertise in distributed systems, batch data processing, and large-scale data pipelines. The ideal candidate has strong hands-on experience with Oozie, Pig, the Apache Hadoop ecosystem, and programming proficiency in Java (preferred) or Python. This role requires a deep understanding of data structures and algorithms, along with a proven track record of writing production-grade code and building robust data workflows. 


This is a fully remote position and requires an independent, self-driven engineer who thrives in complex technical environments and communicates effectively across teams. 


Work Location: US-Remote, Canada-Remote 


Key Responsibilities:
  • Design and develop scalable batch processing systems using technologies like Hadoop, Oozie, Pig, Hive, MapReduce, and HBase, with hands-on coding in Java or Python. 
  • Write clean, efficient, and production-ready code with a strong focus on data structures and algorithmic problem-solving applied to real-world data engineering tasks. 
  • Develop, manage, and optimize complex data workflows within the Apache Hadoop ecosystem, with a strong focus on Oozie orchestration and job scheduling. 
  • Leverage Google Cloud Platform (GCP) tools such as Dataproc, GCS, and Composer to build scalable and cloud-native big data solutions. 
  • Implement DevOps and automation best practices, including CI/CD pipelines, infrastructure as code (IaC), and performance tuning across distributed systems. 
  • Collaborate with cross-functional teams to ensure data pipeline reliability, code quality, and operational excellence in a remote-first environment. 


Qualifications:
  • Bachelors's degree in Computer Science, software engineering or related field of study.
  • Experience with managed cloud services and understanding of cloud-based batch processing systems are critical.
  • Proficiency in Oozie, Airflow, Map Reduce, Java.
  • Strong programming skills with Java (specifically Spark), Python, Pig, and SQL.
  • Expertise in public cloud services, particularly in GCP.
  • Proficiency in the Apache Hadoop ecosystem with Oozie, Pig, Hive, Map Reduce.
  • Familiarity with BigTable and Redis.
  • Experienced in Infrastructure and Applied DevOps principles in daily work. Utilize tools for continuous integration and continuous deployment (CI/CD), and Infrastructure as Code (IaC) like Terraform to automate and improve development and release processes.
  • Proven experience in engineering batch processing systems at scale.


Must Have: (Important)
  • 5+ years of experience in customer-facing software/technology or consulting. 
  • 5+ years of experience with “on-premises to cloud” migrations or IT transformations. 
  • 5+ years of experience building, and operating solutions built on GCP  
  • Proficiency in Oozie andPig 
  • Proficiency in Java or Python 


The following information is required by pay transparency legislation in the following states: CA, CO, HI, NY, and WA. This information applies only to individuals working in these states.

 

·       The anticipated starting pay range for Colorado is: $116,100 - $170,280.

·       The anticipated starting pay range for the states of Hawaii and New York (not including NYC) is: $123,600 - $181,280.

·       The anticipated starting pay range for California, New York City and Washington is: $135,300 - $198,440.


Unless already included in the posted pay range and based on eligibility, the role may include variable compensation in the form of bonus, commissions, or other discretionary payments. These discretionary payments are based on company and/or individual performance and may change at any time. Actual compensation is influenced by a wide array of factors including but not limited to skill set, level of experience, licenses and certifications, and specific work location. Information on benefits  offered is here.

#LI-VM1

#LI-Remote





About Rackspace Technology

We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.

 

 

More on Rackspace Technology

Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.

 

 


Average salary estimate

$157270 / YEARLY (est.)
min
max
$116100K
$198440K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Sr Big Data Engineer - Oozie and Pig (GCP), Rackspace

Are you a Senior Big Data Engineer with a knack for creating robust data pipelines? At Rackspace Technology, we’re on the lookout for someone just like you to join our remote team in the United States. In this role, you'll dive into the world of distributed systems and batch data processing, working primarily with technologies such as Oozie, Pig, and the Apache Hadoop ecosystem. Don’t worry, if you’re proficient in Java or Python, you’ll fit right in. Your creativity will shine as you design and develop scalable batch processing systems while writing clean and efficient code that’s ready for production. We want someone who can deftly manage and optimize complex data workflows using Oozie orchestration and job scheduling. Your expertise in Google Cloud Platform (GCP) tools like Dataproc, GCS, and Composer will be invaluable as you build cloud-native big data solutions. And since we’re committed to following DevOps best practices, your experience with CI/CD pipelines will play a crucial role in helping our team enhance performance and reliability. We value collaboration, so you’ll work alongside cross-functional teams, ensuring that our data pipelines are not just functional, but reliable as well. If you have a strong background in engineering batch processing systems and an enthusiasm for technology, we want to hear from you!

Frequently Asked Questions (FAQs) for Sr Big Data Engineer - Oozie and Pig (GCP) Role at Rackspace
What responsibilities does a Senior Big Data Engineer at Rackspace Technology have?

As a Senior Big Data Engineer at Rackspace Technology, your main responsibilities include designing and developing scalable batch processing systems, writing production-ready code, managing complex data workflows within the Apache Hadoop ecosystem, and leveraging Google Cloud Platform tools to build big data solutions. You will also be involved in implementing DevOps practices and collaborating with cross-functional teams to ensure operational excellence.

Join Rise to see the full answer
What qualifications are required for the Senior Big Data Engineer position at Rackspace Technology?

To qualify for the Senior Big Data Engineer role at Rackspace Technology, candidates should possess a Bachelor's degree in Computer Science or a related field, with at least 5 years of relevant experience in building and operating solutions on cloud platforms, particularly GCP. Proficiency in technologies like Oozie, Pig, Java, and Python, along with a strong understanding of the Apache Hadoop ecosystem and infrastructure as code (IaC) practices, is essential.

Join Rise to see the full answer
What programming languages should I be proficient in for the Senior Big Data Engineer role at Rackspace Technology?

For the Senior Big Data Engineer position at Rackspace Technology, proficiency in Java, especially with Spark, and Python is crucial. Familiarity with Pig and SQL is also essential, as these languages play a key role in developing data solutions and workflows within the Hadoop framework.

Join Rise to see the full answer
How important is experience with cloud services for the Senior Big Data Engineer at Rackspace Technology?

Experience with public cloud services is highly critical for the Senior Big Data Engineer role at Rackspace Technology. Knowledge of Google Cloud Platform (GCP) and managed cloud services, along with an understanding of cloud-based batch processing systems, will significantly aid in the design and implementation of scalable and efficient data solutions.

Join Rise to see the full answer
What kind of work environment can I expect as a Senior Big Data Engineer at Rackspace Technology?

As a Senior Big Data Engineer at Rackspace Technology, you can expect a remote-first work environment that promotes independence, collaboration, and a commitment to excellence. You’ll work closely with cross-functional teams, contributing to the company’s mission to deliver innovative technology solutions while fostering a dynamic and supportive culture.

Join Rise to see the full answer
Common Interview Questions for Sr Big Data Engineer - Oozie and Pig (GCP)
Can you explain your experience with Oozie and how you have utilized it in previous projects?

Certainly! I have utilized Oozie as a workflow scheduler in my previous projects to manage complex data processing jobs. Oozie's ability to orchestrate Hadoop jobs made it easy to define dependencies and ensure tasks were executed in the correct order. I can provide specific examples of workflows I designed that efficiently handled batch jobs, ensuring timely data availability.

Join Rise to see the full answer
How do you approach optimizing performance in batch processing systems?

Optimizing performance in batch processing systems involves a multi-faceted approach. I typically analyze data flow patterns, identify bottlenecks in processing, and utilize efficient data structures. I also leverage parallel processing capabilities of Hadoop and optimize queries in Pig and Hive to reduce execution time. Continuous testing and monitoring play key roles in ensuring performance improvements.

Join Rise to see the full answer
Describe a challenging data engineering project you worked on and how you overcame the challenges.

In a previous project, I was tasked with migrating an on-premises data processing system to GCP. The challenge lay in handling data volume and compatibility. I tackled this by incrementally migrating datasets, thoroughly testing integration with GCP services like Dataproc and ensuring minimal downtime during the transition. Communication with stakeholders was also crucial throughout the process.

Join Rise to see the full answer
What is your experience with Infrastructure as Code (IaC) and CI/CD in Big Data environments?

I have implemented Infrastructure as Code using tools like Terraform to automate provisioning and management of cloud resources. In terms of CI/CD, I developed pipelines that enable seamless code delivery and integration for data workflows. By automating testing and deployment, I ensured that our data processing systems were consistently high-quality and reliable.

Join Rise to see the full answer
How do you ensure data integrity and reliability within data pipelines?

To ensure data integrity and reliability, I implement checks at various stages of the data processing workflow. This includes validations, error logging, and utilizing monitoring tools to receive alerts on any discrepancies. My focus is on building resilient systems that can gracefully handle failures and allow for easy recovery and rerun of processes when needed.

Join Rise to see the full answer
What strategies do you use to communicate effectively with cross-functional teams?

Effective communication is key when working with cross-functional teams. I prioritize clarity by breaking down technical concepts into easily understandable segments for stakeholders from non-technical backgrounds. Regular check-ins and collaborative tools also facilitate openness and feedback. I find using visual aids, like diagrams, can help bridge gaps in understanding.

Join Rise to see the full answer
How do your skills in Java or Python contribute to your role as a Senior Big Data Engineer?

My skills in both Java and Python are instrumental in my role as a Senior Big Data Engineer. With Java, I leverage its robust libraries and performance efficiency for large-scale applications, while Python allows for rapid development and prototyping, especially when dealing with data manipulation and analysis tasks. Each language aids in addressing different aspects of data engineering challenges.

Join Rise to see the full answer
Explain your experience with the Apache Hadoop ecosystem.

I have extensive experience with the Apache Hadoop ecosystem, utilizing components like HDFS for storage, MapReduce for batch processing, and HBase for real-time data access. I regularly work with tools like Pig and Hive to write queries and manage data efficiently. This experience has provided me with insights into designing scalable data architectures that effectively handle massive datasets.

Join Rise to see the full answer
What do you consider best practices for developing batch processing solutions?

Best practices for developing batch processing solutions include designing for scalability, ensuring clear documentation, and maintaining strong monitoring and logging. I also emphasize modular coding practices, which facilitate easier debugging and testing. Regularly reviewing performance metrics and incorporating feedback helps refine the solutions to align with user needs.

Join Rise to see the full answer
What are your thoughts on the evolving landscape of big data technologies, and how do you keep your skills updated?

The big data landscape is continuously evolving with new tools and methodologies emerging regularly. I stay updated by engaging in online courses, participating in tech meetups, and following industry blogs and forums. Regular hands-on practice with emerging technologies and experimenting with new solutions is also key to maintaining a competitive edge in this dynamic field.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User

Join our team as an Azure Engineer II, where your expertise in Active Directory and Windows operations will play a crucial role in hybrid cloud management.

Photo of the Rise User
Posted 6 days ago

Join a talented team as an AI/ML Architect to drive advancements in machine learning and artificial intelligence from the comfort of your home in Vietnam.

Photo of the Rise User
Fuse Energy Remote No location specified
Posted 2 days ago

Fuse Energy is searching for a skilled Data Engineer to join their team and contribute to transforming data management within the energy sector.

Photo of the Rise User
FocusKPI, Inc. Remote Boston, Massachusetts, United States
Posted 4 hours ago

Join FocusKPI as a Data Engineer, where you'll help develop innovative AI solutions for a high-tech SaaS company in Boston.

Photo of the Rise User
Posted 4 days ago

Jobber seeks a Senior Data Engineer to enhance data solutions and empower small businesses across Canada.

Photo of the Rise User
Zingtree Remote India or Argentina
Posted 12 days ago

Zingtree is on the lookout for an experienced Senior Data Engineer to spearhead the design and development of data processing systems for enhanced customer service operations.

Photo of the Rise User
Tiger Analytics Remote No location specified
Posted 2 days ago

Join Tiger Analytics as an AWS Data Engineer, where you'll architect scalable data solutions for leading global brands.

Photo of the Rise User
NBCUniversal Remote 904 Sylvan Ave, Englewood Cliffs, NEW JERSEY
Posted 4 days ago

As a Staff Data Engineer at NBCUniversal, you'll architect cutting-edge data pipelines and influence innovative data solutions across a dynamic technology landscape.

Photo of the Rise User
Ubisoft Remote Bucharest, Romania
Posted 11 days ago

Join Ubisoft as a Senior Data Engineer, where you'll develop innovative data solutions as part of a dynamic and international team.

Photo of the Rise User
Archer Hybrid San Jose, California, United States
Posted 9 days ago
Dental Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance

Join Archer as a Data Engineer and contribute to revolutionizing aerospace technology through innovative data solutions.

Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Customer-Centric
Fast-Paced
Growth & Learning
Medical Insurance
Dental Insurance
401K Matching
Paid Time-Off
Maternity Leave
Paternity Leave
Mental Health Resources
Flex-Friendly
Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Transparent & Candid
Growth & Learning
Fast-Paced
Collaboration over Competition
Take Risks
Friends Outside of Work
Passion for Exploration
Customer-Centric
Reward & Recognition
Feedback Forward
Rapid Growth
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Paternity Leave
Fully Distributed
Flex-Friendly
Some Meals Provided
Snacks
Social Gatherings
Pet Friendly
Company Retreats
Dental Insurance
Life insurance
Health Savings Account (HSA)
Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Transparent & Candid
Growth & Learning
Fast-Paced
Collaboration over Competition
Take Risks
Friends Outside of Work
Passion for Exploration
Customer-Centric
Reward & Recognition
Feedback Forward
Rapid Growth
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Paternity Leave
Fully Distributed
Flex-Friendly
Some Meals Provided
Snacks
Social Gatherings
Pet Friendly
Company Retreats
Dental Insurance
Life insurance
Health Savings Account (HSA)
Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Transparent & Candid
Growth & Learning
Fast-Paced
Collaboration over Competition
Take Risks
Friends Outside of Work
Passion for Exploration
Customer-Centric
Reward & Recognition
Feedback Forward
Rapid Growth
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Paternity Leave
Fully Distributed
Flex-Friendly
Some Meals Provided
Snacks
Social Gatherings
Pet Friendly
Company Retreats
Dental Insurance
Life insurance
Health Savings Account (HSA)

Founded in 1998, Rackspace provides multi-cloud computing solutions and services. Offering advising to customers based on business challenges, designing solutions, building, and managing solutions. The company is headquartered in San Antonio, Texa...

237 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
April 11, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Marysville just viewed Security Specialist at Anduril Industries
Photo of the Rise User
Someone from OH, Cincinnati just viewed Learning Content Designer at QuantHub
Photo of the Rise User
Someone from OH, Tallmadge just viewed Manufacturing and Process Engineer at CVRx
Q
Someone from OH, Columbus just viewed Part-Time Medical Assistant at QualDerm Partners
Photo of the Rise User
Someone from OH, Cincinnati just viewed Summer 2025 Intern – Finance – Michigan at Stryker