Job Title: Data Engineer
Job Description:
Seeking a skilled Data Engineer with a robust background in PySpark and extensive experience with AWS services, including Athena and EMR. The ideal candidate will be responsible for designing, developing, and optimizing large-scale data processing systems, ensuring efficient and reliable data flow and transformation.
Key Responsibilities:
• Data Pipeline Development: Design, develop, and maintain scalable data pipelines using PySpark to process and transform large datasets.
• AWS Integration: Utilize AWS services, including Athena and EMR, to manage and optimize data workflows and storage solutions.
• Data Management: Implement data quality, data governance, and data security best practices to ensure the integrity and confidentiality of data.
• Performance Optimization: Optimize and troubleshoot data processing workflows for performance, reliability, and scalability.
• Collaboration: Work closely with data scientists, analysts, and other stakeholders to understand data requirements and deliver solutions that meet business needs.
• Documentation: Create and maintain comprehensive documentation of data pipelines, ETL processes, and data architecture.
Required Skills and Qualifications:
• Education: Bachelor's or Master’s degree in Computer Science, Engineering, or a related field.
• Experience: 5+ years of experience as a Data Engineer or in a similar role, with a strong emphasis on PySpark.
• Technical Expertise:
o Proficient in PySpark for data processing and transformation.
o Extensive experience with AWS services, specifically Athena and EMR.
o Strong knowledge of SQL and database technologies.
o Experience with Apache Airflow is a plus
o Familiarity with other AWS services such as S3, Lambda, and Redshift.
• Programming: Proficiency in Python; experience with other programming languages is a plus.
• Problem-Solving: Excellent analytical and problem-solving skills with attention to detail.
• Communication: Strong verbal and written communication skills to effectively collaborate with team members and stakeholders.
• Agility: Ability to work in a fast-paced, dynamic environment and adapt to changing priorities.
Preferred Qualifications:
• Experience with data warehousing solutions and BI tools.
• Knowledge of other big data technologies such as Hadoop, Hive, and Kafka.
• Understanding of data modeling, ETL processes, and data warehousing concepts.
• Experience with DevOps practices and tools for CI/CD.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Are you a talented Data Engineer looking for your next big challenge? Join our innovative team and put your skills to the test! We're on the hunt for a skilled Data Engineer who has an impressive background in PySpark and extensive experience with AWS services, particularly Athena and EMR. In this exciting role, you'll have the opportunity to design, develop, and optimize large-scale data processing systems, ensuring that data flows seamlessly across the organization. Your day-to-day responsibilities will include building scalable data pipelines with PySpark, managing AWS services to enhance data workflows, and implementing best practices for data quality and security. Collaborating closely with data scientists and analysts, you'll ensure that the data solutions you deliver meet the high business needs. Plus, don't forget about documentation – creating comprehensive records of data pipelines, ETL processes, and architecture will be a significant part of your role. We believe in nurturing talent, so if you have a Bachelor's or Master’s degree in Computer Science or a related field, coupled with at least 5 years of experience as a Data Engineer, you're exactly who we're looking for! Your proficiency in Python, SQL, and familiarity with other AWS services will help you thrive in our fast-paced environment. When you join us, you’ll not only be working with data but helping to shape our data-driven future. If you're excited about building and optimizing complex data systems, we'd love to see your application and find out how you can contribute to our success!
Join our team as a Data and AI Regulatory Intelligence Expert to drive regulatory compliance and intelligence in an innovative, hybrid work environment.
As a Senior Infrastructure Development Engineer, you will play a pivotal role in managing and enhancing cloud services through cutting-edge technologies.
As a Senior Associate in Data Strategy & Transformation at American Express, you'll leverage data analytics to drive strategic initiatives in payment solutions.
Join VHB as a Geomatics Data Technician, providing innovative geomatics solutions for infrastructure projects.
Join Aegon as a Fund Relationship and Data Lead to oversee fund managers and data requirements in a hybrid work environment.
Become a vital part of Primerica as a Data Analyst Intern, contributing to innovative strategies for risk reduction and field growth.
Our IT solutions empower organizations and individuals throughout the world to maximize value and quality to succeed in today's challenging business environment. As a fast-growing new economy company, we focus our strengths to offer world-class so...
216 jobsSubscribe to Rise newsletter