Job Title: Data Engineer
Job Description:
Seeking a skilled Data Engineer with a robust background in PySpark and extensive experience with AWS services, including Athena and EMR. The ideal candidate will be responsible for designing, developing, and optimizing large-scale data processing systems, ensuring efficient and reliable data flow and transformation.
Key Responsibilities:
• Data Pipeline Development: Design, develop, and maintain scalable data pipelines using PySpark to process and transform large datasets.
• AWS Integration: Utilize AWS services, including Athena and EMR, to manage and optimize data workflows and storage solutions.
• Data Management: Implement data quality, data governance, and data security best practices to ensure the integrity and confidentiality of data.
• Performance Optimization: Optimize and troubleshoot data processing workflows for performance, reliability, and scalability.
• Collaboration: Work closely with data scientists, analysts, and other stakeholders to understand data requirements and deliver solutions that meet business needs.
• Documentation: Create and maintain comprehensive documentation of data pipelines, ETL processes, and data architecture.
Required Skills and Qualifications:
• Education: Bachelor's or Master’s degree in Computer Science, Engineering, or a related field.
• Experience: 5+ years of experience as a Data Engineer or in a similar role, with a strong emphasis on PySpark.
• Technical Expertise:
o Proficient in PySpark for data processing and transformation.
o Extensive experience with AWS services, specifically Athena and EMR.
o Strong knowledge of SQL and database technologies.
o Experience with Apache Airflow is a plus
o Familiarity with other AWS services such as S3, Lambda, and Redshift.
• Programming: Proficiency in Python; experience with other programming languages is a plus.
• Problem-Solving: Excellent analytical and problem-solving skills with attention to detail.
• Communication: Strong verbal and written communication skills to effectively collaborate with team members and stakeholders.
• Agility: Ability to work in a fast-paced, dynamic environment and adapt to changing priorities.
Preferred Qualifications:
• Experience with data warehousing solutions and BI tools.
• Knowledge of other big data technologies such as Hadoop, Hive, and Kafka.
• Understanding of data modeling, ETL processes, and data warehousing concepts.
• Experience with DevOps practices and tools for CI/CD.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Are you a talented Data Engineer looking for your next big challenge? Join our innovative team and put your skills to the test! We're on the hunt for a skilled Data Engineer who has an impressive background in PySpark and extensive experience with AWS services, particularly Athena and EMR. In this exciting role, you'll have the opportunity to design, develop, and optimize large-scale data processing systems, ensuring that data flows seamlessly across the organization. Your day-to-day responsibilities will include building scalable data pipelines with PySpark, managing AWS services to enhance data workflows, and implementing best practices for data quality and security. Collaborating closely with data scientists and analysts, you'll ensure that the data solutions you deliver meet the high business needs. Plus, don't forget about documentation – creating comprehensive records of data pipelines, ETL processes, and architecture will be a significant part of your role. We believe in nurturing talent, so if you have a Bachelor's or Master’s degree in Computer Science or a related field, coupled with at least 5 years of experience as a Data Engineer, you're exactly who we're looking for! Your proficiency in Python, SQL, and familiarity with other AWS services will help you thrive in our fast-paced environment. When you join us, you’ll not only be working with data but helping to shape our data-driven future. If you're excited about building and optimizing complex data systems, we'd love to see your application and find out how you can contribute to our success!
Our IT solutions empower organizations and individuals throughout the world to maximize value and quality to succeed in today's challenging business environment. As a fast-growing new economy company, we focus our strengths to offer world-class so...
57 jobsSubscribe to Rise newsletter