Job details

Semi-Senior Data Engineer (PySpark) - E-Learning

Get a free resume review

About Truelogic

At Truelogic we are a leading provider of nearshore staff augmentation services headquartered in New York. For over two decades, we’ve been delivering top-tier technology solutions to companies of all sizes, from innovative startups to industry leaders, helping them achieve their digital transformation goals.

Our team of 600+ highly skilled tech professionals, based in Latin America, drives digital disruption by partnering with U.S. companies on their most impactful projects. Whether collaborating with Fortune 500 giants or scaling startups, we deliver results that make a difference.

By applying for this position, you’re taking the first step in joining a dynamic team that values your expertise and aspirations. We aim to align your skills with opportunities that foster exceptional career growth and success while contributing to transformative projects that shape the future.

Our Client

At our company, we are committed to building cutting-edge solutions that drive efficiency and innovation. We thrive on a culture of continuous learning, collaboration, and proactive problem-solving. If you’re looking for a place where you can grow and make an impact, this is the team for you!

Job Summary

We are looking for a Junior to Semi-Senior PySpark Data Engineer who is eager to learn, take initiative, and contribute to the development of high-performance and scalable data pipelines. This role is perfect for someone who wants to enhance their technical skills while working on exciting projects within a collaborative team.

Responsibilities

Design, develop, and optimize data pipelines using PySpark and Apache Spark.
Integrate and process data from multiple sources (databases, APIs, files, streaming).
Implement efficient data transformations for Big Data in distributed environments.
Optimize code to improve performance, scalability, and efficiency in data processing.
Collaborate with Data Science, BI, and DevOps teams to ensure seamless integration.
Monitor and debug data processes to ensure quality and reliability.
Apply best practices in data engineering and maintain clear documentation.
Stay up to date with the latest trends in Big Data and distributed computing.

Qualifications and Job Requirements

3-5 years of experience working with PySpark and Apache Spark in Big Data environments.
Experience with SQL and relational and NoSQL databases (PostgreSQL, MySQL, MongoDB, etc.).
Knowledge of ETL processes and data processing in distributed environments.
Familiarity with Apache Hadoop, Hive, or Delta Lake.
Experience with cloud storage (AWS S3, Google Cloud Storage, Azure Blob).
Proficiency in Git and version control.
Strong problem-solving skills and a proactive attitude.
A passion for learning and continuous improvement.

What We Offer

100% Remote Work: Enjoy the freedom to work from the location that helps you thrive. All it takes is a laptop and a reliable internet connection.
Highly Competitive USD Pay: Earn an excellent, market-leading compensation in USD, that goes beyond typical market offerings.
Paid Time Off: We value your well-being. Our paid time off policies ensure you have the chance to unwind and recharge when needed.
Work with Autonomy: Enjoy the freedom to manage your time as long as the work gets done. Focus on results, not the clock.
Work with Top American Companies: Grow your expertise working on innovative, high-impact projects with Industry-Leading U.S. Companies.

Why You’ll Like Working Here

A Culture That Values You: We prioritize well-being and work-life balance, offering engagement activities and fostering dynamic teams to ensure you thrive both personally and professionally.
Diverse, Global Network: Connect with over 600 professionals in 25+ countries, expand your network, and collaborate with a multicultural team from Latin America.
Team Up with Skilled Professionals: Join forces with senior talent. All of our team members are seasoned experts, ensuring you're working with the best in your field.

Apply now!

Average salary estimate

$110000 / YEARLY (est.)

min

max

$100000K

$120000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Semi-Senior Data Engineer (PySpark) - E-Learning, Truelogic

At Truelogic, we're excited to open the door for a talented Semi-Senior Data Engineer (PySpark) to join our exceptional E-Learning team. With the rapidly evolving landscape of technology, your skills will be pivotal in shaping high-performance and scalable data pipelines that our clients rely on. For over two decades, Truelogic has empowered companies by providing top-notch technology solutions, and we want you to be a part of that success! Here, you will dive deep into designing and optimizing data pipelines, collaborating closely with our Data Science, BI, and DevOps teams, ensuring that the integration of data sources is seamless and efficient. You'll get hands-on experience with big data while utilizing your knowledge of PySpark and Apache Spark. We offer an inspiring environment where you'll not only enhance your technical abilities but also contribute to projects that truly make a difference in our clients’ operations. The best part? You’ll be working 100% remotely from wherever you thrive best, and enjoy competitive pay that goes beyond the standard offerings. This position is ideal for someone looking to innovate, learn continuously, and work as part of a dynamic and supportive team. Join us at Truelogic, where your passion for data and technology can flourish, and your contributions have a lasting impact!

Frequently Asked Questions (FAQs) for Semi-Senior Data Engineer (PySpark) - E-Learning Role at Truelogic

What are the responsibilities of a Semi-Senior Data Engineer (PySpark) at Truelogic?

As a Semi-Senior Data Engineer (PySpark) at Truelogic, you will be responsible for designing, developing, and optimizing data pipelines utilizing PySpark and Apache Spark. You'll be integrating data from various sources and ensuring data transformations are efficient for big data scenarios. Furthermore, collaborating with other teams, monitoring processes, and adhering to best practices in data engineering are integral to your role.