Job details

Software Data Engineer II

We are on a mission to rid the world of bad customer service by “mobilizing” the way help is delivered. Today’s consumers want an always-available customer service experience that leaves them feeling valued and respected. Helpshift helps B2B brands deliver this modern customer service experience through a mobile-first approach. We have changed how conversations take place, moving the conversation away from a slow, outdated email and desktop experience to an in-app chat experience that allows users to interact with brands in their own time. Through our market-leading AI-powered chatbots and automation, we help brands deliver instant and rapid resolutions. Because agents play a key role in delivering help, our platform gives agents superpowers with automation and AI that simply works. Companies such as Scopely, Supercell, Brex, EA, Square along with hundreds of other leading brands use the Helpshift platform to mobilize customer service delivery. Over 900 million active monthly consumers are enabled on 2B+ devices worldwide with Helpshift.

Some numbers that illustrate our scale:

85k/rps

30ms response time

300 GB data transfer/hour

1000 VMs deployed at peak

About the team -

Consumers care first and foremost about having their time valued by brands. Brands need insights into their customer service operation to serve their consumers effectively. Such insights and analytics are delivered through various data products like in-app analytics dashboards and data-sharing integrations.

The data platform team is responsible for designing, building, and maintaining the data infrastructure that enables such data and analytics products at scale. We build and manage data pipelines, databases, and other data structures to ensure that the data is reliable, accurate, and easily accessible. We also enable internal stakeholders with business intelligence and machine learning teams with data ops. This team manages the platform that handles 2 Million events per minute and processes 1+ terabytes of data daily.

About Role -

Building maintainable data pipelines both for data ingestion and operational analytics for data collected from 2 billion devices and 900M Monthly active users
Building customer-facing analytics products that deliver actionable insights and data, easily detect anomalies
Collaborating with data stakeholders to see what their data needs are and being a part of the analysis process
Write design specifications, test, deployment, and scaling plans for the data pipelines
Mentor people in the team & organization

3+ years of experience in building and running data pipelines that scale for TBs of data

Proficiency in high-level object-oriented programming language (Python or Java) is must
Experience in Cloud data platforms like Snowflake and AWS, EMR/Athena is a must
Experience in building modern data lakehouse architectures using Snowflake and columnar formats like Apache Iceberg/Hudi, Parquet, etc
Proficiency in Data modeling, SQL query profiling, and data warehousing skills is a must
Experience in distributed data processing engines like Apache Spark, Apache Flink, Datalfow/Apache Beam, etc

Knowledge of workflow orchestrators like Airflow, Dasgter, etc is a plus

Data visualization skills are a plus (PowerBI, Metabase, Tableau, Hex, Sigma, etc)
Excellent verbal and written communication skills
Bachelor’s Degree in Computer Science (or equivalent)

Hybrid setup
Worker's insurance
Paid Time Offs
Other employee benefits to be discussed by our Talent Acquisition team in India.

Helpshift embraces diversity. We are proud to be an equal opportunity workplace and do not discriminate on the basis of sex, race, color, age, sexual orientation, gender identity, religion, national origin, citizenship, marital status, veteran status, or disability status

Privacy Notice
By providing your information in this application, you understand that we will collect and process your information in accordance with our Applicant Privacy Notice. For more information, please see our Applicant Privacy Notice at https://www.keywordsstudios.com/en/applicant-privacy-notice.

What You Should Know About Software Data Engineer II, Keywords Studios

As a Software Data Engineer II at Helpshift, you’ll be at the forefront of revolutionizing customer service through innovative technology. Our mission is to empower B2B brands to offer a seamless, mobile-first customer support experience. You’ll play a crucial role by building and maintaining reliable data pipelines that support our platform, which currently serves over 900 million consumers around the globe. Your work will directly impact how brands interact with users in real-time, providing actionable insights through customer-facing analytics products. You’ll collaborate closely with data stakeholders to ensure their needs are met, while also mentoring fellow team members to foster a collaborative learning environment. With your expertise in data modeling, cloud platforms like Snowflake and AWS, and advanced SQL skills, you’ll help us manage our robust data infrastructure, processing over a terabyte of data daily. Additionally, your programming prowess in Python or Java, combined with experience in distributed data processing engines like Apache Spark, gives you the tools to thrive in this dynamic role. If you’re ready to take on the challenge of supporting one of the leading customer service platforms while enjoying a hybrid work setup, then Helpshift is the place for you!

Frequently Asked Questions (FAQs) for Software Data Engineer II Role at Keywords Studios

What are the responsibilities of a Software Data Engineer II at Helpshift?

As a Software Data Engineer II at Helpshift, your primary responsibilities will include designing, building, and maintaining data pipelines for ingestion and analytics, collaborating with stakeholders to understand their data needs, and creating customer-facing analytics products. You'll also be required to document design specifications and mentor team members, ensuring the delivery of accurate and reliable data in a fast-paced environment.