Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Software Data Engineer II image - Rise Careers
Job details

Software Data Engineer II

We are on a mission to rid the world of bad customer service by “mobilizing” the way help is delivered. Today’s consumers want an always-available customer service experience that leaves them feeling valued and respected. Helpshift helps B2B brands deliver this modern customer service experience through a mobile-first approach. We have changed how conversations take place, moving the conversation away from a slow, outdated email and desktop experience to an in-app chat experience that allows users to interact with brands in their own time. Through our market-leading AI-powered chatbots and automation, we help brands deliver instant and rapid resolutions. Because agents play a key role in delivering help, our platform gives agents superpowers with automation and AI that simply works. Companies such as Scopely, Supercell, Brex, EA, Square along with hundreds of other leading brands use the Helpshift platform to mobilize customer service delivery. Over 900 million active monthly consumers are enabled on 2B+ devices worldwide with Helpshift.

Some numbers that illustrate our scale:

85k/rps

30ms response time

300 GB data transfer/hour

1000 VMs deployed at peak

About the team -

Consumers care first and foremost about having their time valued by brands. Brands need insights into their customer service operation to serve their consumers effectively. Such insights and analytics are delivered through various data products like in-app analytics dashboards and data-sharing integrations. 

The data platform team is responsible for designing, building, and maintaining the data infrastructure that enables such data and analytics products at scale. We build and manage data pipelines, databases, and other data structures to ensure that the data is reliable, accurate, and easily accessible. We also enable internal stakeholders with business intelligence and machine learning teams with data ops. This team manages the platform that handles 2 Million events per minute and processes 1+ terabytes of data daily.

About Role  - 

  • Building maintainable data pipelines both for data ingestion and operational analytics for data collected from 2 billion devices and 900M Monthly active users
  • Building customer-facing analytics products that deliver actionable insights and data, easily detect anomalies
  • Collaborating with data stakeholders to see what their data needs are and being a part of the analysis process
  • Write design specifications, test, deployment, and scaling plans for the data pipelines
  • Mentor people in the team & organization

  • 3+ years of experience in building and running data pipelines that scale for TBs of data
  • Proficiency in high-level object-oriented programming language (Python or Java) is must
  • Experience in Cloud data platforms like Snowflake and AWS, EMR/Athena is a must
  • Experience in building modern data lakehouse architectures using Snowflake and columnar formats like Apache Iceberg/Hudi, Parquet, etc
  • Proficiency in Data modeling, SQL query profiling, and data warehousing skills is a must
  • Experience in distributed data processing engines like Apache Spark, Apache Flink, Datalfow/Apache Beam, etc
  • Knowledge of workflow orchestrators like Airflow, Dasgter, etc is a plus
  • Data visualization skills are a plus (PowerBI, Metabase, Tableau, Hex, Sigma, etc)
  • Excellent verbal and written communication skills
  • Bachelor’s Degree in Computer Science (or equivalent)
  • Hybrid setup
  • Worker's insurance
  • Paid Time Offs
  • Other employee benefits to be discussed by our Talent Acquisition team in India.

Helpshift embraces diversity. We are proud to be an equal opportunity workplace and do not discriminate on the basis of sex, race, color, age, sexual orientation, gender identity, religion, national origin, citizenship, marital status, veteran status, or disability status

Privacy Notice
By providing your information in this application, you understand that we will collect and process your information in accordance with our Applicant Privacy Notice. For more information, please see our Applicant Privacy Notice at https://www.keywordsstudios.com/en/applicant-privacy-notice.

What You Should Know About Software Data Engineer II, Keywords Studios

As a Software Data Engineer II at Helpshift, you’ll be at the forefront of revolutionizing customer service through innovative technology. Our mission is to empower B2B brands to offer a seamless, mobile-first customer support experience. You’ll play a crucial role by building and maintaining reliable data pipelines that support our platform, which currently serves over 900 million consumers around the globe. Your work will directly impact how brands interact with users in real-time, providing actionable insights through customer-facing analytics products. You’ll collaborate closely with data stakeholders to ensure their needs are met, while also mentoring fellow team members to foster a collaborative learning environment. With your expertise in data modeling, cloud platforms like Snowflake and AWS, and advanced SQL skills, you’ll help us manage our robust data infrastructure, processing over a terabyte of data daily. Additionally, your programming prowess in Python or Java, combined with experience in distributed data processing engines like Apache Spark, gives you the tools to thrive in this dynamic role. If you’re ready to take on the challenge of supporting one of the leading customer service platforms while enjoying a hybrid work setup, then Helpshift is the place for you!

Frequently Asked Questions (FAQs) for Software Data Engineer II Role at Keywords Studios
What are the responsibilities of a Software Data Engineer II at Helpshift?

As a Software Data Engineer II at Helpshift, your primary responsibilities will include designing, building, and maintaining data pipelines for ingestion and analytics, collaborating with stakeholders to understand their data needs, and creating customer-facing analytics products. You'll also be required to document design specifications and mentor team members, ensuring the delivery of accurate and reliable data in a fast-paced environment.

Join Rise to see the full answer
What qualifications are necessary for a Software Data Engineer II position at Helpshift?

To qualify for the Software Data Engineer II role at Helpshift, candidates should have a Bachelor’s Degree in Computer Science or a related field, along with at least 3 years of experience in building scalable data pipelines. Proficiency in programming languages such as Python or Java, as well as experience with cloud data platforms like Snowflake and AWS, are essential. Familiarity with data processing engines like Apache Spark and experience in data modeling and warehousing are also important.

Join Rise to see the full answer
What technical skills are needed for a Software Data Engineer II at Helpshift?

A Software Data Engineer II at Helpshift needs a solid foundation in high-level object-oriented programming languages, particularly Python or Java. Additionally, expertise in cloud platforms like Snowflake and AWS, distributed data processing frameworks like Apache Spark, and data modeling practices is essential. Knowledge of workflow orchestration tools and data visualization platforms will also be beneficial.

Join Rise to see the full answer
How does the Software Data Engineer II contribute to Helpshift’s mission?

As a Software Data Engineer II, your contributions are vital to Helpshift’s mission of improving customer service. By developing data pipelines that facilitate analytics and insights, you help brands understand and enhance their customer interactions. Your work supports real-time responses to user inquiries, thereby improving overall service delivery and customer satisfaction.

Join Rise to see the full answer
What benefits does Helpshift offer to its Software Data Engineer II employees?

Helpshift offers a range of benefits to its Software Data Engineer II employees, including a hybrid work setup, workers' insurance, paid time off, and various other employee benefits that are detailed by the Talent Acquisition team. We value our team members and provide a supportive work environment with opportunities for professional growth and career advancement.

Join Rise to see the full answer
Common Interview Questions for Software Data Engineer II
Can you explain how you would design a data pipeline for handling terabytes of data?

When designing a data pipeline for terabytes of data, I would first assess the data's source, format, and required processing. I would then use scalable cloud solutions, such as AWS or Snowflake, to store and process the data efficiently. Implementing distributed processing frameworks like Apache Spark would allow for parallel processing, minimizing latency. Additionally, I’d ensure regular monitoring and logging of the pipeline's performance to detect any bottlenecks.

Join Rise to see the full answer
What experience do you have with SQL and data warehousing?

I have extensive experience with SQL, including writing complex queries for data retrieval, optimization, and profiling. My background includes work in data warehousing environments where I’ve designed star and snowflake schema models to ensure data is structured for efficient querying. I've also optimized query performance by indexing and partitioning tables.

Join Rise to see the full answer
How do you ensure the reliability and accuracy of data in your pipelines?

To ensure the reliability and accuracy of data in my pipelines, I implement rigorous testing protocols, including unit testing and end-to-end testing, to identify any issues before production. Monitoring data quality metrics is also crucial, alongside establishing alerting systems for anomalies. Furthermore, regular audits and reviews of data sources and processes ensure ongoing accuracy.

Join Rise to see the full answer
Describe a challenging data problem you faced and how you solved it.

In a previous role, I encountered an issue where a data ingestion pipeline was lagging due to unexpected data volume surges. To address this, I optimized the existing workflow by implementing asynchronous processing, which reduced the load on the system. Additionally, I scaled out the processing components using AWS resources to efficiently handle the increased volume, ultimately stabilizing the pipeline.

Join Rise to see the full answer
Are you familiar with any data visualization tools?

Yes, I have utilized various data visualization tools such as Tableau and Power BI to create dashboards that present data insights effectively. I focus on making the visualizations intuitive and accessible for stakeholders, allowing them to derive actionable insights from complex datasets.

Join Rise to see the full answer
What programming languages do you prefer for data engineering tasks and why?

I prefer using Python for data engineering tasks due to its versatility and extensive library support for data manipulation and processing. Its readability and community support make it a great choice for building maintainable data pipelines. However, I also appreciate Java for its performance capabilities in large-scale systems.

Join Rise to see the full answer
Can you discuss your experience with distributed data processing engines?

I have hands-on experience with distributed data processing engines, particularly Apache Spark. I’ve utilized Spark for batch processing as well as streaming data applications, allowing for real-time data analysis. This experience has taught me how to leverage these engines to efficiently handle large datasets while optimizing resource allocation.

Join Rise to see the full answer
How do you keep abreast of new trends and technologies in data engineering?

I stay updated on trends and technologies in data engineering by subscribing to relevant tech blogs, participating in online forums, and attending industry conferences. I also engage in continuous learning through online courses focusing on emerging tools like data lakehouses and new frameworks, ensuring that my skills remain current and relevant.

Join Rise to see the full answer
Describe your experience with cloud data platforms.

My experience with cloud data platforms, particularly AWS and Snowflake, has involved setting up data warehouses and implementing ETL processes. Additionally, I have utilized these platforms for data storage and analytics, benefitting from their scalability and performance capabilities to handle large volumes of data effectively.

Join Rise to see the full answer
What strategies would you use to mentor a junior data engineer?

To mentor a junior data engineer, I would adopt hands-on teaching, guiding them through real-world projects to enhance their practical skills. I’d conduct regular knowledge-sharing sessions, encouraging them to ask questions and challenge themselves. Providing constructive feedback and fostering a supportive learning environment would be fundamental to their development.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 17 hours ago
Photo of the Rise User
ServiceNow Remote Hamerton House 18-20 Aharon Bart St., Bldg. B 3rd Floor, Petah Tikva, Israel
Posted 5 days ago
Inclusive & Diverse
Mission Driven
Rise from Within
Diversity of Opinions
Work/Life Harmony
Empathetic
Feedback Forward
Take Risks
Collaboration over Competition
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Conferences Stipend
Paid Time-Off
Maternity Leave
Equity
Photo of the Rise User
Sprout.ai Remote No location specified
Posted 45 minutes ago
Photo of the Rise User
Posted 5 days ago
Photo of the Rise User
Nearsure Remote Latin America - Remote
Posted 5 days ago
Posted 4 days ago

Keywords Studios continues to build world leading capabilities in services that video game and similar interactive content creators need. We stand shoulder to shoulder with our clients working as their external development partner, providing acces...

101 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
January 7, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!