We’re looking for a Staff Data Engineer to be the steward of our data layer, ensuring that our AI and ML models have clean, structured, and high-quality data. This is an opportunity for a high-performing engineer to take ownership of our data platform—designing and building scalable ingestion, transformation, and storage solutions for a fast-growing AI-driven sales intelligence product.
You’ll build and optimize data pipelines that ingest, transform, and correlate structured and unstructured data from multiple sources (CRM, public datasets, web scraping). You’ll work closely with ML and AI teams to ensure that our models are powered by the right data at the right time.
High ownership – You’ll be responsible for designing, maintaining, and evolving our data platform.
Be the expert – You’ll shape how data is structured, transformed, and optimized for ML models.
Direct impact – Your work will power AI-driven sales recommendations for enterprise users.
Own and maintain scalable data pipelines using Python, SQL, Airflow, and Spark (Databricks).
Develop data ingestion strategies using APIs, Airbyte, and web scraping.
Transform and clean data for ML models using Databricks (or Spark-based systems).
Optimize storage layers using a Medallion architecture (Bronze/Silver/Gold) approach.
Ensure data quality, governance, and observability across all pipelines.
Collaborate with ML, AI, and backend teams to integrate data into AI models.
Continuously refine and improve how data is structured, stored, and served.
5+ years of experience in data engineering with strong Python & SQL expertise.
Hands-on experience with Airflow, ETL pipelines, and Spark (Databricks preferred).
Experience integrating structured & unstructured data from APIs, CRMs, and web sources.
Ability to own and scale data infrastructure in a fast-growing AI-driven company.
Strong problem-solving skills and a desire to improve how data is structured for ML.
Exposure to Golang for API development (not required, but helpful).
Experience with MLOps (feature stores, model data versioning, SageMaker, ClearML).
Familiarity with Terraform, Kubernetes, or data pipeline automation.
Experience in database design to support customer-facing access patterns
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
At our thriving AI-driven company based in San Francisco, we are on the lookout for a passionate Sr. Data Engineer to take a significant role in shaping our data landscape. In this position, you'll be the steward of our data layer, ensuring that our AI and machine learning models have access to clean, structured, and high-quality data that they need to work their magic. This is an exciting opportunity to take ownership of our data platform, where you'll design and build scalable ingestion, transformation, and storage solutions for our cutting-edge sales intelligence product. Imagine building and optimizing data pipelines that handle both structured and unstructured data from a variety of sources, like CRMs and public datasets, while working closely with our brilliant ML and AI teams to guarantee that our models are powered by the right data at the right time. We value high ownership in our roles, meaning you will directly influence how our data is structured, transformed, and optimized for our ML models. Your contributions will have a direct impact on the effectiveness of our AI-driven sales recommendations for our enterprise users. If you have strong Python and SQL expertise and a desire to improve data infrastructure, we want to hear from you!
Subscribe to Rise newsletter