Job details

Software Engineer, Core Engine

Get a free resume review

About Eventual

Eventual is a data platform that helps data scientists and engineers build data applications across ETL, analytics and ML/AI.

OUR PRODUCT IS OPEN-SOURCE AND USED AT ENTERPRISE SCALE

Our distributed data engine Daft is open-sourced and runs on 800k CPU cores daily. This is more compute than Frontier, the world's largest supercomputer!

Daft is used at leading AI/ML companies such as Amazon, TogetherAI, EssentialAI, CloudKitchens and more. It makes ML/AI workloads easy and performant to run alongside traditional relational tabular workloads.

Today's “Big Data” data tooling (Spark, Trino, Snowflake) was built for a world of tabular data analytics. They do not generalize well to the needs of modern ML/AI data workloads. We built Daft to be the successor to these Big Data technologies along these core principles:

Python-native: Python is the native language of ML/AI and most of data engineering today

First-Class Local Development UX: Interactive development in a local Python notebook or script is where the magic happens

Multimodal Data Support: Modern workloads require support for operations on complex types such as long-form text, images, tensors and more

Heterogenous Compute (GPUs): GPUs are a requirement for workloads that perform model batch inference as part of the overall query

Key Responsibilities:

As a Software Engineer on the Core Engine team, you will build key capabilities for the Daft distributed data engine.

You will be working on core architectural design and implementation of various components in Daft including:

Planning/Query Optimizer: intelligently optimize users’ workloads with modern database techniques

Execution Engine: improve memory stability through the use of streaming computation and more efficient data structures

Distributed Scheduler: improve Daft’s resource utilization, task scheduling and fault tolerance

Storage: improve Daft integrations with modern data lake technologies such as Apache Parquet, Apache Iceberg and Delta Lake

Our goal is to build the world’s best open-source distributed query engine, becoming the leading framework for data engineering and analytics.

We are a young startup - so be prepared to wear many hats such as tinkering with infrastructure, talking to customers and participating heavily in the core design process of our product!

What we look for:

We are looking for a candidate with a strong foundation in systems programming and ideally experience with building distributed data systems or databases (e.g. Hadoop, Spark, Dask, Ray, BigQuery, PostgreSQL etc)

Our ideal candidate has:

3+ years of experience working with distributed data systems (query planning, optimizations, workload pipelining, scheduling, networking, fault tolerance etc)

Strong fundamentals in systems programming (e.g. C++, Rust, C) and Linux

Familiarity and experience with cloud technologies (e.g. AWS S3 etc)

Most importantly, we are looking for someone who works well in small, focused teams with fast iterations and lots of autonomy. If you are passionate, intellectually curious and excited to build the next generation of distributed data technologies, we want you on the team!

Benefits and Remote Work

We are believers in both having the flexibility of remote work but also the importance of in-person work, especially at the earliest stages of a startup. We have a flexible hybrid approach to in-person work with at least 3 days of in-person work typically from Monday - Wednesday at our office in San Francisco.

We believe in providing employees with best-in-class compensation and benefits including meal allowances, comprehensive health coverage including medical, dental, vision and more.

About the interview

INTRODUCTORY CALL [15M]

A short phone screen over video call with one of our co-founders for us to get acquainted, understand your aspirations and evaluate if there is a good fit in terms of the type of role you are looking for.

TECHNICAL PHONE SCREEN [1 HR]

A technical phone screen question over video call to understand your technical abilities.

TECHNICAL INTERVIEW PANEL [4 HR]

Technical interviews with the rest of the Eventual team with questions to further understand your technical strengths, weaknesses and experiences.

MEET THE TEAM

As many chats as necessary to get to know us - come have a coffee with our co-founders and existing team members to understand who we are and our goals, motivations and ambitions.

We look forward to meeting you!

WE'RE GROWING - COME GROW WITH US!

We are well funded by investors such as YCombinator, Caffeinated Capital, Array.vc and top angels in the valley from Databricks, Meta and Lyft.

Our team has deep expertise in high performance computing, big data technologies, cloud infrastructure and machine learning. Our team members have previously worked in top technology companies such as Amazon, Databricks, Tesla and Lyft.

We are looking for exceptional individuals with a passion for technology and a strong sense of intellectual curiosity.

If that sounds like you, please reach out even if you don't see a specific role listed that matches your skillsets - we'd love to chat!

Eventual Glassdoor Company Review

4.8

Eventual DE&I Review

5.0

CEO of Eventual

Unknown name

Approve of CEO

Average salary estimate

$110000 / YEARLY (est.)

min

max

$90000K

$130000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Software Engineer, Core Engine, Eventual

Are you an innovative Software Engineer looking to make a real impact in the world of data technology? Eventual, based in the vibrant city of San Francisco, is on the hunt for a passionate Software Engineer to join our Core Engine team. At Eventual, we're dedicated to revolutionizing how data scientists and engineers build applications through our powerful open-source data platform, Daft. This isn't just another job; it's an opportunity to contribute to a project that operates on an astonishing 800k CPU cores daily! That’s more processing power than the largest supercomputer in the world! As part of our tight-knit team, you will play a crucial role in designing and implementing the architectural backbone of Daft, optimizing user workloads with advanced query planning techniques, enhancing the execution engine for better memory stability, and refining our resource scheduling. With your strong experience in distributed data systems and systems programming languages like C++, Rust, or C, you'll engage in exciting challenges like integrating modern data lake technologies and innovating with ML/AI workloads. We value autonomy and collaboration, and as you grow with us, you'll help shape the future of distributed data technologies while enjoying flexibility with hybrid work arrangements and top-tier benefits. If you're ready to wear many hats and dive deep into the transformative world of data, we’d love to see you thrive at Eventual!

Frequently Asked Questions (FAQs) for Software Engineer, Core Engine Role at Eventual

What responsibilities does a Software Engineer on the Core Engine team at Eventual have?

As a Software Engineer on the Core Engine team at Eventual, you'll be responsible for building key capabilities in our distributed data engine, Daft. Your main duties will include enhancing the planning/query optimizer to intelligently optimize user workloads, improving the memory stability of the execution engine with innovative data structures, and refining the distributed scheduler for better resource utilization and fault tolerance. You'll also work on enhancing integrations with modern data lake technologies like Apache Parquet and Delta Lake.