Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Software Engineer (Ray Data) image - Rise Careers
Job details

Software Engineer (Ray Data)

About Anyscale:


At Anyscale, we're on a mission to democratize distributed computing and make it accessible to software developers of all skill levels. We’re commercializing Ray, a popular open-source project that's creating an ecosystem of libraries for scalable machine learning. Companies like OpenAIUberSpotifyInstacartCruise, and many more, have Ray in their tech stacks to accelerate the progress of AI applications out into the real world.


With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can scale an ML application from their laptop to the cluster without needing to be a distributed systems expert.


Proud to be backed by Andreessen Horowitz, NEA, and Addition with $250+ million raised to date.


About the role:

Ray aims to provide a universal API for building distributed applications (e.g. a machine learning pipeline of feature engineering, model training, and evaluation). Data is usually a core element connecting these different stages, and therefore plays a critical role in Ray’s usability, performance, and stability. We are looking for strong engineers to build, optimize, and scale Ray’s Datasets library and data processing capabilities in general.


About the Ray Data team:


The Ray Data team currently develops and maintains the Ray Datasets library, which is already powering critical production use cases (e.g. large scale data compaction at Amazon, and ML pipeline at Alibaba). Ray Datasets is a Python library built on top of Apache Arrow and Ray Core (Ray’s C++ backend), and the Ray Data team interacts closely with Ray Core components including the scheduler and the memory & I/O subsystems. The Ray Data team also works closely with Ray’s ML libraries including Train, RLlib, and Serve.


A snapshot of projects you will work on:

- Performance of Ray Datasets at large scale (leveraging Arrow primitives, optimizing Ray object manager, etc.)

- Integration with ML training and data sources

- Stability and stress testing infrastructure

- Lead future work integrating streaming workloads into Ray such as Beam on Ray

- Differentiate Data operations in Anyscale hosted Ray service

As part of this role, you will:

  • Develop high quality open source software to simplify distributed programming (Ray)

  • Identify, implement, and evaluate architectural improvements to Ray core and Datasets

  • Improve the testing process for Ray to make releases as smooth as possible

  • Communicate your work to a broader audience through talks, tutorials, and blog posts

We'd love to hear from you if have:

  • At least 2 year of relevant work experience

  • Solid background in algorithms, data structures, system design

  • Experience in building scalable and fault-tolerant distributed systems

  • Experience with data processing, database internals including Spark or Dask (streaming is a plus)

Compensation

  • At Anyscale, we take a market-based approach to compensation. We are data-driven, transparent, and consistent. The target salary for this role is $170,112 ~ $237,000. As the market data changes over time, the target salary for this role may be adjusted.

  • This role is also eligible to participate in Anyscale's Equity and Benefits offerings, including the following:

  • Stock Options

  • Healthcare plans, with premiums covered by Anyscale at 99%

  • 401k Retirement Plan

  • Education & Wellbeing Stipend

  • Paid Parental Leave

  • Fertility Benefits

  • Flexible Time Off

  • Commute reimbursement

  • 100% of in office meals covered

Anyscale Inc. is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. 


Anyscale Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish

Average salary estimate

$203556 / YEARLY (est.)
min
max
$170112K
$237000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Software Engineer (Ray Data), Anyscale

If you’re ready to elevate your career as a Software Engineer and make a real impact, Anyscale could be your next stop! At Anyscale, we’re on a bold mission to democratize distributed computing, ensuring that developers of all levels can harness the power of scalable machine learning through our flagship project, Ray. As a Software Engineer focused on Ray Data, you’ll work on the cutting-edge Ray Datasets library which plays a pivotal role in various high-stakes applications across major companies like Amazon and Alibaba. In this position, you’ll dive deep into building, optimizing, and scaling our data processing capabilities, enhancing performance at scale, and ensuring stability and usability of our libraries. You won’t just be writing code; you’ll be creating innovative solutions that drive our open-source software forward. You’ll collaborate closely with a talented team, leveraging knowledge of algorithms, system design, and fault-tolerance as you work on exciting projects involving both ML training integration and real-time data operations. We’re proud of our community-driven approach, and you’ll have the opportunity to communicate your work through talks and tutorials, sharing insights with fellow developers. With competitive compensation, equity offerings, and great benefits, joining Anyscale means working in an environment where you can grow and thrive while contributing to a mission-driven project. Let’s make computing accessible together!

Frequently Asked Questions (FAQs) for Software Engineer (Ray Data) Role at Anyscale
What are the responsibilities of a Software Engineer (Ray Data) at Anyscale?

As a Software Engineer (Ray Data) at Anyscale, your main responsibilities will include developing and optimizing the Ray Datasets library, building scalable and fault-tolerant distributed systems, and ensuring smooth performance and stability in data processing. You will also work on integration with machine learning training and data sources, implement architectural improvements, and enhance our testing processes to ensure high-quality releases.

Join Rise to see the full answer
What qualifications are required for the Software Engineer (Ray Data) position at Anyscale?

To qualify for the Software Engineer (Ray Data) position at Anyscale, candidates should possess at least two years of relevant work experience in software development. A solid understanding of algorithms, data structures, and system design is essential. Experience with scalable distributed systems, data processing frameworks like Spark or Dask, and familiarity with streaming workloads will significantly enhance your application.

Join Rise to see the full answer
What kind of projects can a Software Engineer (Ray Data) expect to work on at Anyscale?

As a Software Engineer (Ray Data) at Anyscale, you can expect to engage in influential projects such as improving the performance of the Ray Datasets library, integrating with machine learning training processes, and developing infrastructure for stress testing. You will also explore ways to incorporate streaming workloads within Ray, which contributes to the project's evolution and usability.

Join Rise to see the full answer
What is the work environment like for a Software Engineer (Ray Data) at Anyscale?

The work environment for a Software Engineer (Ray Data) at Anyscale is collaborative and open, with a focus on innovation and community-driven development. You will work alongside talented engineers in a culture that encourages knowledge sharing through talks, tutorials, and blog posts. With flexible time off and a range of benefits, Anyscale promotes a healthy work-life balance.

Join Rise to see the full answer
What are the compensation and benefits for the Software Engineer (Ray Data) role at Anyscale?

At Anyscale, compensation for the Software Engineer (Ray Data) role is based on a market-driven approach with a target salary ranging from $170,112 to $237,000, which is subject to change based on market data. Additionally, employees are eligible for stock options, extensive healthcare benefits with 99% premiums covered, a 401k retirement plan, and great perks such as a wellbeing stipend and paid parental leave.

Join Rise to see the full answer
Common Interview Questions for Software Engineer (Ray Data)
Can you explain your experience with scalable distributed systems?

When answering this question, highlight specific projects where you built or optimized distributed systems. Discuss the technologies you used, the challenges you faced, and how you ensured scalability and fault tolerance.

Join Rise to see the full answer
What algorithms do you consider important for processing large datasets?

Emphasize your understanding of algorithms that are specifically designed for handling large datasets, such as MapReduce or streaming algorithms. Providing examples of how you've applied these algorithms in previous projects will strengthen your answer.

Join Rise to see the full answer
How do you approach debugging in a distributed environment?

Discuss your methods for identifying and resolving issues in distributed systems, such as logging, tracing, and understanding the architecture. Explain how you prioritize issues and test for stability during the debugging process.

Join Rise to see the full answer
Can you provide an example of integrating data sources into a machine learning pipeline?

Detail a specific instance where you integrated data from various sources into a machine learning pipeline. Explain the steps you took and any challenges faced during integration and how you overcame them.

Join Rise to see the full answer
What is your experience with testing frameworks for open-source software?

Describe your familiarity with testing frameworks suited for open-source projects and emphasize any contributions you made towards improving testing processes in your previous roles. Highlight any testing methodologies you prefer.

Join Rise to see the full answer
How do you keep up with advancements in distributed computing?

Share your strategies for staying informed about the latest trends and technologies in distributed computing, such as blogging, attending webinars, or participating in relevant communities and discussions.

Join Rise to see the full answer
What are your thoughts on the future of machine learning in distributed systems?

Articulate your vision for how machine learning can evolve within distributed systems. Discuss emerging technologies and concepts such as real-time training and edge computing that you believe will shape the future.

Join Rise to see the full answer
What challenges have you faced when working with Ray or similar frameworks?

Provide a specific example of a challenge encountered while working with Ray or similar frameworks. Discuss how you highlighted the issue, the approach you took to solve it, and what was learned in the process.

Join Rise to see the full answer
How would you go about improving the performance of a data processing library?

Explain your strategies and considerations for boosting performance in data processing libraries, such as optimizing resource usage, parallel processing, and leveraging efficient algorithms.

Join Rise to see the full answer
Why do you want to work as a Software Engineer (Ray Data) at Anyscale?

Reflect on your motivations for joining Anyscale, emphasizing your passion for open-source software, your interest in distributed computing, and how you align with the company's mission to democratize computing for developers.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 3 days ago

Join Anyscale as the Head of Infrastructure to lead engineering teams in developing solutions for distributed AI applications using Ray in a remote work environment.

Photo of the Rise User

Anyscale seeks a Staff Software Engineer to spearhead the Model Training Infrastructure team, enhancing the scalability of machine learning applications.

Photo of the Rise User
Posted 9 days ago
Feedback Forward
Collaboration over Competition
Growth & Learning
Transparent & Candid

Become a pivotal part of BioRender's mission to transform scientific communication as a Senior/Lead Application Security Engineer.

Photo of the Rise User
Superduper Remote No location specified
Posted 13 days ago

Join Superduper as a Senior Frontend Developer and shape the future of crypto iGaming with cutting-edge technology and innovation.

Photo of the Rise User
Posted 5 days ago

Join MAG Aerospace as a Computer Programmer IV, where your expertise in software design and cybersecurity will contribute to vital operations at NIWC Atlantic in Jacksonville, FL.

xAI Hybrid Palo Alto, California, United States
Posted 12 days ago

Become a key player as an Android Engineer at xAI, where you will craft mobile experiences that redefine user interaction.

Photo of the Rise User
Posted 8 days ago

We're looking for a Senior Software Developer in Test (SDET) to lead our automated testing initiatives at Multi Media LLC.

Photo of the Rise User
Inclusive & Diverse
Startup Mindset
Collaboration over Competition
Growth & Learning

Join Outliant as a Solutions Architect II, driving the technical vision for our projects while utilizing your CMS expertise.

BMLL Technologies Remote No location specified
Posted 14 days ago

BMLL is looking for a Senior Lead C++ Developer to guide their engineering efforts in analytics data products within a cutting-edge FinTech environment.

Photo of the Rise User
Posted 12 days ago

Join Snke's dynamic team as a Senior Full Stack Engineer focusing on scalable AWS-based solutions while enjoying the flexibility of remote work.

We are building the future of software development.

54 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
April 3, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
Someone from OH, Tiffin just viewed Game Operations Specialist at Genius Sports
u
Someone from OH, Loveland just viewed Customer Service Agent - Part Time at uhaul
Photo of the Rise User
Someone from OH, Cleveland just viewed HR Manager at Shearer's Foods
Photo of the Rise User
Someone from OH, Columbus just viewed Mid Level, System Administrator - (ETS) at Delivery Hero
Photo of the Rise User
173 people applied to Scrum Master-Remote at DICE
Photo of the Rise User
Someone from OH, Mason just viewed Inside Sales Co-Op at VEGA Americas
Photo of the Rise User
Someone from OH, Sandusky just viewed Director of IT at Kyo
Photo of the Rise User
Someone from OH, Delaware just viewed Practice Group Manager at LifeStance Health
Photo of the Rise User
Someone from OH, Avon Lake just viewed Advancement Specialist at Sierra Club
Photo of the Rise User
Someone from OH, Sidney just viewed Database Engineer Principal at Sagent
Photo of the Rise User
Someone from OH, North Canton just viewed Manager, Customer Success at impact.com
Photo of the Rise User
Someone from OH, Columbus just viewed Customer Experience Representative at MYOB
Photo of the Rise User
Someone from OH, Lakewood just viewed Production Scheduling Supervisor at Shearer's Foods
Photo of the Rise User
Someone from OH, Hilliard just viewed General Manager at Super Soccer Stars
Photo of the Rise User
Someone from OH, West Chester just viewed Independent Living Ambassador at Otterbein SeniorLife