About Anyscale:
At Anyscale, we're on a mission to democratize distributed computing and make it accessible to software developers of all skill levels. We’re commercializing Ray, a popular open-source project that's creating an ecosystem of libraries for scalable machine learning. Companies like OpenAI, Uber, Spotify, Instacart, Cruise, and many more, have Ray in their tech stacks to accelerate the progress of AI applications out into the real world.
With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can scale an ML application from their laptop to the cluster without needing to be a distributed systems expert.
Proud to be backed by Andreessen Horowitz, NEA, and Addition with $250+ million raised to date.
About the role:
Ray aims to provide a universal API for building distributed applications (e.g. a machine learning pipeline of feature engineering, model training, and evaluation). Data is usually a core element connecting these different stages, and therefore plays a critical role in Ray’s usability, performance, and stability. We are looking for strong engineers to build, optimize, and scale Ray’s Datasets library and data processing capabilities in general.
About the Ray Data team:
The Ray Data team currently develops and maintains the Ray Datasets library, which is already powering critical production use cases (e.g. large scale data compaction at Amazon, and ML pipeline at Alibaba). Ray Datasets is a Python library built on top of Apache Arrow and Ray Core (Ray’s C++ backend), and the Ray Data team interacts closely with Ray Core components including the scheduler and the memory & I/O subsystems. The Ray Data team also works closely with Ray’s ML libraries including Train, RLlib, and Serve.
A snapshot of projects you will work on:
- Performance of Ray Datasets at large scale (leveraging Arrow primitives, optimizing Ray object manager, etc.)
- Integration with ML training and data sources
- Stability and stress testing infrastructure
- Lead future work integrating streaming workloads into Ray such as Beam on Ray
- Differentiate Data operations in Anyscale hosted Ray service
As part of this role, you will:
Develop high quality open source software to simplify distributed programming (Ray)
Identify, implement, and evaluate architectural improvements to Ray core and Datasets
Improve the testing process for Ray to make releases as smooth as possible
Communicate your work to a broader audience through talks, tutorials, and blog posts
We'd love to hear from you if have:
At least 2 year of relevant work experience
Solid background in algorithms, data structures, system design
Experience in building scalable and fault-tolerant distributed systems
Experience with data processing, database internals including Spark or Dask (streaming is a plus)
Compensation
At Anyscale, we take a market-based approach to compensation. We are data-driven, transparent, and consistent. The target salary for this role is $170,112 ~ $237,000. As the market data changes over time, the target salary for this role may be adjusted.
This role is also eligible to participate in Anyscale's Equity and Benefits offerings, including the following:
Stock Options
Healthcare plans, with premiums covered by Anyscale at 99%
401k Retirement Plan
Education & Wellbeing Stipend
Paid Parental Leave
Fertility Benefits
Flexible Time Off
Commute reimbursement
100% of in office meals covered
Anyscale Inc. is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law.
Anyscale Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
If you’re ready to elevate your career as a Software Engineer and make a real impact, Anyscale could be your next stop! At Anyscale, we’re on a bold mission to democratize distributed computing, ensuring that developers of all levels can harness the power of scalable machine learning through our flagship project, Ray. As a Software Engineer focused on Ray Data, you’ll work on the cutting-edge Ray Datasets library which plays a pivotal role in various high-stakes applications across major companies like Amazon and Alibaba. In this position, you’ll dive deep into building, optimizing, and scaling our data processing capabilities, enhancing performance at scale, and ensuring stability and usability of our libraries. You won’t just be writing code; you’ll be creating innovative solutions that drive our open-source software forward. You’ll collaborate closely with a talented team, leveraging knowledge of algorithms, system design, and fault-tolerance as you work on exciting projects involving both ML training integration and real-time data operations. We’re proud of our community-driven approach, and you’ll have the opportunity to communicate your work through talks and tutorials, sharing insights with fellow developers. With competitive compensation, equity offerings, and great benefits, joining Anyscale means working in an environment where you can grow and thrive while contributing to a mission-driven project. Let’s make computing accessible together!
Join Anyscale as the Head of Infrastructure to lead engineering teams in developing solutions for distributed AI applications using Ray in a remote work environment.
Anyscale seeks a Staff Software Engineer to spearhead the Model Training Infrastructure team, enhancing the scalability of machine learning applications.
Become a pivotal part of BioRender's mission to transform scientific communication as a Senior/Lead Application Security Engineer.
Join Superduper as a Senior Frontend Developer and shape the future of crypto iGaming with cutting-edge technology and innovation.
Join MAG Aerospace as a Computer Programmer IV, where your expertise in software design and cybersecurity will contribute to vital operations at NIWC Atlantic in Jacksonville, FL.
Become a key player as an Android Engineer at xAI, where you will craft mobile experiences that redefine user interaction.
We're looking for a Senior Software Developer in Test (SDET) to lead our automated testing initiatives at Multi Media LLC.
Join Outliant as a Solutions Architect II, driving the technical vision for our projects while utilizing your CMS expertise.
BMLL is looking for a Senior Lead C++ Developer to guide their engineering efforts in analytics data products within a cutting-edge FinTech environment.
Join Snke's dynamic team as a Senior Full Stack Engineer focusing on scalable AWS-based solutions while enjoying the flexibility of remote work.
We are building the future of software development.
54 jobsSubscribe to Rise newsletter