About Anyscale:
Anyscale provides a development platform intended to simplify distributed computing. This enables software developers of all skill levels to build applications that run at any scale from a laptop to the data center.
We're commercializing a popular open source project called
Ray - which is a framework for distributed computing as well as an ecosystem of libraries for scalable machine learning.
Anyscale is based in San Francisco, CA.
About the role:
Ray aims to provide a universal API for building distributed applications. To achieve this goal requires a distributed system with high levels of performance and reliability. We're looking for engineers with systems software experience that are interested in contributing to the Ray backend.
About the Runtime team :
The Runtime team develops and maintains the Ray C++ backend (e.g., distributed scheduler, language runtime integration, I/O and memory subsystems). We are responsible for the reliability, scalability, and performance of Ray as well as ensuring that Ray provides the right feature set to support higher level libraries and use cases. The team works on a balance of new features / distributed libraries, test infra improvements, debugging, and longer-term architectural improvements to Ray.
A snapshot of projects you can work on:
- Optimizing performance of large-scale workloads on Ray
- Stability and stress testing infrastructure
- Improving fault tolerance (HA)