ray · Apr 11, 2023 · 6:04 PM UTC

ray

ray

@raydistributed

11 Apr 2023

Distributed fine-tuning LLM is more cost effective than fine-tuning on a single instance! Check out the blog post on how to fine-tune and serve LLM simply, cost effectively using Ray + DeepSpeed and 🤗 hubs.ly/Q01K-BLT0

Blog | Anyscale

anyscale.com

288

49,920

ray · Apr 19, 2023 · 7:43 PM UTC

ray

@raydistributed

19 Apr 2023

Ray is a powerful ML framework, but with great power comes massive documentation. How can we make it more accessible? Now, using @langchain and Ray, we can build and deploy a doc search engine in about 100 lines of code -- with a self-hosted LLM! 1/n piped.video/watch?v=v7a8SR-s…

Open Source LLM Search Engine with LangChain on Ray

Waleed, Head of Engineering at Anyscale, explains how to use LangCh...

youtube.com

275

63,244

ray · Feb 10, 2021 · 7:11 PM UTC

ray

@raydistributed

10 Feb 2021

Announcing a new Ray + 🤗 @huggingface integration! RAG is a new NLP model that uses external documents to augment its knowledge. We’ve integrated Ray with RAG: - 🚄Speeding up retrieval calls by 2x - 💫Improving the scalability of fine tuning Blog: medium.com/distributed-compu…

Retrieval Augmented Generation with Huggingface Transformers and Ray

Huggingface Transformers recently added the Retrieval Augmented Generation (RAG) model, a new NLP architecture that leverages external…

medium.com

173

ray · Apr 7, 2020 · 5:26 PM UTC

ray

@raydistributed

7 Apr 2020

We're releasing RaySGD, a pytorch library that makes distributed training cheap and simple! Features: - fp16 training support - elastic training (automatic fault tolerance) - Integrated distributed HPO (w/ RayTune) - intuitive and pytorch-friendly APIs medium.com/distributed-compu…

Faster and Cheaper Pytorch with RaySGD

Distributed training is annoying to set up and expensive to run. Here’s a library to make distributed Pytorch training simple and cheap.

medium.com

170

ray · Apr 27, 2023 · 2:17 PM UTC

ray

@raydistributed

27 Apr 2023

Announcing Ray 2.4.0: Infrastructure for LLM training, tuning, inference, and serving. 🧠 LLM features 💽 Ray data for ease of use & stability 📊 Serve observability 🤖 RLlib’s module for custom reinforcement learning 🏢Ray scalability for large clusters hubs.ly/Q01MYBLr0

Announcing Ray 2.4.0: Infrastructure for LLM training, tuning, inference, and serving

anyscale.com

161

22,726

ray · Jul 1, 2020 · 6:09 PM UTC

ray

@raydistributed

1 Jul 2020

ML serving infra has evolved, and there are 3 key requirements - Framework agnostic (@TensorFlow, @PyTorch, pure Python, ...) - Pure Python (intuitive for developers) - Out of the box scalability Why? How does this relate to Ray and @huggingface? 🤗 👇 medium.com/distributed-compu…

The Simplest Way to Serve your NLP Model in Production with Pure Python

From scikit-learn to Hugging Face Pipelines, learn the simplest way to deploy ML models using Ray Serve.

medium.com

142

ray · Aug 15, 2023 · 4:02 PM UTC

ray

@raydistributed

15 Aug 2023

@BytedanceTalk, the company behind TikTok, uses Ray for fast & cheap offline inference with multi-modal #LLMs. They generate embeddings for a staggering 200 TB of image and text data using a model with >10B parameters. anyscale.com/blog/how-byteda… 🧵 Thread below 👇

How ByteDance Scales Offline Inference with Multi-Modal LLMs

ByteDance, the company behind Tiktok, leverages multi-modal models to enable many applications, such as text-based image retrieval or object detection.

anyscale.com

113

60,710

ray · Nov 2, 2020 · 4:19 PM UTC

ray

@raydistributed

2 Nov 2020

You can now tune your @huggingface transformer Trainer with RayTune (tune.io/) in 1 line of code! ⚡️Access Bayesian Optimization, Population-based Training to superpower your model 🧙‍♂️Use Multi-GPU and Multi-node support Blog post: anyscale.com/blog/hyperparam…

117

ray · Sep 30, 2020 · 5:01 PM UTC

ray

@raydistributed

30 Sep 2020

Ray 1.0 is up on Github and PyPI (w/ new beautiful docs - docs.ray.io/en/latest/index.…)! 🎉This is a huge and important release, with many new APIs and tons of new committers! 🔖 Read about Ray 1.0 on our blog post (anyscale.com/blog/announcing…)

107

ray · Aug 20, 2021 · 6:23 PM UTC

ray

@raydistributed

20 Aug 2021

🎉 Say hello to Ray Lightning — a faster and simpler path to multi-node distributed training for @PyTorchLightnin⚡️. Change 1 line to scale your PyTorch Lightning training to a multi-node GPU cluster. Give it a try and let us know what you think! anyscale.com/blog/introducin…

Introducing Ray Lightning: Multi-node PyTorch Lightning training made easy | Anyscale

anyscale.com

ray · May 2, 2023 · 6:38 PM UTC

ray

@raydistributed

2 May 2023

Part 2 of our Ray + LangChain Series is ready, in this part we’ll show you how to turbocharge generation of embeddings. See the video(9 minutes) at hubs.ly/Q01Np5sh0 and blog post at hubs.ly/Q01Np8090

This link will take you to a page that’s not on LinkedIn

lnkd.in

18,882

ray · Mar 7, 2025 · 9:23 PM UTC

ray

@raydistributed

7 Mar 2025

ByteScale is a new LLM training framework - Evaluated 7B to 141B param models - 256K to 2048K context lengths - 12,000 GPUs - Optimized for mixed long and short sequences The crux of it is a much more dynamic parallelism strategy (as opposed to a static mesh) to account for heterogeneity in sequence length. They call this strategy Hybrid Data Parallelism (HDP), which combines regular data parallelism with context parallelism in a dynamic manner. Their data loading strategy is very network and CPU-memory intensive and requires global coordination across workers (as opposed to each worker doing its own thing). They use Ray actor for this coordination. There are - Servers to fetch and preprocess raw data from HDFS and generate metadata - A scheduler to collect global metadata from all servers, figure out the the loading plan, and broadcast the plan to clients - Clients (on GPUs), which read the partial data from servers based on the loading plan

18,448

ray · Apr 24, 2025 · 9:53 PM UTC

ray

@raydistributed

24 Apr 2025

vLLM + Ray is a powerful combo for post-training.

vLLM

@vllm_project

24 Apr 2025

OpenRLHF is a pioneering framework to use vLLM for RLHF, driving many design and implementation of vLLM's features for RLHF, making vLLM a popular choice for many RLHF frameworks. Learn more about the story at blog.vllm.ai/2025/04/23/open…

8,613

ray · Aug 26, 2020 · 1:55 PM UTC

ray

@raydistributed

26 Aug 2020

hyperparameter tuning for #NLProc is often overlooked, but by using @huggingface transformers + tuning techniques such as PBT, you can increase model accuracy by up to 5% on certain fine-tuning tasks *without increasing your compute budget*! 🔖 read it: medium.com/@amog_97444/c4e32…

ray · Sep 7, 2023 · 8:19 PM UTC

ray

@raydistributed

7 Sep 2023

The team @MetaAI has done a tremendous amount to move the field forward with the Llama models. We're thrilled to collaborate to help grow the Llama ecosystem. anyscale.com/blog/anyscale-a…

Anyscale and Meta Collaborate to Advance the Llama-2 Ecosystem

We are excited to announce collaboration between Meta and Anyscale to bolster the Llama ecosystem.

anyscale.com

77,568

ray · May 13, 2021 · 8:01 PM UTC

ray

@raydistributed

13 May 2021

JAX is a system for high-performance machine learning research and numerical computing. At #RaySummit, @GoogleAI's @SingularMattrix will show how JAX is used in #neuralnet training, probabilistic programming & more. Register to join live or on-demand bit.ly/3vUpv9x

ray · Mar 9, 2022 · 6:00 PM UTC

ray

@raydistributed

9 Mar 2022

Data scientist != infra engineer. Thanks @marksaroufim for joining our Ray Meetup last week and sharing how to make it easier to train large-scale #ML jobs in #opensource. If you missed it, you can watch the recording here: ow.ly/SqcS50IewPm

Ray Train, PyTorch, TorchX, and distributed deep learning

Welcome to our second Ray meetup, where we focus on Ray’s native libraries for scaling machine learning workloads. We'll discuss Ray Train, a production-ready distributed training library for deep...

anyscale.com

ray · Aug 24, 2020 · 5:59 PM UTC

ray

@raydistributed

24 Aug 2020

excited to see Ray Tune integrated into the awesome 🤗@huggingface Transformers!

Sylvain Gugger @GuggerSylvain

24 Aug 2020

Hyperparameter search with optuna or Ray Tune is now fully integrated in Trainer (support for TF coming soon!) Tutorials coming soon but in the meantime the docs are a good way to get started with it huggingface.co/transformers/…

ray · Mar 27, 2023 · 2:29 PM UTC

ray

@raydistributed

27 Mar 2023

ICYM our blogs on Ray and Generative AI. We have a three-part series on how to use Ray to productionize common generative AI model workloads. Here are parts 1 and 2: 👉 hubs.ly/Q01JcZyd0 👉hubs.ly/Q01JcYmq0 #Ray for #GenerativeAI #workloads

50,135

ray · Sep 27, 2023 · 4:28 PM UTC

ray

@raydistributed

27 Sep 2023

🎉 Announcing Ray Serve and Anyscale Services general availability! Teams at @LinkedIn, @Samsara, @AntGroup + many more have been using Ray to serve LLMs & multi-modal applications in a flexible, performant and scalable way. Read more about the GA release and how companies have been using it for both: - 🛠️ development (python API + local testing, model composition & multiplexing, heterogeneous cluster support, etc.) - 🚀 production (high availability, observability tools, autoscaling, canary rollouts, etc.) anyscale.com/blog/tackling-t…

10,865

ray · May 7, 2024 · 4:38 PM UTC

ray

@raydistributed

7 May 2024

Ray operates at two levels: Ray Core, which scales Python functions and classes with tasks and actors, and its libraries, offering easy-to-use abstractions tailored for ML workloads. #Ray #ML #DistributedComputing

16,600

ray · Jul 26, 2024 · 4:51 PM UTC

ray

@raydistributed

26 Jul 2024

We don't hear the term *exabyte* too frequently. This is an impressive use case. aws.amazon.com/blogs/opensou…

Amazon’s Exabyte-Scale Migration from Apache Spark to Ray on Amazon EC2 | Amazon Web Services

Large-scale, distributed compute framework migrations are not for the faint of heart. There are backwards-compatibility constraints to maintain, performance expectations to meet, scalability limits...

aws.amazon.com

25,560

ray · May 9, 2024 · 4:21 PM UTC

ray

@raydistributed

9 May 2024

We pretrained a stable diffusion model on 2 billion images for under $40K. Here's what we learned. anyscale.com/blog/scalable-a…

Reducing the Cost of Pre-training Stable Diffusion by 3.7x with Anyscale

In this blog post, we introduce an advanced pre-training solution for Stable Diffusion v2 models, leveraging the power of the Ray and Anyscale Platform to enhance scalability and cost efficiency.

anyscale.com

24,114

ray · Sep 14, 2023 · 5:51 PM UTC

ray

@raydistributed

14 Sep 2023

Cloud TPUs from @googlecloud are one of the most cost-effective ways to train and serve LLMs. In 2.7, Ray finally will support TPUs natively -- Ray enables a more intuitive TPU developer experience, allowing you to train and serve on massive TPU pods with ease. Learn more at Ray Summit raysummit.anyscale.com/agend…

8,774

ray · Jun 7, 2021 · 1:16 PM UTC

ray

@raydistributed

7 Jun 2021

Deep RL has become fairly capable of optimizing reward; however, how do you choose the reward function to be optimized? @pabbeel will discuss some recent progress in this area in his #RaySummit talk "Human-in-the-Loop Reinforcement Learning" Register: bit.ly/3ij3OMw

ray · May 17, 2024 · 8:26 PM UTC

ray

@raydistributed

17 May 2024

🚀 Announcing the Ray Distributed Debugger! 🚀 An integrated debugging experience within VSCode. 1⃣ Set breakpoints to pause tasks and inspect variables. 2⃣ Post-mortem debugging: Analyze state after an error. More: anyscale.com/blog/ray-distri… piped.video/watch?v=EiGHHUXL…

Easily Debug Ray Applications with Ray Distributed Debugger

anyscale.com

12,208

ray · May 4, 2023 · 3:18 PM UTC

ray

@raydistributed

4 May 2023

Offline Batch Inference: Comparing Ray, Apache Spark & SageMaker. Image classification benchmarks show that #Ray outperforms while linearly scaling to TB-level data sizes 💽 📈 SageMaker Batch Transform by 17x 📊 Apache Spark by 2x and 3x hubs.ly/Q01NHV2K0

Offline Batch Inference | Ray, Apache Spark & SageMaker

We conduct a comparison of three different solutions for offline batch inference: AWS SageMaker Batch Transform, Apache Spark, and Ray Data.

anyscale.com

9,040

ray · May 9, 2020 · 4:56 PM UTC

ray

@raydistributed

9 May 2020

What enables Ray to be so much faster than Python multiprocessing? A combination of efficient handling of numerical data through @ApacheArrow and a set of abstractions more appropriate for building stateful services/actors. towardsdatascience.com/10x-f…

ray · Apr 22, 2021 · 10:48 PM UTC

ray

@raydistributed

22 Apr 2021

🎉🍾🥳 Ray 1.3 is out! Featuring: * Published scalability limits (github.com/ray-project/ray/t…) * Ray Client enabled by default * Object spilling is now turned on by default. * Faster autoscaling for Ray Tune * R2D2 @PyTorch and TF implementation for RLlib github.com/ray-project/ray/r…

ray · Apr 19, 2024 · 12:47 AM UTC

ray

@raydistributed

19 Apr 2024

With Ray 2.11.0, we switched to weekly releases (previously every 6 weeks)! This is a huge change and will get features and fixes to users faster. This has been a big investment in our overall velocity. github.com/ray-project/ray/r…

Release Ray-2.11.0 · ray-project/ray

Release Highlights [data] Support reading Avro files with ray.data.read_avro [train] Added experimental support for AWS Trainium (Neuron) and Intel HPU. Ray Libraries Ray Data 🎉 New Features: S...

github.com

5,965

ray · Mar 9, 2021 · 6:13 PM UTC

ray

@raydistributed

9 Mar 2021

🔥 Modin (github.com/modin-project/mod…) is a popular library that can scale your pandas workflows by changing one line of code -- using Ray! Learn how below: medium.com/distributed-compu…

GitHub - modin-project/modin: Modin: Scale your Pandas workflows by changing a single line of code

Modin: Scale your Pandas workflows by changing a single line of code - modin-project/modin

github.com

ray · May 6, 2024 · 4:45 PM UTC

ray

@raydistributed

6 May 2024

Ray is emerging as a standard for AI workloads, powering AI at companies like OpenAI, Uber, and Netflix. What sets Ray apart is its rich ecosystem of libraries tailored for various distributed computing tasks across the AI lifecycle. docs.ray.io/en/latest/index.…

6,502

ray · May 6, 2021 · 7:35 PM UTC

ray

@raydistributed

6 May 2021

Growing demand for applications & HW specialization create huge burdens for learning systems at the center of intelligent applications today. At #RaySummit, see how @tqchenml addresses these challenges using the @XGBoostProject @ApacheTVM systems he built bit.ly/3b28Atg

ray · Aug 24, 2021 · 8:14 PM UTC

ray

@raydistributed

24 Aug 2021

🎉 Microsoft Researchers have developed FLAML (Fast Lightweight AutoML) which can now utilize Ray Tune for distributed hyperparameter tuning to scale up FLAML's resource-efficient & easily parallelizable algorithms across a cluster! 🎉 Learn more: anyscale.com/blog/fast-autom…

Fast AutoML with FLAML + Ray Tune | Anyscale

anyscale.com

ray · Nov 16, 2022 · 4:13 PM UTC

ray

@raydistributed

16 Nov 2022

Use gang-scheduling on Ray Clusters on #Kubernetes w/ #KubeRay & Multi-Cluster-App-Dispatcher (MCAD) to scale training #GLUE workloads 👉 Easy MCAD + KubeRay integration to scale Ray Clusters on #k8s 👉 Accelerate fine-tune #NLU tasks w/ multiple GPUs anyscale.com/blog/gang-sched…

Gang Scheduling Ray Clusters on Kubernetes with MCAD

Unblock your ML workload using KubeRay with the Multi-Cluster-App-Dispatcher (MCAD) Kubernetes controller.

anyscale.com

ray · May 21, 2024 · 4:35 PM UTC

ray

@raydistributed

21 May 2024

See how we optimized large-scale ML training in Part 3 of our Stable Diffusion series! We used Ray Train, Ray Data, and PyTorch Lightning to train on 2B images with fault tolerance, data streaming, and advanced strategies like FSDP and DDP. Read more: anyscale.com/blog/we-pre-tra…

We Pre-Trained Stable Diffusion Models on 2 billion Images and Didn't Break the Bank - Definitive...

anyscale.com

9,803

ray · Apr 19, 2023 · 7:43 PM UTC

ray

@raydistributed

19 Apr 2023

Blog: anyscale.com/blog/llm-open-s… @langchain provides an amazing suite of tools for everything around LLMs. There are tools (chains) for prompting, indexing, generating and summarizing text. While an amazing tool, using Ray with it can make LangChain even more powerful. 2/n

Building an LLM Open-Source Search Engine in 100 Lines

In part 1 of a new blog series, we show how to build a search engine in 100 lines using LLM embeddings and a vector database.

anyscale.com

2,319

ray · Mar 19, 2021 · 6:47 PM UTC

ray

@raydistributed

19 Mar 2021

✨Ray is becoming a critical component for the next generation of ML platforms! Check out this recent blog post about how @Uber is leveraging Ray for elastic deep learning with Horovod to enable their rapidly growing usage of deep learning: eng.uber.com/horovod-ray/?ut…

ray · May 8, 2024 · 10:43 PM UTC

ray

@raydistributed

8 May 2024

Just dropped Ray 2.21.0, our 4th weekly release (since we started doing weekly releases). - Ray Data @lancedb connector - Ray Data improved retries - Ray Serve improved batching - Many fixes across libraries - Improved dashboard Tons of contributors over the past week 👇

4,765

ray · May 15, 2020 · 7:10 PM UTC

ray

@raydistributed

15 May 2020

Imagine if your random forest classifier training/tuning was 30x faster while getting 5% more accurate. Wouldn't that be awesome? Today, by leveraging the RAPIDS library with Ray Tune, you can do that. See how in exciting new post: medium.com/rapids-ai/30x-fas… #GTC2020 #RayTune

30x Faster Hyperparameter Search with RayTune and RAPIDS

With RayTune and RAPIDS you can now tune Random Forest Classifiers 30x faster — while getting a 5% accuracy boost.

medium.com

RAPIDS AI @RAPIDSai

15 May 2020

With @rapidsai and @raydistributed #RayTune, you can now tune Random Forest Classifiers 30x faster -- while getting a 5% accuracy boost. See how. nvda.ws/2WCsqou

ray · Aug 24, 2022 · 5:49 PM UTC

ray

@raydistributed

24 Aug 2022

Exciting talk from @dariogila with @IBM on the future of quantum computing, and how @raydistributed could be the key for its success.

ray · Jun 24, 2020 · 7:25 PM UTC

ray

@raydistributed

24 Jun 2020

0.8.6 is out! - Support for Windows (alpha)! - Releasing Ray Serve, a scalable model-serving library! Check out a tutorial for serving @PyTorch models: docs.ray.io/en/master/serve/… - Ray Dashboard now supports GPU monitoring! And more! Release notes: github.com/ray-project/ray/r…

ray · Sep 16, 2021 · 4:47 PM UTC

ray

@raydistributed

16 Sep 2021

As technology has advanced, ML architectures have evolved. One way to see it is in terms of generations: - 1st gen involved "fixed function" pipelines - 2nd gen involved programmability within the pipeline What will be the next gen of ML architectures? bit.ly/2XyG9zR

The 3rd Generation of Production ML Architectures | Anyscale

anyscale.com

ray · Jul 27, 2021 · 6:40 PM UTC

ray

@raydistributed

27 Jul 2021

🎉🍾🥳 Ray 1.5 is out! Featuring: - Ray Datasets now in alpha - LightGBM on Ray in beta - The Ray cluster launcher now has support for launching clusters on Aliyun - RLlib added an improved "input API" for customizing offline datasets Learn more ⬇️ github.com/ray-project/ray/r…

ray · Jun 13, 2023 · 2:29 PM UTC

ray

@raydistributed

13 Jun 2023

Announcing Ray 2.5 release features: 👉 Support #LLMs training with Ray Train 👉 Serve #LLMs with Ray Serve 👉 Multi-GPU learner stack in #RLlib for cost efficiency & scalable RL-agent training 👉 Performant & improved approach to batch inference at scale hubs.ly/Q01TjbM00

Ray 2.5 | Training & Serving for LLMs, Multi-GPU Training & More

anyscale.com

2,506

ray · Feb 9, 2021 · 8:23 PM UTC

ray

@raydistributed

9 Feb 2021

Check out the new @MLflow and @raydistributed integrations for tuning models, tracking experiments, and deploying models. medium.com/distributed-compu…

Ray & MLflow: Taking Distributed Machine Learning Applications to Production

By Amog Kamsetty and Archit Kulkarni

medium.com

ray · Mar 25, 2021 · 6:18 PM UTC

ray

@raydistributed

25 Mar 2021

First sessions for #RaySummit program are up! Join the annual gathering of the global @raydistributed community for the latest in distributed computing. Speakers include @TravisAddair @eric_brewer @tqchenml @slbird @dawnsongtweets & more ➡️raysummit.org

ray · May 6, 2020 · 3:16 PM UTC

ray

@raydistributed

6 May 2020

"A Step-by-Step Guide to Scaling Your First Python Application in the Cloud" by Bill Chambers link.medium.com/W0Yj2hbNg6. You'll learn how to install Ray, create an app, test on your local machine, spin up a Ray cluster in the cloud, deploy your app, ... and more!

A Step-by-Step Guide to Scaling Your First Python Application in the Cloud

Every idea needs a Medium

link.medium.com

ray · May 11, 2023 · 3:52 PM UTC

ray

@raydistributed

11 May 2023

Streaming distributed execution across CPUs and GPUs: Learn how Ray Data streaming works and how to use it for your own ML pipelines. hubs.ly/Q01Px5WS0

How to Stream Distributed Execution Across CPUs & GPUs

This blog post delves into how Ray Data streaming works and how to use it for your own ML pipelines distributed across both CPU and GPU devices.

anyscale.com

2,046

ray · Jun 7, 2024 · 5:50 PM UTC

ray

@raydistributed

7 Jun 2024

Just dropped Ray 2.24.0 🥂🥳 🎗️ Tons of new work on observability, particularly around machine failures. Why did a node die (failure, scaling down, spot preemption, etc). 🔥 Critical bug fixes across Ray core and the Ray dashboard. 🎂 New features in Ray Data and Ray Serve.

5,328

ray · Nov 30, 2021 · 5:18 PM UTC

ray

@raydistributed

30 Nov 2021

Distributed libraries allow improved performance by exploiting the full bandwidth of distributed memory, and giving greater programmability. But how does that actually work? What does the code look like? Learn more ⬇️ bit.ly/3o6a6l8

ray · Aug 26, 2021 · 4:55 PM UTC

ray

@raydistributed

26 Aug 2021

🎉 New Introductory Tutorial on Reinforcement Learning (RL) with OpenAI Gym, RLlib, and Google Colab! 🎉 bit.ly/2Wlnx5W The tutorial explores: - What is RL - The OpenAI Gym CartPole Environment - The Role of Agents in RL & how to train them using RLlib

ray · Jun 14, 2024 · 1:47 AM UTC

ray

@raydistributed

14 Jun 2024

AI for protein design.

Robert Nishihara

@robertnishihara

14 Jun 2024

Amazing work by @DreamFoldAI. FoldFlow-2 is a generative model for protein structure, which is important for protein design. Trained on @anyscalecompute.

2,862

ray · Jul 22, 2020 · 5:24 PM UTC

ray

@raydistributed

22 Jul 2020

ML serving is broken - Ray Serve can fix it! Thread (1/n) 🙁Wrapping models in Flask doesn’t scale 🙁TorchServe, TFServing requires setting up a traditional web server 😊 Ray Serve lets you deploy your ML models with a simple Python interface! medium.com/distributed-compu…

Machine Learning Serving is Broken

And How Ray Serve Can Fix it

medium.com

ray · Dec 6, 2021 · 5:56 PM UTC

ray

@raydistributed

6 Dec 2021

💥🎉 Ray version 1.9 is here! Featuring: ✅ Ray Train is now in beta! ✅ Ray Datasets now supports groupby and aggregations! ✅ Ray Docker images for multiple CUDA versions are now provided! bit.ly/3pzpIgr

ray · Jun 8, 2021 · 10:36 PM UTC

ray

@raydistributed

8 Jun 2021

🎉🍾🥳 Ray 1.4 is out! Highlights include: - Ray Serve has a new deployment centric API! - Ray now has support for namespaces. (Docs: docs.ray.io/en/master/namesp…) - RLlib now has multi-GPU support for PyTorch models! Learn more ⬇️ github.com/ray-project/ray/r…

ray · Jul 13, 2021 · 4:14 PM UTC

ray

@raydistributed

13 Jul 2021

🎉 New Tutorial on Serverless Kafka Stream Processing with Ray! Featuring: - Ray Clusters that autoscale to meet the demands of a stream processing job - How Ray can be paired with @apachekafka Learn more ⬇️ anyscale.com/blog/serverless…

ray · Nov 4, 2021 · 4:34 PM UTC

ray

@raydistributed

4 Nov 2021

💥🎉 Ray version 1.8 is here! Featuring: ✅ Ray SGD has been renamed to Ray Train ✅ Ray Datasets, now beta, has a new integration with Ray Train for scalable ML ingest ✅ Experimental support for Ray on Apple Silicon (M1 Macs) bit.ly/3k3eIGw

ray · Apr 4, 2025 · 6:26 PM UTC

ray

@raydistributed

4 Apr 2025

Uber built a unified ML platform that abstracts away infra complexity — letting teams run Ray jobs without worrying about clusters or resource placement. @raydistributed + @kubernetesio handle orchestration and scaling across @Uber's fleet. 🤝 Full setup breakdown 👇 uber.com/blog/ubers-journey-…

Uber’s Journey to Ray on Kubernetes: Ray Setup

Uber’s taken steps to enhance and modernize its machine learning platform. As part of this enhancement, in early 2024, Uber migrated its machine learning workloads to Kubernetes®. This blog is the...

uber.com

8,021

ray · Feb 24, 2023 · 3:02 PM UTC

ray

@raydistributed

24 Feb 2023

Ray 2.3.0 Released with: ⭐️ Observability enhancements ⭐️ Ray Dataset Streaming ⭐️Boost in Ray core performance ⭐️Gym/Gymnasium library in #RLlib ⭐️ Support ARM & Python 3.11 ⭐️ Support multiple applications in Ray Serve (developer preview) anyscale.com/blog/announcing…

Announcing Ray 2.3: performance improvements, new features and new platforms

The Ray 2.3 release features exciting improvements across the Ray ecosystem. In this blog post, we will highlight new features, performance enhancements, and support for new platforms.

anyscale.com

1,454

ray · Feb 1, 2023 · 5:52 PM UTC

ray

@raydistributed

1 Feb 2023

Ray continues to enable #ML teams innovate at scale & unleash new use cases. @Spotify shares how #Ray helps #ML practitioners innovate & how they built ML platform atop Ray.

Spotify Engineering

@SpotifyEng

1 Feb 2023

"Our goal for Spotify’s ML Platform has always been to create a seamless user experience for ML practitioners who want to take an ML application from development to production..." And so, we introduced @raydistributed to our @Spotify ecosystem. engineering.atspotify.com/20…

2,693

ray · Oct 15, 2020 · 6:39 PM UTC

ray

@raydistributed

15 Oct 2020

🙌🙌 With the v3.0 release, you can use Ray to train @spacy_io on one or more remote machines, potentially speeding up your training process. explosion.ai/blog/spacy-v3-n…

Introducing spaCy v3.0 · Explosion

spaCy v3.0 is a huge release! It features new transformer-based pipelines that get spaCy's accuracy right up to the current state-of-the-art, and a new workflow system to help you take projects from...

explosion.ai

You’re unable to view this Post because this account owner limits who can view their Posts.

ray · Aug 23, 2022 · 6:25 PM UTC

ray

@raydistributed

23 Aug 2022

Co-creator of @PyTorch at Meta AI @soumithchintala shares how various project co-exist with @raydistributed at #raysummit.

ray · May 30, 2020 · 11:43 PM UTC

ray

@raydistributed

30 May 2020

Surprisingly, most popular key-value stores don't support shared-memory! The Plasma Store, part of @ApacheArrow, does. In conjunction with Arrow’s data layout, this enables super fast sharing of data between multiple processes on the same machine. ray-project.github.io/2017/0…

ray · May 3, 2024 · 3:56 AM UTC

ray

@raydistributed

3 May 2024

Ray and Uber have a long history, including 1. uber.com/blog/horovod-ray/ 2. uber.com/blog/elastic-xgboos… 3. uber.com/blog/from-predictiv…

Elastic Deep Learning with Horovod on Ray

Uber Sites

uber.com

Robert Nishihara

@robertnishihara

3 May 2024

This is a fantastic read on Uber's AI 8 year AI journey. From (1) predictive ML on tabular data to (2) adopting deep learning to (3) venturing into generative AI. It's amazing to see that @raydistributed has played a role in enabling deep learning and LLM training at Uber.

3,437

ray · Mar 3, 2021 · 3:29 PM UTC

ray

@raydistributed

3 Mar 2021

Announcing a collaboration between PyCaret + Ray! 🔥PyCaret (pycaret.org/) is a popular low-code ML library in Python. A new contributed blog shows how #PyCaret integrated Ray's tune-sklearn (github.com/ray-project/tune-…) to simplify model tuning! medium.com/distributed-compu…

ray · Jun 8, 2021 · 6:26 PM UTC

ray

@raydistributed

8 Jun 2021

At #RaySummit, @vanpelt will discuss the @wandb tool Tables + new Artifacts features that let you visualize & query datasets & model evaluations at the example level as well as integrate with Ray. Register: bit.ly/3fLnRBA

ray · Jul 22, 2021 · 7:38 PM UTC

ray

@raydistributed

22 Jul 2021

🎉New blog post on the most popular RL talks from Ray Summit 2021! Including: - 24x Speedup for RL (Raoul Khouri) - Orchestrating Robotics Operations with SageMaker + RLlib (@SahikaGenc) - Offline RL with RLlib (@edilmop) - Neural MMO (@jsuarez) bit.ly/3hXrF3I

Blog | Anyscale

anyscale.com

ray · Mar 2, 2020 · 1:15 AM UTC

ray

@raydistributed

2 Mar 2020

New blog post, "Scaling Python Asyncio with Ray" by Simon Mo link.medium.com/GTWU6LxWv4

Scaling Python Asyncio with Ray

Every idea needs a Medium

link.medium.com

ray · Feb 28, 2025 · 7:28 PM UTC

ray

@raydistributed

28 Feb 2025

Awesome turnout for @anyscalecompute @CodyHaoYu presentation at the @vllm_project meetup—nearly 300 people joined to hear about the vLLM roadmap and our team's release of new LLM APIs in Ray Data and Ray Serve.🙌 The new batch inference APIs seamlessly integrate vLLM, improving both speed and scalability. See the APIs here: Ray Data + LLMs-docs.ray.io/en/master/data/w… Ray Serve for LLMs- docs.ray.io/en/master/serve/…

3,452

ray · Aug 24, 2022 · 10:55 PM UTC

ray

@raydistributed

24 Aug 2022

The brains behind the operation 🧠

ray · Jun 26, 2025 · 8:40 PM UTC

ray

@raydistributed

26 Jun 2025

Incredible meetup last night. Thank you to @netflix for hosting! Great talks from - Lingyi Liu on Netflix's ML platform - Pablo Delgado on multimodal data curation at Netflix - Lei Xu on LanceDB's multimodal lakehouse - Richard Liaw on Ray Data for AI data processing

4,434

ray · Feb 16, 2021 · 7:43 PM UTC

ray

@raydistributed

16 Feb 2021

Ray 1.2 is up on GitHub and PyPI (github.com/ray-project/ray/r…)! 🎉 This is an important release with many new APIs and tons of new committers. Some highlights 👇

Release Release ray-1.2.0 · ray-project/ray

Release v1.2.0 Notes Highlights Ray client is now in beta! Check out more details here: https://docs.ray.io/en/master/ray-client.html XGBoost-Ray is now in beta! Check out more details about this ...

github.com

ray · Jul 25, 2023 · 3:46 PM UTC

ray

@raydistributed

25 Jul 2023

The Ray 2.6.1 released with : 🎏 Streaming responses in Serve for real-time capabilities 🎏 📀🏃‍♀️Ray Data streaming integration w/Train 🏃‍♀️☁️Distributed Training & Tuning sync with cloud storage persistence 🤖 Alpha release of the Multi-GPU Learner API 📙 Ray Gallery examples 👇

6,938

ray · Feb 14, 2025 · 6:55 AM UTC

ray

@raydistributed

14 Feb 2025

ByteDance has recently shipped some very impressive models: - OmniHuman-1 for high quality deepfake videos - Seed-Music for music generation Today they hosted an incredible meetup and went into detail on how they use Ray for their (1) audio processing, (2) video pipelines, and (3) RLHF.

4,191

ray · Nov 17, 2020 · 3:26 PM UTC

ray

@raydistributed

17 Nov 2020

In Ray 1.0.1, we're releasing Population-based Bandits (PB2), a new method for tuning neural networks published in #NeurIPS2020 by @jparkerholder and @nguyentienvu! 🚀 PB2 can perform up to 6x more efficiently than methods like Hyperband, PBT. 🔖 Read: anyscale.com/blog/population…

Population Based Bandits: Provably Efficient Online Hyperparameter Optimization | Anyscale

anyscale.com

ray · Oct 11, 2021 · 4:29 PM UTC

ray

@raydistributed

11 Oct 2021

💥🎉 Ray version 1.7 is here! Featuring: ✅ Ray SGD v2, now alpha, introduces APIs that focus on ease of use and composability ✅ Ray Workflows is in alpha. Try it out for your large data, ML, and business workflows ✅ Major enhancements to the C++ API bit.ly/3mH4tIm

ray · Nov 11, 2021 · 10:06 PM UTC

ray

@raydistributed

11 Nov 2021

Distributed C++ systems are more difficult to put into production than single machine systems due to communication, deployment, and fault tolerance issues. The new Ray C++ API was designed to help to address these problems. Learn more ⬇️ bit.ly/3oeWTFq

Modern Distributed C++ with Ray | Anyscale

anyscale.com

ray · Feb 18, 2021 · 3:50 PM UTC

ray

@raydistributed

18 Feb 2021

⚡️In Ray 1.2, we’re improving Ray support for distributed data processing! Featuring: - 💿External storage support - ✨Support for Python data processing libraries Use @ApacheSpark , @dask_dev DataFrames alongside ML libraries on Ray like Horovod! Blog: medium.com/distributed-compu…

Data processing support in Ray

Authors: Sang Cho, Alex Wu, Clark Zinzow, Eric Liang, Stephanie Wang

medium.com

ray · Jun 16, 2021 · 8:02 PM UTC

ray

@raydistributed

16 Jun 2021

🎉 Introducing Distributed XGBoost Training with Ray! Featuring: - Distributed training by only changing three lines of code - Distributed hyperparameter tuning with Ray Tune - Support for Pandas, Modin, & even Dask Dataframes! Learn more ⬇️ bit.ly/35rjtlc

Introducing Distributed XGBoost Training with Ray | Anyscale

anyscale.com

ray · Mar 2, 2023 · 4:37 PM UTC

ray

@raydistributed

2 Mar 2023

As part of our efforts on #observability, a novel feature: "Automatic and optimistic memory scheduling for ML workloads in Ray" 👉 minimal configuration 👉 policy-based mitigation of #OOM errors w/retriable tasks 👉 debug OOM problems w/ the monitor anyscale.com/blog/automatic-…

Automatic Memory Scheduling for ML Workloads in Ray

Learn about Ray's new out of memory (OOM) monitor and detection feature — all part of our efforts to make Ray easy to observe & debug for ML engineers.

anyscale.com

1,573

ray · Oct 1, 2021 · 5:15 PM UTC

ray

@raydistributed

1 Oct 2021

4 common patterns of serving ML models in production are: pipeline, ensemble, business logic, & online learning. Implementing these patterns typically involves a tradeoff between easy development and production readiness. Learn how Ray Serve changes this bit.ly/3ipEsMi

ray · Apr 3, 2024 · 4:29 PM UTC

ray

@raydistributed

3 Apr 2024

Very impressive to see how @canva is using LLMs and image generation to transform the design world.

Anyscale

@anyscalecompute

3 Apr 2024

Canva is a leader in generative AI and modernized their AI platform with @raydistributed. Some key challenges - Scaling training on more GPUs and far more data. - Unifying generative AI and non-generative models. - Flexibility to support different clouds and accelerators. This enabled @canva to speed up training by an order of magnitude and fully saturate GPU utilization. anyscale.com/resources/case-…

9,135

ray · Sep 8, 2021 · 4:29 PM UTC

ray

@raydistributed

8 Sep 2021

🎉 Ant Group has developed Ant Ray Serving which is an online service framework based on Ray, which provides users with a Serverless platform to publish Java/Python code as online services & allows them to focus on their own business logic 🎉 Learn more: bit.ly/3yUkKxy

ray · Aug 2, 2022 · 5:01 PM UTC

ray

@raydistributed

2 Aug 2022

There’s an even divide between developers choosing a generic #Python web server such as @FastAPI and a specialized ML serving solution framework. Check out our latest blog post for more on each option and explore why you might choose one over the other: ow.ly/tlhG50K7I1m

Ray Serve + FastAPI: The best of both worlds | Anyscale

anyscale.com

ray · Jan 26, 2021 · 5:54 PM UTC

ray

@raydistributed

26 Jan 2021

You can configure and Scale ML with @Hydra_Framework and Ray on AWS or local Ray clusters. Blog Post: medium.com/distributed-compu…

Configuring and Scaling ML with Hydra + Ray

Launch your Hydra applications on the cloud with the new Hydra-Ray integration!

medium.com

ray · Jul 7, 2021 · 6:37 PM UTC

ray

@raydistributed

7 Jul 2021

🎉 Really exciting blog from @UberEng on moving distributed @XGBoostProject onto Ray along with parallel efforts to move Elastic #Horovod onto Ray! This is a critical step towards a unified distributed compute backend for end-to-end machine learning workflows at Uber!

Uber Engineering

@UberEng

7 Jul 2021

New on our blog today! Members of our engineering team describe how they co-developed Distributed XGBoost on Ray with the Ray team @raydistributed to tackle various production challenges of doing distributed machine learning at scale. read more: eng.uber.com/elastic-xgboost…

ray · May 20, 2021 · 4:58 PM UTC

ray

@raydistributed

20 May 2021

After training a #MachineLearning model, the model needs to be deployed for online serving and offline processing. At #RaySummit, @simon_mo_ will walk through the journey of deploying ML models in production and how Ray Serve was built. Register: bit.ly/3wdvWo2

ray · Jun 16, 2022 · 6:00 PM UTC

ray

@raydistributed

16 Jun 2022

As modern hardware systems get more complex, it’s becoming more difficult to design integrated circuit implementations. Check out the blog post from the @IBMResearch team to learn how they use AI/ML-driven chip design and Ray to solve this challenge: ow.ly/PX1Q50Jzc8O

Infusing AI and ML into integrated circuit design for faster chip delivery, better chip performance...

anyscale.com

ray · Oct 29, 2025 · 6:42 PM UTC

ray

@raydistributed

29 Oct 2025

Read this blog to learn about Composer, Cursor's latest frontier model built with Ray. For the technical deep dive, come to Ray Summit next week! cursor.com/blog/composer

Composer: Building a fast frontier model with RL · Cursor

Composer is our new agent model designed for software engineering intelligence and speed.

cursor.com

4,653

ray · Nov 1, 2025 · 12:16 AM UTC

ray

@raydistributed

1 Nov 2025

SGLang 🤝 Ray! We're super excited to have @ying11231 and @liin1211 talk about SGLang and its new features at Ray Summit! They'll highlight the newest SGLang features and also talk about SGLang's integration with Ray Data LLM. Hope to see you there!

LMSYS Org

@lmsysorg

31 Oct 2025

SGLang at Ray Summit 2025 is coming! 📍 San Francisco • Nov 3–5 • Hosted by @anyscalecompute 🗓 On Nov 5, SGLang is invited to give a talk on Efficient LLM Serving 🎤 @ying11231 & @liin1211 will introduce core features, high-throughput & low-latency tricks, real-world deployment lessons, and the future roadmap. ✨ Use RaySGLang50 for 50% off! For anyone who cares about: Distributed AI at scale, Performance & efficiency, Open-source evolution - Tag a friend who should join! #SGLang #RaySummit2025 #RayData #DistributedAI anyscale.com/ray-summit/2025

5,175

ray · Aug 15, 2022 · 2:01 PM UTC

ray

@raydistributed

15 Aug 2022

#RaySummit is almost here! Don’t miss out on: 🌁 In-person networking in SF 🎒 3 in-depth Ray training sessions ⚙️ 40+ technical sessions and lightning talks 🎤 Speakers from @MetaAI, @Spotify, @IBM & more ...and much more! ow.ly/RpO050KiKQZ

Ray Summit 2026 | Hosted by Anyscale

Join Ray Summit in San Francisco, Aug 24–26, for technical talks on foundation model training, multimodal AI, RL, and other AI in production systems.

anyscale.com

ray · Jun 14, 2021 · 5:59 PM UTC

ray

@raydistributed

14 Jun 2021

Ray has many ML integrations such as Horovod and 🤗 to data processing frameworks such as Spark, Modin, and Dask. But what does it mean to be "integrated with Ray"? And what benefits does it provide to library developers and users? Learn more ⬇️ bit.ly/2TvQxGh

ray · Aug 13, 2020 · 1:09 PM UTC

ray

@raydistributed

13 Aug 2020

Since it was first released Ray Tune is a leading way of scaling ML tuning. But there's a gap - experiment management & ML tracking. To close this, we're happy to announce an integration with @wandb ! Read about it here: medium.com/distributed-compu…

ray · Apr 14, 2022 · 10:45 PM UTC

ray

@raydistributed

14 Apr 2022

🎉 Ray 1.12 is here! This release includes the alpha of Ray AI Runtime (AIR), a new, unified experience for seamless integration across the Ray ecosystem. 📢 Shoutout to all of the community members who supported this release. Learn all about it here: ow.ly/OF3350IKnw4

Ray 1.12: Ray AI Runtime (alpha), usage data collection, and more | Anyscale

anyscale.com

ray · Mar 23, 2021 · 8:25 PM UTC

ray

@raydistributed

23 Mar 2021

A distributed shuffle is a data intensive-operation that usually calls for a system built specifically for that purpose. Even though its core API contains no shuffle operations, Ray can do it in just a few lines of Python. Learn how 👇 medium.com/distributed-compu…

Executing a distributed shuffle without a MapReduce system

Author: Stephanie Wang

medium.com

ray · Jan 14, 2020 · 1:28 AM UTC

ray

@raydistributed

14 Jan 2020

The performance numbers resulting from the ongoing re-architecture are impressive! Here's why: medium.com/distributed-compu…

How Ray Uses gRPC (and Arrow) to Outperform gRPC

This blog post explains how the Ray 0.8 release uses gRPC and Arrow to provide a distributed Python API that is both fast and simple.

medium.com

ray · Apr 2, 2020 · 2:07 AM UTC

ray

@raydistributed

2 Apr 2020

Using @raydistributed with @scikit_learn . @AmeerHajAli shows you how. medium.com/distributed-compu…. The technique leverages Ray's implementation of joblib. He also shows performance measurements of Ray vs. other tools, Loky, Multiprocessing, and Dask.

Easy Distributed Scikit-Learn Training with Ray

TL;DR: Scale your scikit-learn applications to a cluster with Ray’s implementation of joblib’s backend.

medium.com

ray · Jun 24, 2025 · 6:33 PM UTC

ray

@raydistributed

24 Jun 2025

PyTorch + vLLM + Kubernetes + Ray is a great combination.

PyTorch

@PyTorch

24 Jun 2025

An #OpenSource Stack for #AI Compute: @kubernetesio + @raydistributed + @pytorch + @vllm_project ➡️ This Anyscale blog post by @robertnishihara describes a snapshot of that emerging stack based on experience working with Ray users + case studies from Pinterest, Uber, Roblox, and 5 popular open source post-training frameworks hubs.la/Q03tnBZD0

1,630

ray · May 14, 2024 · 4:23 PM UTC

ray

@raydistributed

14 May 2024

Here is part 2, zooming way in on data preparation (along with runnable code). anyscale.com/blog/processing…

Processing 2 Billion Images for Stable Diffusion Model Training - Definitive Guides with Ray Series

anyscale.com

8,904