Distributed
Ray
Run StatsForecast distributedly on top of Ray.
Installation
As long as Ray is installed and configured, StatsForecast will be able
to use it. If executing on a distributed Ray cluster, make use the
statsforecast
library is installed across all the workers.
StatsForecast on Pandas
Before running on Ray, it’s recommended to test on a smaller Pandas dataset to make sure everything is working. This example also helps show the small differences when using Ray.
ds | AutoETS | |
---|---|---|
unique_id | ||
0 | 2000-08-10 | 5.261609 |
0 | 2000-08-11 | 6.196357 |
0 | 2000-08-12 | 0.282309 |
0 | 2000-08-13 | 1.264195 |
0 | 2000-08-14 | 2.262453 |
Executing on Ray
To run the forecasts distributed on Ray, just pass in a Ray Dataset
instead. Instead of having the unique_id
as an index, it needs to be a
column because Ray has no index.