Installation
As long as Dask is installed and configured, StatsForecast will be able to use it. If executing on a distributed Dask cluster, make use thestatsforecast library is installed across all the workers.
StatsForecast on Pandas
Before running on Dask, it’s recommended to test on a smaller Pandas dataset to make sure everything is working. This example also helps show the small differences when using Dask.| unique_id | ds | AutoETS | |
|---|---|---|---|
| 0 | 0 | 2000-08-10 | 5.261609 |
| 1 | 0 | 2000-08-11 | 6.196357 |
| 2 | 0 | 2000-08-12 | 0.282309 |
| 3 | 0 | 2000-08-13 | 1.264195 |
| 4 | 0 | 2000-08-14 | 2.262453 |
Executing on Dask
To run the forecasts distributed on Dask, just pass in a Dask DataFrame instead.| unique_id | ds | AutoETS | |
|---|---|---|---|
| 0 | 0 | 2000-08-10 00:00:00 | 5.261609 |
| 1 | 0 | 2000-08-11 00:00:00 | 6.196357 |
| 2 | 0 | 2000-08-12 00:00:00 | 0.282309 |
| 3 | 0 | 2000-08-13 00:00:00 | 1.264195 |
| 4 | 0 | 2000-08-14 00:00:00 | 2.262453 |

