In this example we will show how to perform electricity load forecasting on the ERCOT (Texas) market for detecting daily peaks.
NHITS
model on historic load data to forecast day-ahead peaks on September
2022. Multiple seasonality is traditionally present in low sampled
electricity data. Demand exhibits daily and weekly seasonality, with
clear patterns for specific hours of the day such as 6:00pm vs 3:00am or
for specific days such as Sunday vs Friday.
First, we will load ERCOT historic demand, then we will use the
Neuralforecast.cross_validation
method to fit the model and forecast
daily load during September. Finally, we show how to use the forecasts
to detect the coincident peak.
Outline
Tip You can use Colab to run this Notebook interactively![]()
pip install neuralforecast
unique_id
, ds
and y
:
unique_id
(string, int or category) represents an identifier
for the series.
ds
(datestamp or int) column should be either an integer
indexing time or a datestamp ideally like YYYY-MM-DD for a date or
YYYY-MM-DD HH:MM:SS for a timestamp.
y
(numeric) represents the measurement we wish to forecast. We
will rename the
NeuralForecast
class and the models you need.
AutoNHITS
you need to define:
h
: forecasting horizonloss
: training loss. Use the
DistributionLoss
to produce probabilistic forecasts. Default:
MAE
.config
: hyperparameter search space. If None
, the
AutoNHITS
class will use a pre-defined suggested hyperparameter space.num_samples
: number of configurations explored.NeuralForecast
object with the following required parameters:
models
: a list of models. Select the models you want from
models and import them.
freq
: a string indicating the frequency of the data. (See panda’s
available
frequencies.)
cross_validation
method allows the user to simulate multiple
historic forecasts, greatly simplifying pipelines by replacing for loops
with fit
and predict
methods. This method re-trains the model and
forecast each window. See this
tutorial
for an animation of how the windows are defined.
Use the cross_validation
method to produce all the daily forecasts for
September. To produce daily forecasts set the forecasting horizon h
as
24. In this example we are simulating deploying the pipeline during
September, so set the number of windows as 30 (one for each day).
Finally, set the step size between windows as 24, to only produce one
forecast per day.
unique_id | ds | cutoff | AutoNHITS | y | |
---|---|---|---|---|---|
0 | ERCOT | 2022-09-01 00:00:00 | 2022-08-31 23:00:00 | 45841.601562 | 45482.471757 |
1 | ERCOT | 2022-09-01 01:00:00 | 2022-08-31 23:00:00 | 43613.394531 | 43602.658043 |
2 | ERCOT | 2022-09-01 02:00:00 | 2022-08-31 23:00:00 | 41968.945312 | 42284.817342 |
3 | ERCOT | 2022-09-01 03:00:00 | 2022-08-31 23:00:00 | 41038.539062 | 41663.156771 |
4 | ERCOT | 2022-09-01 04:00:00 | 2022-08-31 23:00:00 | 41237.203125 | 41710.621904 |
Important When usingcross_validation
make sure the forecasts are produced at the desired timestamps. Check thecutoff
column which specifices the last timestamp before the forecasting window.
crossvaldation_df
to detect the daily
hourly demand peaks. For each day, we set the detected peaks as the
highest forecasts. In this case, we want to predict one peak (npeaks
);
depending on your setting and goals, this parameter might change. For
example, the number of peaks can correspond to how many hours a battery
can be discharged to reduce demand.
Important In this example we only include September. However,NHITS
can correctly predict the peaks for the 4 months of 2022. You can try this by increasing thenwindows
parameter ofcross_validation
or filtering theY_df
dataset. The complete run for all months take only 10 minutes.