generate_daily_series
| Name | Type | Description | Default |
|---|---|---|---|
n_series | int | Number of series for synthetic panel. | required |
min_length | int, default=50 | Minimum length of synthetic panel’s series. | 50 |
max_length | int, default=500 | Maximum length of synthetic panel’s series. | 500 |
n_static_features | int, default=0 | Number of static exogenous variables for synthetic panel’s series. | 0 |
equal_ends | bool, default=False | Series should end in the same date stamp ds. | False |
static_as_categorical | bool, default=True | Static features should have a categorical data type. | True |
with_trend | bool, default=False | Series should have a (positive) trend. | False |
seed | int, default=0 | Random seed used for generating the data. | 0 |
engine | str, default=‘pandas’ | Output Dataframe type. | ‘pandas’ |
| Type | Description |
|---|---|
DataFrame | pandas or polars DataFrame: Synthetic panel with columns [unique_id, ds, y] and exogenous features. |
| unique_id | ds | y | |
|---|---|---|---|
| 0 | id_00 | 2000-01-01 | 0.395863 |
| 1 | id_00 | 2000-01-02 | 1.264447 |
| 2 | id_00 | 2000-01-03 | 2.284022 |
| 3 | id_00 | 2000-01-04 | 3.462798 |
| 4 | id_00 | 2000-01-05 | 4.035518 |
| … | … | … | … |
| 12446 | id_19 | 2002-03-11 | 0.309275 |
| 12447 | id_19 | 2002-03-12 | 1.189464 |
| 12448 | id_19 | 2002-03-13 | 2.325032 |
| 12449 | id_19 | 2002-03-14 | 3.333198 |
| 12450 | id_19 | 2002-03-15 | 4.306117 |
static_0) is
relevant to the target.
| unique_id | ds | y | static_0 | static_1 | |
|---|---|---|---|---|---|
| 0 | id_00 | 2000-01-01 | 7.521388 | 18 | 10 |
| 1 | id_00 | 2000-01-02 | 24.024502 | 18 | 10 |
| 2 | id_00 | 2000-01-03 | 43.396423 | 18 | 10 |
| 3 | id_00 | 2000-01-04 | 65.793168 | 18 | 10 |
| 4 | id_00 | 2000-01-05 | 76.674843 | 18 | 10 |
| … | … | … | … | … | … |
| 12446 | id_19 | 2002-03-11 | 27.834771 | 89 | 42 |
| 12447 | id_19 | 2002-03-12 | 107.051746 | 89 | 42 |
| 12448 | id_19 | 2002-03-13 | 209.252845 | 89 | 42 |
| 12449 | id_19 | 2002-03-14 | 299.987801 | 89 | 42 |
| 12450 | id_19 | 2002-03-15 | 387.550536 | 89 | 42 |
equal_ends=False (the default) then every serie has a different end
date.
equal_ends=True.
generate_prices_for_series
| ds | unique_id | price | |
|---|---|---|---|
| 0 | 2000-10-05 | id_00 | 0.548814 |
| 1 | 2000-10-06 | id_00 | 0.715189 |
| 2 | 2000-10-07 | id_00 | 0.602763 |
| 3 | 2000-10-08 | id_00 | 0.544883 |
| 4 | 2000-10-09 | id_00 | 0.423655 |
| … | … | … | … |
| 5009 | 2001-05-17 | id_19 | 0.288027 |
| 5010 | 2001-05-18 | id_19 | 0.846305 |
| 5011 | 2001-05-19 | id_19 | 0.791284 |
| 5012 | 2001-05-20 | id_19 | 0.578636 |
| 5013 | 2001-05-21 | id_19 | 0.288589 |

