API Reference
SDK Reference
NixtlaClient
Client to interact with the Nixtla API.
Type | Default | Details | |
---|---|---|---|
api_key | Optional | None | The authorization api_key interacts with the Nixtla API. If not provided, will use the NIXTLA_API_KEY environment variable. |
base_url | Optional | None | Custom base_url. If not provided, will use the NIXTLA_BASE_URL environment variable. |
timeout | int | 60 | Request timeout in seconds. Set this to None to disable it. |
max_retries | int | 6 | The maximum number of attempts to make when calling the API before giving up. It defines how many times the client will retry the API call if it fails. Default value is 6, indicating the client will attempt the API call up to 6 times in total |
retry_interval | int | 10 | The interval in seconds between consecutive retry attempts. This is the waiting period before the client tries to call the API again after a failed attempt. Default value is 10 seconds, meaning the client waits for 10 seconds between retries. |
max_wait_time | int | 360 | The maximum total time in seconds that the client will spend on all retry attempts before giving up. This sets an upper limit on the cumulative waiting time for all retry attempts. If this time is exceeded, the client will stop retrying and raise an exception. Default value is 360 seconds, meaning the client will cease retrying if the total time spent on retries exceeds 360 seconds. The client throws a ReadTimeout error after 60 seconds of inactivity. If you want to catch these errors, use max_wait_time >> 60. |
NixtlaClient.validate_api_key
Returns True if your api_key is valid.
NixtlaClient.plot
Plot forecasts and insample values.
Type | Default | Details | |
---|---|---|---|
df | Union | None | The DataFrame on which the function will operate. Expected to contain at least the following columns: - time_col: Column name in df that contains the time indices of the time series. This is typically a datetimecolumn with regular intervals, e.g., hourly, daily, monthly data points. - target_col: Column name in df that contains the target variable of the time series, i.e., the variable wewish to predict or analyze. Additionally, you can pass multiple time series (stacked in the dataframe) considering an additional column: - id_col: Column name in df that identifies unique time series. Each unique value in this columncorresponds to a unique time series. |
forecasts_df | Union | None | DataFrame with columns [unique_id , ds ] and models. |
id_col | str | unique_id | Column that identifies each serie. |
time_col | str | ds | Column that identifies each timestep, its values can be timestamps or integers. |
target_col | str | y | Column that contains the target. |
unique_ids | Union | None | Time Series to plot. If None, time series are selected randomly. |
plot_random | bool | True | Select time series to plot randomly. |
max_ids | int | 8 | Maximum number of ids to plot. |
models | Optional | None | list of models to plot. |
level | Optional | None | list of prediction intervals to plot if paseed. |
max_insample_length | Optional | None | Max number of train/insample observations to be plotted. |
plot_anomalies | bool | False | Plot anomalies for each prediction interval. |
engine | Literal | matplotlib | Library used to plot. ‘matplotlib’, ‘plotly’ or ‘plotly-resampler’. |
resampler_kwargs | Optional | None | Kwargs to be passed to plotly-resampler constructor. For further custumization (“show_dash”) call the method, store the plotting object and add the extra arguments to its show_dash method. |
ax | Union | None | Object where plots will be added. |
NixtlaClient.forecast
Forecast your time series using TimeGPT.
Type | Default | Details | |
---|---|---|---|
df | AnyDFType | The DataFrame on which the function will operate. Expected to contain at least the following columns: - time_col: Column name in df that contains the time indices of the time series. This is typically a datetimecolumn with regular intervals, e.g., hourly, daily, monthly data points. - target_col: Column name in df that contains the target variable of the time series, i.e., the variable wewish to predict or analyze. Additionally, you can pass multiple time series (stacked in the dataframe) considering an additional column: - id_col: Column name in df that identifies unique time series. Each unique value in this columncorresponds to a unique time series. | |
h | Annotated | Forecast horizon. | |
freq | Optional | None | Frequency of the data. By default, the freq will be inferred automatically. See pandas’ available frequencies. |
id_col | str | unique_id | Column that identifies each serie. |
time_col | str | ds | Column that identifies each timestep, its values can be timestamps or integers. |
target_col | str | y | Column that contains the target. |
X_df | Optional | None | DataFrame with [unique_id , ds ] columns and df ’s future exogenous. |
level | Optional | None | Confidence levels between 0 and 100 for prediction intervals. |
quantiles | Optional | None | Quantiles to forecast, list between (0, 1).level and quantiles should not be used simultaneously.The output dataframe will have the quantile columns formatted as TimeGPT-q-(100 * q) for each q. 100 * q represents percentiles but we choose this notation to avoid having dots in column names. |
finetune_steps | Annotated | 0 | Number of steps used to finetune learning TimeGPT in the new data. |
finetune_depth | Literal | 1 | |
finetune_loss | Literal | default | Loss function to use for finetuning. Options are: default , mae , mse , rmse , mape , and smape . |
clean_ex_first | bool | True | Clean exogenous signal before making forecasts using TimeGPT. |
hist_exog_list | Optional | None | Column names of the historical exogenous features. |
validate_api_key | bool | False | If True, validates api_key before sending requests. |
add_history | bool | False | Return fitted values of the model. |
date_features | Union | False | Features computed from the dates. Can be pandas date attributes or functions that will take the dates as input. If True automatically adds most used date features for the frequency of df . |
date_features_to_one_hot | Union | False | Apply one-hot encoding to these date features. If date_features=True , then all date features areone-hot encoded by default. |
model | Literal | timegpt-1 | Model to use as a string. Options are: timegpt-1 , and timegpt-1-long-horizon .We recommend using timegpt-1-long-horizon for forecastingif you want to predict more than one seasonal period given the frequency of your data. |
num_partitions | Optional | None | Number of partitions to use. If None, the number of partitions will be equal to the available parallel resources in distributed environments. |
feature_contributions | bool | False | |
Returns | AnyDFType | DataFrame with TimeGPT forecasts for point predictions and probabilistic predictions (if level is not None). |
NixtlaClient.cross_validation
Perform cross validation in your time series using TimeGPT.
Type | Default | Details | |
---|---|---|---|
df | AnyDFType | The DataFrame on which the function will operate. Expected to contain at least the following columns: - time_col: Column name in df that contains the time indices of the time series. This is typically a datetimecolumn with regular intervals, e.g., hourly, daily, monthly data points. - target_col: Column name in df that contains the target variable of the time series, i.e., the variable wewish to predict or analyze. Additionally, you can pass multiple time series (stacked in the dataframe) considering an additional column: - id_col: Column name in df that identifies unique time series. Each unique value in this columncorresponds to a unique time series. | |
h | Annotated | Forecast horizon. | |
freq | Optional | None | Frequency of the data. By default, the freq will be inferred automatically. See pandas’ available frequencies. |
id_col | str | unique_id | Column that identifies each serie. |
time_col | str | ds | Column that identifies each timestep, its values can be timestamps or integers. |
target_col | str | y | Column that contains the target. |
level | Optional | None | Confidence level between 0 and 100 for prediction intervals. |
quantiles | Optional | None | Quantiles to forecast, list between (0, 1).level and quantiles should not be used simultaneously.The output dataframe will have the quantile columns formatted as TimeGPT-q-(100 * q) for each q. 100 * q represents percentiles but we choose this notation to avoid having dots in column names. |
validate_api_key | bool | False | If True, validates api_key before sending requests. |
n_windows | Annotated | 1 | Number of windows to evaluate. |
step_size | Optional | None | Step size between each cross validation window. If None it will be equal to h . |
finetune_steps | Annotated | 0 | Number of steps used to finetune TimeGPT in the new data. |
finetune_depth | Literal | 1 | |
finetune_loss | Literal | default | Loss function to use for finetuning. Options are: default , mae , mse , rmse , mape , and smape . |
clean_ex_first | bool | True | Clean exogenous signal before making forecasts using TimeGPT. |
date_features | Union | False | Features computed from the dates. Can be pandas date attributes or functions that will take the dates as input. If True automatically adds most used date features for the frequency of df . |
date_features_to_one_hot | Union | False | Apply one-hot encoding to these date features. If date_features=True , then all date features areone-hot encoded by default. |
model | Literal | timegpt-1 | Model to use as a string. Options are: timegpt-1 , and timegpt-1-long-horizon .We recommend using timegpt-1-long-horizon for forecastingif you want to predict more than one seasonal period given the frequency of your data. |
num_partitions | Optional | None | Number of partitions to use. If None, the number of partitions will be equal to the available parallel resources in distributed environments. |
Returns | AnyDFType | DataFrame with cross validation forecasts. |
NixtlaClient.detect_anomalies
Detect anomalies in your time series using TimeGPT.
Type | Default | Details | |
---|---|---|---|
df | AnyDFType | The DataFrame on which the function will operate. Expected to contain at least the following columns: - time_col: Column name in df that contains the time indices of the time series. This is typically a datetimecolumn with regular intervals, e.g., hourly, daily, monthly data points. - target_col: Column name in df that contains the target variable of the time series, i.e., the variable wewish to predict or analyze. Additionally, you can pass multiple time series (stacked in the dataframe) considering an additional column: - id_col: Column name in df that identifies unique time series. Each unique value in this columncorresponds to a unique time series. | |
freq | Optional | None | Frequency of the data. By default, the freq will be inferred automatically. See pandas’ available frequencies. |
id_col | str | unique_id | Column that identifies each serie. |
time_col | str | ds | Column that identifies each timestep, its values can be timestamps or integers. |
target_col | str | y | Column that contains the target. |
level | Union | 99 | Confidence level between 0 and 100 for detecting the anomalies. |
clean_ex_first | bool | True | Clean exogenous signal before making forecasts using TimeGPT. |
validate_api_key | bool | False | If True, validates api_key before sending requests. |
date_features | Union | False | Features computed from the dates. Can be pandas date attributes or functions that will take the dates as input. If True automatically adds most used date features for the frequency of df . |
date_features_to_one_hot | Union | False | Apply one-hot encoding to these date features. If date_features=True , then all date features areone-hot encoded by default. |
model | Literal | timegpt-1 | Model to use as a string. Options are: timegpt-1 , and timegpt-1-long-horizon .We recommend using timegpt-1-long-horizon for forecastingif you want to predict more than one seasonal period given the frequency of your data. |
num_partitions | Optional | None | Number of partitions to use. If None, the number of partitions will be equal to the available parallel resources in distributed environments. |
Returns | AnyDFType | DataFrame with anomalies flagged by TimeGPT. |