Improve documentation on forecast evaluation #3238

7dy15 · 2025-01-08T13:57:30Z

## Description

When I finisned training the model and got the prediction result,I wanted to evaluate the result with forecast_it, ts_it =
make_evaluation_predictions() as the tutorial suggested, But when I tried to convert the the forecast_it and ts_it with list() function,
an error rose:

To Reproduce

forecast_it, ts_it = make_evaluation_predictions(
dataset=test_data, # test dataset
predictor=predictor, # predictor
num_samples=100, # number of sample paths we want for evaluation
)
forecasts_ev =list(forecast_it)
tss = list(ts_it)

   
df = pd.read_csv(
    "C:/Users/leo/Downloads/AirPassengers.csv",
    index_col=0,
    parse_dates=True,
)


dataset = PandasDataset(df, target="target",freq='d')

# Split the data for training and testing
training_data, test_gen = split(dataset, date=pd.Period("2002/10/25", freq="1D"))
test_data = test_gen.generate_instances(prediction_length=168, windows=1)
1

#Train the model and make predictions
model = DeepAREstimator(
    prediction_length=168, freq="d", trainer_kwargs={"max_epochs": 1},num_layers=5,batch_size=100,hidden_size=336,context_length=24
)
predictor = model.train(training_data)
forecasts = list(predictor.predict(test_data.input))
predictor.serialize(Path("D:/BaiduNetdiskDownload/auto_bnn/tmp"))

# Plot predictions
plt.plot(df["1954":], color="black")
a = []
for forecast in forecasts:
  forecast.plot()
plt.legend(["True values"], loc="upper left", fontsize="xx-large")
plt.show()


#predictor saved and metrics evaluated
predictor_deserialized = Predictor.deserialize(Path("D:/BaiduNetdiskDownload/auto_bnn/tmp"))
forecast_it, ts_it = make_evaluation_predictions(
    dataset=test_data,  # test dataset
    predictor=predictor,  # predictor
    num_samples=100,  # number of sample paths we want for evaluation
)
forecasts_ev =list(forecast_it)
tss = list(ts_it)

evaluator = Evaluator(quantiles=[0.1, 0.5, 0.9])
agg_metrics, item_metrics = evaluator(tss, forecasts_ev)
print(json.dumps(agg_metrics, indent=4))


## Error message or code output

Traceback (most recent call last):
  File "D:\BaiduNetdiskDownload\auto_bnn\c.py", line 55, in <module>
    forecasts_ev =list(forecast_it)
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\torch\model\predictor.py", line 90, in predict
    yield from self.forecast_generator(
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\model\forecast_generator.py", line 172, in __call__
    for batch in inference_data_loader:
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\transform\_base.py", line 111, in __iter__
    yield from self.transformation(
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\transform\_base.py", line 132, in __call__
    for data_entry in data_it:
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\dataset\loader.py", line 50, in __call__
    yield from batcher(data, self.batch_size)
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\itertools.py", line 128, in get_batch
    return list(itertools.islice(it, batch_size))
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\transform\_base.py", line 132, in __call__
    for data_entry in data_it:
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\transform\_base.py", line 186, in __call__
    for data_entry in data_it:
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\transform\_base.py", line 132, in __call__
    for data_entry in data_it:
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\transform\_base.py", line 132, in __call__
    for data_entry in data_it:
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\transform\_base.py", line 132, in __call__
    for data_entry in data_it:
  [Previous line repeated 8 more times]
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\dataset\split.py", line 414, in __iter__
    for input, _label in self.test_data:
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\dataset\split.py", line 386, in __iter__
    yield from self.splitter.generate_test_pairs(
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\dataset\split.py", line 251, in generate_test_pairs
    test = self.test_pair(
  File "D:\anaconda3\envs\glu39\lib\site-packages\gluonts\dataset\split.py", line 289, in test_pair
    offset_ += entry[FieldName.TARGET].shape[-1]
TypeError: tuple indices must be integers or slices, not str

put error or undesired output here
   TypeError: tuple indices must be integers or slices, not str

## Environment
- Operating system: Windows 11
- Python version: 3.10
- GluonTS version: 1.6
- MXNet version: not installed

(Add as much information about your environment as possible, e.g. dependencies versions.)

lostella · 2025-01-15T08:54:37Z

@7dy15 thanks for opening the issue. In fact, make_evaluation_predictions cannot be used directly with the output of split: we have added an improved evaluation setup to work with that, an example of this follows.

import pandas as pd
from gluonts.dataset.pandas import PandasDataset
from gluonts.dataset.split import split
from gluonts.model.npts import NPTSPredictor
from gluonts.model.seasonal_naive import SeasonalNaivePredictor
from gluonts.model.evaluation import evaluate_forecasts
from gluonts.ev.metrics import MASE, RMSE, MeanWeightedSumQuantileLoss

url = "https://raw.githubusercontent.com/numenta/NAB/master/data/realTweets/Twitter_volume_AMZN.csv"
df = pd.read_csv(url, header=0, index_col="timestamp", parse_dates=True).resample("1h").sum()
dataset = PandasDataset(df, target="value")

prediction_length = 24

training_dataset, test_template = split(
    dataset, date=pd.Period("2015-04-07 00:00:00", freq="1h")
)

test_data = test_template.generate_instances(
    prediction_length=prediction_length,
    windows=3,
)

seasonal_naive = SeasonalNaivePredictor(prediction_length=prediction_length, season_length=7*24)
npts = NPTSPredictor(prediction_length=prediction_length)

metrics = [MASE(), RMSE(), MeanWeightedSumQuantileLoss(quantile_levels=[0.1, 0.5, 0.9])]

forecasts_seasonal_naive = list(seasonal_naive.predict(test_data.input))
eval_seasonal_naive = evaluate_forecasts(forecasts_seasonal_naive, test_data=test_data, metrics=metrics)
print(f"seasonal naive:\n {eval_seasonal_naive}")

forecasts_npts = list(npts.predict(test_data.input))
eval_npts = evaluate_forecasts(forecasts_npts, test_data=test_data, metrics=metrics)
print(f"npts:\n {eval_npts}")

which will output

seasonal naive:
       MASE[0.5]  RMSE[mean]  mean_weighted_sum_quantile_loss
None   1.373187  321.770277                         0.275504


npts:
       MASE[0.5]  RMSE[mean]  mean_weighted_sum_quantile_loss
None   0.967116  261.304356                         0.148618

I'm turning this into a documentation issue, since it's not really a bug. Again, thanks for spotting this!

Steps to solving this could include

add an eval example (similar to the above) to the README
add the above eval example to the tutorial for split here: https://ts.gluon.ai/stable/tutorials/data_manipulation/dataset_splitting_example.html
update other tutorials to rely on split and evaluate_forecasts like the above example (the challenge here being: datasets provided by gluonts are structured in a way that really fits well into make_evaluation_predictions)

7dy15 added the bug Something isn't working label Jan 8, 2025

lostella added documentation This item involves documentation issues and removed bug Something isn't working labels Jan 15, 2025

lostella changed the title ~~errors when evaluating the forecast~~ improve documentation on forecast evaluation Jan 15, 2025

lostella changed the title ~~improve documentation on forecast evaluation~~ Improve documentation on forecast evaluation Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve documentation on forecast evaluation #3238

Improve documentation on forecast evaluation #3238

7dy15 commented Jan 8, 2025

lostella commented Jan 15, 2025 •

edited

Loading

Improve documentation on forecast evaluation #3238

Improve documentation on forecast evaluation #3238

Comments

7dy15 commented Jan 8, 2025

## Description

To Reproduce

lostella commented Jan 15, 2025 • edited Loading

lostella commented Jan 15, 2025 •

edited

Loading