Timer (Large Time-Series Model)

This repo provides official code, datasets and checkpoints for Timer: Generative Pre-trained Transformers Are Large Time Series Models. [Poster], [Slides].

Updates

🚩 News (2025.2) We release a open codebase OpenLTM, which contains a simple pipeline to pre-train your own large time-series models :)

🚩 News (2024.12) Timer-XL for unified forecasting is accepted as ICLR 2025. We released a pre-trained model on 260B time points [Performance] [Checkpoint] [Quickstart].

🚩 News (2024.10) We release numpy format UTSD. An simple dataloader for large-scale pre-training is here.

🚩 News (2024.6) Pre-training dataset (UTSD) is available in HuggingFace.

🚩 News (2024.5) Accepted by ICML 2024, a camera-ready version of 31 pages.

🚩 News (2024.2) Releasing model checkpoints and code for fine-tuning.

Introduction

Time Series Transformer (Timer) is a Generative Pre-trained Transformer for general time series analysis.

Zero-Shot Forecasting

We provide the checkpoint to make predictions without training samples. See our HuggingFace Repo for the detialed information and usage.

Example

import torch
from transformers import AutoModelForCausalLM

# load pretrain model
model = AutoModelForCausalLM.from_pretrained('thuml/timer-base-84m', trust_remote_code=True)

# prepare input
batch_size, lookback_length = 1, 2880
seqs = torch.randn(batch_size, lookback_length)

# generate forecast
prediction_length = 96
normed_output = model.generate(normed_seqs, max_new_tokens=prediction_length)

print(output.shape)

There's room for improvement in this small model. We are actively working around it and are glad to see constructive suggestions and noteworthy cases.

Datasets

We collect Unified Time Series Datasets (UTSD), which encompass well-curated time series to facilitate the research on large time-series models. Our dataset is released in HuggingFace.

Usage

You can access the data from HuggingFace and load the data in the style of TSLib:

# huggingface-cli login
# export HF_ENDPOINT=https://hf-mirror.com 

python ./scripts/UTSD/download_dataset.py

# dataloader
python ./scripts/UTSD/utsdataset.py

If you meet troubles when accessing the data, you can also download UTSD in numpy from [Tsinghua Cloud] and use UTSD_Npy dataloader from [OpenLTM].

For Developers

For developers interest in fine-tune large model, we provide fine-tuning code for different example tasks. The checkpoint is pre-trained and fine-tuned using TSLib.

Note

We recommend using this checkpoints on HuggingFace for evaluation and deployment (e.g., zero-shot forecasting). However, this huggingface checkpoint is not compatiable with the following fine-tuning code.

Supported Tasks

Forecasting: We provide scripts for full- or few-shot forecasting.

Imputation: We adopt segment-level imputation, which is more challenging than point-level imputation.

Anomaly Detection: We build a benchmark using UCR Anomaly Archive. The task aims to predict normal future series and detect anomalies in advance.

We provide the README files illustrating each task under the folder ./scripts/.

Code for Fine-tuning

Use Python 3.10 and install necessary dependencies.

pip install -r requirements.txt

Put downstream datasets from Google Drive or Baidu Drive under the folder ./dataset/.
Put the checkpoint from Google Drive or Baidu Drive under the folder ./checkpoints/.
Train and evaluate the model. We provide the above tasks under the folder ./scripts/.

# forecasting
bash ./scripts/forecast/ECL.sh

# segement-level imputation
bash ./scripts/imputation/ECL.sh

# anomaly detection
bash ./scripts/anomaly_detection/UCR.sh

Train on Custom Dataset

To fine-tune on your time series dataset, you can try out the following steps:

The key is to reload the customized dataloader and load the pre-trained checkpoint (See ./scripts/ folder).
CIDatasetBenchmark/CIAutoRegressionDatasetBenchmark in the data_provider folder can train and evaluate models in direct / iterative multi-step mode.

Note

For developers interested in developing and improving your own large time-series models, we recomment this repo OpenLTM.

Approach

Unified Pre-training

To pre-train on heterogeneous time series, we propose single-series sequence (S3), reserving series variations into the unified 1D context. Further, we convert forecasting, imputation, and anomaly detection into a unified generative task.

Model Architecture

We evaluate various candidate backbones and eventually adopt the decoder-only Transformer, which provides notable generalization performance and flexibility that accommodate varying-length time series.

Performance

Timer achieves state-of-the-art performance in zero-shot forecasting and few-shot adaptation.

Scalability

By scaling, Timer achieves notable performance improvement. Currently, we provide the base version containing 84M paramaters that is pre-trained on 260B time points, which supports a maximum context length of 2880.

Subsequent Work

We proposed Timer-XL for unified forecasting in this [Paper]. It can be used for task-specific training or scalable pre-training, handling arbitrary-length and any-variable time series [Repo].

Citation

If you find this repo helpful, please cite our paper.

@inproceedings{liutimer,
  title={Timer: Generative Pre-trained Transformers Are Large Time Series Models},
  author={Liu, Yong and Zhang, Haoran and Li, Chenyu and Huang, Xiangdong and Wang, Jianmin and Long, Mingsheng},
  booktitle={Forty-first International Conference on Machine Learning}
}

@article{liu2024timer,
  title={Timer-XL: Long-Context Transformers for Unified Time Series Forecasting},
  author={Liu, Yong and Qin, Guo and Huang, Xiangdong and Wang, Jianmin and Long, Mingsheng},
  journal={arXiv preprint arXiv:2410.04803},
  year={2024}
}

Contributors

If you have any questions or want to use the code, feel free to contact:

Yong Liu ([email protected])
Guo Qin ([email protected])
Haoran Zhang ([email protected])
Chenyu Li ([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
data_provider		data_provider
examples		examples
exp		exp
figures		figures
layers		layers
models		models
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Timer (Large Time-Series Model)

Updates

Introduction

Zero-Shot Forecasting

Datasets

Usage

For Developers

Supported Tasks

Code for Fine-tuning

Train on Custom Dataset

Approach

Unified Pre-training

Model Architecture

Performance

Scalability

Subsequent Work

Citation

Contributors

About

Releases

Packages

Contributors 2

Languages

License

thuml/Large-Time-Series-Model

Folders and files

Latest commit

History

Repository files navigation

Timer (Large Time-Series Model)

Updates

Introduction

Zero-Shot Forecasting

Datasets

Usage

For Developers

Supported Tasks

Code for Fine-tuning

Train on Custom Dataset

Approach

Unified Pre-training

Model Architecture

Performance

Scalability

Subsequent Work

Citation

Contributors

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages