[ENH] Implementing D2 data module layer for `tslib` models. #1836

PranavBhatP · 2025-05-15T19:32:17Z

Description

This PR fixes issue #1833, implements a D2 for tslib.

This PR involves the following changes:

New dataset and datamodule for tslib - link
Implementation of TimeXer with new data module. - link
Base class implementation of a TslibBaseModel - link.
Tests for the tslib data module. - link
Example notebook - link
Restructure codebase to include a layers directory for module containing architectural deep learning layer classes.

Checklist

Linked issues (if existing)
Amended changelog for large changes (and added myself there as contributor)
Added/modified tests
Used pre-commit hooks when committing to ensure that code is compliant with hooks. Install hooks with pre-commit install.
To run hooks independent of commit, execute pre-commit run --all-files

Make sure to have fun coding!

fkiraly

Nice!

Minor structure request: can you kindly make the attention etc modules in layers actual folders, in those private submodules with a single class each?

this completes the d2 pipeline for timexer, prediction has not been tested, also revert the loss dtype to nn.Module for now.

review-notebook-app · 2025-05-25T20:24:14Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

PranavBhatP · 2025-05-25T20:26:11Z

FYI @agobbifbk @fkiraly @phoeenniixx. Rough Pipeline for tslib is complete. I have implemented TimeXer on this pipeline. Tests are pending. There are still some bugs, but training and prediction are working.

…notebook

this change introduces sub-directories to provide a single module for a group of layers rather than dumping them in a single file with the same as the sub-directory

fkiraly

Great!

there is an empty file modules.py, can you remove this from the PR?
can you ensure the new base class has the "experimental" warning for now, similar to the v2 base model
could you put the base class in another private file, in base?

PranavBhatP · 2025-05-28T12:41:39Z

Hi @fkiraly @phoeenniixx

I went through the PR #1841, and considering that @phoeenniixx is implementing a v2 version of the test_all_estimators.py, this surely means that I will depend on these changes to implement a newer version test_timexer.py for this PR (currently the v1 version is in #1797)

Is it advisable to stack upon PR #1841? Or should I wait for a merge and work only on the independent tests for the tslib datamodule?

tests still failing

fkiraly · 2025-05-29T20:57:17Z

(code formatting is failing)

…llate and window creation

PranavBhatP · 2025-05-30T07:43:34Z

Did code quality tests change @fkiraly? The pre commit tests are passing on my local repo, but here it is failing.

EDIT: Resolved.

PranavBhatP · 2025-05-30T14:55:24Z

Hi @agobbifbk , continuing the discussion from the meet about the windowing, this is the code I was talking about
https://github.com/thuml/Time-Series-Library/blob/main/data_provider/data_loader.py

Specifically the implementation in the class Dataset_ETT_hour and Dataset_M4. Here are they are using fixed window sampling for the first one (Dataset_ETT_hour) and random window sampling ( Dataset_M4) for the second, this seems to be a difference but I might be mistaken. The reason they might be using different windowing is because of the forecasting task for which the dataset is used. (long-term or short-tem forecasting etc).

After going through the code deeper, I understand what you were saying, I don't think it matters what windowing we use as long as it matches with model input and makes it convenient to keep single one in the d2. But it would be nice if you could review the current implementation - link and some comments on whether we need separate windowing strategy on top of this.

Also FYI @phoeenniixx , since you were also involved in the discussion.

…pes between end_time and cutoff_time in _create_windows

phoeenniixx and others added 30 commits April 6, 2025 18:43

D1, D2 layer commit

252598d

remove one comment

d0d1c3e

model layer commit

80e64d2

update docstring

6364780

Merge branch 'refactor-d1-d2' into refactor-model

82b3dc7

update data_module.py

257183c

update data_module.py

9cdcb19

Merge branch 'refactor-d1-d2' into refactor-model

a83bf32

Add disclaimer

ac56d4f

Merge branch 'refactor-d1-d2' into refactor-model

0e7e36f

update docstring

4bfff21

Merge branch 'refactor-d1-d2' into refactor-model

ef98273

Add tests for D1,D2 layer

8a53ed6

Merge branch 'main' into refactor-d1-d2

9f9df31

Code quality

cdecb77

Merge branch 'refactor-d1-d2' into refactor-model

86360fd

refactor file

20aafb7

warning

043820d

linting

1720a15

move coercion to utils

af44474

linting

a3cb8b7

Update _timeseries_v2.py

75d7fb5

Update __init__.py

1b946e6

Update __init__.py

3edb08b

Merge branch 'main' into pr/1811

a4bc9d8

Merge branch 'pr/1811' into pr/1812

4c0d570

update tests

e350291

Merge branch 'refactor-d1-d2' into refactor-model

f90c94f

update tft_v2

3099691

warnings and init attr handling

77cb979

PranavBhatP requested a review from yarnabrina as a code owner May 23, 2025 06:13

fkiraly requested changes May 23, 2025

View reviewed changes

PranavBhatP added 2 commits May 26, 2025 01:51

add example notebook and fix buy in _timexer.py

1831bcb

this completes the d2 pipeline for timexer, prediction has not been tested, also revert the loss dtype to nn.Module for now.

clear cell outputs on trainer.fit()

a896b3f

remove unnecessary squeeze method and add prediction demo in example …

420de37

…notebook

fkiraly moved this from PR in progress to PR under review in May - Sep 2025 mentee projects May 26, 2025

restructure layers directory

3b07263

this change introduces sub-directories to provide a single module for a group of layers rather than dumping them in a single file with the same as the sub-directory

fkiraly requested changes May 27, 2025

View reviewed changes

PranavBhatP added 2 commits May 27, 2025 21:58

delete empty modules.py

010298e

add warning and move tslib base model to a new file

d0aa444

PranavBhatP requested a review from fkiraly May 27, 2025 16:30

fix wrong import statement in _timexer.py

fef4113

fkiraly moved this from PR under review to PR in progress in May - Sep 2025 mentee projects May 28, 2025

Merge branch 'main' into tslib-d2-refactor

8daeb95

PranavBhatP and others added 4 commits May 28, 2025 18:19

fix circular dependency error in en_embedding.py

5142d52

Merge branch 'main' into tslib-d2-refactor

efbbc09

add prelimnary tests for tslib d2

8a680df

add collate, setup and dataset tests

0ccb078

tests still failing

PranavBhatP added 2 commits May 30, 2025 11:02

fix failing setup and tslib_dataset tests

826ac31

fix incorrect metadata handling in tslib dataset and fix tests for co…

d70b07c

…llate and window creation

PranavBhatP added 3 commits May 30, 2025 13:15

Merge branch 'main' into tslib-d2-refactor

5ce4553

fix code to comply with new linting syntax rules

7b41140

add tests for checking custom train test split

e3e5bb8

add tests for multitarget dataset and fix handling of incosistent dty…

d67ccae

…pes between end_time and cutoff_time in _create_windows

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENH] Implementing D2 data module layer for `tslib` models. #1836

[ENH] Implementing D2 data module layer for `tslib` models. #1836

PranavBhatP commented May 15, 2025 •

edited

Loading

Uh oh!

fkiraly left a comment

Uh oh!

review-notebook-app bot commented May 25, 2025

Uh oh!

PranavBhatP commented May 25, 2025 •

edited

Loading

Uh oh!

fkiraly left a comment

Uh oh!

PranavBhatP commented May 28, 2025

Uh oh!

fkiraly commented May 29, 2025

Uh oh!

PranavBhatP commented May 30, 2025 •

edited

Loading

Uh oh!

PranavBhatP commented May 30, 2025

Uh oh!

Uh oh!

[ENH] Implementing D2 data module layer for tslib models. #1836

Are you sure you want to change the base?

[ENH] Implementing D2 data module layer for tslib models. #1836

Conversation

PranavBhatP commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

fkiraly left a comment

Choose a reason for hiding this comment

Uh oh!

review-notebook-app bot commented May 25, 2025

Uh oh!

PranavBhatP commented May 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fkiraly left a comment

Choose a reason for hiding this comment

Uh oh!

PranavBhatP commented May 28, 2025

Uh oh!

fkiraly commented May 29, 2025

Uh oh!

PranavBhatP commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PranavBhatP commented May 30, 2025

Uh oh!

Uh oh!

[ENH] Implementing D2 data module layer for `tslib` models. #1836

[ENH] Implementing D2 data module layer for `tslib` models. #1836

PranavBhatP commented May 15, 2025 •

edited

Loading

PranavBhatP commented May 25, 2025 •

edited

Loading

PranavBhatP commented May 30, 2025 •

edited

Loading