-
Notifications
You must be signed in to change notification settings - Fork 688
[ENH] TimeXer
model from thuml
#1797
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #1797 +/- ##
=======================================
Coverage ? 85.71%
=======================================
Files ? 68
Lines ? 6580
Branches ? 0
=======================================
Hits ? 5640
Misses ? 940
Partials ? 0
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
@fkiraly the implementation of |
FYI @agobbifbk, @benHeid |
A couple of comment:
|
Hi @agobbifbk thanks for the review!
Yes, I think I forgot to credit the original authors. Will do :).
I had similar questions with how this model would handle quantile predictions, the original architecture doesn't seem to be handling this, so I just decided to patch it up this line of code (which might be a bad approach). I am not very aware of what changes I should make to fix this? Could you help me here?
It is multi-output indeed, the _forecast_multi method is native to the |
Sure, usually it is sufficient to increase the number of output channels. Suppose the model ends with something in the shape My suggestion is to start from the implementation of the quantile loss and check the definition (in DSIPTS there is a multioutput version of it, just summing the contribution of each channel) and then play with the output shape of the model! Let me know if it is sufficient to finish the job :-) |
Hi @agobbifbk
I have added comments to explain the shape of the final output tensors better. Do let me know, if more changes are required.
Since this change would probably affect all the future |
Ok it seems reasonable to close this and open a new one. You wrote: (batch_size, prediction_length, n_quantiles) It seems to me that there is a missing dimension (it should be 4, one for the number of output channels), isn't it? |
Maybe the language of the comment is confusing. |
Hi @fkiraly , this PR is ready to be merged. |
TimeXer
model from thuml
TimeXer
model from thuml
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
updated from main
- it seems like tests are failing.
Possibly due to the recent changes to the metadata layer (datamodule method)?
Description
This PR works on #1793 and aims to align and implement the
TimeXer
model within PTF's design.Checklist
pre-commit install
.To run hooks independent of commit, execute
pre-commit run --all-files
Make sure to have fun coding!