yuqinie98 / PatchTST

An offical implementation of PatchTST: "A Time Series is Worth 64 Words: Long-term Forecasting with Transformers." (ICLR 2023) https://arxiv.org/abs/2211.14730
Apache License 2.0
1.37k stars 248 forks source link

Runtime long #55

Closed DIKSHAAGARWAL2015 closed 11 months ago

DIKSHAAGARWAL2015 commented 1 year ago

any estimate on how long it will take to run supervised and self-supervised learning based on default model and params.

yuqinie98 commented 1 year ago

It varies on different datasets, epochs, GPU... thus it would be hard to answer. The fastest one may take half an hour while the largest model takes a day on multiple GPUs.