I am confused - Githubissues

Thank you for the article. I have reproduced some of the results, and I plan to present this paper at the group meeting next week. However, I have some questions. After reading it, I feel a bit confused. It seems that after removing the LLM, the model is just a simple linear layer, yet the prediction performance is even better. It even outperforms some more complex architectures, such as Transformers or MLP-based models like TimesNet. This is quite difficult to understand. Why hasn’t anyone used a simple linear layer to achieve these results before? Thanks to the authors!

BennyTMT / LLMsForTimeSeries

I am confused #4