ameliawong1996 / From_News_to_Forecast

This repository is for the paper entitled: From News to Forecast: Iterative Event Reasoning in LLM-Based Time Series Forecasting (NeurIPS 2024)
MIT License
22 stars 3 forks source link

Request additional information (val_num, train_data of Exchange-Rate, trafic, and Bitcoin) #3

Open 137809404 opened 2 days ago

137809404 commented 2 days ago

I have been exploring your project and noticed some missing details in the code regarding the Exchange-Rate, traffic, and Bitcoin datasets. Specifically, I would appreciate clarification on the validation split ratio and the number of samples used, particularly for the Exchange-Rate dataset. Additionally, could you specify which columns are intended for prediction?

If possible, could you also provide the final datasets of Exchange-Rate, traffic, and Bitcoin used for LoRA fine-tuning? Your help would be greatly appreciated!

Thank you for your time.

ameliawong1996 commented 2 days ago

Thank you for your interest in our project and for your detailed questions!

For all datasets (including Electricity, Exchange Rate, Traffic, and Bitcoin), we used a validation split ratio of 20% of the total samples, with the remaining 80% for training. The exact number of samples may vary slightly depending on any preprocessing steps you perform, but the dataset configuration in the code should align closely with these ratios.

Target Variables

Exchange Rate Dataset Column Descriptions

To further clarify, here are the detailed descriptions for each column in the Exchange Rate dataset:

Please feel free to reach out if you have further questions or encounter any issues with the setup. In the meantime, you can generate training datasets based on the examples provided in Appendix 6.2 of the paper.

137809404 commented 2 days ago

Thank you so much for your detailed response!