Closed sonaalKant closed 4 months ago
@sonaalKant What is the expected timeline?
In the models sheet, Evaluated on benchmark
meant what other test set they have evaluated on besides the test split.
@sonaalKant Could you please review the docs here?
Also, could you please provide your feedback on the following:
This looks good. I will close this issue as it has covered preliminary research in models and datasets. I will file another issue about next steps.
Good stuff.
This is the first step towards PEOE. I think we have some preliminary findings /survey here https://drive.google.com/drive/u/3/folders/1FKEkRZwshZhj_hSB-n0HaOkeneBiFnOY
The task involves leveraging a Large Language Model (LLM) for sentiment analysis on financial news. The primary objective is to:
Locate and acquire a publicly available dataset containing financial news articles (see if there are some on crypto). Create a spreadsheet with columns
[ dataset name, dataset source, size of dataset, Cost(Free/paid), label, any benchmark available or not, Contains crypto news ]
Start with creating List of LLMs which are currently finetuned on financial data/news. Create a spreadsheet with columns
[Model Name, Model download link, Model trained/finetuned on datasets, evaluated on benchmark]
For reference on dataset and publicly available models you can refer https://paperswithcode.com/ , https://huggingface.co/models and https://huggingface.co/datasets.
FYI @gpsaggese @samarth9008 @jsmerix