Open davidsbatista opened 4 months ago
Regarding the 2 datasets for which we already have evaluation code working.
The ARAGOG dataset:
The SQuAD dataset:
For the mini ESG dataset:
The main files are all public:
https://sustainability.aboutamazon.com/2022-sustainability-report.pdf
https://www.apple.com/environment/pdf/Apple_Environmental_Progress_Report_2022.pdf
https://sustainability.fb.com/wp-content/uploads/2022/06/Meta-2021-Sustainability-Report.pdf
https://sustainability.google/reports/google-2022-environmental-report/
and the question-answer pairs are here: https://github.com/run-llama/llama-datasets/blob/main/llama_datasets/mini_esg_bench/rag_dataset.json
Issue opened on the llama-datasets repo: https://github.com/run-llama/llama-datasets/issues/54