Closed jon-chuang closed 1 year ago
Hi, @jon-chuang! I'm Dosu, and I'm here to help the LlamaIndex team manage their backlog. I wanted to let you know that we are marking this issue as stale.
From what I understand, the issue you created discusses the evaluation of integrations for the project. You outlined various tasks to investigate, such as retrieval, RAG/Knowledge-Intensive QA, and MMLU. You also provided references and areas for further research. However, there hasn't been any activity on the issue since then.
Before we close this issue, we wanted to check with you if it is still relevant to the latest version of the LlamaIndex repository. If it is, please let us know by commenting on the issue. Otherwise, feel free to close the issue yourself, or the issue will be automatically closed in 7 days.
Thank you for your contribution, and we look forward to hearing from you soon!
Best regards, Dosu
Feature Description
TODO: investigate more RAG-specific benchmarks rather than retrieval-only or generation-only.
References / Areas of Exploration
t=t_1 to t=t_2
) v.s. an aggregate over time fromt=0
. Otherwise the dataset is too massive (110GB compressed). Even the index is massive (400MB+). See issue: https://github.com/deepmind/streamingqa/issues/2Target Domains
TODO: