Open Jonathan-Adly opened 6 days ago
@Jonathan-Adly
Evaluation:
python evaluate.py --api-key="my-key"
and run the full Vidore evaluations.python evaluate.py --api-key="my-key" --collection_name arxivqa_collection
and run just the arxivqa portion. The same should be with the other benchmarksUpsertion: Code is done in main.py. to-do list here:
Report:
We are almost there!
Report:
- We want a table similar to the one in the ColPali paper with the current leader in the Vidore leaderboard, the original ColPali numbers/comparsions, and ours. (Currently pursuing)
Done
we still have upsertion edits.
@HalemoGPA - as discussed, here is the goals of this task. We want to run the full eval via a simple command:
Evaluation:
python evaluate.py --api-key="my-key"
and run the full Vidore evaluations.python evaluate.py --api-key="my-key" --collection_name arxivqa_collection
and run just the arxivqa portion. The same should be with the other benchmarksUpsertion:
python upsert.py --api-key="my-key"
and this will upsert the vidore documents in the backgroundpython upsert.py --api-key="my-key" --collection_name arxivqa_collection
and this will upsert only the arxiqa_collectionReport: