Open Silence-o0 opened 4 weeks ago
Collect a set of PDF or DOCX files suitable for project task, ensuring variation in content types.
Partially done:
Collected:
https://huggingface.co/datasets/anakib1/mango-truth
Will also look for other data - maybe use data @AntonGog171 suggested
(@AntonGog171 please link your data here)
Collect a set of PDF or DOCX files suitable for project task, ensuring variation in content types.