microsoft / LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Other
1.29k stars 148 forks source link

support multiprocessing and pmc file removal to save disk space #38

Closed tgisaturday closed 2 months ago

tgisaturday commented 6 months ago

For faster download of pmc articles and image extraction, I've applied multiprocessing.

I've also added additional flag that removes pmc_articles after image extraction to save disk space.