NVIDIA / NeMo-Curator

Scalable toolkit for data curation
Apache License 2.0
327 stars 32 forks source link

Where to obtain domain and quality model files? #86

Open randerzander opened 1 month ago

randerzander commented 1 month ago

I'm trying to follow the data classification tutorial, which refers to some .pth model files.

Where can I obtain these from? I'd have expected the library to automatically pull them from Hugging Face?

ayushdg commented 1 month ago

Possibly a duplicate of #72

ryantwolf commented 4 days ago

Domain classifier available here. I will close the issue when we release the quality classifier as well.