NVIDIA / NeMo-Curator

Scalable data pre processing and curation toolkit for LLMs
Apache License 2.0
597 stars 81 forks source link

Improve NeMo Curator Experience for Pytorch Models (with crossfit) #288

Open VibhuJawa opened 1 month ago

VibhuJawa commented 1 month ago

Is your feature request related to a problem? Please describe.

Based on user feedback we need to fix the following to make user experience better:

Christina-Young-NVIDIA commented 1 month ago

At risk because not all items above will complete in this sprint. Vibhu to break into two issues.