rapidsai / rapids.ai

rapids.ai web site
https://rapids.ai
8 stars 17 forks source link

Add NeMo Curator section #376

Open exactlyallan opened 4 months ago

exactlyallan commented 4 months ago

Add NeMo Curator to RAPIDS Accelerated section with _"NeMo Curator is a Python library designed for scalable and efficient dataset preparation, enhancing LLM training accuracy through GPU-accelerated data curation using Dask and RAPIDS. It offers a customizable and modular interface that simplifies pipeline expansion and accelerates model convergence by preparing high-quality tokens." github and docs