huggingface / datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Apache License 2.0
1.97k stars 139 forks source link

Can you make a release with the latest batch version? #272

Closed nldxtd closed 1 month ago

nldxtd commented 1 month ago

Good lib to use! I am using the datatrove to handle my data, can you make the a latest release with the main branch? thx!

guipenedo commented 1 month ago

Hi, we've just pushed version 0.3.0 to pypi :)