allenai / dolma

Data and tools for generating and inspecting OLMo pre-training data.
https://allenai.github.io/dolma/
Apache License 2.0
894 stars 90 forks source link

PyPI release #167

Open baberabb opened 3 months ago

baberabb commented 3 months ago

Hi! Is it possible to cut a new version to PyPI. The current one installs all the optional dependencies and some of them have specific build requirements (e.g. LTpycld2 requires build-essential on Linux systems.)