argilla-io / biome-text

Custom Natural Language Processing with big and small models 🌲🌱
https://recognai.github.io/biome-text/
Other
68 stars 7 forks source link
allennlp data-science natural-language-processing nlp pytorch



CI GitHub Documentation GitHub release

Natural Language Processing library built with AllenNLP

## Quick Links - [Documentation](https://recognai.github.io/biome-text/) ## Features * State-of-the-art and not so state-of-the-art models trained with **your own data** with simple workflows. * **Efficient data reading** for (large) datasets in multiple formats and sources (CSV, Parquet, JSON, etc.). * **Modular configuration and extensibility** of models, datasets and training runs programmatically or via config files. * Use via **`cli`** or as plain Python (e.g., inside a Jupyter Notebook) * **Compatible with AllenNLP** ## Installation For the installation we recommend setting up a fresh [conda](https://docs.conda.io/en/latest/miniconda.html) environment: ```shell script conda create -n biome python~=3.7.0 pip>=20.3.0 conda activate biome ``` Once the conda environment is activated, you can install the latest release via pip: ````shell script pip install -U biome-text ```` After installing *biome.text*, the best way to test your installation is by running the *biome.text* cli command: ```shell script biome --help ``` ## Get started The best way to see how *biome.text* works is to go through our [first tutorial](https://recognai.github.io/biome-text/master/documentation/tutorials/1-Training_a_text_classifier.html). Please refer to our [documentation](https://recognai.github.io/biome-text) for more tutorials, detailed user guides and how you can [contribute](https://recognai.github.io/biome-text/master/documentation/community/1-contributing.html) to *biome.text*. ## Licensing The code in this project is licensed under Apache 2 license.