piskvorky / gensim-data

Data repository for pretrained NLP models and NLP corpora.
https://rare-technologies.com/new-api-for-pretrained-nlp-models-and-datasets-in-gensim/
GNU Lesser General Public License v2.1
965 stars 128 forks source link

Own Corpus #50

Open marcgehring1995 opened 2 years ago

marcgehring1995 commented 2 years ago

Hello together,

I would like to create my own corpus. The problem is that I'm not allowed to share the data that the corpus will be build on.

Is there any chance how I can create my own corpus without sharing the data?

Kind regards

Marc