Closed siebeniris closed 2 years ago
Given the nature and sources of the data, we are legally unable to share these datasets. This is unfortunate for reproducability, but there is nothing we can do about it. I hope you understand.
Okay, thanks for the quick reply :)
Hello,
thank you very much for BERTje!
We would like to do some analysis regarding BERT models in different languages. Is it possible to release the data you used for pre-training the model. Especially the ones without citations:
Thank you very much!