allenai / scibert

A BERT model for scientific text.
https://arxiv.org/abs/1903.10676
Apache License 2.0
1.48k stars 217 forks source link

Language Distribution #57

Open fhaase2 opened 5 years ago

fhaase2 commented 5 years ago

Can you provide any information regarding the language distribution of the semantic scholar corpus? I can't find any information wether there are other languages included than english.

Thanks!

ibeltagy commented 4 years ago

It is just English.