h1alexbel / srdataset

GitHub repositories dataset that contains sample repositories (SRs), with their metrics and metadata
MIT License
4 stars 0 forks source link

skip non-English repositories with langdetect #13

Closed h1alexbel closed 2 weeks ago

h1alexbel commented 2 weeks ago

Let's use langdetect in order to skip non-English textual data