h1alexbel / srdataset

GitHub repositories dataset that contains sample repositories (SRs), with their metrics and metadata
MIT License
4 stars 0 forks source link

check language of the `readme` only after translation to the plain text #22

Closed h1alexbel closed 3 months ago

h1alexbel commented 3 months ago

We need to check the language using english(text) only after readme got translated to the text, since in this case we detecting language for plain text, not markdowd, thereby filtering gets more clean