h1alexbel / srdataset

GitHub repositories dataset that contains sample repositories (SRs), with their metrics and metadata
MIT License
4 stars 0 forks source link

preprocess markdown into text via HTML translation #14

Closed h1alexbel closed 2 weeks ago

h1alexbel commented 2 weeks ago

Let's use markdown and beautifulsoup4 for this purpose