h1alexbel / srdataset

GitHub repositories dataset that contains sample repositories (SRs), with their metrics and metadata
MIT License
4 stars 0 forks source link

feat(#13): english + null filter #21

Closed h1alexbel closed 2 weeks ago

h1alexbel commented 2 weeks ago

closes #13


PR-Codex overview

This PR updates dependencies, refactors scripts, and enhances workflow automation. It introduces language detection, formalizes data collection, and improves repository filtering.

Detailed summary

The following files were skipped due to too many changes: tests/filter-expected.csv, tests/test.csv

✨ Ask PR-Codex anything about this PR by commenting with /codex {your question}

h1alexbel commented 2 weeks ago

@rultor merge

rultor commented 2 weeks ago

@rultor merge

@h1alexbel OK, I'll try to merge now. You can check the progress of the merge here

rultor commented 2 weeks ago

@rultor merge

@h1alexbel Done! FYI, the full log is here (took me 8min)