This PR updates dependencies, refactors scripts, and enhances workflow automation. It introduces language detection, formalizes data collection, and improves repository filtering.
Detailed summary
Added langdetect for language detection
Renamed metrics.sh to structure.py and filter.py
Updated workflow in Makefile
Renamed workflow file to copyrights.yml
Added filtering functionality in apply_filter.py
Added tests for filtering and English language detection
The following files were skipped due to too many changes: tests/filter-expected.csv, tests/test.csv
✨ Ask PR-Codex anything about this PR by commenting with /codex {your question}
closes #13
PR-Codex overview
This PR updates dependencies, refactors scripts, and enhances workflow automation. It introduces language detection, formalizes data collection, and improves repository filtering.
Detailed summary
langdetect
for language detectionmetrics.sh
tostructure.py
andfilter.py
Makefile
copyrights.yml
apply_filter.py