The focus of this PR is to rename files and variables for consistency and clarity, update dataset building process details, and add new files for different types of embeddings.
Detailed summary
Renamed numerics.py to numerical.py
Updated dataset building process details in README.md
Added new files for different types of embeddings: readme-embeddings.csv, description-embeddings.csv, topics-embeddings.csv
Changed the environment variable GITHUB_TOKEN to HF_TOKEN
Updated print statements for dataset preparation in Python script
✨ Ask PR-Codex anything about this PR by commenting with /codex {your question}
ref #29
PR-Codex overview
The focus of this PR is to rename files and variables for consistency and clarity, update dataset building process details, and add new files for different types of embeddings.
Detailed summary
numerics.py
tonumerical.py
README.md
readme-embeddings.csv
,description-embeddings.csv
,topics-embeddings.csv
GITHUB_TOKEN
toHF_TOKEN