issues
search
h1alexbel
/
srdataset
GitHub repositories dataset that contains sample repositories (SRs), with their metrics and metadata
MIT License
4
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
feat(#49): more english tests
#50
h1alexbel
closed
1 day ago
3
test_english.py:55-57: Language detection produces...
#49
0pdd
opened
1 day ago
0
embed.py:31-35: Generate embeddings for all the text in...
#48
0pdd
opened
1 day ago
0
feat(#37, #29): numericals
#47
h1alexbel
closed
1 day ago
8
feat(#43): look for CLUSTER=true
#46
h1alexbel
closed
4 days ago
3
feat(#41): push to hf
#45
h1alexbel
closed
4 days ago
3
feat(#37): agglomerative, dbscan, gmm for numerics.csv, simple plots
#44
h1alexbel
closed
4 days ago
3
Makefile:43-45: Look for CLUSTER=true option in order to...
#43
0pdd
closed
4 days ago
1
feat(#37): kmeans on numerical.csv
#42
h1alexbel
closed
1 week ago
3
push all aggregated CSVs to the HuggingFace
#41
h1alexbel
closed
4 days ago
0
Add an ability to run clustering on prepared data in HuggingFace/GitHub release
#40
h1alexbel
opened
1 week ago
0
add linters
#39
h1alexbel
opened
1 week ago
1
better readme
#38
h1alexbel
opened
1 week ago
0
analyze collected datasets with clustering models
#37
h1alexbel
opened
1 week ago
2
feat(#29): texts, text-embeddings
#36
h1alexbel
closed
1 week ago
3
chore(deps): update dependency sentence-transformers to v3
#35
renovate[bot]
opened
1 week ago
0
feat(#33): semantic_similarity.py
#34
h1alexbel
closed
1 week ago
3
output similar repositories for provided head
#33
h1alexbel
closed
1 week ago
0
doc(#29): numerical in data.sh
#32
h1alexbel
closed
1 week ago
3
doc(#29): docs for generated CSVs, numerical
#31
h1alexbel
closed
1 week ago
3
feat(#29): numerics, filter push to repos.csv directly
#30
h1alexbel
closed
1 week ago
3
define three datasets: numerics, textual embeddings, numerics + embeddings
#29
h1alexbel
opened
1 week ago
1
feat(#15): embed.py, infer
#28
h1alexbel
closed
1 week ago
6
feat(#22): md_to_text before filter by language
#27
h1alexbel
closed
1 week ago
3
feat(#25): apply_structure.py, structure.py
#26
h1alexbel
closed
1 week ago
3
structure data obtained with `ghminer` into a dataset that suitable for embedding generation
#25
h1alexbel
closed
1 week ago
0
feat(#23): replace null topics with `[]`
#24
h1alexbel
closed
1 week ago
3
replace null topics with an empty array
#23
h1alexbel
closed
1 week ago
0
check language of the `readme` only after translation to the plain text
#22
h1alexbel
closed
1 week ago
0
feat(#13): english + null filter
#21
h1alexbel
closed
1 week ago
3
chore(deps): update dependency numpy to v2
#20
renovate[bot]
opened
1 week ago
0
chore(deps): update actions/setup-python action to v5
#19
renovate[bot]
closed
2 weeks ago
6
chore(deps): update dependency pytest to v8.2.2
#18
renovate[bot]
closed
2 weeks ago
6
feat(#14): md_to_text.py
#17
h1alexbel
closed
2 weeks ago
0
push `embeddings.csv` with generated embeddings to Hugging Face dataset
#16
h1alexbel
closed
1 week ago
0
encode textual data into embeddings via Hugging Face inference endpoint
#15
h1alexbel
closed
1 week ago
0
preprocess markdown into text via HTML translation
#14
h1alexbel
closed
2 weeks ago
0
skip non-English repositories with langdetect
#13
h1alexbel
closed
1 week ago
0
feat(#11): collect, how it works
#12
h1alexbel
closed
3 weeks ago
0
collect script
#11
h1alexbel
closed
3 weeks ago
0
feat(#7): install all requirements
#10
h1alexbel
closed
3 weeks ago
0
metrics.py:26-29: Compute CPD and RC metrics too. We need...
#9
0pdd
closed
1 week ago
2
feat(#7): metrics.sh
#8
h1alexbel
closed
3 weeks ago
0
compute metrics
#7
h1alexbel
closed
3 weeks ago
2
Publish to Zenodo/HuggingFace
#6
h1alexbel
opened
1 month ago
0
Makefile:31-35: Create test script for the whole package....
#5
0pdd
closed
2 weeks ago
1
chore(deps): update dependency ubuntu to v22
#4
renovate[bot]
opened
1 month ago
0
chore(deps): update actions/checkout action to v4
#3
renovate[bot]
closed
2 weeks ago
6
Dependency Dashboard
#2
renovate[bot]
opened
1 month ago
0
skeleton
#1
h1alexbel
closed
3 weeks ago
14
Next