-
@audrism
When analyzing a project's activity over time, commit counts and _unique_ author counts per month (not counting one author multiple time if they have made multiple commits) are great metr…
-
## GitHub Issues
Dataset URL - [here](https://huggingface.co/datasets/lewtun/github-issues)
Does the dataset exists in a scraped format ?
URL if Yes - [here](https://huggingface.co/datasets/…
-
Currently, this is the behavior of clicking on a node:
**Github Icon**: Go to github.com
**Issue title**: Go to that issue on github.com
**Clicking/Hovering depends/related/blocks icons**: Nothin…
-
-
(Housekeeping - I move the original issue written here by @abitrolly into #8)
Enhance the OSCI algorithm to filter only projects with open-source licenses.
This will require some external dataset…
-
* Wikipedia 내에서 20개 언어(국가)의 과학기술 구조 차이를 본 이후, 이를 풀어가기 위해 다른 류의 국가별 상관관계와 매칭을 해 보는 것이 좋을 것 같습니다.
* 이를 위해 현재 가지고 있는 데이터에 대해 공유하고, 사용 가능성에 대해 논의했으면 싶습니다.
-
@agcolom requested that I open a single issue with all requests.
- [ ] Make the tracking code public
- [ ] Monitor all repositories
- [ ] Track pull requests and issues
- [ ] Track opens and closes, n…
-
Consider:
* Can the GH archive timeline tell us about specific files?
* Can we determine likely repositories to scan using some heuristics?
* Is the size of the archive a plus or a negative, or b…
-
We can take some rule-based approach as a benchmark: email contains `bot` word or `no-reply`. However, there are emails like `tensorflow-gardener@tensorflow.org` that is hard to find. So some ML shoul…
-
# Rovers
Get also if the repository is a fork and which is the parent repository. This can be done checking if `"fork": true` in the JSON and getting with the api the repository and checking `sourc…