leonbusy / Hanliang_Project

0 stars 0 forks source link

Experiment with API Programming for Gathering Data from GitHub #5

Open twsswt opened 2 years ago

twsswt commented 2 years ago

This contains a reasonably good code base for downloading data from the GitHub API.

https://git.dcs.gla.ac.uk/cornichon/github-pipeline

The actual API is here:

https://docs.github.com/en/rest

Existing code base for github-pipeline uses this Python client:

https://github.com/PyGithub/PyGithub

The GitHub pipeline code deals with throttling management to avoid getting blocked. You only need to write the component that downloads the pull request data.

Write some prototype code for downloading pull requests from selected repositories.

twsswt commented 2 years ago

If you don't use anything else, strongly suggest using the rate limiter:

https://git.dcs.gla.ac.uk/cornichon/github-pipeline/-/blob/master/github_pipeline/api_wrapper/functions.py