ls1intum / Hephaestus

Hephaestus - A Generative AI Mentor for Agile Software Development
MIT License
7 stars 1 forks source link

Caching and UPDATED-sorting for the Github Data Sync Algorithm #94

Closed GODrums closed 1 week ago

GODrums commented 2 weeks ago

Motivation

The current data fetching algorithm didn't collect all the required data and had massive performance issues. This PR aims to fix both of these issues. With this PR, the table should be on-par with the data from the original Python script.

Description

This PR includes multiple improvements:

Performance Test

#PRs develop-branch PR (no caching) PR (caching)
8 90s - -
53 impossible 5 min 3 min

In my test runs the new fetching process now takes about 3 minutes (53 PRs), in comparison to the original 1 minute (8 PRs) and 5 minutes without caching (53 PRs).

Screenshots (if applicable)

Checklist

General

Server (if applicable)