SwissDataScienceCenter / renku-graph

renku-graph
https://renku.readthedocs.io/en/latest/reference/services/graph-services.html?highlight=graph#graph-services
Apache License 2.0
10 stars 2 forks source link

Abandoned projects clean-up #1181

Closed jachro closed 11 months ago

jachro commented 2 years ago

As a renku user, I'd like not to see projects which are very old, created to try out or testing purposes in the search results.

The problem The amount of data in our deployments (renkulab.io specifically) simply grows quite fast. There's no doubt it's a great thing, however, there are consequences of that. One of them is the general responsiveness of the API while the other is the quality of the data and noise. There is a good number of projects which are old and not even working (for instance created with some very old versions of renku CLI), test projects which were created just to feel what renku is, and test projects which were created by us to verify certain features. All of them are discoverable by the search API which makes it both slow and polluted with not the best quality findings.

The solution It seems doable to denoise our data by simply removing abandoned projects data from KG (while keeping it in the GitLab so users can bring them back if needed).

Acceptance criteria:

jachro commented 11 months ago

Done as part the token expiration removal feature