whotracksme / whotracks.me

Data from the largest and longest measurement of online tracking.
https://www.ghostery.com/whotracksme
MIT License
407 stars 73 forks source link

Migrate datasets from Git LFS to Amazon S3 #239

Closed y3ti closed 3 years ago

y3ti commented 3 years ago

We decided to migrate WhoTracks.me datasets from Git LFS to Amazon S3.

Git LFS worked very well for us, but every month we have more and more data. We started having problems with limits described by Philipp in #231.

Also, I have changed the GitHub Actions Workflow. The workflow fetches much smaller datasets because we started having an issue with free space on the disk. The GitHub hosted runner has only 14GB of SSD disk space.

What's changed:

Fixes #231