this adds a one_off:backfill_nuget_published_at rake task that will...
1) pull a list of all NuGet Version records where published_at: '1900-01-01 00:00:00', which means "unlisted" on NuGet (or "Deprecated" in Libraries terms)
2) iterate over release events in NuGet's Catalog API endpoint
3) if one of the releases from the API is in the list from step (1), and the API date is not 1900, then consider the API date as the original "published" date, update the Version record to that date (and mark it as status: "Deprecated")
the catalog_start_idx argument lets you skip over catalog pages, to not re-do work if you restart the task
the threads argument lets you adjust the level of parallelism when fetching release pages
(followup to https://github.com/librariesio/libraries.io/pull/3398 )
this adds a
one_off:backfill_nuget_published_at
rake task that will...1) pull a list of all NuGet Version records where
published_at: '1900-01-01 00:00:00'
, which means "unlisted" on NuGet (or "Deprecated" in Libraries terms) 2) iterate over release events in NuGet's Catalog API endpoint 3) if one of the releases from the API is in the list from step (1), and the API date is not 1900, then consider the API date as the original "published" date, update the Version record to that date (and mark it asstatus: "Deprecated"
)catalog_start_idx
argument lets you skip over catalog pages, to not re-do work if you restart the taskthreads
argument lets you adjust the level of parallelism when fetching release pages