purarue / google_takeout_parser

A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)
https://pypi.org/project/google-takeout-parser/
MIT License
84 stars 14 forks source link

add parser for subscriptions.json/likes.csv files #2

Open purarue opened 3 years ago

karlicoss commented 2 years ago

pre-2019: seems that subscriptions were in YouTube/subscriptions/subscriptions.opml but actually it's not an opml file at all, looks like HTML for the youtube subscription page as if it was downloaded without logging into the account so it basically just has the google login form o_o

karlicoss commented 2 years ago

for me, liked videos are

also perhaps makes sense to parse other playlists?

purarue commented 2 years ago

yeah, can parse other playlists

one other concern is how to merge this across takeouts, not sure if playlists would return individual items or the entire playlist as one object/event

not sure how to track which playlist is newer/if playlist items should be overwritten or not (probably not?)

karlicoss commented 2 years ago

yeah, I guess for playlists makes sense to return the latest only not sure what happens to deleted videos, might be interesting to check if they disappear from playlists