tweag / nix-hour

Questions for the weekly Nix Hour
MIT License
78 stars 3 forks source link

Scrape all existing episodes from YouTube #49

Open YorikSar opened 1 year ago

YorikSar commented 1 year ago

Split off minimal useful piece from https://github.com/tweag/nix-hour/pull/48.

Create episodes directory with Nix module definition and Go tool that can talk to YouTube API. Impement scrape command for the tool that fetches all data from YouTube and generates all local files.

In a separate commit are results of running that command.

Inspired by https://github.com/tweag/nix-hour/pull/47 by @con-f-use.

con-f-use commented 1 year ago

In response to @YorikSar in #47:

@con-f-use Thanks a lot for this! It inspired me to cut off an MVP of my big project of storing all possible metadata and just scrape existing episodes into separate dirs here: #49. I didn't generate one big file with all chapters though. Do you think it would be helpful? It should be easy to generate it if needed.

Hi Yuriy, glad it helpd at least a little bit. I think the one big index file is useful, if someone is looking for a specific topic but doesn't know which nixhour it was discussed in. I suppose one could use github's reposity search instead. That would search the individual files as well, so a unified index is not really necessary. However, it would still be nice for offline or 3rd-party tool usage, if one could download a single file, or you could just mention in the readme that people looking for a specific topic should use the github repo-search. Either way is fine, I think.

YorikSar commented 1 year ago

@con-f-use good point. I’ll add an index file there as well then.