Open devesh opened 3 years ago
@devesh Doing some starting research at the moment:
https://www.nhk.or.jp/school/keyword/?kw=メダカ&cat=all&from=1&sort=ranking
https://www.nhk.or.jp/school/keyword/?kw=<query>&cat=<category>&from=<page>&sort=<sorting>
all
b
(program)c
(clip)ranking
(most popular)update
(most recent)bangumi
, we also have clip
:
https://www2.nhk.or.jp/school/movie/bangumi.cgi?das_id=D0005150191_00000 (program)
https://www2.nhk.or.jp/school/movie/clip.cgi?das_id=D0005311329_00000&p=box (clip)
https://www2.nhk.or.jp/school/movie/<type>.cgi?das_id=<id>
kokugo
pages have a JSON version of the page (https://www.nhk.or.jp/school/kokugo/drill/meta/program.json), which will make them super easy to scrape.ouchi
pages, unlike kokugo
pages, aren't very standardized on the frontend and backend, so I'm probably going to have to skip parsing them; sorry.I created playlist ID 9178 if you want to try to implement support for it. I do not plan to use it myself, but somebody else might.
Similarly, I do not plan to download from ouchi
pages and instead plan to use only the kokugo
pages, so what you're proposing to implement support for is fantastic.
Apart from the issue, they do use Akamai!? so rich they are!
Anyway, I'll try to write extractor for that later.
Checklist
Example URLs
Description