webrecorder / browsertrix-crawler

Run a high-fidelity browser-based web archiving crawler in a single Docker container
https://crawler.docs.browsertrix.com
GNU Affero General Public License v3.0
611 stars 79 forks source link

Crawling brightcove videos #89

Open mattwelly opened 2 years ago

mattwelly commented 2 years ago

Hi there, I'm trying to crawl an education website that streams the videos using brightcove player. My crawling attempts keep timing out on pages and I end up with no video pages when I play it back.

Any suggestion on how this can be resolved?

Cheers

ikreymer commented 2 years ago

Hi, can you share an example page that has the issue, or another page that has this player?

mattwelly commented 2 years ago

sure, try this: https://www.tvnz.co.nz/shows/home-learning-tv/episodes/sage-12-15-e1

ikreymer commented 2 years ago

I'm not able to access that outside of NZ, so I'll need to try something else..

mattwelly commented 2 years ago

there is one on Brightcove's website too: https://www.brightcove.com/en/products/player/

mattwelly commented 2 years ago

any clues?