mozilla / readability

A standalone version of the readability lib
Other
8.99k stars 608 forks source link

YouTube videos not being extracted. #822

Open jkinggg opened 11 months ago

jkinggg commented 11 months ago

Hi All,

I'm encountering an issue with video iframes. YouTube videos on Medium are not being extracted. Per the documentation, I would expect this to be extracted successfully. Does anyone have any advice on this?

Here is an example of an article: https://towardsdatascience.com/quick-fire-guide-to-multi-modal-ml-with-openais-clip-2dad7e398ac0

Here is the iframe I would expect to be extracted: <iframe id="player" frameborder="0" allowfullscreen="" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" title="Fast intro to multi-modal ML with OpenAI's CLIP" width="640" height="360" src="https://www.youtube.com/embed/989aKUVBfbk?wmode=opaque&amp;widget_referrer=https%3A%2F%2Ftowardsdatascience.com%2Fquick-fire-guide-to-multi-modal-ml-with-openais-clip-2dad7e398ac0&amp;enablejsapi=1&amp;origin=https%3A%2F%2Fcdn.embedly.com&amp;widgetid=1"></iframe>

Thanks, Johnny

gijsk commented 11 months ago

Can you provide a code sample that reproduces the problem, and perhaps debug output (from passing debug: true in the options object/dictionary) ?