postlight / parser

📜 Extract meaningful content from the chaos of a web page
https://reader.postlight.com
Apache License 2.0
5.41k stars 442 forks source link

convertLazyLoadedImages should be consider No Protocol-Relative URLs #575

Open lljxx1 opened 4 years ago

lljxx1 commented 4 years ago

https://github.com/postlight/mercury-parser/blob/15f7fa1e27fe6b47c87da40ba4fce9b2db7934ec/src/resource/utils/dom/convert-lazy-loaded-images.js#L17

<img src="" width="900" height="506" alt="" itemprop="url contentUrl" data-src="//cdn.iz.ru/sites/default/files/styles/900x506/public/news-2020-08/TASS_40788895.jpg?itok=pqXTqwzC" class="owl-lazy" />

Here is an example