j0k3r / graby

Graby helps you extract article content from web pages
MIT License
363 stars 73 forks source link

Rebuild siteconfig if host for singlepage is not the same than the so… #246

Closed Kdecherf closed 3 years ago

Kdecherf commented 3 years ago

Consider a single_page_link directive on twitter.com that follows any link found in a tweet. Here is an example with the following tweet:

https://twitter.com/Cloudflare/status/1341353044504694787

This eventually leads to the following page:

https://blog.cloudflare.com/beat-an-acoustics-inspired-ddos-attack/

Until now, Graby was incorrectly sending to cloudflare.com http headers defined for twitter.com.

coveralls commented 3 years ago

Coverage Status

Coverage increased (+0.03%) to 96.482% when pulling 898a6be4a6638b3d17470cb2f54890f7f072b614 on Kdecherf:singlepage-rebuild-site-config into 1c581bb80076d933d184dbaf540ee63dec74501c on j0k3r:master.