WildPHP / module-linksniffer

Provides details about links posted in a channel
MIT License
0 stars 1 forks source link

Sometimes Link Descriptions Include HTML #10

Closed Archer70 closed 6 years ago

Archer70 commented 6 years ago

Linking this page: https://www.mozilla.org/en-US/firefox/developer/ results in the following. I've seen this before, and I'm not sure what's causing it.

screenshot from 2017-09-26 18-16-32

NanoSector commented 6 years ago

Just ran some tests but they check out: https://regex101.com/r/kKkxHi/1/

Not sure what is causing it in this case.

Archer70 commented 6 years ago

¯\_(ツ)_/¯ It's either fixed, or the page has changed, because it's not doing it anymore.

NanoSector commented 6 years ago

it tends to happen on more pages though so I'll leave this open for now

NanoSector commented 6 years ago

Botergos still does this:

<NanoSector> https://www.mozilla.org/en-US/firefox/developer/ Botergos 
<Botergos> [NanoSector] Firefox Developer Edition</title> <meta name="description" content="Firefox Developer Edition is the blazing fast browser that offers cutting edge developer tools and the latest features like CSS Grid support and framework debugging"> <meta property="og:type" content="website"> <meta property="og:site_name" content="Mozilla"> <meta property="og:locale" content="en_US"> <meta property="og:url" content="https

EDIT: It works properly on a test bot though... Very strange :\

NanoSector commented 6 years ago

Oh, the Firefox Developer page has multiple tags in it. Botergos is likely slower in looping due to the device it runs on which allows for more page data to be downloaded, hence it hitting the second tag.