persistory / browserparrot-issues

Public issues, suggestions, and support for BrowserParrot
https://browserparrot.com/
5 stars 1 forks source link

Duplicate URLs in search results #11

Open iansinnott opened 2 years ago

iansinnott commented 2 years ago

It's possible for URLs to return multiple times. each exact URL should only show up once. if a URL is matched on the URL itself as well as page body those should be unified.

see https://github.com/persistory/browserparrot-issues/discussions/7

nfcampos commented 2 years ago

Another consideration here is what to do with websites whose content changes over time, eg the homepage of a blog, I think it's fine to just let the latest content replace the earlier, as if you were interested in it at the time you probably clicked into a specific link. But other folks may have other opinions

iansinnott commented 2 years ago

Another consideration here is what to do with websites whose content changes over time

Yeah, my thinking here is that overwriting is fine. For historic versions i'd like to add a plugin that will automatically submit URLs you've visited to the internet archive so that you can still access historical versions of the full page if desired. Likely opt-in, since it's a third party service neither the user nor BP control.