evolvingweb / sitediff

SiteDiff makes it easy to see differences between two versions of a website.
http://sitediff.io
GNU General Public License v2.0
227 stars 48 forks source link

Drupal: Paginated views are not crawled #190

Open ProCycleDev opened 5 months ago

ProCycleDev commented 5 months ago

If a view in Drupal has multiple pages it seems that only the first page is crawled and the rest are ignored. Is this by design? Is there a setting to enable crawling these pages? A large portion of my site is missing because the crawler ignores all the links that are like /some/url/view?page=1

I tried adding .*\?page\=\d* to the include but that did not affect anything.

kirk-brown-ew commented 5 months ago

SiteDiff ignores all query parameters and fragment. This is a feature we'd like to include in the future.