survos / survos

Example of a mono repo
MIT License
0 stars 1 forks source link

fix crawler issues #25

Open tacman opened 9 months ago

tacman commented 9 months ago

https://mus.wip/en/crawlerdata

image

Crawler should take an optional --locale argument that limits the pages being searched to only that locale. Otherwise, we're overwhelmed with duplicate links.

_profiler/ links should be excluded for the json -- they should never even get to the link list.

image

There are still pages missing, like https://www.mus.wip/en/mus/countries, which is in the footer. Maybe because of so many langugage and profile links.

tacman commented 7 months ago

Specifically, the DX for crawler should be:

tacman commented 6 months ago

image

Switch User should be excluded, and may be part of the problem above.

image