Closed MKorostoff closed 5 years ago
So this seems to work better on smaller sites, but larger sites we end up getting timeouts and network issues from fetch. You can see in the recording below that no nodes, forms, or languages are logged, but you can see in the Status Codes
the types of errors that were caught.
https://asciinema.org/a/245625
I added a few catches for errors I was seeing to reduce or handle as many as I can, but we might want to look at using something like Promise Queue instead to limit the number of Promises being executed at once.
Some sitemaps to test:
Improves scrape time by roughly 8x