Open saumier opened 4 days ago
@saumier This is happening because the october events are now missing in the Capitol.nb website and the Artsdata Pipeline is configured to stop looking for event URLs when it doesn't find any new ones in a page.
This can be solved in 2 ways:
Let me know what you think.
I think crawling the first page was ok in the beginning when we wanted to load data and see what it contained. But now we need a better solution that gets all their pages of events.
Please consider adding a parameter like "offset" to increment the pagination by more than one. In this case the offset would be 12 and the API calls would be https://capitol.nb.ca/en/tickets-events?start=1 followed by https://capitol.nb.ca/en/tickets-events?start=12 and then https://capitol.nb.ca/en/tickets-events?start=24
The workflow for capitol-nb-ca is failing after being updated. The website has improved their JSON-LD. Gregory removed the custom location from the workflow and set the mode "fetch-push".
https://github.com/culturecreates/artsdata-orion/actions/workflows/capitolnb-events.yml