etalab / transport-validator

GTFS validator
https://transport.data.gouv.fr/validation/
MIT License
37 stars 10 forks source link

Raise an error when files are in a subfolder #160

Closed thbar closed 10 months ago

thbar commented 1 year ago

While doing a pass of data quality on https://transport.data.gouv.fr, I noticed a number of GTFS are considered valid despite containing a subfolder:

I have opened this issue to discuss this at the spec level:

Which raised a return so far:

We get lots of support requests for exactly this topic in the OpenTripPlanner chat room. I would welcome the spec being more clear on this.

The crate used to read GTFS in the validator is smoothly handling that, but other tools may not etc, causing them trouble to ingest the data.

I would personally be happy to consider a file invalid on https://transport.data.gouv.fr if there is a subfolder, since it complicates the ingestion.

Given how the crate works, would it be complicated to return an error in that case (i.e. detect the fact that while data is valid, it wasn't stored at the root)?

To be discussed with the team (@fchabouis @AntoineAugusti), just raising the point with concrete data.

antoine-de commented 1 year ago

hum, it seems quite common for a GTFS to be nested, so not sure about this :thinking:

Maybe we can just raise a warning for this though ?

what are your thoughts on this @Tristramg ?

Tristramg commented 1 year ago

The spec might be more explicit here.

We used to be quite tolerant in order to start getting people to publish data, but that era is now over. There seems to be a consensus towards forbidding the sub-folders.

I suggest to wait until it get really into the spec. In the meantime, a warning should be good

On Wed, 7 Jun 2023 at 15:46, Antoine D @.***> wrote:

hum, it seems quite common for a GTFS to be nested, so not sure about this 🤔

Maybe we can just raise a warning for this though ?

what are your thoughts on this @Tristramg https://github.com/Tristramg ?

— Reply to this email directly, view it on GitHub https://github.com/etalab/transport-validator/issues/160#issuecomment-1580858926, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAAC7SZ3J53MMHZYYN4MAYLXKCA4JANCNFSM6AAAAAAYB75SYE . You are receiving this because you were mentioned.Message ID: @.***>

thbar commented 1 year ago

Quick update:

The spec is now more explicit:

The MobilityData validator has implemented that as an ERROR:

AntoineAugusti commented 10 months ago

Hi @antoine-de @Tristramg, would it be possible to add an error in the coming weeks with this rule?

antoine-de commented 10 months ago

yep, sure completly forgotten about this 👍