datopian / datahub

🌀 Rapidly build rich data portals using a modern frontend framework
https://datahub.io/opensource
MIT License
2.19k stars 326 forks source link

Not recognizing project as a dataset project if add datapackage after initial project create #1196

Open rufuspollock opened 3 months ago

rufuspollock commented 3 months ago

Bug description

[!warning] 🚩 2024-06-20 I've updated the repo names for the broken and working projects so working project is at the nice url. Broken is now ttps://datahub.io/@rufuspollock/jaan-tallinn-donations-broken. I've updated text below.

i've just published https://datahub.io/@rufuspollock/jaan-tallinn-donations-broken from https://github.com/rufuspollock/jaan-tallinn-donations - it should show up as a dataset project given i have a datapackage.yaml but that is not happening. Here's the result.

image

I've now just created the site again new and it works 🎉 https://datahub.io/@rufuspollock/jaan-tallinn-donations

image

Debugging

OK, so i think the source here are the steps by which i created the repo. Steps were something like:

So my guess here is that when it got initially published with just README and frontmatter it wasn't "seen" as a dataset project. And then even when the datapackage.yaml were added it didn't change the "type" of the project in the database (which is a 🐛)

Thoughts

olayway commented 3 months ago

This is happening because there was no changes made to the corresponding README.md and so the sync workflow skipped it and its datapackage file wasn't processed. It will also happen if you make some changes to the datapackage that was previously correctly parsed if you don't make any changes to the README.md file (= datapackage changes won't go live). It's a bug of course.

rufuspollock commented 3 months ago

@olayway great analysis.

IMO it's low priority as quite a rare bug so changing status.