Open tlambert03 opened 2 years ago
Hey Talley, thanks for looking into our data!
There's some good stuff in Organelles.tsx. I wonder if you might consider an api/organelles.json file to facilitate programmatic access to the table at https://openorganelle.janelia.org/organelles?
As you note, putting all that data inOrganelles.tsx
is... less than ideal, to put it mildly. Totally agree that this info should be persisted externally, e.g. in this repo (which is a pseudo-API) or as an endpoint for a real API.
I know that all of that info is repeated inside of each manifest.sources, but is there a better place that one could go to generally see what organelles and labels are available "somewhere" in the datasets and what each key means?
At the moment no, but this will be fixed when I complete the work of this branch: https://github.com/janelia-cosem/fibsem-metadata/tree/fastapi
The plan is to put all this metadata in a postgresql database (making it queryable) and expose that with fastapi (making it documented and discoverable). Openorganelle will then use this API to discover and sort datasets.
Your second concern would be addressed even more comprehensively by expanding the metadata for individual volumes, e.g. giving each volume within a dataset its own description
field, which would basically give some human-comprehensible explanation for a name like er
.
Thanks again for the feedback! This is super helpful, and coming at the perfect time :)
oooh ❤️ fantastic. happy to wait for that stuff. looks like an ideal long term solution.
(side note, did you happen to see https://sqlmodel.tiangolo.com/ ? ... I've been wanting to redo some older database projects after seeing how lovely that is 😄 )
ohhh wow I had not seen that... looks like my googling "pydantic + sql" didn't uncover it. I will definitely look into it. This is my first time doing anything with a database so there's a lot of learning in involved, but it's long overdue :)
Hey @d-v-b and @avweigel! Finally started digging into this dataset... just amazing. Thank you all so much for making this heroic effort so easily available! Also appreciate the ease of loading into xarray with metadata from fibsem-tools 👍
I found myself digging into
openorganelle
source code to find a bit of info, and I wonder if you'd consider putting some of that metadata in this repo, for ease of querying (outside of OpenOrganelle).There's some good stuff in Organelles.tsx. I wonder if you might consider an
api/organelles.json
file to facilitate programmatic access to the table at https://openorganelle.janelia.org/organelles? Something like:I found myself digging a bit for a general key of the sources names, i.e something like:
I know that all of that info is repeated inside of each
manifest.sources
, but is there a better place that one could go to generally see what organelles and labels are available "somewhere" in the datasets and what each key means?thanks again!