datalad / datalad-catalog

Create a user-friendly data catalog from structured metadata
https://datalad-catalog.netlify.app
MIT License
15 stars 12 forks source link

Support datalad-based generation and deployment scenario #314

Open mih opened 1 year ago

mih commented 1 year ago

Having gone through https://github.com/datalad/datalad-catalog/issues/311 I want to make a suggestion for a (possibly already possible) workflow that would avoid the repeated specification of essential setup properties (like --catalog-dir), and align the handling better with datalad tooling and features.

I think the task of maintaining a catalog is rather similar to the task of maintaining a software package repository (see https://datalad-debian.readthedocs.io). In a datalad context we could have the following setup:

catalog_maint/ [ds]
    www/ [ds]
    superds/ [ds]
         (records/)
         subds/ [ds]
            (records/)

where

So conceptually we have a constellation of 3 datasets:

a maintenance workflow would be

within the scope of (A), all configuration for that specific catalog could be captured and stored, avoiding any per-call necessity to redeclare constants

jsheunis commented 10 months ago

Sample structure: https://github.com/psychoinformatics-de/abcdj-catalog/tree/6ac29c8f0e0cb9a10eb094a7d9b7c95a3daa6520

Consider structuring this as:

Notes: