Closed Oufattole closed 3 weeks ago
The changes introduce a new markdown template for datasets within the src/MEDS_DEV/templates/dataset.md
file. This template is structured to include sections for the dataset name, description, supported tasks, resources, access details, and a compliance checklist. The template aims to standardize the documentation process for datasets in the MEDS-DEV project.
File Path | Change Summary |
---|---|
src/MEDS_DEV/templates/dataset.md | Added a new markdown template for datasets, including sections for name, description, supported tasks, resources, access, and a compliance checklist. |
src/MEDS_DEV/datasets/README.md
file clarify the dataset contribution workflow, which is related to the new dataset template introduced in the main PR, as both aim to enhance the documentation and guidance for dataset contributions.🐇 In the meadow, templates bloom,
For datasets, they chase away gloom.
With sections neat and checklists bright,
Documentation shines, a pure delight!
Hoppy changes, let’s all cheer,
For structured data, we hold dear! 🌼
Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?
I like this, but chat with @prockenschaub and @rvandewater -- they've added similar dataset templating information in #27, @Oufattole
@Oufattole do you prefer to take over our changes as you see fit?
@prockenschaub @rvandewater I tried to merge the two versions for the dataset template (sorry this turned out to be concurrent!) and included suggestions from @Simonlee711 and coderabbit. Let me know if you have any thoughts!
@Jeanselme maybe we should also think about and request more information on any fairness-related considerations for the dataset (e.g. which groups are possible, request their definitions, etc.)
Also cc: @Oufattole @mmcdermott
Looks good @kamilest
Thoughts on the dataset template?
We should add that the user is expected to commit in the pull request:
datasets/${dataset_name}/predicates.yaml
datasets/${dataset_name}/README.md
Summary by CodeRabbit