Closed yarikoptic closed 6 years ago
Triples would also solve this. In general such task is not very rewarding. It is an attempt to bring order to a world that doesnt want to be structured.
I think our search should try to make this a non issue for consumers, without having to hand tweak each dataset.
FTR: in #1630 we push users towards unified metadata keys, but there is complete freedom for tags/keywords. At the same time the new search
is pretty powerful, so we likely don't care.
3 months in no great ideas have emerged. With the new parser in #1630 the complexity of the situation goes up, if anything. I think focusing on the beautification of the stored data is a futile exercise -- unless constrained to a very specific domain of use case. But if the data source we are pulling from is a complicated mess, we would have to spend a disproportionate amount of energy to fix things. I am confident that are new search implementation is more adequate to deal with messy data.
ATM it is quite wild:
I believe at some point we were talking about templating dataset descriptors for people to start easily composing them. I wondered if we should maintain some kind of a list to suggest keywords? someone (could be a nice student project) could come up with a suggested list of keywords based on the dataset at hand (e.g. checking if BIDS, if has func, etc)