Open anitacaron opened 1 year ago
Here is some ancient code I wrote:
https://github.com/VirtualFlyBrain/vfb-pipeline-dumps/blob/master/scripts/obographs-solr.py
Something that's maybe irrelevant: https://github.com/VirtualFlyBrain/vfb-solr
Be very careful with this, don't just copy blindly: https://github.com/VirtualFlyBrain/Docker-VFB-SOLR/blob/master/loadOLS.sh
Use sssom CLI to turn mapping sets to JSON
This also needs a script to flat the data. The decision would be to add all mapping_set information to each of its mappings, but there are the same attributes' names in both levels. Discussing with Nico, he said that the policy (not official) is to prioritise the field at the mapping level. And only take the mapping_set field if it's not available in the mapping. However, this is still in discussion on a case by case.
The list of terms present in mapping set and mapping: TBD