GenomicsStandardsConsortium / mixs

Minimum Information about any (X) Sequence” (MIxS) specification
https://w3id.org/mixs
Creative Commons Zero v1.0 Universal
33 stars 20 forks source link

Requests for improved performance of LinkML tools in MIxS #629

Open turbomam opened 8 months ago

turbomam commented 8 months ago

MIxS is a large schema. There are hundreds of classes (including the checklist/extension combinations) and about 1000 MIxS terms/LinkML slots. Some of the tools that run in our build process are slow, especially

turbomam commented 8 months ago

@turbomam to provide metrics

cpauvert commented 7 months ago

Hi @turbomam, thanks for the ongoing efforts on the mixs, really appreciated! I also notice that the validation was slow when using the example (with linkml v1.6.3).

linkml-validate -s mixs.yaml MixsCompliantData-MimsSoil-example.yaml

with compliant example

real    2m21,002s
user    2m20,868s
sys 0m0,112s

with incorrect metadata

I changed https://github.com/GenomicsStandardsConsortium/mixs/blob/18794c4997aeb189d442d965beac00da073e8664/examples/MixsCompliantData-MimsSoil-example.yaml#L5 to an invalid text : "Column 1"

real    2m19,171s
user    2m19,085s
sys 0m0,072s

This was tested on a Xubuntu 20.04 LTS 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz 32,00 GB (2x 16GB DDR4

Hope that helps! Best,