Open turbomam opened 3 months ago
looks like I had probably written some yq for listing the slots in a schema file
wget https://raw.githubusercontent.com/GenomicsStandardsConsortium/mixs/v6.2.0/src/mixs/schema/mixs.yam
yq e '.slots | keys' mixs.yaml | sed 's/^- //' | sort > mixs.6.2.slots.txt
also did some analysis of used mixs slots. might have obtained used-73-mixs-slots.csv
from a SPARQL query?
awk -F',' 'NR>1 {print $2}' used-73-mixs-slots.csv | sort > used-73-mixs-slot-names.txt
and obtained the sizes of files like nmdc_mga0rre721_centrifuge_classification.tsv
from perlmutter.nersc.gov:/global/cfs/cdirs/m3408/results/nmdc:mga0rre721/ReadbasedAnalysis