eQTL-Catalogue / eQTL-SumStats

eQTL Catalogue Summary Statistics
3 stars 1 forks source link

investigate the use of far fewer HDF5 files with more indices #14

Closed jdhayhurst closed 3 years ago

jdhayhurst commented 4 years ago

fewer files with more indices will make the files more portable and will likely improve querying. For instance, if we combine all studies in one, and only separate on quantification method and chromosome, data retrieval should be far faster. The cost is then the time to index like this. It would be necessary to maintain study level files, so that when new studies are added, we can regenerate the file each time because appending will not an option.