nsalomonis / altanalyze

AltAnalyze is a multi-functional and easy-to-use software package for automated single-cell and bulk gene and splicing analyses. Easy-to-use precompiled graphical user-interface versions available from our website.
http://www.altanalyze.org
Apache License 2.0
99 stars 30 forks source link

Reduce database size #10

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Species databases are particularly large for human, mouse and rat. To increase 
usability for systems with minimal hard-drive space, we can collapse 
junction/exon/reciprocal-junction annotation flat files into combined entries. 
To do this, we will need to combine unique IDs (e.g., exon/junction/junction 
pairs) into a single entry where the values are the same. This will eliminate 
the subfolders "exon" and "junction" for RNASeq and junction array databases 
and reduce database size by an estimated 20-40%.

Original issue reported on code.google.com by nsalomo...@gmail.com on 21 Mar 2011 at 6:40