divonlan / genozip

A modern compressor for genomic files (FASTQ, SAM/BAM/CRAM, VCF, FASTA, GFF/GTF/GVF, 23andMe...), up to 5x better than gzip and faster too
Other
159 stars 12 forks source link

binding multiple files defined in file #10

Closed joshuamschmidt closed 3 years ago

joshuamschmidt commented 3 years ago

Hi @divonlan.

Currently genozip can bind all files in current directory using * wildcard. It would be useful to be able to specify paths of files for binding - either piped from cmd or stored in a file cf tar:

tar -cvf my_bams.tar -T my_bams.txt

This would be a handy feature when wanting to bind files that are located within myriad subdirectories.

divonlan commented 3 years ago

Thanks @joshuamschmidt for the excellent suggestion. I have added it to my list.

In the mean time, as a work around, you can compress the files individually, and then tar together the compressed files. Something like this might work:

for a in `find .`; do if [ -f $a ] ; then genozip $a ;fi; done

tar cvf mytar.tar `find . -name *.genozip`

divonlan commented 3 years ago

@joshuamschmidt this feature is now released. Please see: https://genozip.com/archiving.html