Closed HenrikBengtsson closed 9 years ago
Another reference is http://gatkforums.broadinstitute.org/discussion/1601/how-can-i-prepare-a-fasta-file-to-use-as-reference, which says: "a text file with one record per line for each of the fasta contigs. Each record is of the: contig
, size
, location
, basesPerLine
, bytesPerLine
"
Note: *.fai files only works on non-compressed FASTA files.
Add
FastaReferenceIndexFile
class for*.fai
FASTA index files. They are short tabular text files, e.g.I don't know of a formal reference for the file format, but the columns appears to be (the column names are mine):
sequence
: the name of the sequencelength
: the length of the sequencefileOffset
: the offset of the first base in the FASTA filelengthPerEntry
: the number of bases in each FASTA linebytesPerEntry
: the number of bytes in each FASTA line