BIO-DIKU / SeqScan

Pattern matching in biological sequences
GNU General Public License v2.0
4 stars 0 forks source link

README refers to a nonexistent DOT rule #113

Open selkovjr opened 7 years ago

selkovjr commented 7 years ago

It does so in this tetraloop example:

seqscan -p 'p1=4..6 .{4} ~p1' -o hits.tsv input.fna

The only rules involving DOT are

INT DOT DOT DOT INT INT DOT DOT INT

maasha commented 7 years ago

Yeah, I am afraid that was never implemented. But IIRC N{4} and 4...4 should work.

RasmusFonseca commented 7 years ago

I'm not sure why we'd need more than one wildcard notation. Is it ok that I change the tetraloop example to p1=4..6 4..4 ~p1 for now?

maasha commented 7 years ago

The .. and ... notation is a leftover from the original scanformatches. The new notation is a bit more versatile. I don't see a problem in supporting both.