BIO-DIKU / SeqScan

Pattern matching in biological sequences
GNU General Public License v2.0
4 stars 0 forks source link

Add benchmark protein dataset #49

Closed maasha closed 8 years ago

maasha commented 9 years ago

This file:

http://genome.fungalgenomics.ca/downloadFiles/Aurpu2/Aurpu2p4.representatives.faa.zip

contains protein predictions from the fungi Aureobasidium pullulans

It is possible to locate one or two hydrophobins with the pattern:

[^C]{25,158} C [^C]{5,9} CC [^C]{4,44} C [^C]{7,23} C [^C]{5,7} CC [^C]{6,18} C [^C]{2,13}
maasha commented 8 years ago

done