mdshw5 / fastqp

Simple FASTQ quality assessment using Python
https://pypi.python.org/pypi/fastqp
MIT License
108 stars 14 forks source link

Match a list of known adapter sequences #11

Closed mdshw5 closed 9 years ago

mdshw5 commented 9 years ago

A fast strategy would be to break adapter sequences into kmers, and then classify previously counted kmers into "possible adapter" kmers.

mdshw5 commented 9 years ago

The list from Trimmomatic is probably good 👎It's GPLv3, and does not specify whether the adapter sequences are licensed differently. Jeesh.

mdshw5 commented 9 years ago

Illumina's PDF📄 says I can redistribute the sequences with a copyright notice.

Oligonucleotide sequences © 2007-2013 Illumina, Inc. All rights reserved. Derivative works created by Illumina customers are authorized for use with Illumina instruments and products only. All other uses are strictly prohibited.