vertesy / TheCorvinas

Various Scripts and Summaries
http://vertesy.github.io/TheCorvinas
2 stars 0 forks source link

Fixing incorrect gene names in the dog transcriptome using regular expressions and in-place editing #26

Closed vertesy closed 3 years ago

vertesy commented 8 years ago

Use SED to in-place change gene names in 2 steps

Incorrect name

Example:

>rna17099 gene=TMEM63C CDS=299-2713

Remove prefix, keep >

>rna.+\ gene=

result:

>TMEM63C CDS=299-2713

Remove suffix

CDS=.*

result:

gene=TMEM63C