OstfriesenBI / PredmiRNA

A set of scripts and tools to train a classifier for pre-miRNA Recognition
1 stars 0 forks source link

Feature calculation: GC Content #10

Closed Finesim97 closed 5 years ago

Finesim97 commented 5 years ago

R function: Input: csv file with the sequences:

"comment","sequence","realmiRNA"
"mmu-mir-380 MI0000797 Mus musculus miR-380 stem-loop","AAGAUG",1
"mmu-mir-381 MI0000798 Mus musculus miR-381 stem-loop","AAUUC",1

Output: csv file with the sequence identifier and the GC content

"comment","gccont"
"mmu-mir-380 MI0000797 Mus musculus miR-380 stem-loop",0.3333
"mmu-mir-381 MI0000798 Mus musculus miR-381 stem-loop",0,2

This function already does part of the work, only seqinr needs to be installed.

mariusrueve commented 5 years ago

Result can be found in dc7c485.