kheyer / Genomic-ULMFiT

ULMFiT for Genomic Sequence Data
285 stars 55 forks source link

Tips on using model for Regression #6

Open sn88798 opened 4 years ago

sn88798 commented 4 years ago

Hi, I got to know about your model thru fast.ai forum I am interested in using your genomic model for Kaggle competition: https://www.kaggle.com/c/stanford-covid-vaccine/overview

As the competition only last 3 weeks, need some tips 1) for RNA seqeunces about 100 sequence length, letters A, G, C, U --> How do I generate a vocab ? 2) There are some loop types need to predict: S: paired "Stem" M: Multiloop I: Internal loop B: Bulge H: Hairpin loop E: dangling End X: eXternal loop Not clear how to use model to predict loops ?

cheers