japonicusdb / japonicus-curation

Data files for JaponicusDB
0 stars 1 forks source link

finding expected missing small genes (example, nce1,and missing list) #32

Open ValWood opened 3 years ago

ValWood commented 3 years ago

Finding expected missing small genes example, nce1 (this should work for most small, reasonably conserved proteins even if we don't have a clue about the location)

Pombe. SPAC12G12.17 /nce1

Screenshot 2021-07-27 at 21 51 24

clearly missing from japonicus

Screenshot 2021-07-27 at 21 53 28

tbalxstN identifies 2 fragments in region:

Screenshot 2021-07-26 at 13 26 55 Screenshot 2021-07-26 at 13 27 27

create CDS and FASTA/adjust until matches orthologs:

CDS - 1322452: 1322658 MW: 6130.2935 MNLISKRLDPVFGLAVGVYAYILYERKQPRPAGRSLRELLARAWLRPSSKPSST 10 20 30 40 50

Screenshot 2021-07-27 at 22 09 51 Screenshot 2021-07-27 at 22 06 19
ValWood commented 3 years ago

These are some of the small missing proteins we need to look for. There are more, but this is some of the major obvious missing ones:

ValWood commented 3 years ago

SJAG_02161 phosphatidylglycerol phospholipase C Pgc1 /SMN complex subunit Gem6 tandem fusion (I can't see a way to split. May be a sequence error)