IEDB / ONTIE

Ontology for Immune Epitopes
11 stars 1 forks source link

IEDB SRC proteins not in ONTIE #58

Open beckyjackson opened 3 years ago

beckyjackson commented 3 years ago

There are 400 SRC proteins in the IEDB source table that are not in ONTIE. Of these, 189 are inactive or related, so they don't appear anywhere in the protein tree. Of the remaining 211, 73 of them are assigned to parent proteins so their taxon-protein IRIs don't appear in the tree anyway. That leaves 138 SRC proteins that are not in ONTIE (see table below).

Questions:

Name Accession Org ID Org Name
Fibroblast Activation Protein 2 SRC528250 851 Fusobacterium nucleatum
14 kDa antigen SRC546476 1773 Mycobacterium tuberculosis
cation-transporting P-type ATPase SRC552198 1773 Mycobacterium tuberculosis
thioredoxin family protein SRC551284 2261 Pyrococcus furiosus
DNA translocase SRC536882 5476 Candida albicans
Tryptophan-Threonine-rich plasmodium antigen C terminal SRC528077 5821 Plasmodium berghei
Saposin B domain-containing protein SRC522283 6182 Schistosoma japonicum
micro exon gene 12 SRC536327 6182 Schistosoma japonicum
SJCHGC06871 protein SRC536328 6182 Schistosoma japonicum
cathepsin D (lysosomal aspartyl protease) SRC536329 6182 Schistosoma japonicum
hypothetical protein EWB00_009944 SRC536330 6182 Schistosoma japonicum
micro exon gene 11 SRC536331 6182 Schistosoma japonicum
micro exon gene 14 SRC536332 6182 Schistosoma japonicum
micro exon gene 15 SRC536333 6182 Schistosoma japonicum
micro exon gene 19 SRC536334 6182 Schistosoma japonicum
micro exon gene 22 SRC536335 6182 Schistosoma japonicum
micro exon gene 26.1 SRC536336 6182 Schistosoma japonicum
micro exon gene 26.2 SRC536337 6182 Schistosoma japonicum
micro exon gene 26.4 SRC536338 6182 Schistosoma japonicum
micro exon gene 26.5 SRC536339 6182 Schistosoma japonicum
micro exon gene 26.6 SRC536340 6182 Schistosoma japonicum
micro exon gene 29 SRC536341 6182 Schistosoma japonicum
micro exon gene 4.1 C SRC536342 6182 Schistosoma japonicum
micro exon gene 4.1 N SRC536343 6182 Schistosoma japonicum
micro exon gene 4.2 SRC536344 6182 Schistosoma japonicum
micro exon gene 8.1 C SRC536345 6182 Schistosoma japonicum
micro exon gene 8.2 C SRC536346 6182 Schistosoma japonicum
micro exon gene 8.3 C SRC536347 6182 Schistosoma japonicum
micro exon gene 8.4 SRC536348 6182 Schistosoma japonicum
micro exon gene 9 SRC536349 6182 Schistosoma japonicum
micro exon gene n.1 SRC536350 6182 Schistosoma japonicum
micro exon gene n.2 SRC536351 6182 Schistosoma japonicum
Group XV phospholipase A2 SRC536352 6182 Schistosoma japonicum
palmitoyl-protein thioesterase 1 SRC536353 6182 Schistosoma japonicum
tetraspanin-CD63 receptor isoform 1 SRC536354 6182 Schistosoma japonicum
25 kDa integral membrane protein isoform 2 SRC536355 6182 Schistosoma japonicum
SJCHGC01839 protein SRC536356 6182 Schistosoma japonicum
phospholipase A1 SRC536883 6182 Schistosoma japonicum
integrin beta-1 SRC536939 9031 Gallus gallus (chicken)
Ptal-N*01:01 SRC516267 9402 Pteropus alecto (black flying fox)
APOBEC-3C SRC552219 9541 Macaca fascicularis (crab eating macaque)
HLA class I A2 SRC505873 9606 Homo sapiens (human)
ribonuclease P 25kDa subunit SRC517406 9606 Homo sapiens (human)
Histone H3 SRC522013 9606 Homo sapiens (human)
collagen alpha-1(IV) chain SRC522285 9606 Homo sapiens (human)
serine/threonine kinase 35 SRC526348 9606 Homo sapiens (human)
tau-tubulin kinase 1 SRC526349 9606 Homo sapiens (human)
Septin-1 SRC526350 9606 Homo sapiens (human)
mannose-1-phosphate guanyltransferase alpha SRC526351 9606 Homo sapiens (human)
basigin (Ok blood group) SRC526352 9606 Homo sapiens (human)
Complement component 4 binding protein, alpha SRC526353 9606 Homo sapiens (human)
8.2 kDa differentiation factor SRC526354 9606 Homo sapiens (human)
Neuregulin 3 SRC526355 9606 Homo sapiens (human)
TRAF3 interacting protein 3 SRC526356 9606 Homo sapiens (human)
Band 4.1-like protein 2 SRC526357 9606 Homo sapiens (human)
ATP synthase subunit beta, mitochondrial SRC526358 9606 Homo sapiens (human)
Galectin-1 SRC526359 9606 Homo sapiens (human)
fibronectin type III domain-containing protein 1 SRC528028 9606 Homo sapiens (human)
wingless-type MMTV integration site family, member 11 SRC528029 9606 Homo sapiens (human)
apolipoprotein A-I SRC529349 9606 Homo sapiens (human)
PDZ domain-containing protein MAGIX SRC530329 9606 Homo sapiens (human)
high-mobility group nucleosome binding domain 1 SRC530330 9606 Homo sapiens (human)
Melanocyte Protein Pmel 17 SRC536071 9606 Homo sapiens (human)
CEA cell adhesion molecule 18 SRC541537 9606 Homo sapiens (human)
5'-nucleotidase, cytosolic IIIB SRC541538 9606 Homo sapiens (human)
FAM161 centrosomal protein A SRC541539 9606 Homo sapiens (human)
serine protease 3 SRC541540 9606 Homo sapiens (human)
leukocyte immunoglobulin like receptor B3 SRC541541 9606 Homo sapiens (human)
pentraxin 4 SRC541542 9606 Homo sapiens (human)
N-acetyllactosaminide alpha-1,3-galactosyltransferase SRC541543 9606 Homo sapiens (human)
leukocyte immunoglobulin like receptor A6 SRC541544 9606 Homo sapiens (human)
ATP-dependent RNA helicase DHX33 SRC541699 9606 Homo sapiens (human)
protein YIPF1 SRC541700 9606 Homo sapiens (human)
serine/threonine-protein kinase 32C SRC541701 9606 Homo sapiens (human)
Proline and serine-rich protein 1 SRC542907 9606 Homo sapiens (human)
calcitonin SRC542908 9606 Homo sapiens (human)
usherin SRC543193 9606 Homo sapiens (human)
Golgin subfamily A member 6-like protein 2 SRC543194 9606 Homo sapiens (human)
glutamate receptor 3 SRC546506 9606 Homo sapiens (human)
Calreticulin SRC547807 9606 Homo sapiens (human)
Zinc finger protein 568 SRC549076 9606 Homo sapiens (human)
Sorting nexin 11 SRC549078 9606 Homo sapiens (human)
heterogeneous nuclear ribonucleoprotein M4 SRC549097 9606 Homo sapiens (human)
multidrug resistance-associated protein 9 SRC549098 9606 Homo sapiens (human)
NACHT, LRR and PYD domains-containing protein SRC549099 9606 Homo sapiens (human)
hairy-related transcription factor 3 SRC549100 9606 Homo sapiens (human)
GnTV intron SRC549105 9606 Homo sapiens (human)
Cytoskeleton-associated protein 2 SRC549757 9606 Homo sapiens (human)
solute carrier family 35 member G2 SRC549760 9606 Homo sapiens (human)
zinc finger FYVE-type containing 19 SRC549763 9606 Homo sapiens (human)
Double PHD fingers 1 SRC549764 9606 Homo sapiens (human)
TGFB induced factor homeobox 1 SRC549765 9606 Homo sapiens (human)
G protein subunit alpha i2 SRC549766 9606 Homo sapiens (human)
transforming growth factor, beta receptor II SRC550094 9606 Homo sapiens (human)
TATA box-binding protein-associated factor RNA polymerase I subunit B SRC550095 9606 Homo sapiens (human)
alternative protein ASTE1 SRC550096 9606 Homo sapiens (human)
double homeobox protein 4-like SRC551377 9606 Homo sapiens (human)
Myristoylated alanine-rich C-kinase substrate SRC551530 9606 Homo sapiens (human)
Parathyroid hormone-related protein SRC551531 9606 Homo sapiens (human)
Putative uncharacterized protein GAFA-1 SRC551532 9606 Homo sapiens (human)
Solute carrier family 22 member 9 SRC551533 9606 Homo sapiens (human)
ADAMTS-like protein 1 SRC551534 9606 Homo sapiens (human)
Activin receptor type-2A SRC551535 9606 Homo sapiens (human)
Protein BANP SRC551536 9606 Homo sapiens (human)
Protein asteroid homolog 1 SRC551537 9606 Homo sapiens (human)
SMAD5 antisense gene protein 1 SRC551538 9606 Homo sapiens (human)
Putative uncharacterized protein encoded by LINC00615 SRC551539 9606 Homo sapiens (human)
C1q receptor protein SRC551541 9606 Homo sapiens (human)
Insulin and Receptor-type tyrosine-protein phosphatase-like N SRC551884 9606 Homo sapiens (human)
Receptor-type tyrosine-protein phosphatase-like N SRC551885 9606 Homo sapiens (human)
Neuroendocrine protein 7B2 SRC551886 9606 Homo sapiens (human)
Hepatocyte Nuclear Factor 6 SRC551887 9606 Homo sapiens (human)
RNA exonuclease 2 SRC551888 9606 Homo sapiens (human)
Bone Morphogenetic Protein Receptor Type 1B SRC557624 9606 Homo sapiens (human)
mlrq-like protein SRC505861 10090 Mus musculus (mouse)
spectrin SRC507716 10090 Mus musculus (mouse)
N-myc SRC507717 10090 Mus musculus (mouse)
myelin P2 protein SRC543542 10116 Rattus norvegicus (brown rat)
VACAC2_210 SRC549096 10245 Vaccinia virus (vaccinia virus VV)
polymerase SRC506611 10407 Hepatitis B virus (hepatitis B virus (HBV))
envelope glycoprotein SRC499792 11642 Simian foamy virus
gag protein SRC528257 11723 Simian immunodeficiency virus (Chimpanzee immunodeficiency virus)
ORF1ab polyprotein SRC522284 28344 Porcine reproductive and respiratory syndrome virus
envelope glycoprotein Q SRC549211 32604 Human betaherpesvirus 6B (Human herpes virus 6B)
unknown protein eluted from bat MHC allele SRC499793 32644 Unidentified
sigma A protein SRC516038 38170 Avian orthoreovirus (Avian reovirus)
apoptosis protein SRC550050 46221 Porcine circovirus
Three-finger toxin 3FTx SRC520746 54390 Micrurus corallinus (painted coral snake)
2-oxoacid:acceptor oxidoreductase SRC549212 63363 Aquifex aeolicus
PrgI family protein SRC543255 97253 Eubacterium plexicaudatum
response regulator transcription factor SRC543256 97253 Eubacterium plexicaudatum
Exo m 1 SRC522165 345840 Palaemon modestus
Large T antigen SRC517438 1891726 Human polyomavirus 5
jamesaoverton commented 3 years ago

@rvita?

rvita commented 2 years ago

we add new src every week and all of them need to be added to ontie. but as part of the protein tree recuration task, we have also been editing the term names and synonyms of many of the existing src, so ontie will need to be updated to have the correct labels and synonyms for any pre-existing src that are lareday in ontie and to also add all new src, but this ontie activity should wait until we are done b/c we have a large set of src that are not in use and we will be deleting those after we confirm why they are not in use. i will update this ticket when we are done.