monarch-initiative / monarch-ingest

Data ingest application for Monarch Initiative knowledge graph using Koza
https://monarchinitiative.org
14 stars 1 forks source link

Review and Complete Mapping of All Namespaces that We Ingest #305

Closed putmantime closed 1 year ago

putmantime commented 2 years ago
kevinschaper commented 2 years ago
q -t "select subject from monarch-kg-dangling-edges.tsv union select object from monarch-kg-dangling-edges.tsv" | grep ":" | cut -d':' -f 1 | sort | uniq -c | sort -rn
146812 AspGD
77679 UniProtKB
38787 CLO
37992 OBO
35357 http
35086 ZP
22961 UMLS
22170 GO
21396 MONDO
20533 MGI
20406 ZFIN
20063 XPO
19288 ENSEMBL
17005 HGNC
15884 RGD
14974 NCBITaxon
11456 FB
10707 WB
9973 DOID
9796 dictyBase
9312 MESH
8888 OMIM
8641 HP
7427 NCIT
5802 FYPO
5002 PomBase
4867 SGD
4275 EMAPA
4078 UBERON
4032 POMBASE
3678 FBbt
2838 EFO
2610 PW
2448 CL
2418 WBbt
2376 SO
2080 REACT
1943 ECO
1917 NCBIGene
1903 PATO
1862 MP
1610 ZFA
1435 MEDDRA
1054 CHEBI
 966 WBPhenotype
 676 XCO
 653 RO
 604 PR
 578 Xenbase
 445 OBI
 400 ENVO
 365 NBO
 288 SNOMED
 283 GENO
 273 PO
 233 IAO
 230 FBcv
 171 ECTO
 165 FOODON
 148 BSPO
 115 SEPIO
 100 dct
  77 foaf
  72 MAXO
  71 https
  65 PCO
  65 DDANAT
  60 FMA
  58 BFO
  56 UBERON_CORE
  50 ORPHA
  47 DECIPHER
  33 FAO
  25 faldo
  20 OIO
  19 GOP
  16 dc
  16 HsapDv
  13 dctypes
  11 OMIMPS
  10 PMID
   8 xsd
   7 pav
   6 rdfs
   5 prov
   5 owl
   5 SCTID
   4 SIO
   4 Orphanet
   4 ORCID
   4 GOREL
   3 schema
   3 rdf
   3 STATO
   3 OGMS
   3 ExO
   2 UPHENO
   2 IDO
   1 skos
   1 doi
   1 TAXRANK
   1 EDAM-DATA
   1 DOI
   1 !/hgnc_id/HGNC
putmantime commented 1 year ago

This analysis needs to be redone on an ingest by ingest basis.