To help with context, I did more of a dive into current metastasis mappings. Whatever decisions we end up making, I believe it's clear the current state is not consistent.
Initial thoughts: Even if we synchronize these vocabularies into a singular hierarchical form, we're still going to need an additional qualifier on other types of concepts that are indicative of the existence of metastasis if we are going to facilitate comprehensive phenotyping. To that end, one solution would be to also create relationships to those concepts:
have a relation of type "indicates" (or something)
that points to "metastasis".
Anyway...
With the caveat of defining the "source metastasis condition concepts" as :
select concept_id, concept_name, domain_id, vocabulary_id, concept_code, standard_concept
FROM concept
WHERE Concept_id in (
SELECT concept_id_2
FROM concept_relationship cr
WHERE concept_id_1 = 44498002 -- ICDO Neoplasm, metastatic
AND relationship_id = 'Histology of ICDO'
)
union
select concept_id, concept_name, domain_id, vocabulary_id, concept_code, standard_concept
from concept
where vocabulary_id in ('ICD10CM', 'ICD9CM')
and lower(concept_name) like 'secondary%'
and lower(concept_name) like '%neoplasm%'
union
select concept_id, concept_name, domain_id, vocabulary_id, concept_code, standard_concept
from concept
where concept_id in (
select descendant_concept_id
from concept_ancestor ca
where ancestor_concept_id = 432851 -- SNOMED: secondary malignant neoplastic disease
)
1) Counts source concepts, and the corresponding domain mapping, by vocabulary
Takeaway: There is significant inconsistency as to which domain(s) these are mapped to.
vocabulary_id
source_concept_count
mapped_to_condition
mapped_to_measurement
ICD10CM
98
46
52
ICD9CM
43
31
12
ICDO3
229
5
224
Nebraska Lexicon
7
6
0
SNOMED
847
847
0
2) Counts source concepts, and the corresponding vocabulary mapping
Takeaway: There is significant inconsistency as to which vocabularies these are mapped to.
To help with context, I did more of a dive into current metastasis mappings. Whatever decisions we end up making, I believe it's clear the current state is not consistent.
Initial thoughts: Even if we synchronize these vocabularies into a singular hierarchical form, we're still going to need an additional qualifier on other types of concepts that are indicative of the existence of metastasis if we are going to facilitate comprehensive phenotyping. To that end, one solution would be to also create relationships to those concepts:
e.g. All concepts under: https://athena.ohdsi.org/search-terms/terms/1635142
Anyway...
With the caveat of defining the "source metastasis condition concepts" as :
<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns="http://www.w3.org/TR/REC-html40">
1) Counts source concepts, and the corresponding domain mapping, by vocabulary Takeaway: There is significant inconsistency as to which domain(s) these are mapped to.
2) Counts source concepts, and the corresponding vocabulary mapping Takeaway: There is significant inconsistency as to which vocabularies these are mapped to.
Concept level mapping counts attached: mets_source_mapping_compare.csv