Open AngharadGreen opened 3 days ago
Explore the different tables from the extract to identify which ones contain the relevant data and matching concept table to tables that contain the relevant microbiology data
First look at the following tables to identify microbiology data:
Concept
Condition_Occurence
Specimen
Observation
Measurement
Visit_Occurrence
Provider
I read in these four tables as data frames into R:
library(arrow)
concept_df <- read_parquet("concept.parquet")
condition_df <- read_parquet("CONDITION_OCCURRENCE.parquet")
specimen_df <- read_parquet("SPECIMEN.parquet")
observation_df <- read_parquet("OBSERVATION.parquet")
measurement_df <- read_parquet("MEASUREMENT.parquet")
visit_occurence_df <- read_parquet("VISIT_OCCURRENCE.parquet")
provider_df<- read_parquet("VISIT_OCCURRENCE.parquet")
Looking at structure of the concept
table - when filtering the concept_name
column based on the search term microbiology, this column contains useful information about corresponds to the result of a microbiology lab result and each concept_name
has an associated concept_id
and a concept_code
. There is also a domain_id
column that indicates the OMOP table this information would be linked to.
Currently filtering information from the OMOP data frames based on concept IDs related to Microbiology - using E.coli as an example to understand how microbiology lab test results are stored across the different OMOP tables
Here I want to:
concept
table to tables that contain the relevant microbiology dataconcept_relationship
table matches to relevant tables