chartgerink / 2017ctgov

Creative Commons Zero v1.0 Universal
0 stars 0 forks source link

Parse data from ctgov #3

Open chartgerink opened 7 years ago

chartgerink commented 7 years ago

In the data/ folder there are raw xml files downloaded from ClinicalTrials.gov. When parsing works, the latest results can be downloaded in full here. This only returns Completed | Studies With Results | Interventional Studies.

chartgerink commented 7 years ago

This is the information parsed from each trial

  nct_id <- z$id_info$nct_id
  study_design <- z$study_design
  randomized <- grepl(study_design, pattern = 'randomized', ignore.case = TRUE)
  start_date <- z$start_date
  completion_date <- z$completion_date
  overall_status <- z$overall_status
  trial_condition <- z$condition
  trial_phase <- z$phase
  oversight_authority <- z$oversight$authority
  trial_source <- z$source
  fda_regulated <- z$is_fda_regulated
  number_of_arms <- z$number_of_arms
  intervention_type <- z$intervention$intervention_type

Plus the baseline measurements.

chartgerink commented 7 years ago

Completed in commit 3223ffe4ea5bfb82f639688ae6fa16a2d4947c67

Minor changes might be necessary still though.