biocaddie / prototype_issues

Used to report and track bioCADDIE prototype issues
3 stars 5 forks source link

Clinical trials #266

Open DataMedFeedback opened 7 years ago

DataMedFeedback commented 7 years ago

When I click on Clinical Trials.gov from here https://datamed.org/repository_list.php I am only given 3854 results but there are over 26000 result in clinicaltrials.gov which have results. What are you actually pulling in?

bozyurt commented 7 years ago

Only publicly available datasets are indexed (checking for dataset.available=True in the transformed document before indexing).

Burak

On 05/19/2017 10:46 AM, DataMedFeedback wrote:

When I click on Clinical Trials.gov from here https://datamed.org/repository_list.php I am only given 3854 results but there are over 26000 result in clinicaltrials.gov which have results. What are you actually pulling in?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/biocaddie/prototype_issues/issues/266, or mute the thread https://github.com/notifications/unsubscribe-auth/AHLczNEjw4NoX4IWDr63V7i-JYryREKmks5r7dWFgaJpZM4Ng0Sr.

-- I. Burak Ozyurt PhD Project Scientist University of California, San Diego 9500 Gilman Drive, M/C 0608 La Jolla, CA 92093-0608

ianfore commented 7 years ago

We certainly wanted to filter clinical trials.gov for those trials that have data. But Datamed also has the capability to record accessibility metadata for a dataset. Do the options for "accessibility" include options that could be assigned to the clinical trials.gov datasets which are not publicly available. That would allow the larger set (>26000) to be imported.

Here's an example of a dataset from another repository where the data do not appear to be publicly accessible - but the requirements for access are provided. https://datamed.org/display-item.php?repository=0061&id=58dc290a5152c678ad50d62a&query=a[Dimension]