biopragmatics / bioregistry

📮 An integrative registry of biological databases, ontologies, and nomenclatures.
https://bioregistry.io
MIT License
120 stars 53 forks source link

ClinicalTrials.gov: 2023 URI update and ctgov prefix #897

Closed dhimmel closed 1 year ago

dhimmel commented 1 year ago

refs https://github.com/greenelab/covid19-review/issues/1210

Two changes (one per commit) that should be treated as separate (I can remove either of the changes from this PR based on review).

dhimmel commented 1 year ago

Possible confusion with Connecticut's https://portal.ct.gov/.

ChatGPT answers with ClinicalTrials.gov for "In the context of life science databases, ontologies, and other resources, what does ctgov stand for?"

cthoyt commented 1 year ago

@dhimmel thank you for the contribution, the new URI format is great. Can you give me an example where you encountered ctgov as a prefix?

codecov[bot] commented 1 year ago

Codecov Report

Patch coverage: 100.00% and no project coverage change.

Comparison is base (0b45492) 40.62% compared to head (630d987) 40.62%.

Additional details and impacted files ```diff @@ Coverage Diff @@ ## main #897 +/- ## ======================================= Coverage 40.62% 40.62% ======================================= Files 135 135 Lines 7790 7790 Branches 1809 1809 ======================================= Hits 3165 3165 Misses 4429 4429 Partials 196 196 ``` | [Impacted Files](https://app.codecov.io/gh/biopragmatics/bioregistry/pull/897?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=biopragmatics) | Coverage Δ | | |---|---|---| | [src/bioregistry/version.py](https://app.codecov.io/gh/biopragmatics/bioregistry/pull/897?src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=biopragmatics#diff-c3JjL2Jpb3JlZ2lzdHJ5L3ZlcnNpb24ucHk=) | `68.42% <100.00%> (ø)` | |

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Do you have feedback about the report comment? Let us know in this issue.

dhimmel commented 1 year ago

Can you give me an example where you encountered ctgov as a prefix?

I don't know if I've seen it as a CURIE prefix anywhere, but I think it is common to use in column names like ctgov_id. Also when writing ClinicalTrials.gov, people will often abbreviate it as CT.gov or CT-gov. Searching via pubmed, about 25-30 publications do this per year. Admittedly that is low compared to the 20k articles that mention ClinicalTrials.gov per year.

Partly this addition is motivated by the existing prefixes at https://bioregistry.io/registry/clinicaltrials being somewhat unsatisfactory, where clinicaltrials looks out of place as a prefix in the Clinical Trial Registries collection. This change is a potential gateway into a future changing the preferred prefix to ctgov, which could be evaluated in the future based on adoption of the ctgov prefix.

Also interested in whether anyone knows someone more involved with CT.gov data that could advise? Tagging @micronix in case they have any insights here (I see they are a recent contributor to https://github.com/ctti-clinicaltrials/aact).

dhimmel commented 1 year ago

If you're referring to the giant AACT schema it looks like it uses both nct and ctgov.

All good on my end, so merge away. More feedback still welcome, but can always make changes later if anything is actionable.