MIT-LCP / physionet-build

The new PhysioNet platform.
https://physionet.org/
BSD 3-Clause "New" or "Revised" License
55 stars 20 forks source link

Formatting issue with description field in Google Datasets #866

Open tompollard opened 4 years ago

tompollard commented 4 years ago

There is a formatting issue with the Description field in at least one project on Google Datasets (MIMIC-III).

See the list under "It is notable for three factors:": https://datasetsearch.research.google.com/search?query=mimic-iii&docid=O7qvsX7ueG%2FITwCJAAAAAA%3D%3D

Screen Shot 2020-02-07 at 13 33 23

The content is generated from the schema.org metadata embedded in the project page, so whatever this fix is, it will involve updating the metadata.

bemoody commented 4 years ago

This is also relevant to DOIs, as mentioned at https://github.com/MIT-LCP/physionet-build/pull/855#issuecomment-583191271

I'm gonna guess that "tab at start of line means monospace" is a Google quirk, but either way... what we need here is a "plain text" version of the abstract, which is not the same as HTML with the tags stripped out.