We have to inherently create some sort of mapping between what the mentions originally looked like in CORD-19 (e.g., ['Statistical Package for Social Sciences (SPSS)', 'SPSS', 'SPSS Statistics'] and what they look like in a normalized fashion in our new dataset (e.g., SPSS).
It would probably be very useful for other projects that may reuse our dataset to also have access to the mapping. Therefore, it would be nice to provide this mapping in some consumable form.
We have to inherently create some sort of mapping between what the mentions originally looked like in CORD-19 (e.g.,
['Statistical Package for Social Sciences (SPSS)', 'SPSS', 'SPSS Statistics']
and what they look like in a normalized fashion in our new dataset (e.g.,SPSS
).It would probably be very useful for other projects that may reuse our dataset to also have access to the mapping. Therefore, it would be nice to provide this mapping in some consumable form.