MIT-LCP / mimic-code

MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases
https://mimic.mit.edu
MIT License
2.64k stars 1.53k forks source link

MIMIC-IV ED Triage 'pain' field #1063

Closed divyashan closed 3 years ago

divyashan commented 3 years ago

Prerequisites

* [X] Checked the online documentation: https://mimic.mit.edu/
* [X] Checked that your issue isn't already addressed: https://github.com/MIT-LCP/mimic-code/issues?utf8=%E2%9C%93&q=

Description

Looking at the pain field in the triage table, ~2.6% of the rows contain reported pain values > 10. About 97% of these rows have a pain value of exactly 13. Do you know what these higher pain values correspond to?

Thanks again for your work on this dataset!! It's great to work with.

alistairewj commented 3 years ago

Most likely one of two things: (1) they intended to write 1-3 and omitted the dash resulting in 13 (it seems too frequent for that), or (2) they wrote a value into the wrong field... also seems too frequent for that. Not sure!

Worth mentioning that this field was originally free-text. The deidentification of this field required values to be integers - but the original data allows for any free-text, e.g. here's a snippet (rounded to the nearest 100):

                           pain                           |   n
----------------------------------------------------------+--------
 0                                                        | 164400
 8                                                        |  48100
 10                                                       |  47200
 7                                                        |  36400
 5                                                        |  33800
 6                                                        |  29100
 9                                                        |  23400
 4                                                        |  22800
 3                                                        |  18600
 2                                                        |  16000
                                                          |  14000
 13                                                       |  11700
 1                                                        |   6800
 unable                                                   |   4000
 uta                                                      |   2500
 Critical                                                 |   2400
 UTA                                                      |   2300
 ua                                                       |   1400
 UA                                                       |    600
 c                                                        |    500
 u/a                                                      |    400
 unable                                                   |    300
 Unable                                                   |    300
 critical                                                 |    200
 crit                                                     |    200
 Non-verbal                                               |    100
 7.5                                                      |    100
 4-5                                                      |    100
 ?                                                        |    100
 9.5                                                      |    100
 o                                                        |    100
 8.5                                                      |    100
 1-2                                                      |    100
 ett                                                      |    100

I realize now that I should have put ___ instead of NULL in this column to indicate that deid had been performed.