umcu / negation-detection

Negation detection in Dutch clinical text.
GNU General Public License v3.0
3 stars 0 forks source link

Transform EMC DCC dataset to spaCy format #4

Closed lcreteig closed 3 years ago

lcreteig commented 3 years ago

N.B. The code addressing this issue should not be added to this repo, but to https://github.com/umcu/emc-dcc-transformation instead

For the context module from medspacy to recognise negated entities, we first need to transform the ECC dataset to Doc objects that can be fed to a (med)spaCy pipeline: