dalgu90 / icd-coding-benchmark

Automatic ICD coding benchmark based on the MIMIC dataset
MIT License
35 stars 5 forks source link

Change the macro AUC metric to multi-process method (branch issue cleared) #39

Closed dalgu90 closed 2 years ago

dalgu90 commented 2 years ago

Hi, I would like to change the MacroAUC metric since computing takes too much time with only one thread when there are a lot of labels (like the MIMIC-III full dataset). The number of processes to compute macro AUC can be configured by the parameter num_process. If the parameter is not specified, it will automatically be set to min(# CPU cores, 16). You can configure not to use multiprocessing by setting it less than or equal to 1. Here's the change in the computation time on the MIMIC-III full dataset (The machine has 32 CPU cores, so num_process is 16)