dalgu90 / icd-coding-benchmark

Automatic ICD coding benchmark based on the MIMIC dataset
MIT License
35 stars 5 forks source link

Change the macro AUC metric to multi-process method #38

Closed dalgu90 closed 2 years ago

dalgu90 commented 2 years ago

NOTE: I re-created a new branch and a new PR #39 since this branch has an issue. Please review #39 instead.

Hi, I would like to change the MacroAUC metric since computing takes too much time with only one thread when there are a lot of labels (like the MIMIC-III full dataset). The number of processes to compute macro AUC can be configured by the parameter num_process. If the parameter is not specified, it will automatically be set to min(# CPU cores, 16). You can configure not to use multiprocessing by setting it less than or equal to 1. Here's the change in the computation time on the MIMIC-III full dataset (The machine has 32 CPU cores, so num_process is 16)

dalgu90 commented 2 years ago

Oh. It all got mixed up with the fusion branch. Let me clear things up.