bigscience-workshop / biomedical

Tools for curating biomedical training data for large-scale language modeling
454 stars 116 forks source link

Closes #888 - Bug in CARDIO:DE loader #890

Closed nachollorca closed 1 year ago

nachollorca commented 1 year ago

Closes bug #888, which was causing some documents to be parsed incorrectly for CARDIO:DE.

INFO:__main__:args: Namespace(bypass_keys=[], bypass_split_key_pairs=[], bypass_splits=[], config_name=None, data_dir='/dhc/home/ignacio.rodriguez/cardio_de_corpus', dataset_name='cardiode', test_local=True)
INFO:__main__:Running (Local) Unit Test
INFO:__main__:all_config_names: ['cardiode_source', 'cardiode_bigbio_kb']
INFO:__main__:self.DATASET_NAME: bigbio/hub/hub_repos/cardiode/cardiode.py
INFO:__main__:self.CONFIG_NAME: cardiode_source
INFO:__main__:self.DATA_DIR: /dhc/home/ignacio.rodriguez/cardio_de_corpus
INFO:__main__:importing module .... 
INFO:__main__:imported module <module 'datasets_modules.datasets.cardiode.4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69.cardiode' from '/dhc/home/ignacio.rodriguez/.cache/huggingface/modules/datasets_modules/datasets/cardiode/4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69/cardiode.py'>
INFO:__main__:Checking for _SUPPORTED_TASKS ...
INFO:__main__:Found _SUPPORTED_TASKS=['NAMED_ENTITY_RECOGNITION']
INFO:__main__:_SUPPORTED_TASKS implies _MAPPED_SCHEMAS={'KB'}
INFO:__main__:Checking load_dataset with config name cardiode_source
WARNING:datasets.builder:Using custom data configuration cardiode_source-c8bb5852e63a7387
Dataset =  cardiode
DatasetModule(module_path='datasets_modules.datasets.cardiode.4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69.cardiode', hash='4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69', builder_kwargs={'hash': '4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69', 'base_path': 'bigbio/hub/hub_repos/cardiode'})
Downloading and preparing dataset cardiode/cardiode_source to /dhc/home/ignacio.rodriguez/.cache/huggingface/datasets/cardiode/cardiode_source-c8bb5852e63a7387/5.0.0/4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69...

Generating train split: 0 examples [00:00, ? examples/s]
Generating train split: 1 examples [00:00,  1.62 examples/s]
Generating train split: 2 examples [00:01,  1.80 examples/s]
Generating train split: 3 examples [00:01,  1.64 examples/s]
Generating train split: 4 examples [00:02,  1.87 examples/s]
Generating train split: 5 examples [00:02,  1.94 examples/s]
Generating train split: 6 examples [00:03,  1.92 examples/s]
Generating train split: 7 examples [00:03,  2.40 examples/s]
Generating train split: 8 examples [00:03,  2.27 examples/s]
Generating train split: 9 examples [00:04,  2.50 examples/s]
Generating train split: 10 examples [00:04,  2.30 examples/s]
Generating train split: 11 examples [00:05,  2.38 examples/s]
Generating train split: 12 examples [00:05,  2.32 examples/s]
Generating train split: 13 examples [00:06,  2.18 examples/s]
Generating train split: 14 examples [00:06,  2.44 examples/s]
Generating train split: 15 examples [00:06,  2.93 examples/s]
Generating train split: 16 examples [00:06,  3.57 examples/s]
Generating train split: 17 examples [00:07,  2.51 examples/s]
Generating train split: 18 examples [00:08,  2.00 examples/s]
Generating train split: 19 examples [00:08,  2.25 examples/s]
Generating train split: 20 examples [00:08,  2.12 examples/s]
Generating train split: 21 examples [00:09,  1.63 examples/s]
Generating train split: 22 examples [00:10,  1.94 examples/s]
Generating train split: 23 examples [00:10,  1.80 examples/s]
Generating train split: 24 examples [00:12,  1.08 examples/s]
Generating train split: 25 examples [00:14,  1.08s/ examples]
Generating train split: 26 examples [00:14,  1.05 examples/s]
Generating train split: 27 examples [00:15,  1.21 examples/s]
Generating train split: 28 examples [00:15,  1.37 examples/s]
Generating train split: 29 examples [00:15,  1.77 examples/s]
Generating train split: 30 examples [00:16,  2.18 examples/s]
Generating train split: 31 examples [00:16,  2.09 examples/s]
Generating train split: 32 examples [00:17,  2.31 examples/s]
Generating train split: 33 examples [00:18,  1.68 examples/s]
Generating train split: 34 examples [00:18,  1.68 examples/s]
Generating train split: 35 examples [00:19,  1.52 examples/s]
Generating train split: 36 examples [00:19,  1.68 examples/s]
Generating train split: 37 examples [00:20,  1.81 examples/s]
Generating train split: 38 examples [00:21,  1.63 examples/s]
Generating train split: 39 examples [00:21,  1.50 examples/s]
Generating train split: 40 examples [00:22,  1.57 examples/s]
Generating train split: 41 examples [00:23,  1.31 examples/s]
Generating train split: 42 examples [00:24,  1.39 examples/s]
Generating train split: 43 examples [00:24,  1.42 examples/s]
Generating train split: 44 examples [00:25,  1.50 examples/s]
Generating train split: 45 examples [00:26,  1.35 examples/s]
Generating train split: 46 examples [00:26,  1.67 examples/s]
Generating train split: 47 examples [00:27,  1.42 examples/s]
Generating train split: 48 examples [00:28,  1.20 examples/s]
Generating train split: 49 examples [00:29,  1.17 examples/s]
Generating train split: 50 examples [00:30,  1.21 examples/s]
Generating train split: 51 examples [00:30,  1.61 examples/s]
Generating train split: 52 examples [00:31,  1.53 examples/s]
Generating train split: 53 examples [00:32,  1.39 examples/s]
Generating train split: 54 examples [00:32,  1.82 examples/s]
Generating train split: 55 examples [00:33,  1.57 examples/s]
Generating train split: 56 examples [00:33,  1.62 examples/s]
Generating train split: 57 examples [00:34,  1.39 examples/s]
Generating train split: 58 examples [00:35,  1.37 examples/s]
Generating train split: 59 examples [00:35,  1.68 examples/s]
Generating train split: 60 examples [00:36,  1.66 examples/s]
Generating train split: 61 examples [00:36,  1.68 examples/s]
Generating train split: 62 examples [00:37,  1.86 examples/s]
Generating train split: 63 examples [00:37,  2.25 examples/s]
Generating train split: 64 examples [00:37,  2.39 examples/s]
Generating train split: 65 examples [00:38,  2.65 examples/s]
Generating train split: 66 examples [00:38,  2.91 examples/s]
Generating train split: 67 examples [00:38,  3.23 examples/s]
Generating train split: 68 examples [00:38,  3.22 examples/s]
Generating train split: 69 examples [00:39,  3.30 examples/s]
Generating train split: 70 examples [00:39,  3.65 examples/s]
Generating train split: 71 examples [00:39,  3.76 examples/s]
Generating train split: 72 examples [00:39,  4.08 examples/s]
Generating train split: 73 examples [00:39,  4.34 examples/s]
Generating train split: 74 examples [00:40,  2.44 examples/s]
Generating train split: 75 examples [00:41,  2.08 examples/s]
Generating train split: 76 examples [00:41,  2.35 examples/s]
Generating train split: 77 examples [00:42,  1.52 examples/s]
Generating train split: 78 examples [00:44,  1.28 examples/s]
Generating train split: 79 examples [00:44,  1.32 examples/s]
Generating train split: 80 examples [00:44,  1.72 examples/s]
Generating train split: 81 examples [00:45,  1.82 examples/s]
Generating train split: 82 examples [00:46,  1.28 examples/s]
Generating train split: 83 examples [00:47,  1.40 examples/s]
Generating train split: 84 examples [00:48,  1.32 examples/s]
Generating train split: 85 examples [00:48,  1.40 examples/s]
Generating train split: 86 examples [00:49,  1.56 examples/s]
Generating train split: 87 examples [00:49,  1.48 examples/s]
Generating train split: 88 examples [00:50,  1.45 examples/s]
Generating train split: 89 examples [00:51,  1.44 examples/s]
Generating train split: 90 examples [00:51,  1.76 examples/s]
Generating train split: 91 examples [00:51,  2.07 examples/s]
Generating train split: 92 examples [00:52,  2.60 examples/s]
Generating train split: 93 examples [00:52,  2.14 examples/s]
Generating train split: 94 examples [00:53,  2.20 examples/s]
Generating train split: 95 examples [00:54,  1.55 examples/s]
Generating train split: 96 examples [00:54,  1.71 examples/s]
Generating train split: 97 examples [00:55,  1.50 examples/s]
Generating train split: 98 examples [00:55,  1.79 examples/s]
Generating train split: 99 examples [00:56,  1.42 examples/s]
Generating train split: 100 examples [00:57,  1.43 examples/s]
Generating train split: 101 examples [00:58,  1.24 examples/s]
Generating train split: 102 examples [00:58,  1.55 examples/s]
Generating train split: 103 examples [00:59,  1.54 examples/s]
Generating train split: 104 examples [01:00,  1.38 examples/s]
Generating train split: 105 examples [01:00,  1.54 examples/s]
Generating train split: 106 examples [01:01,  1.40 examples/s]
Generating train split: 107 examples [01:02,  1.27 examples/s]
Generating train split: 108 examples [01:03,  1.60 examples/s]
Generating train split: 109 examples [01:03,  1.96 examples/s]
Generating train split: 110 examples [01:03,  2.28 examples/s]
Generating train split: 111 examples [01:03,  2.63 examples/s]
Generating train split: 112 examples [01:04,  2.20 examples/s]
Generating train split: 113 examples [01:04,  2.14 examples/s]
Generating train split: 114 examples [01:05,  2.47 examples/s]
Generating train split: 115 examples [01:06,  1.80 examples/s]
Generating train split: 116 examples [01:06,  2.33 examples/s]
Generating train split: 117 examples [01:06,  2.87 examples/s]
Generating train split: 118 examples [01:06,  2.33 examples/s]
Generating train split: 119 examples [01:07,  2.05 examples/s]
Generating train split: 120 examples [01:08,  1.55 examples/s]
Generating train split: 121 examples [01:08,  1.80 examples/s]
Generating train split: 122 examples [01:09,  2.02 examples/s]
Generating train split: 123 examples [01:09,  2.43 examples/s]
Generating train split: 124 examples [01:09,  2.82 examples/s]
Generating train split: 125 examples [01:10,  3.02 examples/s]
Generating train split: 126 examples [01:10,  3.20 examples/s]
Generating train split: 127 examples [01:10,  2.43 examples/s]
Generating train split: 128 examples [01:11,  2.91 examples/s]
Generating train split: 129 examples [01:11,  2.79 examples/s]
Generating train split: 130 examples [01:12,  1.84 examples/s]
Generating train split: 131 examples [01:12,  2.04 examples/s]
Generating train split: 132 examples [01:13,  1.86 examples/s]
Generating train split: 133 examples [01:13,  2.36 examples/s]
Generating train split: 134 examples [01:14,  2.11 examples/s]
Generating train split: 135 examples [01:14,  1.98 examples/s]
Generating train split: 136 examples [01:15,  2.35 examples/s]
Generating train split: 137 examples [01:15,  2.71 examples/s]
Generating train split: 138 examples [01:15,  3.17 examples/s]
Generating train split: 139 examples [01:15,  3.40 examples/s]
Generating train split: 140 examples [01:15,  3.83 examples/s]
Generating train split: 141 examples [01:16,  2.64 examples/s]
Generating train split: 142 examples [01:17,  2.09 examples/s]
Generating train split: 143 examples [01:17,  2.28 examples/s]
Generating train split: 144 examples [01:18,  1.97 examples/s]
Generating train split: 145 examples [01:19,  1.54 examples/s]
Generating train split: 146 examples [01:19,  1.68 examples/s]
Generating train split: 147 examples [01:19,  2.14 examples/s]
Generating train split: 148 examples [01:20,  2.59 examples/s]
Generating train split: 149 examples [01:20,  2.87 examples/s]
Generating train split: 150 examples [01:20,  3.02 examples/s]
Generating train split: 151 examples [01:20,  3.39 examples/s]
Generating train split: 152 examples [01:21,  3.72 examples/s]
Generating train split: 153 examples [01:21,  3.81 examples/s]
Generating train split: 154 examples [01:21,  3.89 examples/s]
Generating train split: 155 examples [01:21,  3.90 examples/s]
Generating train split: 156 examples [01:22,  3.95 examples/s]
Generating train split: 157 examples [01:22,  2.42 examples/s]
Generating train split: 158 examples [01:23,  1.95 examples/s]
Generating train split: 159 examples [01:24,  1.74 examples/s]
Generating train split: 160 examples [01:25,  1.41 examples/s]
Generating train split: 161 examples [01:27,  1.00 examples/s]
Generating train split: 162 examples [01:27,  1.08 examples/s]
Generating train split: 163 examples [01:28,  1.33 examples/s]
Generating train split: 164 examples [01:29,  1.26 examples/s]
Generating train split: 165 examples [01:29,  1.60 examples/s]
Generating train split: 166 examples [01:29,  1.65 examples/s]
Generating train split: 167 examples [01:30,  1.62 examples/s]
Generating train split: 168 examples [01:30,  1.79 examples/s]
Generating train split: 169 examples [01:31,  1.73 examples/s]
Generating train split: 170 examples [01:31,  2.09 examples/s]
Generating train split: 171 examples [01:31,  2.52 examples/s]
Generating train split: 172 examples [01:32,  2.97 examples/s]
Generating train split: 173 examples [01:32,  2.88 examples/s]
Generating train split: 174 examples [01:32,  2.88 examples/s]
Generating train split: 175 examples [01:33,  2.51 examples/s]
Generating train split: 176 examples [01:33,  2.54 examples/s]
Generating train split: 177 examples [01:34,  2.75 examples/s]
Generating train split: 178 examples [01:34,  3.22 examples/s]
Generating train split: 179 examples [01:34,  3.67 examples/s]
Generating train split: 180 examples [01:34,  3.92 examples/s]
Generating train split: 181 examples [01:35,  3.54 examples/s]
Generating train split: 182 examples [01:35,  3.76 examples/s]
Generating train split: 183 examples [01:35,  4.15 examples/s]
Generating train split: 184 examples [01:35,  3.98 examples/s]
Generating train split: 185 examples [01:35,  3.97 examples/s]
Generating train split: 186 examples [01:36,  3.95 examples/s]
Generating train split: 187 examples [01:36,  2.78 examples/s]
Generating train split: 188 examples [01:37,  3.21 examples/s]
Generating train split: 189 examples [01:37,  3.59 examples/s]
Generating train split: 190 examples [01:37,  3.95 examples/s]
Generating train split: 191 examples [01:37,  4.17 examples/s]
Generating train split: 192 examples [01:37,  4.60 examples/s]
Generating train split: 193 examples [01:38,  4.38 examples/s]
Generating train split: 194 examples [01:38,  3.66 examples/s]
Generating train split: 195 examples [01:38,  3.59 examples/s]
Generating train split: 196 examples [01:39,  3.51 examples/s]
Generating train split: 197 examples [01:39,  3.15 examples/s]
Generating train split: 198 examples [01:39,  3.29 examples/s]
Generating train split: 199 examples [01:39,  3.54 examples/s]
Generating train split: 200 examples [01:40,  3.83 examples/s]
Generating train split: 201 examples [01:40,  4.25 examples/s]
Generating train split: 202 examples [01:40,  2.89 examples/s]
Generating train split: 203 examples [01:42,  1.66 examples/s]
Generating train split: 204 examples [01:42,  2.04 examples/s]
Generating train split: 205 examples [01:42,  2.33 examples/s]
Generating train split: 206 examples [01:42,  2.43 examples/s]
Generating train split: 207 examples [01:43,  2.18 examples/s]
Generating train split: 208 examples [01:44,  1.69 examples/s]
Generating train split: 209 examples [01:45,  1.55 examples/s]
Generating train split: 210 examples [01:45,  1.96 examples/s]
Generating train split: 211 examples [01:45,  2.42 examples/s]
Generating train split: 212 examples [01:45,  2.70 examples/s]
Generating train split: 213 examples [01:46,  3.05 examples/s]
Generating train split: 214 examples [01:46,  3.30 examples/s]
Generating train split: 215 examples [01:46,  3.66 examples/s]
Generating train split: 216 examples [01:46,  3.35 examples/s]
Generating train split: 217 examples [01:47,  2.00 examples/s]
Generating train split: 218 examples [01:48,  1.79 examples/s]
Generating train split: 219 examples [01:48,  2.13 examples/s]
Generating train split: 220 examples [01:49,  2.12 examples/s]
Generating train split: 221 examples [01:50,  1.46 examples/s]
Generating train split: 222 examples [01:51,  1.21 examples/s]
Generating train split: 223 examples [01:52,  1.44 examples/s]
Generating train split: 224 examples [01:52,  1.47 examples/s]
Generating train split: 225 examples [01:53,  1.46 examples/s]
Generating train split: 226 examples [01:53,  1.54 examples/s]
Generating train split: 227 examples [01:54,  1.36 examples/s]
Generating train split: 228 examples [01:55,  1.74 examples/s]
Generating train split: 229 examples [01:55,  1.94 examples/s]
Generating train split: 230 examples [01:55,  2.36 examples/s]
Generating train split: 231 examples [01:55,  2.73 examples/s]
Generating train split: 232 examples [01:56,  3.30 examples/s]
Generating train split: 233 examples [01:56,  3.54 examples/s]
Generating train split: 234 examples [01:56,  3.80 examples/s]
Generating train split: 235 examples [01:56,  3.87 examples/s]
Generating train split: 236 examples [01:56,  4.17 examples/s]
Generating train split: 237 examples [01:57,  2.15 examples/s]
Generating train split: 238 examples [01:58,  1.74 examples/s]
Generating train split: 239 examples [02:00,  1.21 examples/s]
Generating train split: 240 examples [02:00,  1.23 examples/s]
Generating train split: 241 examples [02:02,  1.14 examples/s]
Generating train split: 242 examples [02:03,  1.07 examples/s]
Generating train split: 243 examples [02:04,  1.22s/ examples]
Generating train split: 244 examples [02:05,  1.15s/ examples]
Generating train split: 245 examples [02:06,  1.14 examples/s]
Generating train split: 246 examples [02:07,  1.11 examples/s]
Generating train split: 247 examples [02:07,  1.26 examples/s]
Generating train split: 248 examples [02:08,  1.35 examples/s]
Generating train split: 249 examples [02:09,  1.12 examples/s]
Generating train split: 250 examples [02:10,  1.28 examples/s]
Generating train split: 251 examples [02:11,  1.19 examples/s]
Generating train split: 252 examples [02:11,  1.51 examples/s]
Generating train split: 253 examples [02:11,  1.65 examples/s]
Generating train split: 254 examples [02:11,  2.09 examples/s]
Generating train split: 255 examples [02:12,  1.69 examples/s]
Generating train split: 256 examples [02:14,  1.27 examples/s]
Generating train split: 257 examples [02:14,  1.30 examples/s]
Generating train split: 258 examples [02:15,  1.33 examples/s]
Generating train split: 259 examples [02:16,  1.26 examples/s]
Generating train split: 260 examples [02:16,  1.37 examples/s]
Generating train split: 261 examples [02:18,  1.16 examples/s]
Generating train split: 262 examples [02:19,  1.14 examples/s]
Generating train split: 263 examples [02:20,  1.03s/ examples]
Generating train split: 264 examples [02:21,  1.00s/ examples]
Generating train split: 265 examples [02:21,  1.29 examples/s]
Generating train split: 266 examples [02:22,  1.06 examples/s]
Generating train split: 267 examples [02:24,  1.01 examples/s]
Generating train split: 268 examples [02:24,  1.18 examples/s]
Generating train split: 269 examples [02:25,  1.35 examples/s]
Generating train split: 270 examples [02:25,  1.31 examples/s]
Generating train split: 271 examples [02:26,  1.64 examples/s]
Generating train split: 272 examples [02:26,  1.90 examples/s]
Generating train split: 273 examples [02:26,  2.22 examples/s]
Generating train split: 274 examples [02:27,  2.53 examples/s]
Generating train split: 275 examples [02:27,  2.83 examples/s]
Generating train split: 276 examples [02:27,  3.32 examples/s]
Generating train split: 277 examples [02:27,  3.51 examples/s]
Generating train split: 278 examples [02:27,  3.63 examples/s]
Generating train split: 279 examples [02:28,  2.67 examples/s]
Generating train split: 280 examples [02:29,  2.27 examples/s]
Generating train split: 281 examples [02:29,  2.70 examples/s]
Generating train split: 282 examples [02:29,  2.83 examples/s]
Generating train split: 283 examples [02:30,  2.60 examples/s]
Generating train split: 284 examples [02:30,  2.22 examples/s]
Generating train split: 285 examples [02:31,  1.85 examples/s]
Generating train split: 286 examples [02:31,  2.12 examples/s]
Generating train split: 287 examples [02:32,  1.92 examples/s]
Generating train split: 288 examples [02:32,  2.20 examples/s]
Generating train split: 289 examples [02:33,  1.81 examples/s]
Generating train split: 290 examples [02:34,  1.62 examples/s]
Generating train split: 291 examples [02:34,  1.90 examples/s]
Generating train split: 292 examples [02:35,  1.74 examples/s]
Generating train split: 293 examples [02:36,  1.53 examples/s]
Generating train split: 294 examples [02:36,  1.52 examples/s]
Generating train split: 295 examples [02:37,  1.73 examples/s]
Generating train split: 296 examples [02:37,  1.75 examples/s]
Generating train split: 297 examples [02:37,  2.12 examples/s]
Generating train split: 298 examples [02:38,  1.85 examples/s]
Generating train split: 299 examples [02:38,  2.25 examples/s]
Generating train split: 300 examples [02:39,  2.45 examples/s]
Generating train split: 301 examples [02:39,  2.73 examples/s]
Generating train split: 302 examples [02:39,  3.13 examples/s]
Generating train split: 303 examples [02:39,  3.50 examples/s]
Generating train split: 304 examples [02:40,  3.59 examples/s]
Generating train split: 305 examples [02:40,  3.31 examples/s]
Generating train split: 306 examples [02:40,  3.02 examples/s]
Generating train split: 307 examples [02:41,  3.15 examples/s]
Generating train split: 308 examples [02:41,  3.44 examples/s]
Generating train split: 309 examples [02:41,  3.61 examples/s]
Generating train split: 310 examples [02:41,  3.77 examples/s]
Generating train split: 311 examples [02:42,  3.40 examples/s]
Generating train split: 312 examples [02:42,  3.65 examples/s]
Generating train split: 313 examples [02:43,  2.70 examples/s]
Generating train split: 314 examples [02:43,  2.15 examples/s]
Generating train split: 315 examples [02:44,  2.02 examples/s]
Generating train split: 316 examples [02:44,  2.01 examples/s]
Generating train split: 317 examples [02:45,  2.13 examples/s]
Generating train split: 318 examples [02:46,  1.78 examples/s]
Generating train split: 319 examples [02:46,  1.47 examples/s]
Generating train split: 320 examples [02:48,  1.22 examples/s]
Generating train split: 321 examples [02:49,  1.01s/ examples]
Generating train split: 322 examples [02:50,  1.05 examples/s]
Generating train split: 323 examples [02:51,  1.08 examples/s]
Generating train split: 324 examples [02:51,  1.22 examples/s]
Generating train split: 325 examples [02:52,  1.26 examples/s]
Generating train split: 326 examples [02:53,  1.15 examples/s]
Generating train split: 327 examples [02:54,  1.17 examples/s]
Generating train split: 328 examples [02:54,  1.37 examples/s]
Generating train split: 329 examples [02:55,  1.40 examples/s]
Generating train split: 330 examples [02:56,  1.50 examples/s]
Generating train split: 331 examples [02:56,  1.48 examples/s]
Generating train split: 332 examples [02:57,  1.49 examples/s]
Generating train split: 333 examples [02:58,  1.53 examples/s]
Generating train split: 334 examples [02:59,  1.31 examples/s]
Generating train split: 335 examples [03:00,  1.05s/ examples]
Generating train split: 336 examples [03:01,  1.02 examples/s]
Generating train split: 337 examples [03:02,  1.03s/ examples]
Generating train split: 338 examples [03:03,  1.12 examples/s]
Generating train split: 339 examples [03:04,  1.20 examples/s]
Generating train split: 340 examples [03:04,  1.50 examples/s]
Generating train split: 341 examples [03:05,  1.41 examples/s]
Generating train split: 342 examples [03:05,  1.67 examples/s]
Generating train split: 343 examples [03:05,  2.00 examples/s]
Generating train split: 344 examples [03:06,  1.82 examples/s]
Generating train split: 345 examples [03:07,  1.57 examples/s]
Generating train split: 346 examples [03:08,  1.31 examples/s]
Generating train split: 347 examples [03:08,  1.53 examples/s]
Generating train split: 348 examples [03:09,  1.33 examples/s]
Generating train split: 349 examples [03:10,  1.33 examples/s]
Generating train split: 350 examples [03:10,  1.47 examples/s]
Generating train split: 351 examples [03:11,  1.47 examples/s]
Generating train split: 352 examples [03:12,  1.61 examples/s]
Generating train split: 353 examples [03:12,  1.97 examples/s]
Generating train split: 354 examples [03:12,  2.29 examples/s]
Generating train split: 355 examples [03:12,  2.52 examples/s]
Generating train split: 356 examples [03:13,  2.82 examples/s]
Generating train split: 357 examples [03:13,  3.16 examples/s]
Generating train split: 358 examples [03:13,  3.12 examples/s]
Generating train split: 359 examples [03:14,  2.53 examples/s]
Generating train split: 360 examples [03:14,  2.81 examples/s]
Generating train split: 361 examples [03:14,  3.11 examples/s]
Generating train split: 362 examples [03:15,  3.08 examples/s]
Generating train split: 363 examples [03:15,  3.44 examples/s]
Generating train split: 364 examples [03:15,  3.60 examples/s]
Generating train split: 365 examples [03:15,  3.64 examples/s]
Generating train split: 366 examples [03:16,  4.08 examples/s]
Generating train split: 367 examples [03:16,  4.00 examples/s]
Generating train split: 368 examples [03:16,  4.37 examples/s]
Generating train split: 369 examples [03:16,  3.86 examples/s]
Generating train split: 370 examples [03:17,  4.13 examples/s]
Generating train split: 371 examples [03:17,  4.22 examples/s]
Generating train split: 372 examples [03:17,  4.29 examples/s]
Generating train split: 373 examples [03:17,  4.05 examples/s]
Generating train split: 374 examples [03:18,  3.91 examples/s]
Generating train split: 375 examples [03:18,  2.86 examples/s]
Generating train split: 376 examples [03:19,  2.51 examples/s]
Generating train split: 377 examples [03:19,  2.96 examples/s]
Generating train split: 378 examples [03:19,  3.35 examples/s]
Generating train split: 379 examples [03:19,  3.72 examples/s]
Generating train split: 380 examples [03:20,  3.50 examples/s]
Generating train split: 381 examples [03:20,  2.29 examples/s]
Generating train split: 382 examples [03:21,  1.83 examples/s]
Generating train split: 383 examples [03:21,  2.31 examples/s]
Generating train split: 384 examples [03:22,  2.12 examples/s]
Generating train split: 385 examples [03:23,  1.91 examples/s]
Generating train split: 386 examples [03:23,  2.32 examples/s]
Generating train split: 387 examples [03:23,  2.88 examples/s]
Generating train split: 388 examples [03:23,  2.51 examples/s]
Generating train split: 389 examples [03:24,  2.00 examples/s]
Generating train split: 390 examples [03:24,  2.38 examples/s]
Generating train split: 391 examples [03:25,  2.84 examples/s]
Generating train split: 392 examples [03:25,  2.45 examples/s]
Generating train split: 393 examples [03:25,  2.62 examples/s]
Generating train split: 394 examples [03:26,  2.35 examples/s]
Generating train split: 395 examples [03:27,  2.07 examples/s]
Generating train split: 396 examples [03:27,  2.54 examples/s]
Generating train split: 397 examples [03:27,  2.19 examples/s]
Generating train split: 398 examples [03:28,  1.99 examples/s]
Generating train split: 399 examples [03:29,  1.73 examples/s]
Generating train split: 400 examples [03:29,  2.12 examples/s]

Dataset cardiode downloaded and prepared to /dhc/home/ignacio.rodriguez/.cache/huggingface/datasets/cardiode/cardiode_source-c8bb5852e63a7387/5.0.0/4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69. Subsequent calls will reuse this data.

  0%|          | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:00<00:00, 108.29it/s]
INFO:__main__:schema = source
.
----------------------------------------------------------------------
Ran 1 test in 212.404s

OK
INFO:__main__:self.DATASET_NAME: bigbio/hub/hub_repos/cardiode/cardiode.py
INFO:__main__:self.CONFIG_NAME: cardiode_bigbio_kb
INFO:__main__:self.DATA_DIR: /dhc/home/ignacio.rodriguez/cardio_de_corpus
INFO:__main__:importing module .... 
INFO:__main__:imported module <module 'datasets_modules.datasets.cardiode.4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69.cardiode' from '/dhc/home/ignacio.rodriguez/.cache/huggingface/modules/datasets_modules/datasets/cardiode/4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69/cardiode.py'>
INFO:__main__:Checking for _SUPPORTED_TASKS ...
INFO:__main__:Found _SUPPORTED_TASKS=['NAMED_ENTITY_RECOGNITION']
INFO:__main__:_SUPPORTED_TASKS implies _MAPPED_SCHEMAS={'KB'}
INFO:__main__:Checking load_dataset with config name cardiode_bigbio_kb
WARNING:datasets.builder:Using custom data configuration cardiode_bigbio_kb-c8bb5852e63a7387
Downloading and preparing dataset cardiode/cardiode_bigbio_kb to /dhc/home/ignacio.rodriguez/.cache/huggingface/datasets/cardiode/cardiode_bigbio_kb-c8bb5852e63a7387/1.0.0/4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69...

Generating train split: 0 examples [00:00, ? examples/s]
Generating train split: 1 examples [00:00,  1.32 examples/s]
Generating train split: 2 examples [00:01,  1.39 examples/s]
Generating train split: 3 examples [00:02,  1.26 examples/s]
Generating train split: 4 examples [00:02,  1.44 examples/s]
Generating train split: 5 examples [00:03,  1.47 examples/s]
Generating train split: 6 examples [00:04,  1.46 examples/s]
Generating train split: 7 examples [00:04,  1.80 examples/s]
Generating train split: 8 examples [00:05,  1.71 examples/s]
Generating train split: 9 examples [00:05,  1.87 examples/s]
Generating train split: 10 examples [00:06,  1.72 examples/s]
Generating train split: 11 examples [00:06,  1.79 examples/s]
Generating train split: 12 examples [00:07,  1.73 examples/s]
Generating train split: 13 examples [00:08,  1.66 examples/s]
Generating train split: 14 examples [00:08,  1.82 examples/s]
Generating train split: 15 examples [00:08,  2.19 examples/s]
Generating train split: 16 examples [00:08,  2.68 examples/s]
Generating train split: 17 examples [00:09,  1.92 examples/s]
Generating train split: 18 examples [00:10,  1.54 examples/s]
Generating train split: 19 examples [00:11,  1.71 examples/s]
Generating train split: 20 examples [00:11,  1.59 examples/s]
Generating train split: 21 examples [00:13,  1.23 examples/s]
Generating train split: 22 examples [00:13,  1.47 examples/s]
Generating train split: 23 examples [00:14,  1.35 examples/s]
Generating train split: 24 examples [00:16,  1.21s/ examples]
Generating train split: 25 examples [00:18,  1.42s/ examples]
Generating train split: 26 examples [00:19,  1.25s/ examples]
Generating train split: 27 examples [00:20,  1.07s/ examples]
Generating train split: 28 examples [00:20,  1.06 examples/s]
Generating train split: 29 examples [00:21,  1.35 examples/s]
Generating train split: 30 examples [00:21,  1.65 examples/s]
Generating train split: 31 examples [00:22,  1.54 examples/s]
Generating train split: 32 examples [00:22,  1.67 examples/s]
Generating train split: 33 examples [00:23,  1.22 examples/s]
Generating train split: 34 examples [00:24,  1.22 examples/s]
Generating train split: 35 examples [00:25,  1.12 examples/s]
Generating train split: 36 examples [00:26,  1.21 examples/s]
Generating train split: 37 examples [00:27,  1.28 examples/s]
Generating train split: 38 examples [00:28,  1.16 examples/s]
Generating train split: 39 examples [00:29,  1.04 examples/s]
Generating train split: 40 examples [00:30,  1.10 examples/s]
Generating train split: 41 examples [00:31,  1.08s/ examples]
Generating train split: 42 examples [00:32,  1.00s/ examples]
Generating train split: 43 examples [00:33,  1.05 examples/s]
Generating train split: 44 examples [00:34,  1.09 examples/s]
Generating train split: 45 examples [00:35,  1.02s/ examples]
Generating train split: 46 examples [00:35,  1.24 examples/s]
Generating train split: 47 examples [00:36,  1.07 examples/s]
Generating train split: 48 examples [00:38,  1.12s/ examples]
Generating train split: 49 examples [00:39,  1.13s/ examples]
Generating train split: 50 examples [00:40,  1.11s/ examples]
Generating train split: 51 examples [00:40,  1.19 examples/s]
Generating train split: 52 examples [00:41,  1.15 examples/s]
Generating train split: 53 examples [00:42,  1.06 examples/s]
Generating train split: 54 examples [00:43,  1.37 examples/s]
Generating train split: 55 examples [00:44,  1.20 examples/s]
Generating train split: 56 examples [00:44,  1.26 examples/s]
Generating train split: 57 examples [00:46,  1.06 examples/s]
Generating train split: 58 examples [00:47,  1.04 examples/s]
Generating train split: 59 examples [00:47,  1.28 examples/s]
Generating train split: 60 examples [00:48,  1.24 examples/s]
Generating train split: 61 examples [00:49,  1.22 examples/s]
Generating train split: 62 examples [00:49,  1.35 examples/s]
Generating train split: 63 examples [00:50,  1.61 examples/s]
Generating train split: 64 examples [00:50,  1.71 examples/s]
Generating train split: 65 examples [00:51,  1.91 examples/s]
Generating train split: 66 examples [00:51,  2.13 examples/s]
Generating train split: 67 examples [00:51,  2.29 examples/s]
Generating train split: 68 examples [00:52,  2.21 examples/s]
Generating train split: 69 examples [00:52,  2.40 examples/s]
Generating train split: 70 examples [00:52,  2.67 examples/s]
Generating train split: 71 examples [00:53,  2.80 examples/s]
Generating train split: 72 examples [00:53,  3.08 examples/s]
Generating train split: 73 examples [00:53,  3.24 examples/s]
Generating train split: 74 examples [00:54,  1.90 examples/s]
Generating train split: 75 examples [00:55,  1.65 examples/s]
Generating train split: 76 examples [00:55,  1.84 examples/s]
Generating train split: 77 examples [00:57,  1.26 examples/s]
Generating train split: 78 examples [00:58,  1.00 examples/s]
Generating train split: 79 examples [00:59,  1.03 examples/s]
Generating train split: 80 examples [00:59,  1.32 examples/s]
Generating train split: 81 examples [01:00,  1.40 examples/s]
Generating train split: 82 examples [01:02,  1.02 examples/s]
Generating train split: 83 examples [01:02,  1.10 examples/s]
Generating train split: 84 examples [01:04,  1.04 examples/s]
Generating train split: 85 examples [01:04,  1.08 examples/s]
Generating train split: 86 examples [01:05,  1.19 examples/s]
Generating train split: 87 examples [01:06,  1.19 examples/s]
Generating train split: 88 examples [01:07,  1.20 examples/s]
Generating train split: 89 examples [01:08,  1.19 examples/s]
Generating train split: 90 examples [01:08,  1.43 examples/s]
Generating train split: 91 examples [01:08,  1.69 examples/s]
Generating train split: 92 examples [01:08,  2.09 examples/s]
Generating train split: 93 examples [01:09,  1.70 examples/s]
Generating train split: 94 examples [01:10,  1.75 examples/s]
Generating train split: 95 examples [01:11,  1.27 examples/s]
Generating train split: 96 examples [01:12,  1.33 examples/s]
Generating train split: 97 examples [01:13,  1.15 examples/s]
Generating train split: 98 examples [01:13,  1.34 examples/s]
Generating train split: 99 examples [01:15,  1.08 examples/s]
Generating train split: 100 examples [01:16,  1.08 examples/s]
Generating train split: 101 examples [01:17,  1.05s/ examples]
Generating train split: 102 examples [01:17,  1.18 examples/s]
Generating train split: 103 examples [01:18,  1.18 examples/s]
Generating train split: 104 examples [01:19,  1.04 examples/s]
Generating train split: 105 examples [01:20,  1.14 examples/s]
Generating train split: 106 examples [01:21,  1.02 examples/s]
Generating train split: 107 examples [01:23,  1.09s/ examples]
Generating train split: 108 examples [01:23,  1.14 examples/s]
Generating train split: 109 examples [01:23,  1.39 examples/s]
Generating train split: 110 examples [01:24,  1.61 examples/s]
Generating train split: 111 examples [01:24,  1.89 examples/s]
Generating train split: 112 examples [01:25,  1.55 examples/s]
Generating train split: 113 examples [01:26,  1.49 examples/s]
Generating train split: 114 examples [01:26,  1.69 examples/s]
Generating train split: 115 examples [01:27,  1.26 examples/s]
Generating train split: 116 examples [01:28,  1.62 examples/s]
Generating train split: 117 examples [01:28,  1.97 examples/s]
Generating train split: 118 examples [01:29,  1.64 examples/s]
Generating train split: 119 examples [01:30,  1.44 examples/s]
Generating train split: 120 examples [01:31,  1.15 examples/s]
Generating train split: 121 examples [01:31,  1.39 examples/s]
Generating train split: 122 examples [01:32,  1.54 examples/s]
Generating train split: 123 examples [01:32,  1.81 examples/s]
Generating train split: 124 examples [01:32,  2.07 examples/s]
Generating train split: 125 examples [01:33,  2.17 examples/s]
Generating train split: 126 examples [01:33,  2.29 examples/s]
Generating train split: 127 examples [01:34,  1.72 examples/s]
Generating train split: 128 examples [01:34,  2.08 examples/s]
Generating train split: 129 examples [01:35,  1.98 examples/s]
Generating train split: 130 examples [01:36,  1.35 examples/s]
Generating train split: 131 examples [01:37,  1.48 examples/s]
Generating train split: 132 examples [01:38,  1.38 examples/s]
Generating train split: 133 examples [01:38,  1.75 examples/s]
Generating train split: 134 examples [01:39,  1.56 examples/s]
Generating train split: 135 examples [01:39,  1.48 examples/s]
Generating train split: 136 examples [01:40,  1.74 examples/s]
Generating train split: 137 examples [01:40,  2.00 examples/s]
Generating train split: 138 examples [01:40,  2.31 examples/s]
Generating train split: 139 examples [01:41,  2.45 examples/s]
Generating train split: 140 examples [01:41,  2.80 examples/s]
Generating train split: 141 examples [01:42,  1.86 examples/s]
Generating train split: 142 examples [01:43,  1.55 examples/s]
Generating train split: 143 examples [01:43,  1.74 examples/s]
Generating train split: 144 examples [01:44,  1.50 examples/s]
Generating train split: 145 examples [01:45,  1.22 examples/s]
Generating train split: 146 examples [01:46,  1.33 examples/s]
Generating train split: 147 examples [01:46,  1.66 examples/s]
Generating train split: 148 examples [01:46,  1.98 examples/s]
Generating train split: 149 examples [01:47,  2.17 examples/s]
Generating train split: 150 examples [01:47,  2.28 examples/s]
Generating train split: 151 examples [01:47,  2.48 examples/s]
Generating train split: 152 examples [01:48,  2.70 examples/s]
Generating train split: 153 examples [01:48,  2.69 examples/s]
Generating train split: 154 examples [01:48,  2.75 examples/s]
Generating train split: 155 examples [01:49,  2.75 examples/s]
Generating train split: 156 examples [01:49,  2.83 examples/s]
Generating train split: 157 examples [01:50,  1.75 examples/s]
Generating train split: 158 examples [01:51,  1.40 examples/s]
Generating train split: 159 examples [01:52,  1.25 examples/s]
Generating train split: 160 examples [01:54,  1.05 examples/s]
Generating train split: 161 examples [01:56,  1.37s/ examples]
Generating train split: 162 examples [01:57,  1.29s/ examples]
Generating train split: 163 examples [01:57,  1.01s/ examples]
Generating train split: 164 examples [01:58,  1.05s/ examples]
Generating train split: 165 examples [01:59,  1.21 examples/s]
Generating train split: 166 examples [02:00,  1.25 examples/s]
Generating train split: 167 examples [02:00,  1.21 examples/s]
Generating train split: 168 examples [02:01,  1.32 examples/s]
Generating train split: 169 examples [02:02,  1.28 examples/s]
Generating train split: 170 examples [02:02,  1.53 examples/s]
Generating train split: 171 examples [02:03,  1.81 examples/s]
Generating train split: 172 examples [02:03,  2.12 examples/s]
Generating train split: 173 examples [02:03,  2.09 examples/s]
Generating train split: 174 examples [02:04,  2.05 examples/s]
Generating train split: 175 examples [02:05,  1.80 examples/s]
Generating train split: 176 examples [02:05,  1.82 examples/s]
Generating train split: 177 examples [02:05,  1.93 examples/s]
Generating train split: 178 examples [02:06,  2.25 examples/s]
Generating train split: 179 examples [02:06,  2.61 examples/s]
Generating train split: 180 examples [02:06,  2.72 examples/s]
Generating train split: 181 examples [02:07,  2.38 examples/s]
Generating train split: 182 examples [02:07,  2.50 examples/s]
Generating train split: 183 examples [02:07,  2.79 examples/s]
Generating train split: 184 examples [02:08,  2.64 examples/s]
Generating train split: 185 examples [02:08,  2.59 examples/s]
Generating train split: 186 examples [02:09,  2.61 examples/s]
Generating train split: 187 examples [02:10,  1.95 examples/s]
Generating train split: 188 examples [02:10,  2.25 examples/s]
Generating train split: 189 examples [02:10,  2.53 examples/s]
Generating train split: 190 examples [02:10,  2.75 examples/s]
Generating train split: 191 examples [02:11,  2.90 examples/s]
Generating train split: 192 examples [02:11,  3.22 examples/s]
Generating train split: 193 examples [02:11,  3.01 examples/s]
Generating train split: 194 examples [02:12,  2.61 examples/s]
Generating train split: 195 examples [02:12,  2.47 examples/s]
Generating train split: 196 examples [02:13,  2.58 examples/s]
Generating train split: 197 examples [02:13,  2.24 examples/s]
Generating train split: 198 examples [02:14,  2.24 examples/s]
Generating train split: 199 examples [02:14,  2.36 examples/s]
Generating train split: 200 examples [02:14,  2.54 examples/s]
Generating train split: 201 examples [02:15,  2.84 examples/s]
Generating train split: 202 examples [02:15,  2.01 examples/s]
Generating train split: 203 examples [02:17,  1.20 examples/s]
Generating train split: 204 examples [02:17,  1.46 examples/s]
Generating train split: 205 examples [02:18,  1.77 examples/s]
Generating train split: 206 examples [02:18,  1.79 examples/s]
Generating train split: 207 examples [02:19,  1.48 examples/s]
Generating train split: 208 examples [02:20,  1.15 examples/s]
Generating train split: 209 examples [02:22,  1.07 examples/s]
Generating train split: 210 examples [02:22,  1.36 examples/s]
Generating train split: 211 examples [02:22,  1.65 examples/s]
Generating train split: 212 examples [02:23,  1.84 examples/s]
Generating train split: 213 examples [02:23,  2.10 examples/s]
Generating train split: 214 examples [02:23,  2.27 examples/s]
Generating train split: 215 examples [02:23,  2.52 examples/s]
Generating train split: 216 examples [02:24,  2.31 examples/s]
Generating train split: 217 examples [02:25,  1.44 examples/s]
Generating train split: 218 examples [02:26,  1.32 examples/s]
Generating train split: 219 examples [02:27,  1.56 examples/s]
Generating train split: 220 examples [02:27,  1.59 examples/s]
Generating train split: 221 examples [02:29,  1.09 examples/s]
Generating train split: 222 examples [02:30,  1.11s/ examples]
Generating train split: 223 examples [02:31,  1.08 examples/s]
Generating train split: 224 examples [02:32,  1.08 examples/s]
Generating train split: 225 examples [02:33,  1.10 examples/s]
Generating train split: 226 examples [02:33,  1.18 examples/s]
Generating train split: 227 examples [02:34,  1.11 examples/s]
Generating train split: 228 examples [02:35,  1.39 examples/s]
Generating train split: 229 examples [02:35,  1.52 examples/s]
Generating train split: 230 examples [02:35,  1.79 examples/s]
Generating train split: 231 examples [02:36,  2.03 examples/s]
Generating train split: 232 examples [02:36,  2.45 examples/s]
Generating train split: 233 examples [02:36,  2.53 examples/s]
Generating train split: 234 examples [02:37,  2.70 examples/s]
Generating train split: 235 examples [02:37,  2.75 examples/s]
Generating train split: 236 examples [02:37,  2.99 examples/s]
Generating train split: 237 examples [02:39,  1.55 examples/s]
Generating train split: 238 examples [02:40,  1.28 examples/s]
Generating train split: 239 examples [02:42,  1.12s/ examples]
Generating train split: 240 examples [02:43,  1.05s/ examples]
Generating train split: 241 examples [02:44,  1.13s/ examples]
Generating train split: 242 examples [02:45,  1.21s/ examples]
Generating train split: 243 examples [02:48,  1.54s/ examples]
Generating train split: 244 examples [02:49,  1.46s/ examples]
Generating train split: 245 examples [02:49,  1.13s/ examples]
Generating train split: 246 examples [02:51,  1.18s/ examples]
Generating train split: 247 examples [02:51,  1.06s/ examples]
Generating train split: 248 examples [02:52,  1.00s/ examples]
Generating train split: 249 examples [02:54,  1.19s/ examples]
Generating train split: 250 examples [02:55,  1.04s/ examples]
Generating train split: 251 examples [02:56,  1.11s/ examples]
Generating train split: 252 examples [02:56,  1.14 examples/s]
Generating train split: 253 examples [02:57,  1.25 examples/s]
Generating train split: 254 examples [02:57,  1.58 examples/s]
Generating train split: 255 examples [02:58,  1.28 examples/s]
Generating train split: 256 examples [03:00,  1.01 examples/s]
Generating train split: 257 examples [03:01,  1.04s/ examples]
Generating train split: 258 examples [03:02,  1.02s/ examples]
Generating train split: 259 examples [03:03,  1.07s/ examples]
Generating train split: 260 examples [03:04,  1.03 examples/s]
Generating train split: 261 examples [03:05,  1.15s/ examples]
Generating train split: 262 examples [03:06,  1.17s/ examples]
Generating train split: 263 examples [03:08,  1.40s/ examples]
Generating train split: 264 examples [03:10,  1.36s/ examples]
Generating train split: 265 examples [03:10,  1.03s/ examples]
Generating train split: 266 examples [03:12,  1.22s/ examples]
Generating train split: 267 examples [03:13,  1.21s/ examples]
Generating train split: 268 examples [03:13,  1.04s/ examples]
Generating train split: 269 examples [03:14,  1.10 examples/s]
Generating train split: 270 examples [03:15,  1.04 examples/s]
Generating train split: 271 examples [03:15,  1.27 examples/s]
Generating train split: 272 examples [03:16,  1.44 examples/s]
Generating train split: 273 examples [03:16,  1.67 examples/s]
Generating train split: 274 examples [03:17,  1.87 examples/s]
Generating train split: 275 examples [03:17,  2.08 examples/s]
Generating train split: 276 examples [03:17,  2.47 examples/s]
Generating train split: 277 examples [03:18,  2.56 examples/s]
Generating train split: 278 examples [03:18,  2.71 examples/s]
Generating train split: 279 examples [03:19,  2.09 examples/s]
Generating train split: 280 examples [03:19,  1.82 examples/s]
Generating train split: 281 examples [03:20,  2.13 examples/s]
Generating train split: 282 examples [03:20,  2.15 examples/s]
Generating train split: 283 examples [03:21,  1.93 examples/s]
Generating train split: 284 examples [03:22,  1.62 examples/s]
Generating train split: 285 examples [03:23,  1.37 examples/s]
Generating train split: 286 examples [03:23,  1.57 examples/s]
Generating train split: 287 examples [03:24,  1.38 examples/s]
Generating train split: 288 examples [03:24,  1.54 examples/s]
Generating train split: 289 examples [03:25,  1.31 examples/s]
Generating train split: 290 examples [03:27,  1.10 examples/s]
Generating train split: 291 examples [03:27,  1.27 examples/s]
Generating train split: 292 examples [03:28,  1.22 examples/s]
Generating train split: 293 examples [03:29,  1.10 examples/s]
Generating train split: 294 examples [03:30,  1.08 examples/s]
Generating train split: 295 examples [03:31,  1.27 examples/s]
Generating train split: 296 examples [03:31,  1.29 examples/s]
Generating train split: 297 examples [03:32,  1.56 examples/s]
Generating train split: 298 examples [03:33,  1.42 examples/s]
Generating train split: 299 examples [03:33,  1.73 examples/s]
Generating train split: 300 examples [03:33,  1.94 examples/s]
Generating train split: 301 examples [03:34,  2.07 examples/s]
Generating train split: 302 examples [03:34,  2.37 examples/s]
Generating train split: 303 examples [03:34,  2.64 examples/s]
Generating train split: 304 examples [03:35,  2.64 examples/s]
Generating train split: 305 examples [03:35,  2.49 examples/s]
Generating train split: 306 examples [03:36,  2.23 examples/s]
Generating train split: 307 examples [03:36,  2.30 examples/s]
Generating train split: 308 examples [03:36,  2.48 examples/s]
Generating train split: 309 examples [03:37,  2.54 examples/s]
Generating train split: 310 examples [03:37,  2.73 examples/s]
Generating train split: 311 examples [03:37,  2.95 examples/s]
Generating train split: 312 examples [03:38,  3.07 examples/s]
Generating train split: 313 examples [03:38,  2.14 examples/s]
Generating train split: 314 examples [03:39,  1.58 examples/s]
Generating train split: 315 examples [03:40,  1.47 examples/s]
Generating train split: 316 examples [03:41,  1.47 examples/s]
Generating train split: 317 examples [03:41,  1.56 examples/s]
Generating train split: 318 examples [03:43,  1.27 examples/s]
Generating train split: 319 examples [03:44,  1.06 examples/s]
Generating train split: 320 examples [03:45,  1.08s/ examples]
Generating train split: 321 examples [03:47,  1.28s/ examples]
Generating train split: 322 examples [03:48,  1.20s/ examples]
Generating train split: 323 examples [03:49,  1.17s/ examples]
Generating train split: 324 examples [03:50,  1.05s/ examples]
Generating train split: 325 examples [03:51,  1.02s/ examples]
Generating train split: 326 examples [03:52,  1.15s/ examples]
Generating train split: 327 examples [03:54,  1.19s/ examples]
Generating train split: 328 examples [03:54,  1.01s/ examples]
Generating train split: 329 examples [03:55,  1.00 examples/s]
Generating train split: 330 examples [03:56,  1.09 examples/s]
Generating train split: 331 examples [03:57,  1.08 examples/s]
Generating train split: 332 examples [03:58,  1.14 examples/s]
Generating train split: 333 examples [03:58,  1.22 examples/s]
Generating train split: 334 examples [04:00,  1.04 examples/s]
Generating train split: 335 examples [04:02,  1.36s/ examples]
Generating train split: 336 examples [04:03,  1.24s/ examples]
Generating train split: 337 examples [04:04,  1.35s/ examples]
Generating train split: 338 examples [04:05,  1.18s/ examples]
Generating train split: 339 examples [04:06,  1.11s/ examples]
Generating train split: 340 examples [04:06,  1.12 examples/s]
Generating train split: 341 examples [04:08,  1.04 examples/s]
Generating train split: 342 examples [04:08,  1.23 examples/s]
Generating train split: 343 examples [04:08,  1.44 examples/s]
Generating train split: 344 examples [04:09,  1.33 examples/s]
Generating train split: 345 examples [04:11,  1.14 examples/s]
Generating train split: 346 examples [04:12,  1.05s/ examples]
Generating train split: 347 examples [04:13,  1.11 examples/s]
Generating train split: 348 examples [04:14,  1.01s/ examples]
Generating train split: 349 examples [04:15,  1.01s/ examples]
Generating train split: 350 examples [04:16,  1.10 examples/s]
Generating train split: 351 examples [04:16,  1.17 examples/s]
Generating train split: 352 examples [04:17,  1.27 examples/s]
Generating train split: 353 examples [04:17,  1.56 examples/s]
Generating train split: 354 examples [04:18,  1.77 examples/s]
Generating train split: 355 examples [04:18,  1.94 examples/s]
Generating train split: 356 examples [04:18,  2.14 examples/s]
Generating train split: 357 examples [04:19,  2.37 examples/s]
Generating train split: 358 examples [04:19,  2.27 examples/s]
Generating train split: 359 examples [04:20,  1.80 examples/s]
Generating train split: 360 examples [04:20,  1.95 examples/s]
Generating train split: 361 examples [04:21,  2.11 examples/s]
Generating train split: 362 examples [04:21,  2.03 examples/s]
Generating train split: 363 examples [04:22,  1.99 examples/s]
Generating train split: 364 examples [04:22,  2.19 examples/s]
Generating train split: 365 examples [04:23,  2.32 examples/s]
Generating train split: 366 examples [04:23,  2.73 examples/s]
Generating train split: 367 examples [04:23,  2.64 examples/s]
Generating train split: 368 examples [04:23,  2.94 examples/s]
Generating train split: 369 examples [04:24,  2.65 examples/s]
Generating train split: 370 examples [04:24,  2.83 examples/s]
Generating train split: 371 examples [04:24,  2.92 examples/s]
Generating train split: 372 examples [04:25,  2.97 examples/s]
Generating train split: 373 examples [04:25,  2.90 examples/s]
Generating train split: 374 examples [04:26,  2.84 examples/s]
Generating train split: 375 examples [04:26,  2.23 examples/s]
Generating train split: 376 examples [04:27,  1.95 examples/s]
Generating train split: 377 examples [04:27,  2.30 examples/s]
Generating train split: 378 examples [04:27,  2.58 examples/s]
Generating train split: 379 examples [04:28,  2.78 examples/s]
Generating train split: 380 examples [04:28,  2.60 examples/s]
Generating train split: 381 examples [04:29,  1.70 examples/s]
Generating train split: 382 examples [04:30,  1.37 examples/s]
Generating train split: 383 examples [04:30,  1.72 examples/s]
Generating train split: 384 examples [04:31,  1.60 examples/s]
Generating train split: 385 examples [04:32,  1.43 examples/s]
Generating train split: 386 examples [04:32,  1.75 examples/s]
Generating train split: 387 examples [04:33,  2.18 examples/s]
Generating train split: 388 examples [04:33,  1.89 examples/s]
Generating train split: 389 examples [04:34,  1.49 examples/s]
Generating train split: 390 examples [04:35,  1.73 examples/s]
Generating train split: 391 examples [04:35,  2.06 examples/s]
Generating train split: 392 examples [04:36,  1.77 examples/s]
Generating train split: 393 examples [04:36,  1.88 examples/s]
Generating train split: 394 examples [04:37,  1.70 examples/s]
Generating train split: 395 examples [04:38,  1.50 examples/s]
Generating train split: 396 examples [04:38,  1.84 examples/s]
Generating train split: 397 examples [04:39,  1.59 examples/s]
Generating train split: 398 examples [04:40,  1.41 examples/s]
Generating train split: 399 examples [04:41,  1.28 examples/s]
Generating train split: 400 examples [04:41,  1.57 examples/s]

Dataset cardiode downloaded and prepared to /dhc/home/ignacio.rodriguez/.cache/huggingface/datasets/cardiode/cardiode_bigbio_kb-c8bb5852e63a7387/1.0.0/4e2e3faad676e674c33287b8e6ccdf407e15c38c8c54bfef90e358667d61bf69. Subsequent calls will reuse this data.

  0%|          | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:00<00:00, 140.65it/s]
INFO:__main__:schema = KB
INFO:__main__:Checking global ID uniqueness
INFO:__main__:Found 117870 unique IDs
INFO:__main__:Gathering dataset statistics
INFO:__main__:Testing schema for: train
INFO:__main__:Checking if referenced IDs are properly mapped
INFO:__main__:KB ONLY: Checking passage offsets
INFO:__main__:KB ONLY: Checking entity offsets
INFO:__main__:KB ONLY: multi-label `db_id`
INFO:__main__:KB ONLY: Checking event offsets
INFO:__main__:KB ONLY: Checking coref offsets
INFO:__main__:KB ONLY: multi-label `type` fields
.
----------------------------------------------------------------------
Ran 1 test in 309.474s

OK
train
==========
id: 400
document_id: 400
passages: 96203
entities: 21267
normalized: 0
events: 0
coreferences: 0
relations: 0