Remove prop_samples from calc_confidence_interval

neulab / ExplainaBoard

Interpretable Evaluation for AI Systems

MIT License

359 stars 36 forks source link

The sampling part of the procedure of Bootstrapping can be summarized at high level if I understand it correctly:

Resample each sample in the given data.
Repeat Step 1 several times

where the number of random samples generated in Step 1 should be same as the number of samples in the data. (The similar description is found in scipy.stats.bootstrap)

However, it seems calc_confidence_interval implements differently, which raises a concern for the correctness of the algorithm. I could be wrong, but it seems better to prefer the correctness over the efficiency of resampling given that computed values with this library could be used in someone's research. With this in mind, this PR removes prop_samples from calc_confidence_interval.

neulab / ExplainaBoard

Remove prop_samples from calc_confidence_interval #502