Open choshin84 opened 4 years ago
when choosing M data point from universe of N data point, chance of x_i NOT chosen is (N-1 / N)^M. when M = N, the chance will be converged to (1 - 1/N)^N ~ 1/e = 0.3678... when repeat the bootstrap meaning M >> N, then chance will be close to zero
Tweet summary
average will be 1/e ~ 36% will be OOB every time running bootstrap thus recommend to repeat it multiple times >10 to ensure it'll use all data
Experiment code