Bootstrap data is not being used

rajatsen91 / CCIT

Classifier Conditional Independence Test: A CI test that uses a binary classifier (XGBoost) for CI testing

45 stars 10 forks source link

num_samp = len(all_samples) if bootstrap: np.random.seed() random.seed() I = np.random.choice(num_samp, size=num_samp, replace=True) samples = all_samples[I, :] else: samples = all_samples Xtrain, Ytrain, Xtest, Ytest, CI_data = CI_sampler_conditional_kNN( all_samples[:, Xcoords], all_samples[:, Ycoords], all_samples[:, Zcoords], train_samp, k, )

Second that. The next lines also look strange to me: why, depending on the dimension of Xtrain, the code uses the classifier with either custom parameters or default ones?

    if bootstrap:
        np.random.seed()
        random.seed()
        I = np.random.choice(num_samp,size = num_samp, replace = True)
        samples = all_samples[I,:]
    else:
        samples = all_samples
    Xtrain,Ytrain,Xtest,Ytest,CI_data = CI_sampler_conditional_kNN(all_samples[:,Xcoords],all_samples[:,Ycoords], None,train_samp,k)
    s1,s2 = Xtrain.shape
    if s2 >= 4:
        model = xgb.XGBClassifier(nthread=nthread,learning_rate =0.02, n_estimators=bp['n_estimator'], max_depth=bp['max_depth'],min_child_weight=1, gamma=0, subsample=0.8, colsample_bytree=bp['colsample_bytree'],objective= 'binary:logistic',scale_pos_weight=1, seed=11)
    else:
        model = xgb.XGBClassifier()

rajatsen91 / CCIT

Bootstrap data is not being used #7