Is it possible to extract the remaining basis function from the algorithm for further calculation?

scikit-learn-contrib / py-earth

A Python implementation of Jerome Friedman's Multivariate Adaptive Regression Splines

BSD 3-Clause "New" or "Revised" License

458 stars 121 forks source link

@kwchau It is possible but you have to do some manipulation of basis_. Something like the below code may get at what you are looking for. Wrote the snippet from memory so it may not get you all the way there but should get you pretty close.

varnames = []
pruned = []
coefs = []
i = 0

for bf in your_model.basis_:
    varnames.append(str(bf))
    pruned.append(bf.is_pruned())

    if bf.is_pruned() is False:
        coefs.append(your_model.coef_[0, i])
        i = i+1

    #Zero fill pruned coefs just to put something
    if bf.is_pruned() is not False:
        coefs.append(0)

#Construct Dataframe
summ = pd.DataFrame({'Variable' : varnames, 'Pruned': pruned, 'Coefficient': coefs})

#Remove Pruned Variables
summ = summ.loc[summ['Pruned'] == False]  

#Clean up variable name by removing h() 
summ['Variable'].replace("h" + r"\(",'', regex = True, inplace = True).replace(r"\)", '', regex = True, inplace = True)

scikit-learn-contrib / py-earth

Is it possible to extract the remaining basis function from the algorithm for further calculation? #209