rhiever / sklearn-benchmarks

A centralized repository to report scikit-learn model performance across a variety of parameter settings and data sets.
MIT License
209 stars 53 forks source link

Code duplication? #15

Closed amueller closed 8 years ago

amueller commented 8 years ago

Maybe a silly question but why do you have a separate .py file for each classifier? The code is the same for all, right?

Also, have you checked out skll?

rhiever commented 8 years ago

It was easier to run them on the HPCC that way. :-)

for dataset in [`datasets`]:
    for model in [`models.py` ...]:
        # benchmark model on dataset
amueller commented 8 years ago

ok but you could still just have all the stuff in a single function and call that in each file ;)