Code duplication? - Githubissues

rhiever / sklearn-benchmarks

A centralized repository to report scikit-learn model performance across a variety of parameter settings and data sets.

MIT License

210 stars 54 forks source link

Closed amueller closed 8 years ago

amueller commented 8 years ago

Maybe a silly question but why do you have a separate .py file for each classifier? The code is the same for all, right?

Also, have you checked out skll?

rhiever commented 8 years ago

It was easier to run them on the HPCC that way. :-)

for dataset in [`datasets`]:
    for model in [`models.py` ...]:
        # benchmark model on dataset

amueller commented 8 years ago

ok but you could still just have all the stuff in a single function and call that in each file ;)