Closed dmpe closed 8 years ago
https://github.com/paper82/pylometry/tree/master/features < All in python
http://scikit-learn.org/stable/auto_examples/hetero_feature_union.html http://scikit-learn.org/stable/modules/pipeline.html#feature-union (http://scikit-learn.org/stable/modules/generated/sklearn.pipeline.FeatureUnion.html#sklearn.pipeline.FeatureUnion)
We use the 20-newsgroups dataset and compute standard bag-of-words features for the subject line and body in separate pipelines as well as ad hoc features on the body. We combine them (with weights) using a FeatureUnion and finally train a classifier on the combined set of features.
http://scikit-learn.org/stable/auto_examples/model_selection/grid_search_digits.html#example-model-selection-grid-search-digits-py http://scikit-learn.org/stable/auto_examples/neural_networks/plot_rbm_logistic_classification.html http://scikit-learn.org/stable/auto_examples/classification/plot_classifier_comparison.html https://www.quantstart.com/articles/Using-Cross-Validation-to-Optimise-a-Machine-Learning-Method-The-Regression-Setting http://scikit-learn.org/stable/auto_examples/feature_selection/plot_permutation_test_for_classification.html
https://github.com/jakevdp/sklearn_pycon2015/blob/master/notebooks/03.1-Classification-SVMs.ipynb