alegonz / baikal

A graph-based functional API for building complex scikit-learn pipelines.
https://baikal.readthedocs.io
BSD 3-Clause "New" or "Revised" License
592 stars 30 forks source link

[ENH] fit_compute_func API #22

Closed alegonz closed 4 years ago

alegonz commented 4 years ago

The motivation follows the rationale in Issue #16:

  1. Make custom fitting protocols, such as the common stacking protocol that uses out-of-fold predictions in the first level, possible. (The current stacked classifier example is a naive example that does not use OOF predictions and thus the second level classifier is prone to prefer an overfitted classifier from the first level).
  2. Allow the use of transductive estimators (e.g. sklearn.manifold.TSNE, sklearn.cluster.AgglomerativeClustering).
  3. Leverage estimators that implement a fit_transform more efficient than calling fit and transform separately.
codecov-io commented 4 years ago

Codecov Report

Merging #22 into master will increase coverage by 0.03%. The diff coverage is 98.76%.

Impacted file tree graph

@@            Coverage Diff            @@
##           master     #22      +/-   ##
=========================================
+ Coverage   97.87%   97.9%   +0.03%     
=========================================
  Files          13      13              
  Lines         705     764      +59     
  Branches      130     142      +12     
=========================================
+ Hits          690     748      +58     
  Misses         11      11              
- Partials        4       5       +1
Impacted Files Coverage Δ
baikal/steps/expression.py 100% <ø> (ø) :arrow_up:
baikal/_core/step.py 100% <100%> (ø) :arrow_up:
baikal/steps/factory.py 100% <100%> (ø) :arrow_up:
baikal/_core/model.py 99.59% <96.29%> (-0.41%) :arrow_down:

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update 8b1efd9...1704881. Read the comment docs.