mlr-org / mlr3pipelines

Dataflow Programming for Machine Learning in R
https://mlr3pipelines.mlr-org.com/
GNU Lesser General Public License v3.0
138 stars 25 forks source link

Use DataBackend info to avoid unnecessary data comparison in #271 #387

Open mb706 opened 4 years ago

mb706 commented 4 years ago

271 solution currently needs to compare data to make sure clashing columns are not different. Once https://github.com/mlr-org/mlr3/issues/476 has an answer we can speed up this test.

mb706 commented 3 years ago

Depends on https://github.com/mlr-org/mlr3/pull/703

mb706 commented 2 months ago

note that different hashes could still be the same data if only a subset of rows are selecte