VowpalWabbit / coba

Contextual bandit benchmarking
https://coba-docs.readthedocs.io/
BSD 3-Clause "New" or "Revised" License
48 stars 19 forks source link

OpenML ARFF Handling #4

Closed anrath closed 3 years ago

anrath commented 3 years ago

This pull request modifies the OpenmlSource and OpenmlSourceTests which previously accommodated CSVs only. To test non-sparse ARFFs use test_default_classification from OpenmlArffSource_Tests in test_simulations.py.

To test sparse data use test_sparse_classification from OpenmlArffSource_Tests in test_simulations.py. Note that the sparse data implementation is inefficient and will require optimization.

Classes created in data/filters.py to handle ARFF data: