EpistasisLab / pmlb

PMLB: A large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms.
https://epistasislab.github.io/pmlb/
MIT License
805 stars 135 forks source link

fetch nearest datasets #100

Closed lacava closed 4 years ago

lacava commented 4 years ago

add the ability for a user to fetch the nearest dataset names, where the neighborhood is defined in summary stats space

def fetch_nearest_dataset_names(X,y, n=1):
    return list of closest datasets based on summary stats
trangdata commented 4 years ago

Closed with #106. 🥳