Closed samterfa closed 1 year ago
Perhaps splitting the function into two -
the current load_dataset function should cache any downloaded datasets I think?
Would add that the hf_load_dataset() function was built too much around the 'emotions' dataset, and many datasets have a different structure. Do we try to cater for all datasets stored on The Hub?
This code should ideally work based on this documentation:
hf_load_dataset("csv", data_files = "iris.csv")
. However, it fails on this line of datasets.R:dataset_base <- reticulate::py$load_dataset(dataset)
. Movingdataset_base <- reticulate::py$load_dataset(dataset)
further down in the code to wheredataset_base
is used would fix this issue but I wasn't sure if it's placement was important for something else or if other refactoring would make sense.