Closed alirezag closed 2 months ago
The tutorial here suggest we can set source to 'txt' and use 'data_files' key to load local files. But after I get an error saying the dataset does not exist on hugging face. Still fater creaing a dummy HF dataset it still doesn't work.
Using the component directly in code works:
# Load in tokenizer tokenizer = ... dataset = text_completion_dataset( tokenizer, source="txt", data_files="path/to/my_data.txt", )
Hi @alirezag, can you share the exact error you are getting? And is this from specifying a local dataset in the config or in code?
The tutorial here suggest we can set source to 'txt' and use 'data_files' key to load local files. But after I get an error saying the dataset does not exist on hugging face. Still fater creaing a dummy HF dataset it still doesn't work.
Using the component directly in code works: