Closed flckv closed 1 year ago
Hi @flckv, thanks for raising an issue!
The error messages are telling you what the issues are.
The feature audio
isn't in the csv. The csv has two column names: train
and label
. You should either update the csv to have audio
as a column name, or passing in --audio_column_name train
when you run the script
The dataset created is a DatasetDict
with DatasetDict
objects as its keys rather than the expected Dataset
instance. This should be resolved by doing:
data_files = {'train': 'train/train.csv', 'test': 'test/test.csv', 'valid': 'valid/valid.csv'}
raw_datasets = load_dataset("s/data/s/s", data_files=data_files)
For further questions about how to customise a script, please ask in our forums. We try to reserve the github issues for feature requests and bug reports.
Thank you, @amyeroberts
System Info
transformers
version: 4.30.0.dev0Versions of relevant libraries: [pip3] numpy==1.24.3 [pip3] torch==2.0.1 [pip3] torchaudio==2.0.2 [conda] numpy 1.24.3 pypi_0 pypi [conda] torch 2.0.1 pypi_0 pypi [conda] torchaudio 2.0.2 pypi_0 pypi
Who can help?
@sanchit-gandhi @sgugger @albertvillanova
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
I want to run this model but not on superb dataset: https://github.com/huggingface/transformers/blob/main/examples/pytorch/audio-classification/README.md
I want to load a dataset from local:
with command: I don't specify the superb dataset:
run_audio_classification.py
script to load audio from csv file:3.1 I specify the location of the csv files :
with:
It seems that loading the csv files is successful. I get message: "Dataset csv downloaded and prepared ".
But these are the errors:
I comment out lines 262 -274
Then I receive error:
(I am loading it locally because I have not received a reply on how to load private hub datasets when I raised the issue: https://github.com/huggingface/datasets/issues/5930 ) @albertvillanova
Expected behavior
I want to be able to run the official example script run_audio_classification.py instead of predefined dataset superb, but on my own local dataset to train the model on my dataset.