Closed luisquintanilla closed 4 years ago
I think your error was that you chose col index of 4 when there's only 2 columns in the dataset (but yes that should have friendlier error).
Keep existing: • If a user selects a dataset that doesn’t exist: File or Directory does not exist: dataset.txt • If a user selects a dataset that has the right extension but is not formatted correctly: One or more errors occurred. (Unable to split the file provided into multiple, consistent columns.)
Modify existing: • For --label-col (name), if user selects column “X” that does not exist in the dataset: One or more errors occurred. (Specified label column 'X' does not exist in the dataset.) • For --ignore-cols, if user selects column “X” that does not exist in the dataset: One or more errors occurred. (Specified column “X” does not exist in the dataset.) • For --ignore-cols, if user selects column index X that does not exist in the dataset: One or more errors occurred. (Specified column index X is out of range. Must be non-negative and less than the size of the collection.) • If user selects --output and folder name has incorrect characters / syntax (e.g. --output ?dataset), user gets error before training starts (e.g. training will not start until they fix output folder name): One or more errors occurred. (Specified output folder name syntax is incorrect.)
New: • For classification and regression, if user selects a tabular file that is not a .txt, .tsv, or .csv for their training, testing, or validation dataset. One or more errors occurred. (File type not supported. File must be .csv, .tsv, or .txt format.) • For image classification, if user selects folder without sub-folders of images: One or more errors occurred. (Data is not in correct format. Selected folder should have labelled sub-directories containing images for classification.) • For --label-col, if user selects column index X that is out of bounds in the dataset: One or more errors occurred. (Specified label column index X is out of range. Must be non-negative and less than the size of the collection.) • If user indicates --has-header false, and they try to input a column name (instead of an index): One or more errors occurred. (You specified --has-header false and inputted a column name for --label-col. Please input a column index or change to --has-header true). • If user tries to train on dataset X that is open in Excel, Notepad, etc.: One or more errors occurred. (Your dataset X is being used by another process. Please close and try training again.)
System Information (please complete the following information):
Describe the bug
When trying to run classification on a file with no header, not specifying has-header value and providing the column index displays the whole stack trace. It would be nice to display a more friendly error instead of the stack trace. Add the stack trace to the log and point the user to it if they want more details.
Dataset:
Command:
Stack Trace / Error Message: