jbonda / ml-workflow-pipeline

The work flow of CS-600(Professional software development) Project.Project Title: GUI pipeline for ML models.
0 stars 0 forks source link

[BUG] Copies of CSV files raise different name/number error. #37

Open raelewis opened 1 year ago

raelewis commented 1 year ago

Is there an existing issue for this?

Current Behavior

Uploading a zip file containing multiple csvs (all of which are copies of each other) raises an error stating that the name/number of columns are different from each other.

The error generated is as follows: Names/number of columns in the uploaded files(s) does not match in the ZIP archive.

Expected Behavior

We believe that the software should allow the csv files to pass the test without this error. The csv files contain the same column names and column numbers, so it should not raise this particular error.

Steps To Reproduce

  1. Select a csv file that will function properly with the webpage.
  2. Create multiple copies of this csv file in a directory.
  3. Zip this directory.
  4. Upload this zip folder to the webpage.

I've included a zip archive containing the original and 2 copies of the same csv file. wine_csv_copies.zip

Environment

No response

Anything else?

No response

TechSolomon commented 1 year ago

Thanks for reporting this issue and providing the relevant replication dataset – we are currently in the process of clarifying how to handle file-to-file duplication as compared to row-to-row values.