help wanted - Githubissues

CYJnrclw commented 3 days ago

Why can't the data processing part of this code run?

TBJr commented 1 day ago

Issue Summary:

The data processing part of the project was failing due to a mismatch between the column names in the dataset and the expected feature names in the code. Specifically, the CSV file used MTML_* column names, while the code expected PSML_* feature names.

Solution:

I have resolved this issue in two ways:

Code Modification: Updated the datagen.py script to automatically map MTML_* columns to PSML_*, ensuring that the existing dataset can be used without modification.
CSV Column Rename: Alternatively, I created a modified version of the CSV file (Suturing_S02_T01_renamed.csv) that renames all MTML_* columns to PSML_*, matching the expected format in the code.

Steps to Resolve:

Use the Code Modification:
- I added a mapping function in datagen.py that transforms the MTML_* feature names to PSML_*. You can integrate this function to handle datasets with different column name prefixes.
Use the Renamed CSV:
- I generated a renamed CSV file that directly aligns with the expected feature names in the script. This CSV file is available and can be used for immediate processing without code changes. Suturing_S02_T01_renamed.csv

Let me know which approach works best for your setup! Also I will be glad to open a pull request if you would like me to.

keshara2032 commented 23 hours ago

@TBJr Thanks a lot for suggesting this. We understand that our data preprocessing code expects all the raw data to be in a very specific format, which is not always practical. I also made another branch called "gesture" which has some more information and sample structure of the csv after preprocessing.
Thanks a lot for this change, and please feel free to create a PR. I will merge it. Thanks!

CYJnrclw commented 18 minutes ago

thanks

UVA-DSA / MTRSAP

help wanted #1

Issue Summary:

Solution:

Steps to Resolve: