I am working on a capstone project that fits the item-based kNN on a custom Amazon appliance 100K dataset. I wanted to get the cross-validation metrics for this dataset, however, I am getting wildly incorrect results. To make sure my code wasn't a mistake, I ran the built-in MovieLens 100k dataset into my function and it returned valid results.
Description
I am working on a capstone project that fits the item-based kNN on a custom Amazon appliance 100K dataset. I wanted to get the cross-validation metrics for this dataset, however, I am getting wildly incorrect results. To make sure my code wasn't a mistake, I ran the built-in MovieLens 100k dataset into my function and it returned valid results.
I've attached the datasets for your reference. amazon_appliance_100k.csv ml_100k.csv
Steps/Code to Reproduce
Here is the code to run and cross-validate a custom dataset on google collab:
Expected Results
My expected results should be similar to this:![ML_100k_results](https://user-images.githubusercontent.com/46816574/219120920-5f099a14-20a9-4443-b38a-719e561f4ace.jpg)
Actual Results
Here are my actual results:![amazon_100k_result](https://user-images.githubusercontent.com/46816574/219121140-725d2547-b74d-4c5d-b0de-5c85ddabd6bb.jpg)
Versions