biolab / orange3-associate

🍊 :package: Frequent itemsets and association rules mining for Orange 3.
Other
65 stars 37 forks source link

Worthless Erroneous Results from uploaded CSV file #62

Closed M4x28 closed 6 months ago

M4x28 commented 6 months ago
Associate version

1.3.0

Orange version

3.36.2

Expected behavior

Frequent Itemsets and Association Rules Analyses

Actual behavior

worthless erroneous results

Steps to reproduce the behavior

1) Load data into CSV File Import 2) Connect to Frequent Itemsets and try analysis 3) Connect to Association Rules and try analysis

Additional info (worksheets, data, screenshots, ...)

Orange Schema image Frequent Itemsets image Association Rules image Data Table image CSV Data Pivot like Market Basket dataset pivot.csv CSV Data like Foofmart 2000 dataset dataset.csv

janezd commented 6 months ago

Frequent Itemsets and Association Rules must be run on basket data. Open widget Datasets, find Foodmart and try it.

Data in attribute-value form (like the one that you use) must be used when we don't want to treat a "0" value as special. They find relations like "didn't buy bread -> didn't buy ham", and there are naturally many more of them. This is by design.

In your case, 0 is a special value, so the data must be given like this: https://file.biolab.si/datasets/foodmart.basket.