FAU-CS6 / KDD

Lecture and exercise of "Knowledge Discovery in Databases"
GNU General Public License v3.0
22 stars 11 forks source link

Dataset(s) for Exercise #15

Closed melsigl closed 2 years ago

melsigl commented 2 years ago

Motivation Exercises are based on datasets. While every exercise could be based on its own dataset, one dataset used by all exercises groups (e.g. Association Rules, Classification, Clustering, Outlier) is desirable. For instance a sales dataset originated from some ERP/DWH system similar to the one dataset referenced in the book of Han et. al ("Data Mining").

Definition of Done Specify datasets for some lectures as well as one dataset that acts as a common theme throughout the semester.

Tasks

melsigl commented 2 years ago

ERP sample databases:

dominik-probst commented 2 years ago

A first data set has already been deposited in the repository https://github.com/FAU-CS6/KDD-Databases (AdventureWorks). At the moment, it is assumed that this will be sufficient for the time being to design our exercises