FAU-CS6 / KDD

Lecture and exercise of "Knowledge Discovery in Databases"
GNU General Public License v3.0
22 stars 11 forks source link

Fix Wrong Statistical Terminology #66

Closed melsigl closed 1 year ago

melsigl commented 1 year ago

The chapter on preprocessing introduces sampling and various types of sampling. However, the terminology stated in the slides differs to the statistical literature as well as with the book, this lecture is based on. More specifically, the slides introduces "sampling with/without repetition". Yet the concept is commonly referred to as "sampling with/without replacement". The original slides of Han also have the correct terminology (cf. PowerPoint slides of preprocessing on slide 43).

Please also refer to the following literature: