Rotrixx / sptm

1 stars 1 forks source link

Fixing Radius Neighbor #42

Closed Leschu closed 5 years ago

Leschu commented 5 years ago

The confusion matrix for Radius Neighbor Classification shows that almost all the books are classified as Literatur/Unterhaltung. The way this classifier works: It looks at a certain radius around the specific book, that is to be classified. Now when calling the function we have set a fixed radius and we set a "default class" (L/U), which the book is assigned to, in case no other neighbor is found inside of that radius.

PROBLEM:

EITHER: The radius is wayyy too small, so that every time we check for neighbors, no other neighbors are found and we just assign the default class "Literatur und Unterhaltung".

OR: The radius is wayyy too big, so that every time we check for neighbors, hundreds or thousands of neighbors of other classes are inside of that radius. "L/U" being the most represented class in the training set, would always be the majority inside of that big radius.

ERGO: Make the fucking radius bigger or smaller and check if we get other results :)

Maybe I find the motivation to download all of that shit and try it myself. Maybe, but not today :)

In case we already tried that... ignore everything above.