MichaelAquilina / Reddit-Recommender-Bot

Indentifying Interesting Documents for Reddit using Recommender Techniques
7 stars 0 forks source link

No proper way to resolve page disambiguation over Captilisation #76

Closed MichaelAquilina closed 10 years ago

MichaelAquilina commented 10 years ago

There is no way to currently tell which article is being pointed to in a link if there are multiple occurrences that simply differ by case.

Example: GNOME - The linux Desktop Environment Gnome - A Garden gnome

How would these be solved for the following internal links?:

[[GNOME]], [[gnome]], [[gnoME]] etc...

This is especially difficult because finding the link often means that not all possible articles have been found yet, so heuristics like "best fit" wont really work. Some form of complicated repair process will have to be performed which is both complicated and slow!