TLL pdf search: "Dia" leads to "ia"

Dia (proper name) is a headword in TLL 46-th pdf, in page 63, and bookmarked as such, but its link to TLL leads to "ia" (or "iaceo") in the 26-th pdf, page 7. I suspect some bug in the keyword handling algorithm drops the capital "D".

[edit-again] The line 601 of Perseus.pm $word =~ s/[^a-z]//g; is there to remove the troublesome diacritical marks(as the resolution to the issue #52), but it over-does its job. The tll-bookmarks.txt contains proper names that start with capital letters that all precedes those lower-cased words. Maybe the L-S contains only a few proper names, but "Dia" is one of the rare exceptions and it has an entry in TLL, too. The line 601 chops off its capital letter "D" and renders it "ia". So my suggestion is replace line 601 with $word =~ s/[^A-Za-z]//g;, or possibly somewhat more sophisticated treatment might be needed.

P.S. I could fork and commit this one, but I am known to myself to make goofs, so I restrain myself to local modifications and tests.

pjheslin / diogenes

TLL pdf search: "Dia" leads to "ia" #61