websterParser / WebsterParser

Convert Webster's Unabridged 1913 dictionary in to a more usable format
GNU General Public License v3.0
351 stars 20 forks source link

Some issues #1

Closed adah1972 closed 4 years ago

adah1972 commented 10 years ago

Thanks for the terrific work! I think it is very useful, but I do find some issues.

One thing is question mark. I see many question marks, many in the place of pronunciations. E.g. I can see "Ap′ple (ăp"p'l), n.", but only "Word (?), n.".

I also see some U+FFFD symbols. E.g. in "Venal", I can see "[L. venalis, from venus sale; akin to Gr. � price, Skr. vasna: cf. F. vénal.]".

I would also suggest using an alternative font. Other dictionaries seem to be using Garamond, which looks nicer than Times (which I suppose you are using).

Thank you for checking the feedback.

ponychicken commented 10 years ago

Hey, thanks for the feedback.

  1. The missing pronunciations are actually also missing in the source files (Project Guttenberg) I use. I'm not sure why they weren't transcribed, but there no easy way for me to add them.
  2. The same applies here. &?; is used in the transcription, which I convert to xFFFD.
  3. I thought I embedded the open source Gentium font, since it has a pretty good unicode code support. Maybe it's not loaded on your device? Could you send a screenshot?
ponychicken commented 10 years ago

I checked in the paper version and it seems that the Guttenberg transcriptions is laking in a lot of places. Unfortunately it is the only free version I know of.

adah1972 commented 10 years ago

Hi DieBuche,

Thanks for the answer.

For 1/2, I noticed that www.dict.org has the information in a transcribed form. Maybe you can have a look. It mentions that the source files are available at http://ftp.gnu.org/gnu/gcide/.

For 3, I installed the Gentium font, and it does show a difference (thanks!). Please compare the two screen shots (I am using OS X Lion). Yup, I like the look with Gentium :-). original gentium

ponychicken commented 10 years ago

I uploaded a new release which is now based on GCIDE. Thanks for the hint!

screenshot 2014-09-02 14 48 40

adah1972 commented 10 years ago

I am definitely loving it. Much nicer.

Two minor issues that might be worth fixing:

ponychicken commented 10 years ago
  1. Cosmos has this double bar is my source (also in CIDE).

This precedes words marked with a double vertical bar in the original dictionary, signifying that the word was adopted directly into English without modification of the spelling.

Which makes sense for cosmos, since it's straight up greek.

  1. The Vagabond screenshot was done with the new version. Are you seeing something else? If yes, screenshot please and also OS X version.
adah1972 commented 10 years ago

Thanks for the explanation on "cosmos".

The screen shot is attached. The system is OS X Lion.

screen

I also tried on an old Snow Leopard machine, and the result was good. Surprising! On both machines the Gentium font is now not needed.

jeffbyrnes commented 4 years ago

Based on the last comment, seems like this can be closed.

adah1972 commented 4 years ago

Yes. Actually it seems only Lion had this problem. It works fine also on Sierra and High Sierra.