[WIP] Add support for Latin dicts

tecosaur / lexic

Mirror of https://git.tecosaur.net/tec/lexic

GNU General Public License v3.0

78 stars 10 forks source link

[WIP] Add support for Latin dicts #7

Closed ymarco closed 3 years ago

ymarco commented 4 years ago

Of the format used in https://nikita-moor.github.io/dictionaries/dictionaries.html . Currently only Lewis 1890 was tested, I'm planning to add a Latin-English one as well

ymarco commented 4 years ago

In the last commit, the following dictionaries are supported:

 | Dictionary's name                              | type          | Word count |
 |------------------------------------------------+---------------+------------|
 | Index verbōrum, Appleton (1914)                | latin latin   |       2183 |
 | An Elementary Latin Dictionary, Lewis (1890)   | latin english |      17582 |
 | Hand-book of Latin Synonymes, Döderlein (1875) | latin english |        556 |
 | A Latin Dictionary, Lewis & Short (1879)       | latin english |      51596 |
 | Glossarium Anglico-Latinum, Redmond (2005)     | english latin |      10867 |

I'll delete Döderlein and Lewis 1890 if I could confirm that they don't new words to add on Lewis & Short.

ymarco commented 4 years ago

The last commits converted from regex-based parsing to using libxml, which is much cleaner