mine-cetinkaya-rundel / dictionary

R package with words from the English dictionary, their types and meanings
1 stars 0 forks source link

Find data source #3

Open mine-cetinkaya-rundel opened 7 years ago

mine-cetinkaya-rundel commented 7 years ago

We need to determine where we're getting the data from

The birdnik package is one option: https://github.com/Ironholds/birdnik

Are there other dictionary APIs we could use? Let's discuss here and make a decision before we go ahead with it. Feel free to post sample code for fetching the data and we can decide based on availability and ease of use.

mine-cetinkaya-rundel commented 7 years ago

The hunspell package has a list of (all?) English words: https://github.com/ropensci/hunspell/blob/666f94c6697f112d9ef4ad99629002f0b0c4e7be/inst/dict/en_US.dic

It's not really a list of the words though, affixes are in a separate file, so we would first need to figure out the logic and create a list of words.

Maybe we could use this as the base list of words, and then pull definitions using the birdnik package.

kathy0305 commented 7 years ago

The Merriam-Webster Dictionary has an API:

https://www.dictionaryapi.com/ The Merriam-Webster Dictionary API is free as long as it is for non-commercial use, usage does not exceed 1000 queries per day per API key, and use is limited to two reference APIs.

mine-cetinkaya-rundel commented 7 years ago

@kathy0305 this looks promising!

kathy0305 commented 7 years ago

Re: Merriam-Webster They issued a Key on the spot , I got 2 keys one for Dictionary and one for Thesaurus