burgern / vocabs

Learn vocabulary in a structured way by leveraging language embeddings.
0 stars 0 forks source link

Dataset #1

Open burgern opened 3 weeks ago

burgern commented 3 weeks ago

Find a capable dictionary dataset.

AC:

Nice to have:

burgern commented 3 weeks ago

OpenAI prompt

I want to create an application for learning vocabulary. To start out, i am looking for a dataset with the following requirements:

do you know of any good datasets i could easily acquire?

OpenAI's response

For your vocabulary learning application, I recommend the following datasets that meet your criteria:

WordNet:

Description: A lexical database for the English language. Features: Includes definitions, synonyms, antonyms, and example sentences for a large number of English words. Coverage: Extensive coverage of the English language. Accessibility: Open-source and free to use. URL: WordNet

Oxford Dictionaries API:

Description: Provides access to a comprehensive dictionary with definitions, translations, synonyms, and example sentences. Features: Detailed explanations and example phrases for each word. Coverage: Extensive and up-to-date coverage of the English language. Accessibility: Free tier available, but may require API key and registration. URL: Oxford Dictionaries API

CMU Wordlist:

Description: A list of English words with definitions and example sentences. Features: Each word is accompanied by a definition and an example sentence. Coverage: Broad coverage of common English words. Accessibility: Open-source and free to use. URL: CMU Wordlist

The Open English Word List (EWL):

Description: A freely available list of English words with definitions and example sentences. Features: Contains explanations and example phrases. Coverage: Extensive coverage and regularly updated. Accessibility: Open-source and free to use. URL: The Open English Word List (EWL) These datasets should provide a solid foundation for your vocabulary learning application. You can choose the one that best fits your specific needs and technical requirements.