Open English Wordnet is a lexical network of the English language grouping words into synsets and linking them according to relationships such as hypernymy, antonymy and meronymy. It is intended to be used in natural language processing applications and provides deep lexical information about the English language as a graph.
Open English Wordnet is a fork of the Princeton WordNet developed under an open source methodology. The quality and veracity of the resource may differ from the Princeton Wordnet and we welcome contributions. Contributions to this wordnet may eventually be incorporated into future releases of Princeton WordNet. Correspondance to previous versions and wordnets in other language is provided through the Collaborative Interlingual Index (CILI). The Open English Wordnet is available as individual files in GWN-LMF format.
Open English Wordnet is released through the Open English Wordnet website. The versions released are
The size of each resource is as follows
Edition | Words | Synsets | Relations |
---|---|---|---|
2024 | 161,705 | 120,630 | 418,168 |
2023 | 161,338 | 120,135 | 415,905 |
2022 | 161,221 | 120,068 | 386,437 |
2021 | 163,161 | 120,039 | 384,505 |
2020 | 163,079 | 120,052 | 385,211 |
2019 | 160,051 | 117,791 | 378,201 |
Princeton 3.1 | 159,015 | 117,791 | 378,203 |
To compile these into a single file please use the following script(s)
python scripts/from-yaml.py
python scripts/merge.py
This will create a file at wn31.xml
that contains the complete wordnet.
Further conversions are available through the converter here.
We welcome changes, to make a change please read our contributing guidelines and make a pull request.
Open English Wordnet is a high-quality resource that acts as a gold-standard for natural language processing, as such we cannot accept any automatically generated results that have not been manually validated.
Please be aware that we use the Global WordNet Association LMF and please read the guidelines for using the format
Open English Wordnet is released under CC-BY 4.0
The canonical citation for English Wordnet is:
More recent papers describing it include:
John Philip McCrae, Alexandre Rademaker, Ewa Rudnicka, Francis Bond (2020) English WordNet 2020: Improving and Extending a WordNet for English using an Open-Source Methodology. In Proceedings of the LREC 2020 Workshop on Multimodal Wordnets (MMW2020), Marseille
John P. McCrae, Michael Wayne Goodman, Francis Bond, Alexandre Rademaker, Ewa Rudnicka, Luis Morgado Da Costa (2020) The GlobalWordNet Formats: Updates for 2020. In Proceedings of the 11th Global Wordnet Conference (GWC2021), University of South Africa (UNISA)
It incorporates material from: