BLKSerene / Wordless

An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
GNU General Public License v3.0
699 stars 92 forks source link
corpus corpus-analysis corpus-linguistics corpus-processing corpus-search corpus-statistics corpus-tools dependency-parser lemmatizer linguistics literature stopword tagger tokenizer translation
Wordless: An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation


AppVeyor Azure Pipelines CircleCI Github Actions Codecov
linting: pylint Codacy Maintainability CodeFactor Quality Gate Status
Release Downloads License FOSSA Status DOI


中文 (简体) | 中文 (繁體) | English


Wordless is an integrated corpus tool with multilingual support for the study of language, literature, and translation designed and developed by Ye Lei (叶磊), then MA student in interpreting studies at Shanghai International Studies University (上海外国语大学).

Download

The latest version (3.5.0) of Wordless supports Windows 7/8/8.1/10/11, macOS 10.11 or later, Ubuntu 18.04 or later, Debian 10 or later, and Arch Linux, all 64-bit only. Both Intel-based and Apple Silicon-based Macs are supported.

For a detailed changelog, please see CHANGELOG.md.

Release Remarks
Latest Release for Windows 1. Extract all files
2. Double-click Wordless/Wordless.exe to run
Latest Release for macOS 1. Extract all files
2. Double-click Wordless.app to run
Latest Release for Linux 1. Extract all files
2. Double-click Wordless/Wordless to run
3. [Optional] Double-click Wordless/Wordless - Create Shortucut to create a shortcut in Show Applications
Past Releases Not recommended
Baidu Netdisk For Chinese users with unstable connections to Github (PASSWORD: wdls)

[!IMPORTANT] Note 1: It is recommended that the path to Wordless not contain any non-ASCII chatacters, such as Chinese characters and letters with diacritics.

Note 2: If your Mac says that “Wordless” is damaged and can’t be opened, please open Terminal (Launchpad → Other) and run:

xattr -rc /Applications/Wordless.app

remember to replace /Applications/Wordless.app with the actual path of Wordless on your computer (you could drag Wordless.app to the Terminal). Then, run Wordless again (the warning prompted in Terminal could be ignored if the program could be successfully opened).

Note 3: While opening corpora in languages other than English in Wordless, extra model files might need to be downloaded from the internet. If you encounter a Network Error dialog while downloading the model, chances are that it's a literal network error, so you just need to check your internet connections following the instructions in the error message and try downloading the model once more.

Users in China, where connections to Github and Hugging Face Hub are unstable, are recommended to use a proxy and set properly the settings in Menu Bar → Preferences → Settings → General → Proxy Settings. Alternatively, Chinese users can choose to manually download model files from Baidu Netdisk. The steps of installing models are as follows:

  1. Check the error message displayed in the Network Error dialog. If stanza is found in the error message, you need a Stanza model, otherwise you need a spaCy model.
  2. Download model files for the language of your corpus from the above link and extract all files.
  3. For Windows and Linux users, put spaCy models under Wordless/libs and Stanza models under Wordless/libs/stanza_resources. For macOS users, right click on Wordless.app, select Show Package Contents, then put spaCy models under Contents/Frameworks and Stanza models under Contents/Frameworks/stanza_resources.
  4. If your corpora are in different languages or both spaCy and Stanza models are required, repeat step 1 ~ 3 until the Network Error dialog disappears.
  5. Try opening corpora in Wordless again, the model downloading process should be skipped now.

If the problem persists or the model you need is missing from the above link, please contact the author for further support.

Need Help?

If you have any questions, find software bugs, need to provide feedback, or want to submit feature requests, you may seek support from the open-source community or contact me directly via any of the support channels listed below.

Support Channel Information
Official documentation Stable Version | Development Version
Tutorial videos YouTube | bilibili
Bug reports Github Issues
Usage questions Github Discussions
Email support blkserenei>@</igmail.com
WeChat official account WeChat official account

Citing

If you are going to publish a work that uses Wordless, please cite Wordless as a journal article or a piece of computer software.

  1. Cite as a journal article

APA (7th edition):

Ye, L. (2024). Wordless: An integrated corpus tool with multilingual support for the study of language, literature, and translation. SoftwareX, 28, Article 101931. https://doi.org/10.1016/j.softx.2024.101931

Chicago (18th edition):

Ye, Lei. “Wordless: An Integrated Corpus Tool with Multilingual Support for the Study of Language, Literature, and Translation.” SoftwareX 28 (December 2024): 101931. https://doi.org/10.1016/j.softx.2024.101931.

MLA (9th edition):

Ye Lei. “Wordless: An Integrated Corpus Tool with Multilingual Support for the Study of Language, Literature, and Translation.” SoftwareX, vol. 28, Dec. 2024, https://doi.org/10.1016/j.softx.2024.101931.
  1. Cite as a piece of computer software

APA (7th edition):

Ye, L. (2024). Wordless (Version 3.5.0) [Computer software]. Github. https://github.com/BLKSerene/Wordless

Chicago (18th edition):

Ye, Lei. Wordless. V. 3.5.0. Released July 1, 2024. PC. https://github.com/BLKSerene/Wordless.

MLA (9th edition):

Ye Lei. Wordless. Version 3.5.0, Github, 1 Jul. 2024, https://github.com/BLKSerene/Wordless.

Works Using Wordless

For details, please click HERE.

License

Copyright (C) 2018-2024  Ye Lei (叶磊)

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program.  If not, see <http://www.gnu.org/licenses/>.

FOSSA Status

Contributing

For details, please click HERE.

Donating

If you would like to support the development of Wordless, you may donate via PayPal, Alipay, or WeChat Pay.

PayPal Alipay WeChat Pay
PayPal Alipay WeChat Pay

Acknowledgments

For details, please click HERE.