czcorpus / wag

WaG - install your own word profile generator out of diverse data resources
Apache License 2.0
10 stars 2 forks source link
corpora data-aggregation dictionaries language-resources linguistics portal react rxjs typescript visualization

WaG - Word at a Glance

WaG screenshot

  1. Use existing corpus/data search and retrieval software as backend,
  2. Obtain and compile information about:
    1. single word,
    2. two or more words compared with each other,
    3. word translation.
  3. explore text metadata statistics, time-based trends, word cloud-based data and many more,
  4. combine statistics from different corpora,
  5. Use results of a resource as an input for other resource.

Currently supported resources

WaG KonText MQuery NoSkE Treq Clarin FCS Datamuse ElasticSearch LCC
collocations :star: 🚧 :star: :star:
concFilter :star:
concordance :star: :star: :star: :star:
freqBar :star: :star:
freqComparison :star: :star:
freqPie :star: :star:
geoAreas :star: :star:
multiWordGeoAreas :star: :star:
html :star: :star:
matchingDocuments :star: :star:
mergeCorpFreq :star: :star:
speeches :star:
syntacticColls :star:
timeDistrib :star: :star:
multiWordtimeDistrib :star: :star:
translations :star:
treqSubsets :star:
wordForms :star: :star: 🚧
wordFreq :star: :star: 🚧
wordSim :star: 🚧 :star: :star:

Requirements

WaG is able to run either as a self-hosted application or within a compatible web page. For the self-hosted variant the following is needed:

Please refer for more information to the INSTALL.md.

How to cite WaG

Tomáš Machálek (2020): Word at a Glance: Modular Word Profile Aggregator. In: Proceedings of LREC 2020, s. 7011–7016.

@InProceedings{machalek2020lrec,
 author = {Tomáš Machálek},
 title = "{Word at a Glance: Modular Word Profile Aggregator.}",
 booktitle = {Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020)},
 year = {2020},
 publisher = {European Language Resources Association (ELRA)},
 language = {english}
}