-
A new Chuvash grammar textbook is being prepared on the basis of a 3M+ word corpus and our morphological analysis. The author is asking for a composite output of modes chv-morph and chv-segment in whi…
-
Hi @alfcrisci, hi @valenitna,
I tried to rerun the script and encountered the following in error in line 22: codified_df_native=...
`Error in [.data.frame(x$dmeta, tag) : undefined columns selecte…
-
Create new corpus, to be called "LTS+UNTS GCAs, 1935-1972". This includes the English-language texts of all general cultural agreements (GCAs) deposited with the League of Nations or the United Nation…
-
When I attempt to convert the output from mp_corpus() with coded manifestos into a Quanteda object, the quasi-sentences are not separated into separate documents in the Quanteda corpus, as described i…
-
I see the original WikiConv paper says there were conversations in Chinese collected, are these available through ConvoKit?
-
I was running local experiment using fuzzers `aflplusplus` and benchmarks `curl_curl_fuzzer_http` and `bloaty_fuzz_target`
I pass the `make presubmit` after installing `qtbase-dev5` mentioned in this…
-
hello !
thank you so much for sharing this beautiful work and specially for sharing the examples of applications.
i have tried to modify your code en example of : semantic_search.py and semantic_s…
-
I'm trying to use your newest model "pl_spacy_model_morfeusz_big" to parse some documents and I run into a memory error when the size of the documents grows too big. One document is about 4000 words b…
-
👋 This dashboard summarizes my activity on the repository, including available improvement opportunities.
## Recommendations
_Last analysis: Feb 09 | Next scheduled analysis: Feb 13_
### Open
- h…
-
I will add the previous year's corpora for author masking task. these corpora contain 205 problems in English for author obfuscation task from 2016.
besides that, I will add the author verification…