-
На странице http://lingconlab.ru/resources.html неверно указан объём корпусов, он значительного меньше актуального (для карельского и Дагестана)
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Is your feature request related to a problem? Please describe.
Currently in order to perform BM25 based text r…
-
I noticed tinyxml2 have missed corpus for a while, causing some issues in the coverage build: https://oss-fuzz-build-logs.storage.googleapis.com/index.html#tinyxml2
Around a month ago some changes …
-
### Describe the bug
I train a NER&NEL model according to the tutorial https://flairnlp.github.io/flair/master/tutorial/tutorial-training/how-to-train-span-classifier.html. However, if in SpanClass…
-
/adverb tool is used to have valency statistics for adverbs. Related parser results can be previously added/edited/deleted . Parser results can be edited on changing source text and/or changing words …
-
On the menu add a tab to view the statistics of the data for a selected domain using bokeh. These could be:
1. Display a summary of queries thus far
2. The domains that were crawled
3. Some statistics…
-
**Ахцәажәара**
The current parallel corpus has been extracted from various sources (ebooks,websites...)
**Ауадаҩрақәа**
The sentences are automatically lined up. We come across these issues…
-
Dear colleagues, thank you for your fantastic work on the long-awaited treebank!
Decided that I should report this to you just in case: one can see from both the `.conllu` files and `stats.xml` tha…
-
We're working on extracting statistics from the entailment graph.
We need to be able to extract:
• The size of the corpus (the number of documents used to build the graph)
• The number of text fragme…
-
A common issue is not knowing whether some state is being reached effectively and wanting greater insight into execution statistics (see this issue https://github.com/crytic/medusa/issues/431 and PR w…