-
A few people now, including Timo Korkiakangas, have requested that we have a view of morphology similar to that which was available in the P4 treebanking interface. This would be helpful when doing di…
-
Right now it's a little Latin heavy. We're working on Arabic and on better Greek support, which of course is already working, but lacking a bit in its morphology settings.
-
This post relates to the effort to harmonize the Ancient Greek treebanks, as per [Issue 7](https://github.com/unipv-larl/UD4HL/issues/7).
One of the first issues to solve is tokenization itself. Th…
-
The @cite attribute is empty when the token is a mark of punctuation. If punctuation is part of the Edition, it belongs to citable passages as much as any word-token.
-
I noticed a few possible issues in the Cicero Cat. 1 data:
regie lemmatized as adv (regius1)
ut as conj (ut1)
3 numerals were lemmatized as NUMERAL1 (maybe this is standard?)
v (quinque1)
xii (duodec…
-
Hello,Where can I find the Data? the link you have given can't open.
-
Some changes that might be helpful:
- Caching (#31)
- Have a separate TaggedText service that all the Flask workers talk to, built with something like Twisted (https://github.com/twisted/twisted), s…
-
@amir-zeldes
We know how to report a redundant token (`reparandum`). Do we have a way to report a missing one?
I have "במטרה להעלות את נפח דם" instead of "את נפח **ה**דם". We make sure to report typ…
-
The [CoNLL-U specification](https://universaldependencies.org/format.html#paragraph-and-document-boundaries) says
> When a paragraph starts at sentence boundary, the first sentence of the paragraph c…
-
@Marie-ClaireBeaulieu would like to build out the functionality around annotating the overlap between meter and morphology/syntax
Initial User Stories are described here:
https://docs.google.com/spr…