-
Closed issue #45 indicates that udpipe was used and `__main__.py` suggests that you use the expanded form for conll multiword tokens, e.g. 2 tokens "de le" instead of "du" in French. The readme should…
-
(First of all, congrats on UDPipe, it's a pleasure to use!)
I've built a morphological generator for an endangered language, and I'm having it save its output in the tab-separated `FORM,LEMMA,UPOS,…
jeanm updated
4 years ago
-
Issues to figure out:
1. What to compare on?
2. Order of comparison? Right now, we plan to look at NER first, then UDPipe and then doc2vec vector similarity with Jacquard/cosine similarity.
htt…
-
- [x] stanza ancient greek models (Jan)
- [x] udpipe
- [x] cltk
- [ ] ~~spark-nlp ancient greek models (tokenizer, sentencer, POS + lemmas)~~
- [x] homercy (Marton)
- [x] GreCy (Jan)
- [ ] [Tra…
-
Hello professor Vandeweerd,
I've been working on a pipeline for French analysis for a while now. It currently uses rsyntax to extract noun and verb phrases but I've been looking to replace rsyntax …
-
The author of the udpipe R package referenced this package for network visualizations: https://github.com/iankloo/sigmaNet
It claims to be suited to quickly render large networks as well as provide…
-
***Opis:*** Avtomatičny prěkladatelj RU -> ISV, ktory nahodi v paralelnyh korpusah tekstov\* pary rěčenij, ktore věrojetno sut prěkladami jedin drugogo. Koristniku bude potrěbno ručno izbirati variant…
-
We used the tagger and tokenizer of UDpipe. In some of our files we had this newline character '\u2028' which wasn't recognized as one. This led to further errors in other programs in our pipeline, bu…
-
The heuristic in `split_tokenised_text_into_sentences.py` is too simplistic:
- Full-stops in quoted text such as in `' Is cuid den searmanas é . ' ar sise . ` should not count as split point.
- 3570…
-
Spawned off of #513 and UniversalDependencies/UD_English#40.
I proposed:
> Another issue relevant here is abbreviations. For uncommon abbreviations/shortened forms (like *w* for *with*, *btwn* f…