-
ftp://ftp.ncbi.nlm.nih.gov/pub/lu/PubTatorCentral
Add support to replace English text with the concepts using the offsets provided by PubTator.
-
First, thank you for the great work for Japanese dataset.
We are considering to use UD_Japanese-GSD for training the built-in model of the open-source library.
https://github.com/explosion/spaCy…
-
It would be useful to add a sentence splitter, for instance, possibilities could be,
- [Puntk sentence tokenizer](https://www.nltk.org/_modules/nltk/tokenize/punkt.html) from NLTK (needs pre-trained…
-
There is an unexpected behavior in the cooccurrences() and kwic() functions. When using the left/right arguments to adjust the window for calculations and display, only the value for "left" is used. T…
-
Our current coverage counts instructions covered _during non-reverting transactions_. In a sense, that's right -- it's not as useful to cover something if it isn't a potentially-blockchain-affecting …
-
@tsalo
A few questions concerning functions in data_preparation.py
1) process_corpus
Im sure a lot of the term manipulation is built in, but are we sure its doing what we want it to? Taking a l…
-
Here we can discuss any post-classification analyses @mriedel56 will perform for the upcoming paper. Feel free to edit this comment to add new analyses or respond to it with your thoughts.
We will ru…
tsalo updated
4 years ago
-
I have two suggestions re the documentation.
Under CorpusReader: "Most users will want to access words, sentences, paragraphs and even whole documents via a CorpusReader object." I wasn't able to f…
-
@jonathanmetzman
I dont want to spam the announcement thread so I think communicating in a dedicated issue is better?
you wanted to run the next batch today. Will that happen? And if so - when do…
-
## 1. どんなもの?
(タスク)
- Semantic Dependency Parsing (SDP): 意味的関係を acyclic graph で表現
(提案)
- Iterative Predicate Selection (IPS) algorithm を提案
- graph-based および transition-based parsing approach…