-
@ewan-klein is working on a [TwitterCorpusReader](https://github.com/nltk/nltk/blob/twitter/nltk/corpus/reader/twitter.py) that includes:
- `tweets()` – returns a list of strings, one per tweet
- `ful…
-
Attachment contains models & corpus for "Jason" (some experimental data for a project of Fritz Breithaupt).
Prepped and trained model as show below, with high threshold of '50' reporting that "n't" w…
-
Hi,
I am able to train SyntaxNet using the mini-training-set data. But when I train the parser using the Penn Treebank data (in Stanford basic dependencies), the parser stops training at the first ep…
-
We will likely need several different representations of sentences that might be FRDs for various features:
- [ ] Stripped of all punctuation & accents, lower case, target term replaced with `_TERM_`
…
-
Hi,
In the readme, you state that you get a "86% on the Penn treebank". Are you doing parsing in this library? If so, I wasn't able to find any docs on it, could you perhaps guide me to the correct mo…
-
```
You can find attached two json files.
penntb.json is the tagset for the Penn Treebank POS
sd.json is the tagset for the Stanford Dependencies relations
The tagsets contain descriptions/examples…
-
**Issue by [afader](https://github.com/afader)**
_Tue Mar 17 20:56:12 2015_
_Originally opened as https://github.com/allenai/okcorpus/pull/37_
---
- New tab on the interface for Help
- Tells people …
-
The word "attempt" is classified by pos_tag as VBD.
According to http://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html this is "Verb, past tense".
But "attempt" is present tense (…
-
I've run Standard Input Parsing with this sentence `I said , 'what 're you ? Crazy ? said Sandowsky. I can't afford to do that`.
SyntaxNet parses `can't` to `ca` and `n't` instead of `can't` or `can …
-
On docs/user-guide/examples/eng-pos-tagger.md are given some instructions that fail:
The following code:
``` lisp
NLP> (let ((words-dist #h(equal))
(map-corpus :ptb-tagged (corpus-file "ptb/T…