-
Hi there,
First up, apologies if this is a stupid question - I'm not an NLP person and some of the language and ideas are brand new to me.
So, as I understand it, collocation is the idea of comm…
-
Should we manually remove some of these as stopwords or is there a more general way to go about it? Fixing some of the other issues here might make this issue less pressing (but not necessarily given …
-
Hello!
Thanks for the great project! Quick question: does gd-tools have support for the English language, especially when it comes to parsing and searching for words/collocations in sentences?
If it…
JayXT updated
8 months ago
-
I want to compute bigram collocations per sentence and tried to output my demo results in a jupyter notebook. Only for those four/five sentence combinations and a window size of larger than 4, the ord…
-
Post questions here for one or more of our fundamentals readings:
Manning and Schütze. 1999. Foundations of Statistical Natural Language Processing. MIT Press:
Chapter 3 (“Linguistic foundati…
-
The main idea is to extract many features from text and perform feature selection to determine what are the most promising ones.
Ideas:
- useful feature - most informative unigram and bigram colloca…
-
Post questions here for:
Manning and Schütze. 1999. Foundations of Statistical Natural Language Processing. MIT Press: selections from Chapter 5 (“Collocations”): 151-163, 172-176, 183-186.
-
I noted that when doing a harp.import_product on a pth file that contains two different paths, but each containing a HARP file with the same basename (but different content), only the content of the H…
-
```Python
(env) [root@localhost information-extraction]# python bin/p_classification/p_train.py --conf_path=./conf/IE_extraction.conf
Traceback (most recent call last):
File "bin/p_classification…
-
Matchups for harmonisation need to be screened appropriately. That may include outlier detection but in particular, there should be no duplicates. For example, if one pixel in sensor N collocates wi…