-
In the search_byWord_main and CoNLL_table_analyzer GUIs below the widgets to allow users to extract 1, 2, 3,... words before and after a given search word were added. We need the code to implement the…
-
I want to use the compress text source code to compress models without pip install.
Here's my code:
```
org_model_path = "cc.en.300.bin"
ft = load_facebook_model(org_model_path).wv
ft_fp16 = …
-
As discussed in our last meeting with Pasi, the visualisations do not seem to work for the Norway corpus (for term frequency and ngrams)
-
### testing notes (QA - round two)
In the [QA site document search](https://test-geniza.cdh.princeton.edu/en/documents/):
- [ ] Searching `אלמרכב AND אלצ` should return the results, in correct order…
-
The n-grams search produces wordclouds that include the search work, contrary to the file_search_byWord_util that excludes the search word from the wordcloud. Should be uniformed?
![image](https://…
-
**Describe the bug**
When init estimator, its saying instance count is not one of the keywords
**To reproduce**
Run the default examples blazingtext_text_classification_dbpedia
**Expected beha…
-
When I try to exclude punctuation or stopwords for the setting of Calculation of Ngram, the terminal give me the below error message
what should I do to fix
-
We already use fielddata on the `_uid` field today in order to implement random sorting. However, given that doc values are disabled on `_uid`, this will use an insane amount of memory in order to loa…
-
**描述这个 bug**
UnicodeDecodeError: 'gbk' codec can't decode byte 0xa6 in position 1105: illegal multibyte sequence
**如何复现**
C:\Users\dell>python ./TextBox/run_textbox.py --model=BART --dataset=sams…
-
Hi John. Thank you for developing this application. it is very interesting. I have two questions for which I could not find information in the tutorial.
1. Is there a way that I can add an additional…