-
### Missing functionality
Word clouds contain the most common words, and for free text fields, these words are often: 'and', 'to', 'the', 'from' etc. Which provide no meaningful insight into the da…
-
*@kelson42 commented on Mar 12, 2017, 8:21 PM UTC:*
Stopwords are words which should not be indexed (during the FT index process) and also be ignore during the FT search. This stopwords are language …
ghost updated
1 month ago
-
## User Story
As a user it's distracting to see common stopwords (https://github.com/pulibrary/pul_solr/blob/main/solr_configs/catalog-production-v2/conf/stopwords.txt) highlighted in search results. …
-
### Bug Description:
Manticore removes all ngram characters from the stopwords file despite their collocation.
```sql
CREATE TABLE IF NOT EXISTS test (data text indexed) charset_table='non_cjk' s…
-
In [get_widget_data()](https://github.com/htrc/torchlite-backend/blob/a4cd41836cd08d653b4dde98332daa9bd6ed94ce/htrc/torchlite/routers/dashboards.py#L148), call a function that returns a version of `fi…
-
When trying the search on the demo website I'm unable to find an exact result. It only shows fuzzy results.
I typed 'why' too fast the first time and realized what was going on soon after.
Is this …
-
On the stop words API Reference, we provide a link to an external website containing lists of possible stop words for different languages. This website doesn't appear to exist anymore.
Link to the …
-
Through Python 3.6 and scikit-learn, the model will predict the language of new data. Steps include data preprocessing, feature extraction, model training, and evaluation. Techniques like tokenization…
-
Hi have this core dump in the log
```
ct 09 17:06:23 indexer-worker(l*******************): Panic: file hash.c: line 266 (hash_table_insert_node): assertion failed: (opcode == HASH_TABLE_OP_UPDATE)…
-
Remove stopwords from statements and sections.