-
```
Searching for "do" should retrieve entries with the word "doença", but not
"dos", and so on. The list should be localizable and easily changeable.
```
Original issue reported on code.google.com …
-
Nice job for the word cloud visualization of abstract and title!
There are some words like pronouns (like we) that can be removed to leave spaces for more informative topic words.
-
E.g. search for "ver" returns three results. 2 for "vermicelli" which is legitimate, and 1 for "very". This should be excluded.
r15h1 updated
8 years ago
-
-
Stop words list might need extending with Member and Stage
-
Whether the turicreate.text_classifier.create parameter drop_stop_words can be customized, because it can only be in English now, if I am training Chinese or other languages, it will be invalid.
-
(This affects both Blacklight and Traject, but since indexing is on the Traject side, I thought I'd put it here)
We have a partly implemented Title Browse which ran into an issue with the indexing …
-
```
Any title with a The files in the T's - so I found a bunch of "The journal
of.." there when I was trying to look at publication authorities.
We should ignore The (and a and an)
```
Original is…
-
```
Any title with a The files in the T's - so I found a bunch of "The journal
of.." there when I was trying to look at publication authorities.
We should ignore The (and a and an)
```
Original is…
-
In RCIT_A1 (private ontology), with screenshots omitted (sorry):
(1). For the classes ending with "L/min", 3 out of 6 are annotated out. Why half of them can be annotated, but the rest cannot?
(2)…