-
Most texts correspond to only one jurisdiction ID, but some texts span multiple jurisdiction IDs, to a maximum of six. Only one (corresponding to Jurisdiction_ID field) currently displays in map. Is i…
-
```
Here the steps to reproduce the problem:
1. Take a corpus.
2. build the corpus Lucene index using a windows machine
(org.apache.lucene.demo.IndexFiles)
3. build the vectors .bin using semanticv…
-
Therefore, with the aim of taking advantage of a massive unstructured corpus from textual world knowledge, we augment the training data with passages retrieved from Wikipedia. To be concise, each conc…
-
I’m attempting to use the MockingBird model to make a chatbot only answer questions related to the .PDF document data in the corpus (so a question like “How is the weather?” or “How are you?” should n…
-
## Actual behavior
Updating this client’s cache takes ~41 seconds.
Loading the pages' cache takes `< 1 second`.
The cache for the `--search` argument takes the rest of the time.
### Explaining t…
-
#### Description of the problem
Searching through text documents is a process that requires lots of computational powers. For a given search keyword, the naive searching algorithm would be to pass th…
-
```
Here the steps to reproduce the problem:
1. Take a corpus.
2. build the corpus Lucene index using a windows machine
(org.apache.lucene.demo.IndexFiles)
3. build the vectors .bin using semanticv…
-
-
Users require to 'save' a set of documents (generated by a query), attached to a user account, for later reuse as a 'corpus'.
In the short term, this can be achieved by the saving query function ( …
-
This is a priority. It should be easier to move from browsing a corpus to browsing the entire database. Right now you have to open the _search_ interface and click the "Browse All" button.