-
I tried to run the search corpus by words option in the search by single word or collocations GUI but I got an error.
![search words gui](https://github.com/NLP-Suite/NLP-Suite/assets/142953438/b92…
-
See: https://github.com/david-allison/manx-corpus-search/actions/workflows/publish.yml
Improve, or replace with CI from https://github.com/david-allison/manx-search-data/actions/workflows/publish.y…
-
> Add support for text-search REGEX overrides
> A major constraint for this application is its tight coupling to the three methods of searching text by reference (1, 1.1, and 1.1-4). This is also ba…
-
Thanks for the awesome work! I'm not sure what retrieval corpus is used in the DPR results. The paper mentions 2.9M entities and I can see that they seem to be in the intersection of Wikipedia and Wik…
-
Hi, i want to reproduce the result of Visualized BGE, but zero-shot benchmark not clear, such as WebQA. Can you provide evaluation dataset and codes for zero-shot benchmark. Thanks!
zwhus updated
2 months ago
-
Steps:
- Develop use case
- Find and prep data
- Load and query
Resources:
- [ Caselaw Access Project (CAP)](https://case.law/)
- [US Code](https://uscode.house.gov/)
- [Legal AI Benchmarks](https:…
-
(Probably should be worked with alongside glossing and additional sign issues, and maybe meta-data: #105, #106, #145.)
Allow user to define a set of tags that can be associated to any sign in the s…
-
When right-clicking on a cell in a grid, a context menu should appear giving the options: find all in this corpus, find all in selected corpora, find in a all corpora. The second option might ideally …
-
Let's say you search the Globalise corpus for "Amsterdam". The results will be sorted by relevance (which I take is frequency of occurrence of the keyword relative to the amount of text on the page). …
-
**To Reproduce**
1. In the advanced search, use the [query `人前で話すことに慣れていないの。` and tag `Tanaka Corpus`](https://tatoeba.org/en/sentences/search?query=%E4%BA%BA%E5%89%8D%E3%81%A7%E8%A9%B1%E3%81%99%E3%8…