-
Since our texts will be hosted in InkVisitor, it would be great if we could use some already existing open corpus manager and hook it into InkVisitor - to ensure queries over the corpus with selection…
-
### Version
1
### DataCap Applicant
FileTech
### Project ID
FileTech-02
### Data Owner Name
Commoncrawl
### Data Owner Country/Region
United States
### Data Owner Industry
Life Science / He…
-
We have a brand new project courts-db that we should start using to identify courts in CL.
-
```
What steps will reproduce the problem?
1. Open text2onto and add a corpus.
2. Select by concept an Algorithm, no matter which one.
3. Click the run button
What is the expected output? What do you…
-
```
What steps will reproduce the problem?
1. Open text2onto and add a corpus.
2. Select by concept an Algorithm, no matter which one.
3. Click the run button
What is the expected output? What do you…
-
```
What steps will reproduce the problem?
1. Open text2onto and add a corpus.
2. Select by concept an Algorithm, no matter which one.
3. Click the run button
What is the expected output? What do you…
-
```
What steps will reproduce the problem?
1. Open text2onto and add a corpus.
2. Select by concept an Algorithm, no matter which one.
3. Click the run button
What is the expected output? What do you…
-
**Is your feature request related to a problem? Please describe.**
cuDF has an extensive fuzzing suite that could benefit the project by running in the CI.
**Describe the solution you'd like**
In…
-
Just a very quick enhancement request, we should discuss options/usefulness.
It'd be great to have the option to create an in-site "corpus homepage" with extended metadata on the selected corpus. C…
-
```
def build_frequency_table(corpus):
frequency_table={}
for word in corpus:
try:
current_value=frequency_table[word]
new_value=current_value+1
frequency_table[word]=…