corpus-statistics Search Results

1000+ results
for corpus-statistics

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AmenRa/retriv #22

[Feature Request] Add documents to index after initializing?

Hi, I understand that there are reasons why we only want to do indexing once, since there are corpus-level statistics that need to be calculated. But is there any way to index a huge batch of do…

alex2awesome updated 3 months ago
2
Helsinki-NLP/OPUS-API #2

change the update script to use OPUS YAML files

Change the DB update script to use the new YAML files in https://github.com/Helsinki-NLP/OPUS. For example: * https://github.com/Helsinki-NLP/OPUS/blob/main/corpus/RF/v1/info.yaml * https://github…

jorgtied updated 1 year ago
1
common-voice/common-voice #4156

[FR] Show some statistics for text-corpus progress

**Is your feature request related to a problem? Please describe.** In the old Sentence Collector, we could see how many sentences are waiting for us to approve/reject. Now, we don't know how many are…

HarikalarKutusu updated 10 months ago
2
common-voice/common-voice #4196

[FR] Add text-corpus related statistics to the panel

**Is your feature request related to a problem? Please describe.** Text-corpus generation is the most important and troublesome part of the dataset and many language communities are failing to extend…

HarikalarKutusu updated 11 months ago
1
google/clusterfuzz #2946

Web app: empty Testcases, Corpora, Fuzzer Statistics pages

We've setup ClusterFuzz on GCP and ran a few fuzz jobs but Testcases, Corpora, Fuzzer Statistics pages are empty. I've checked logs and there are no errors when accessing these pages. Confirmed cor…

andrei-near updated 1 year ago
1
MontrealCorpusTools/Montreal-Forced-Aligner #385

7 utterances that need a larger beam to align There were…

**Debugging checklist** [ ] Have you updated to latest MFA version? Yes [ ] Have you tried rerunning the command with the `--clean` flag? Yes **Describe the issue** A clear and concise descrip…

wwdok updated 2 years ago
3
runbox/runbox7 #1482

Overview improvements

**Is your feature request related to a problem? Please describe.** Information overflow is ubiquitous in email communication, indicated by the Inbox and other folders containing more messages than ca…

gtandersen updated 1 month ago
2
California-QSO-Party/cabfixer #11

Using column information to improve "column finding"

@VictorDenisov you brought up that using the expected content of each column could benefit the column finding algorithm. It would be nice if the column content was customizable, so this program could…

tepperly updated 3 months ago
2
confluentinc/ksql #2849

Illegal initial character when CSAS to Avro

This is a valid stream created in KSQL from existing JSON data - note the column `3ALPHA`: ``` ksql> DESCRIBE CORPUS_RAW; Name : CORPUS_RAW Field | Type ----------------…

rmoff updated 5 years ago
1
apache/lucene #13251

Can we add configuration on dropping raw vectors from quanti…

### Description Tangentially related to: https://github.com/apache/lucene/issues/13158 But, I have observed, that as the corpus reaches a fairly large size, the actual quantiles aren't changing mu…

benwtrent updated 2 months ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for corpus-statistics

1000+ results
for corpus-statistics