-
The provided datasets have four variants, each serving a specific purpose, and contain a `text_description` as described below E.g gov:
1. **syntheticDocQA_government_reports_test** – **No text_des…
-
## ASR
- [ ] ASR2K: Speech Recognition for Around 2000 Languages without Audio https://arxiv.org/abs/2209.02842
- [x] Whisper: Whisper is a general-purpose speech recognition model. https://github…
-
A couple of comments for readability:
- [x] remove org from model name
- [x] and add reference link #1316
- [x] multiply scores by 100 and keep one decimal, e.g. 78.1 (@orionw not sure if this …
-
Hi, it's me again!
I tried to reproduce the LIBERO benchmark results, but failed.
|Model|Spatial|Object|Goal|Long|Average|
|------|------|------|------|------|------|
|Reported (3trials mean)|…
-
@twolinin Hello, when I try to evaluate different phasing tools like what you had done in LongPhase' paper, the results showed great differences between the 2 benchmark vcf datasets( GIAB _HG002_GRCh3…
-
### Name and Institution (Required)
Name: Melissa Sulprizio
Institution: Harvard
### Description of your issue or question
When generating concentration difference plots for 1-month Transpor…
-
Hi! Awesome work and datasets collection! Is there a way (or plan to release such a script) to launch a model's benchmark evaluation on the full set of data and obtain a comprehensive report on all th…
-
I can not find the file "irrationally_interface.py".
ccckj updated
5 years ago
-
```
2024-11-09 21:39:44.994636: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:485] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already b…
-
Similar to benchmarks created for image classification, we would like to create a benchmark for video datasets.
A few tasks for tackling video datasets:
1. Identify ML training related to video …