-
After training en-hu we noticed a somewhat larger quality gap in 4 BLEU points between the teacher and student models.
It’s 24.8 for the quantized and fine-tuned student vs 30.2 BLEU for the teache…
-
```
RTX 2080 Ti
python 3.7.7 hcff3b4d_5
cuda100 1.0 0 pytorch
pytorch 0.4.1 py37_py…
-
Is it possible to fine-tune Whisper/Distil-Whisper to achieve mixed speech transcription like Hindi+English in a single sentence which is common in casual conversations. Has anyone tried this before? …
-
Hello,
When I try the DETIC model. I do not see any result : It seems labelled images are zero.
person
Loading pretrained CLIP
/media/csverma/M2Disk/Projects/CompVis/ObjectDetection/AutoDis…
-
@intfloat
Hi,
I'm implementing a E5 fine tune task similar to [1066](https://github.com/microsoft/unilm/issues/1066).
I am trying to run a simple E5-large fine-tuning with BM25 hard negatives…
-
# URL
- https://arxiv.org/abs/2305.01649
# Affiliations
- George Cazenavette, N/A
- Tongzhou Wang, N/A
- Antonio Torralba, N/A
- Alexei A. Efros, N/A
- Jun-Yan Zhu, N/A
# Abstract
- Datase…
-
Thanks authors for the insightful work!
I want to understand more details about tuning the visual tokenizer. Could you mind explaining about what kind of dataset used in training your own visual to…
-
Masked Generative Distillation
https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136710053.pdf
https://github.com/yzd-v/MGD
CMX的特征出来后参考MGD的方式做一下mask,然后用重建后的特征与RGB的特征去比较,特征重建模块与CMX一并训练。
…
-
Hi,
I was looking at the zero cost, zero time, zero shot notebook for financial sentiment analysis (i.e., [this one](https://github.com/huggingface/setfit/blob/main/notebooks/zero_cost_zero_time_ze…
-
Hi, thank you very much for your work. Now I want to training openfold, and download the corresponding dataset(RODA dataset), and then use the following script for training.
`python3 train_openfold.p…