-
@wnhsu I am curious about the way to train a multilingual HuBERT as [this](https://github.com/facebookresearch/fairseq/blob/0338cdc3094ca7d29ff4d36d64791f7b4e4b5e6e/examples/speech_to_speech/docs/text…
-
- moco 1x1
- resnet50 with conv1x1
- resnet50
- moco (without 1x1)
- moco (with 1x1 but with random weight)
https://github.com/Berkeley-Data/hpt/blob/taeil/references/model_architectures.md
taeil updated
3 years ago
-
I trained models on Windows, then I tried to use them on Linux, however, I could not load them due to an incorrect path joining. During model loading, I got `learner_path` in the following format `exp…
-
I am confused about the loss function of entity and predicate.
In your paper, Eq.(10) and Eq.(11), you use the l2_loss between E^O and E^T
**However**, in the code, it seems like that you calculat…
-
Hello,I wonder the performance of supervised finetune using CONTRIQUE encoder compared to imagenet pretrained model, but I can't find such exp in paper.
can you share the results if you have done…
-
This work is a part of a master project DS Project @ University of Vienna
- Analyze [CheXpert dataset](https://stanfordmlgroup.github.io/competitions/chexpert/) ([paper](https://arxiv.org/pdf/1901.…
-
## 🚀 Feature
The ESC-10/50 dataset is widely used and no yet available in torchaudio.datasets.
[More Information](https://dx.doi.org/10.7910/DVN/YDEPUT)
## Motivation
- This dataset is often use…
-
Congrats on the great work! This is a great work. I beg some replies to my confusion. In the code line 77, utils.py, new_t is divided by interval, which is (total_end-total_start)/num_frame, and num_f…
-
I'm training a photo swin_unet_2x model using gan. I use a cosine lr scheduler with init lr = 1e-5. After some tries I found that the discriminator loss fluctuated around 0.8 (the threshold for genera…
-
Hello, can you provide the trained weights used in your paper for us to reproduce your prediction capabilities? Thank you!