-
## Keyword: sgd
### Doubly Stochastic Models: Learning with Unbiased Label Noises and Inference Stability
- **Authors:** Authors: Haoyi Xiong, Xuhong Li, Boyang Yu, Zhanxing Zhu, Dongrui Wu, Dejin…
-
你好,我使用的是样例测试集,想跑通README. 但是发现,在训练的时候,会卡住,然后超时;
[batch=23/3200]:
Train time/batch: 22
Train time/sample: 198
Train time/batch_in_epoch: 6
Train time/sample_in_e…
-
https://lmsys.org/blog/2023-06-29-longchat/
https://arxiv.org/abs/2305.07185
https://www.reddit.com/r/LocalLLaMA/comments/14fgjqj/a_simple_way_to_extending_context_to_8k/
https://github.com/epfml…
-
## Keyword: detection
### RacketStore: Measurements of ASO Deception in Google Play via Mobile and App Usage
- **Authors:** Authors: Nestor Hernandez, Ruben Recabarren, Bogdan Carbunar, Syed Ishti…
-
# 💻 cs
## 📚 mask (total: 9)
### 📃 Deep Pneumonia: Attention-Based Contrastive Learning for Class-Imbalanced Pneumonia Lesion Recognition in Chest X-rays
- **Authors:** Xinxu Wei, Haohan Bai, Xianshi …
-
In case anyone is interested in the credibility of the national-level daily confirmed cases, see the Appendix of [ArXiv:2007.11779](https://arXiv.org/abs/2007.11779) = [zenodo.4432080](https://zenodo.…
-
**Context**
We often train multi-modal deep learning models (in our case, various contrasts of MRI, e.g. T1(ce), T2, T2*, FLAIR, etc) to take advantage of the complementary information that these mod…
-
Hi,
Can you please add our recent CD paper with Transformers ("A Transformer-Based Siamese Network for Change Detection") to your collection?
arxiv link: https://arxiv.org/abs/2201.01293
Code: …
-
I am trying to train similar model for action classification on videos, It has around 200 classes.
Following is the code for loss_labels:
```py
def loss_labels(self, outputs, targets, indices, nu…
-
It seems that your mwer loss implementation needs prior beam search for inputs for mwer_loss function.
We can get 'seq_logprobs' during beam search for each hypothesis, however, your implementation…