-
For Vision and Language pretraining cc3m, mscoco, SBUcaptions and VG are very relevant datasets. I haven't been able to download SBU captions and VG. Here are my questions.
1) How to download SBU c…
-
-
What are sequence-to-sequence language models and how are they related to transformer models?
-
Is there a formula for model size estimation for the mixture of experts based model?
I was looking at the model variants in the following blog, and wondered if there's a formula to compute the mode…
-
### Week 1 - Get to know the community
- [x] Join the communication channels
- [x] Open a GitHub issue (this one!)
- [x] Install the Ersilia Model Hub and test the simplest model
- [x] Write a motiva…
-
1. In your opinion, is EVA a method of both model scaling and data scaling? Does pretraining with more data (such as the data used in CLIP finetuning) yield better results than using only the 30M data…
-
In file [dataset](https://github.com/OFA-Sys/OFA/blob/main/datasets.md), there describes the datasets for [Pretraining](https://github.com/OFA-Sys/OFA/blob/main/datasets.md#pretraining) and [Vision & …
-
hi,大家好,非常高兴的告诉大家,百度飞桨论文复现赛第七期已经开始了,本次论文复现赛共将有100+篇的经典&前沿论文供大家复现。同时飞桨特色模型挑战赛持续展开,详细信息可以参考[AI Studio 链接](https://aistudio.baidu.com/aistudio/competition/detail/406/0/introduction),大家是否已经迫不及待了呢~
为了帮助大…
-
### 检查清单
- [X] 合法的、无木马植入的站点。
- [X] 有实质性原创内容的 HTTPS 站点,发布过至少 5 篇原创文章,内容题材不限。
- [X] 有独立域名,非免费域名。
### 站点信息
```json
{
"title": "Kaiming He",
"url": "http://kaiminghe.com/",
"avatar": "h…
-
first of all, thank you for releasing the code
now i would ike to reproduce the model with same vision dataset with translated on different language,as for text dataset. i would like to use filtere…
acul3 updated
2 years ago