baaivision / Emu3

Next-Token Prediction is All You Need
Apache License 2.0
1.84k stars 73 forks source link

Add information about training #7

Open Andrey36652 opened 1 month ago

Andrey36652 commented 1 month ago

Hello, thank you for the research. Please share more info about pre-training process.

Data:

Hardware: what kind of hardware were used, for how long, and maybe pretraining cost estimation.

nunbuzor commented 1 month ago

and why not use natural language synthetic captioning for images?

Molly-3000 commented 1 month ago

Please also include the source of the videos. Additionally, provide the composition ratio of different categories as mentioned in the paper.

yinsong1986 commented 1 month ago

+1 for this question. Thanks!