Yangyi-Chen / SOLO

Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"
Apache License 2.0
95 stars 2 forks source link

GREAT WORK #2

Closed yangluo23 closed 3 weeks ago

yangluo23 commented 1 month ago

Dear authors, Thanks for realeasing the training recipe. Hopefully, it will be helpful enough for beginners

Yangyi-Chen commented 1 month ago

Thanks for your interests. Please let us know if you encounder any difficulty in following the document. We are still adding more details.