HomebrewNLP / Olmax

HomebrewNLP in JAX flavour for maintable TPU-Training
BSD 2-Clause "Simplified" License
45 stars 6 forks source link

Image Classification #11

Open ClashLuke opened 2 years ago

ClashLuke commented 2 years ago

At the moment, we have a novel architecture that's very powerful in language modelling. However, we don't know whether it will transfer as well to other domains as the transformer. That's why it'd be interesting to test its versatility by training it on ImageNet.\ This issue is about implementing the input projection for image tokens (as in ViT), the necessary data pipelines and testing the model on this new modality.