KA for VIT - Githubissues

Provide a comprehensive and robust pipeline and a data preprocessing method that allows training with own datasets. Implement KA on vision transformers to accomplish image classification tasks.(finetuned on several datasets such as Stanford Dogs) Provide a multimodal transfer method to apply KA for audio classification.

zju-vipa / KamalEngine

KA for VIT #43