facebookresearch / ToMe

A method to increase the speed and lower the memory footprint of existing vision transformers.
Other
931 stars 67 forks source link

Candidate fix for a missing distillation token #3

Closed dbolya closed 1 year ago

dbolya commented 1 year ago

For compatibility with different timm implementations.

Candidate fix for #1.