soniajoseph / ViT-Prisma

ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).
Other
165 stars 18 forks source link

Test standard mech interp techniques on Video ViT #82

Open soniajoseph opened 7 months ago

soniajoseph commented 7 months ago

@themachinefan adapted the video vision transformer VivitForVideoClassification to Prisma from transformers (thank you!), but it's not yet clear if the standard mech interp techniques apply or break.

Try running direct logit attribution, etc on the video ViT. Put your results in a jupyter notebook as research code.