Closed themachinefan closed 7 months ago
-Fix folding to include the final layer (needed for https://github.com/soniajoseph/ViT-Prisma/issues/72).
-Also fixed import errors from renaming
Aside: "head" is a confusing name for the final layer since it conflicts with the attention heads. We should probably rename it.
Great catch, thank you.
-Fix folding to include the final layer (needed for https://github.com/soniajoseph/ViT-Prisma/issues/72).
-Also fixed import errors from renaming
Aside: "head" is a confusing name for the final layer since it conflicts with the attention heads. We should probably rename it.