bigcode-project / bigcode-analysis

Repository for analysis and experiments in the BigCode project.
Apache License 2.0
113 stars 20 forks source link

cannot import AttentionType from gpt2 #20

Closed ocramz closed 1 year ago

ocramz commented 1 year ago

Loading multi_query_experiments.profile_hf_generate results in

ImportError: cannot import name 'AttentionType' from 'transformers.models.gpt2.modeling_gpt2'

I find this puzzling since this class is not exported by transformers 4.20.0

Are you using a custom fork of transformers by chance?