Tsinghua-MARS-Lab / StateTransformer

210 stars 18 forks source link

remove attention mask generation in pre-train forward function #92

Closed Shiduo-zh closed 1 year ago