AnswerDotAI / bert24

Apache License 2.0
25 stars 3 forks source link

Setting intermediate size back to its original definition #41

Closed NohTow closed 1 month ago

NohTow commented 1 month ago

As discussed on discord, during the refactor, the intermediate_size of the GLU was cut in half, this PR set it back to its original definition for consistency. I also took the opportunity to fix some typos in the docstrings.

ohallstrom commented 1 month ago

LGTM