explosion / thinc

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
https://thinc.ai
MIT License
2.82k stars 275 forks source link

Add ParametricAttention.v2 #913

Closed danieldk closed 10 months ago

danieldk commented 10 months ago

Description

This layer is an extension of the existing ParametricAttention layer, adding support for transformations (such as a non-linear layer) of the key representation. This brings the model closer to the paper that suggested it (Yang et al, 2016) and gave slightly better results in experiments.

Types of change

Feature

Checklist

netlify[bot] commented 10 months ago

Deploy request for thinc-ai pending review.

Visit the deploys page to approve it

Name Link
Latest commit 80f47b793be92f82d533b5c244ede9be4b62965e