Open davidryanau opened 3 months ago
Location in document: S3.SS1.p1.6
Selected HTML: For an input token , the computation of the Attention Block is
Under the single-token assumption, we will show that we can approximate these operations as linear transformations.
Let us first
Hello @davidryanau, thanks for the issue report! We are reviewing your report and will address it as soon as possible.
Description
Words not landing inside boxes
(Optional:) Please add any files, screenshots, or other information here.
No response
(Required) What is this issue most closely related to? Select one.
Choose One
Internal issue ID
01ad90f4-ee03-4fe6-816c-582a555a9f6d
Paper URL
https://arxiv.org/html/2407.16826v1
Browser
Chrome/127.0.0.0
Device Type
Desktop