Open rafikg opened 23 hours ago
Hi @rafikg, thank you for reporting this, good catch! I just opened a PR #292 that should fix this issue with the aggregation step, could you verify if it is working for you if you check out that branch?
@gsarti Thanks it is working on my side.
One more question: I annotated some translated sentences by highlighting the error span.
I want to calculate the importance of each translated token
to see if tokens with higher importance correspond to error span
.
In my above example, I calculate the contribution of each source token to generate each translated token. I am not sure how to leverage this to get the importance of each translated token.
Question
This is MRE:
Error:
value_zeroing
: source_length X target_length X n_headsAttention
issues
.