Open Aidenzich opened 5 months ago
NOTE: In Maarten's diagram, $H_n$ should be regarded as $H_n'$ because it is not the final output before the sweep-up
phase. This means that in the complete computation process, H_0
, like other intermediate states, needs to be further processed during the sweep-up
phase.
URL | Description |
---|---|
mamba | The code for selective scan in the original repo is implemented at a low level, making it harder to understand. |
mamba-minimal | Simplified, readable, annotated code for Mamba; not optimized for speed. |
Maarten's blog | The comprehensive introduction by Maarten, the author of BERTopic. The best article I have found 💯 |