kmheckel / Q-S5

A Fully Quantized SSM Implementation
https://arxiv.org/abs/2406.09477
MIT License
2 stars 0 forks source link

Baseline & Long Range Arena #9

Open kmheckel opened 2 months ago

kmheckel commented 2 months ago

General approach:

LRA:

Bonus:

stevenabreu7 commented 2 months ago

Isn't LRA with sequential CIFAR-10 instead of MNIST?

kmheckel commented 2 months ago

You're probably right - I didn't copy down all of the benchmarks and was basing the list off what was in the S5 repo so this may not be a perfect list. Will fix it.

On Wed, May 1, 2024, 22:45 Steven Abreu @.***> wrote:

Isn't LRA with sequential CIFAR-10 instead of MNIST?

— Reply to this email directly, view it on GitHub https://github.com/kmheckel/NeuroSSMs/issues/9#issuecomment-2089186568, or unsubscribe https://github.com/notifications/unsubscribe-auth/AMG7YKBFQEIRUXT527CFPMTZAFPBDAVCNFSM6AAAAABHCGBJQWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAOBZGE4DMNJWHA . You are receiving this because you authored the thread.Message ID: @.***>

stevenabreu7 commented 2 months ago

We could also just do both (since MNIST is relatively fast)

AlessandroPierro commented 1 month ago

Retrieval

Pathfinder

Path-X