nvtransfer / RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Apache License 2.0
646 stars 43 forks source link

New Command R 08-2024 and Command R+ 08-2024 models #60

Closed jukofyork closed 1 month ago

jukofyork commented 1 month ago

Any chance of testing the new versions of these?

They still claim 128k context and the smaller Command R 08-2024 now use GQA unlike the old version.

hsiehjackson commented 1 month ago

Results are updated :)

jukofyork commented 1 month ago

Thanks!