Closed Mooler0410 closed 1 day ago
https://github.com/NVIDIA/Megatron-LM/tree/main/examples/mamba
I use this script to try mamba-2. And I insert three lines:
from megatron.training import get_tokenizer
tokenizer = get_tokenizer()
import pdb; pdb.set_trace()
before this line: https://github.com/NVIDIA/Megatron-LM/blob/e33c8f78a35765d5aa37475a144da60e8a2349d1/tools/run_mamba_text_generation_server.py#L105
to test the tokenizer.
Hi! I found that the id 24639 and id 7298 will be decoded to the same token 'Yes' for mamba-2-hybrid.
Also:
I always think different ids correspond to different tokens. Is there anything wrong with my understanding?
Thanks!