Open xddun opened 3 weeks ago
I noticed that the vocab.json file in the multimodal model appears to contain garbled text, for example:
vocab.json
"Ġnegot": 11642, "Ġpointers": 27454, "çŃĸçķ¥": 104238, "Ġdelights": 95675, "ĠArrayAdapter": 49640, "Ġfv": 61654, "åIJij社ä¼ļ": 111834, "nowledge": 51186, "çļĦä»»åĬ¡": 108530, "ĠاÙĦØ·ÙĦاب": 140123, "Ġacc": 1029, ".crt": 93869, "(save": 33546,
Could someone explain the reason behind this garbled text and the underlying principles?
https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct/tree/main
I noticed that the
vocab.json
file in the multimodal model appears to contain garbled text, for example:Could someone explain the reason behind this garbled text and the underlying principles?