-
### What features would you like to see added?
Gemini conversations
### More details
It would be great if Gemini conversations can also auto generate titles.
Also, since Gemini is free, it would…
-
Great job for this toolkit .
I'm attempting to merge two models with differing `vocab_size`: `augmxnt/shisa-7b-v1` (base) and `teknium/OpenHermes-2.5-Mistral-7B`. The `augmxnt/shisa-7b-v1` model ha…
-
could you add support for German? very very please
-
You can easily test out sentences like the following:
`The name of that celebrity is 王菲`
everything will be classified as English (you can try any Chinese name or any English prefix sentence, it w…
-
Can the training instructions (https://nerd.readthedocs.io/en/latest/train.html) be used to train models in languages that are currently not supported? Or are they just for retraining with supported l…
-
**Describe the bug**
Unable to input Japanese text using MessageComposer.
Japanese is a language composed of three types of characters: Hiragana, Katakana, and Kanji. When input Japanese text on a…
-
Hi, here is the case.
1. I pretrained a language model on English-only corpus, using BPE tokenization with vocab_size=32000.
2. I want to continue training the model on Japanese corpus.
Since t…
-
If I try to OCR images that are kinda like this.
![0_00_16_916__0_00_20_486_3000000000640010606400120](https://github.com/cloudy-sfu/GUI-for-tesseract-OCR/assets/129892077/c1e21939-50ef-4109-b511-667…
-
The new version was unstable and crashed very often. Switch displayed the message "This software has encountered an error and needs to close."
Additionally, the new version has some connection iss…
-
Curently, it seems just use `str.split` so it didn't work with non-space segmented languages like Japanese.
https://github.com/PAIR-code/lit/blob/3eb824b01e0f72a5486124b16056bf912465debc/lit_nlp/co…