korean-tokenizer Search Results

372 results
for korean-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google/sentencepiece #902

Model can't load if special characters are in the path

If there are Korean characters in the path of the tokenizer model when it's loaded like this: `sentencepiece::SentencePieceProcessor tokenProcessor;` `tokenProcessor.load(pathtomodel);` Any help…

gianmarcohutter updated 10 months ago
4
bentoml/OpenLLM #553

Llama2 models giving junk output in v100

Hello everyone! I found Llama models like `beomi/llama-2-ko-7b` are giving junk output like `\n[/INST]\n\n[/INST]...`. I tried with multiple Llama2 korean models and I am getting similar junk results.…

bibekyess updated 10 months ago
1
BlinkDL/RWKV-LM #210

how to pretrain v5 other lang?

Hi, I tried v5 pretrain this data (https://huggingface.co/datasets/eaglewatch/Korean_Wikipedia_Dataset_for_GPT2_August_2022) And I am using this script. ``` python train.py --data_file /wor…

HaloKim updated 9 months ago
3
mjpost/sacrebleu #244

Working on tokenized pairs?

I'm trying to calculate the blue score for a low resource language, so I'm using a tokenizer that I've trained myself, is there a way to pass the tokenizer as a param? for now when I am passing the …

MostHumble updated 10 months ago
1
turboderp/exllamav2 #262

Inquiring about Calibration Procedures and Issues for Model …

### Context and Issue I'm attempting to quantize the model [alpindale/goliath-120b](https://huggingface.co/alpindale/goliath-120b) using [royallab/PIPPA-cleaned](https://huggingface.co/datasets/royal…

MatrixC7 updated 8 months ago
11
asdf-vm/asdf-ruby #206

Installing 2.6.3 The Ruby openssl extension was not compiled…

Hey guys, Bumped into the issue with openssl. I've already installed 2.7.2 sucessfully, but no luck with 2.6.3. `libssl-dev` is already installed ``` snake@mothership:~$ uname -a Linux mothershi…

ssnake updated 5 months ago
6
NVIDIA/NeMo #3243

Training conformer_ctc with korean

I am trying to train the conformer-ctc model with the Ksponspeech dataset, which is a Korean speaking dataset. Ksponspeech - 1000hours / 123GB / 630000 pcm audio files ( fs=16000 / sample_width = 2…

hslee4716 updated 10 months ago
10
huggingface/transformers #17106

Socket Timeout when using DDP

### System Info ```shell - `transformers` version: 4.17.0.dev0 - Platform: Linux-4.15.0-176-generic-x86_64-with-glibc2.17 - Python version: 3.8.13 - PyTorch version (GPU?): 1.8.2 (True) - Tens…

sajastu updated 8 months ago
17
ggerganov/llama.cpp #2865

Converting kfkas Llama-2-ko-7b-Chat to GGUF fails

Hi. I'm trying to convert the 'kfkas/Llama-2-ko-7b-Chat' model I received from huggingface on Windows 11 into a gguf file. So I tried to convert it to the command below. C:\AI\llama.cpp>python con…

kurugai updated 9 months ago
34
coqui-ai/TTS #1712

[Bug] AssertionError: [!] There are duplicate characters in…

### Describe the bug There is a tutorial of korean version of coqui_tts.(not written by coqui-ai) tutorial link : https://colab.research.google.com/drive/1hv37sT7Pq-qKZe9Ihbbp5XZ-A9tsURli?usp=sharin…

newgrit1004 updated 10 months ago
2

上一页 1...12 13 14 15 16 17 18...38 下一页

372 results for korean-tokenizer

372 results
for korean-tokenizer