chinese-vocab Search Results

1000+ results
for chinese-vocab

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Xie-Minghui/EntityRelationExtraction #3

没有vocab.txt文件

项目中没有vocab.txt这个文件，提供的数据集中也没有 ``` # config_ner.py self.vocab_file = '../data/vocab.txt' ```

shenjing023 updated 2 years ago
1
thu-coai/KdConv #17

Cannot find chinese_wwm_pytorch

Hi, I encountered a problem when running the code in benchmark/bertret and want to seek your help. It seems that the 'chinese_wwm_pytorch' cannot be found, including all related files (/vocab.txt, /ad…

KristenZHANG updated 2 years ago
3
Exiv2/exiv2 #2083

Unit test test_issue_1959.py fails when run on a system whos…

##### **Describe the bug** Unit test test_issue_1959.py fails when run on a system whose UTC offset is nonzero. For example, this test fails on my system, where the time zone is set to Pacific Stan…

mallman updated 2 years ago
8
huggingface/tokenizers #761

bert vocab

We use 500 txt files (including Chinese and English), use WordPieceTrainer, and set vocab_size to 30522, but the json is 32430. bert_tokenizer = Tokenizer(WordPiece(unk_token="[UNK]",max_input_ch…

hongjianyuan updated 2 years ago
1
huggingface/transformers #3359

Some community models are broken and can't be downloaded

# 🐛 Bug ## Information Model I am using (Bert, XLNet ...): Community Models Language I am using the model on (English, Chinese ...): Multiple different ones Quite some community models ca…

patrickvonplaten updated 2 years ago
6
explosion/spaCy #2229

feature: Merge multiple `Doc()` objects into one

When processing large documents, I usually process sentence by sentence. Then I have numerous `Doc()` objects per document. It'll be great if I could merge those objects into one then serialize/save t…

kyoungrok0517 updated 2 years ago
11
PaddlePaddle/PaddleHub #1206

自定义数据集finetune报错: ZeroDivisionError: division by zero

@Steffy-zxf 在使用paddle1.8进行finetune时，系统自动下载的数据集没问题，切换到自定义数据集finetune时，报错，错误如下： ```powershell Traceback (most recent call last): File "sequence_label.py", line 187, in main() File "se…

rela0426 updated 2 years ago
1
lipiji/SongNet #22

non Chinese vocab (input)

Hi, thx for repo! Can I use code for non Chinese languages ? let's say for Russian text thx!

pavelxx1 updated 3 years ago
1
huggingface/tokenizers #756

Issue in BertWordPieceTokenizer

I am trying to train a custom BertWordPieceTokenizer for the Ukrainian language. `tokenizer = ByteLevelBPETokenizer(lowercase = True, unicode_normalizer='nfkc')` `tokenizer.train( files=pat…

KyloRen1 updated 2 years ago
9
huggingface/tokenizers #765

Fixing alphabet not working

Hey there! I am trying to train a tokenizer with `BertWordPieceTokenizer`. I use an iterator that gives the text and `tokenizer.train_from_iterator`. After training the tokenizer I realized tha…

erksch updated 2 years ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for chinese-vocab

1000+ results
for chinese-vocab