-
This is a "living issue". Editing is appreciated.
### Context:
- Most prominent benchmark for embedding models: https://huggingface.co/spaces/mteb/leaderboard
- We can choose to index the pdf dat…
-
I want to provide an image to BLIP-2, and in return, it should generate a Chinese description. Can anyone guide me on how to do it?
-
Thanks for the great work on reproducing the T-Ptr-$lambda$ model! I have reproduced the non-BERT result with your kindly instruction. However, when I tried to combine the model with pretrained chines…
-
I try to apply mtmsn in DRCD chinese corpus, and find out that "bert.tokenization.FullTokenizer" can't handle the chinese word tokenization. Is that why I can't use mtmsn in Chinese corpus ?
But I …
-
Code breaks using a different model other than BERT. I debugged into the code and found that the code is written with respect to BERT tokenizer only while the tokenizers of other transformer models ar…
-
Some Chinese Text has some English words, for example: "Apples是苹果的复数形式。". I have questions about how to tokenize the text:
1. why Chinese Bert Case sensitive, but I can't find even 'A' in vocab.txt
…
-
训练集使用https://github.com/zjy-ucas/ChineseNER
**训练命令**
```
bert-base-ner-train \
-data_dir=/home/bert/BERT-BiLSTM-CRF-NER/data/ \
-bert_config_file=/home/bert/chinese_L-12_H-768_A-12/bert_config.…
-
bert-as-service can support bert-wwm-ext model released in https://github.com/ymcui/Chinese-BERT-wwm
-
Hi! Thanks for your contribution. It is an excellent piece of work!
Your idea is great, and I want to test my task. But my corpus language is Chinese, do I need to adjust the tokenizer and pre-trai…
yihp updated
3 months ago
-
![image](https://user-images.githubusercontent.com/62530854/121482240-86c78c00-c9ff-11eb-9842-f88550fe93ae.png)
写了一个预测的句子,跑的时候在cached_path这个类里面报错