tokenization Search Results

1000+ results
for tokenization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

taishan1994/sentencepiece_chinese_bpe #3

python chinese_bpe.py 报错如下：

Traceback (most recent call last): File "D:\sentencepiece_chinese_bpe-main\chinese_bpe.py", line 23, in tokenizer = ChineseTokenizer(vocab_file=output_dir + 'chinese.model') File "D:\sente…

hanyong-max updated 1 month ago
1
davemckain/snuggletex #4

Supperscript tokenization

Hi can you please review my solution for this bug: https://sourceforge.net/p/snuggletex/bugs/9/ It's done in this commit in `LaTeXTokenizer`: https://github.com/axkr/symja_android_library/com…

axkr updated 1 year ago
2
mpsilfve/finer-data #3

Tokenization errors

At least the following appear in the data: * `Gam esIndustry-julkaisu` * `ki rjoitettu` * `myytyynYounitediin` * `tal lennustilaa` * `jaNokia` * `televisionkatselu un` * `Lumia-puheli mia` *…

spyysalo updated 4 years ago
1
mcavdar/NeuroNER #3

Tokenization problem

Spacy models should be modified according medical corpus. For example: `tokens['train'][0:10]: [['EMEA', '/', 'H', '/', 'C', '/', '551', 'PRIALT']...`

mcavdar updated 6 years ago
4
rinonaito/minishell #1

01_tokenization

rinonaito updated 1 year ago
4
turbopape/postagga #4

Design tokenization

turbopape updated 7 years ago
4
devcrocod/Kotok #1

Rust implementation of tokenization

devcrocod updated 6 months ago
1
lyq0724/Clip #1

感谢你的工作，它正是我需要的，但是我出现了如下错误，请求你的帮助

Traceback (most recent call last): File "/data/server03/Zhaozikai/Clip/train.py", line 170, in model = CLIP(**config) File "/data/server03/Zhaozikai/Clip/nets/clip.py", line 57, in __ini…

Baobaon updated 1 week ago
1
microsoft/vscode #77140

Tokenization overhaul

The current tokenisation story of VS Code is based on TM grammars, which are pretty powerful, but we are running into their limits if we want to do something more than a top-down scanner can do. Also,…

alexdima updated 3 years ago
21
hats-finance/Spectra-0x4b792db3d2a5d1c1ccf9938380756b200c240e5d #19

Users can pay more than expected fee due to lack of slippage…

**Github username:** @0xSwahili **Twitter username:** -- **Submission hash (on-chain):** 0x468bd6737182dd0a8ad12826c0130b1894c9a1c18627568f8e26abd58f1e68c2 **Severity:** high **Description:** **Desc…

hats-bug-reporter[bot] updated 3 days ago
4

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for tokenization

1000+ results
for tokenization