split-token Search Results

1000+ results
for split-token

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xenova/transformers.js #866

compat with transformers >= 4.40 and tokenizers >= 0.19

### Question This is probably a known issue, as I'm aware that this project lags a bit behind the fast changes being made in the python transformers library, but I wanted to document a specific com…

joprice updated 1 month ago
3
TYPO3-Solr/ext-solr #511

mail addresses are split by tokenizer

Mail addresses in content such as my.name@example.org are split by the StandardTokenizerFactory as "my.name" and "example.org" because according to http://unicode.org/reports/tr29/#Word_Boundaries cer…

timohund updated 8 years ago
2
kovapatrik/homebridge-midea-platform #93

Log error "MessageQuery"

Just upgraded to the new version and seeing this error in the log files: [7/15/2024, 8:03:34 AM] [homebridge-midea-platform] [Mini Split] Does not supports the protocol MessageQuerySubtype, ignor…

scaryfast8750 updated 3 weeks ago
12
hats-finance/SeeR-PM-0x899bc13919880db76edf4ccd72bdfa5dfa666fb7 #121

Unlimited Token Approval Vulnerability in Router Contract

**Github username:** -- **Twitter username:** -- **Submission hash (on-chain):** 0xac22b432f475565c9ca28e80fa970c0b30d912cf334aec73ee9b521fe07c2129 **Severity:** medium **Description:** ## Details T…

hats-bug-reporter[bot] updated 1 week ago
2
HKU-MedAI/HERGen #1

Longitudinal Dataset

Hi authors, I am trying to recreate the temporal dataset you used in your paper. I noticed in your preprocessing folder under the 'create_temporal_dataset.ipynb', that you used **'master.csv'** fil…

Olawumi2021 updated 3 weeks ago
1
netease-youdao/QAnything #480

[BUG] <title> python最新版pdf无法解析，已经下载了pdf模型文件

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

changqingla updated 1 month ago
2
huggingface/transformers #33579

NER workflow improvement

### Feature request 1. [run_ner.py in examples](https://github.com/huggingface/transformers/blob/main/examples/pytorch/token-classification/run_ner.py) are requiring data of pre-tokenized words, like…

ain-soph updated 2 weeks ago
2
SKD-HPC/TSGET #3

请问报告是中文的情况下要怎么训练？

你好，我看了下代码，把代码的tokenizer换成了中文的jieba分词器，但是生成结果非常低，请问要怎么修改代码?需要改哪些内容呢？能否指导一下呢

shenshaowei updated 1 day ago
3
huggingface/tokenizers #1616

BPE trainer ignoring special tokens.

I am trying to train a custom tokenizer. My use case is related to assembly code, so I want merges to be possible across full instructions (potentially multiple "words"). To do this, I am replacing al…

henrycharlesworth updated 1 month ago
3
xenova/transformers.js #959

Failed to encode text with T5's tokenizer

### System Info Node.js v22.9.0. `"@xenova/transformers": "2.17.2"` ### Environment/Platform - [ ] Website/web-app - [ ] Browser extension - [X] Server-side (e.g., Node.js, Deno, Bun) - [ ] Desktop…

zcbenz updated 1 week ago
7

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for split-token

1000+ results
for split-token