tokenize Search Results

1000+ results
for tokenize

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

machinalis/iepy #20

LiteralNER shall tokenize each entry

Right now our LiteralNER is _very_ literal, so in some cases is not working. Example: an entry like this ``` takayasu's arteritis ``` Is never found because the documents will be tokenized, transf…

jmansilla updated 10 years ago
1
jehna/humanify #92

Where is the output?

I ran the command like this: ```bash bun x humanifyjs local responsez.js ggml_vulkan: Found 1 Vulkan devices: Vulkan0: NVIDIA GeForce GTX 1070 (NVIDIA) | uma: 0 | fp16: 0 | warp size: 32 [nod…

Duoquote updated 2 hours ago
4
TillMacher/TaxonTableTools #10

Running setup.py install for scikit-bio ... error

Hello Till, I have reached out to my IT department; however, they are not able to resolve the below issue. Do you have any suggestions on how to resolve this issue? Running pythonv 3.9.5, PIP…

jesika0123 updated 1 month ago
2
v-mipeng/LexiconAugmentedNER #38

gaz tokenize问题

你好我看bert tokenizer只对text进行了tokenize，如果碰到tokenizer把例如1994分成了19和##94, 但是gaz是针对每个character 1/9/9/4识别的BMES word，不会发生输入mismatch的问题么？

DSXiangLi updated 3 years ago
2
adsabs/montysolr #114

Tokenize root of arxiv_class

We currently have the field `arxiv_class` which contains the classification of a paper provided by arXiv, which typically is in the form of `category.SC` (where SC represents an abbreviation for the s…

aaccomazzi updated 10 months ago
1
allenai/bi-att-flow #45

nltk tokenize doesn't work?

Dear Team, The code below doesn't work and the context doesn't sententce token. if args.tokenizer == "PTB": import nltk sent_tokenize = nltk.sent_tokenize def word_tokeniz…

lihongqiang updated 6 years ago
1
okezieokpara/FlutterWave.RavePay.Net #26

Save a Card and Tokenize

Hello, Please how can I use this library to save a card and then tokenize? It is actually crucial to my development. Can you help?

ghost updated 6 years ago
5
gitcoinco/skunkworks #61

[sustainability idea] tokenize your time

what if you could issue tokens where each token is worth 1 hour of your time.

owocki updated 6 years ago
1
huggingface/tokenizers #1613

Space after unnormalized token is added when `use_fast=True`…

Related to: https://github.com/huggingface/transformers/issues/25073 In my current project, I'd like to add a special token that doesn't insert a space to the next token. Currently, I need to spec…

Butanium updated 1 week ago
10
mcxiaoxiao/annotated-transformer-Chinese #1

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in …

请问你是怎么解决“def build_vocabulary(spacy_de, spacy_en): def tokenize_de(text): return tokenize(text, spacy_de) def tokenize_en(text): return tokenize(text, spacy_en) pr…

hackhaye updated 3 months ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for tokenize

1000+ results
for tokenize