tokenization Search Results

1000+ results
for tokenization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unipv-larl/UD4HL #8

Tokenization of Ancient Greek

This post relates to the effort to harmonize the Ancient Greek treebanks, as per [Issue 7](https://github.com/unipv-larl/UD4HL/issues/7). One of the first issues to solve is tokenization itself. Th…

francescomambrini updated 1 year ago
6
elastic/elasticsearch #83660

[ML] investigate wordpiece tokenization

Here are a list of investigations arisen out of https://github.com/elastic/elasticsearch/pull/82870 - How should "strip_accents" in BERT style wordpiece treat umlauts and diaeresis? https://github.…

benwtrent updated 2 years ago
1
mkharibalaji/react-native-adyen-payment #17

Question: support for tokenization?

Hi! First of all thanks for your work in building this library! We're just in the first steps of integrating adyen and have a working version for the web so far. We wanted to integrate this now als…

m3co-code updated 3 years ago
5
matteobaccan/owner #206

Feature Required: DisableFeature.TOKENIZATION

Hi, I have written custom converter which converts json properties into objects specified in reloadable interface. my json convertor ``` public class JsonConvertor implements Converter{ …

shariqislam786 updated 5 years ago
3
keyvank/femtoGPT #8

More efficient tokenization methods

We will need to have more clever methods of tokenization in femtoGPT. Possibly, it's good to have a SentencePiece model reader.

keyvank updated 1 year ago
2
ben0oil1/GPT-SoVITS-Server #11

说明实在是太……

D:\asdasd\AI\GPT-SoVITS-Server-main\GPT-SoVITS-Server-main>python server.py DirectML可用，将使用DirectML进行推理加速。设备名称: NVIDIA GeForce GTX 1650 Traceback (most recent call last): File "D:\asdasd\AI\GPT-…

wycstc353 updated 1 month ago
1
servo/html5ever #25

Speculative parsing and tokenization

Similar to servo/servo#1009 The first step is speculative parsing concurrent with scripts, [similar to what Gecko does](https://developer.mozilla.org/en-US/docs/Mozilla/Gecko/HTML_parser_threading).

kmcallister updated 9 years ago
2
UniversalDependencies/UD_Dutch-Alpino #6

Inconsistent tokenization train/dev

There seems to be some inconsistency in the original tokenization as well as the gold. I mainly found these in sports results: In train for example, the original text looks like: "Na de 2-0 overwin…

robvanderg updated 2 years ago
2
sunzeyeah/RLHF #25

Pangu 2.6b 启动失败。

Traceback (most recent call last): File "/mnt/d/ai/RLHF/test.py", line 3, in tokenizer = AutoTokenizer.from_pretrained("/mnt/d/ai/pretrain_models/pangu", trust_remote_code=True) File "/hom…

Liufeiran123 updated 3 months ago
3
google-research/bert #560

Problem with wordpiece tokenization

I'm doing a NER project and trying to use BERT. For BERT, it uses wordpiece tokenization, which means one word may break into several pieces. Then for NER, how to find the corresponding class label fo…

yexing99 updated 4 years ago
11

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for tokenization

1000+ results
for tokenization