html-tokenizer Search Results

1000+ results
for html-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sloria/TextBlob #90

Advanced usage of tokenizer for sentence tokenization

I may have misunderstood the intent with the section under **Advance Usage / Tokenizers** (https://textblob.readthedocs.org/en/dev/advanced_usage.html#advanced) but I cannot get my passed in tokenizer…

nmstoker updated 9 years ago
1
microsoft/onnxruntime #13139

Failed to create CUDAExecutionProvider

### Describe the issue I compared inference on GPU of a native torch Helsinki-NLP/opus-mt-fr-en model with respect to the optimized onnx model thanks to Optimum library. When load testing the mode…

Matthieu-Tinycoaching updated 2 years ago
5
NVIDIA/NeMo #10679

When converting a checkpoint from Hugging Face, the checkpoi…

``` python3 /opt/NeMo/scripts/checkpoint_converters/convert_llava_hf_to_nemo.py \ --input_name_or_path llava-hf/llava-1.5-7b-hf \ --output_path /workspace/checkpoints/llava-7b.nemo \ --tok…

changg10 updated 5 days ago
2
untitaker/html5gum #21

Add tree builder

We should add a real treebuilder and lol-html like api on top of that to this crate. As part of that we need to move the tokenizer into a submodule and rework the readme

untitaker updated 2 years ago
1
dills122/trader-tools #72

Look into a more comprehensive Plan for Tokenization

After digging into a few issues will building out some tests I learned that the tokenizer was working differently than I expected. After a quick look at the docs, it looks like `natural supports a …

dills122 updated 3 years ago
2
autokey/autokey #163

implement a proper tokenizer/scanner for phrase macros and s…

## Summary The macro support for phrases needs some improvement. Most importantly, phrases need a proper regular expression based tokenizer/scanner, The scanner should automatically identify valid,…

luziferius updated 1 year ago
3
jamietre/HtmlParserSharp #2

Exception throw if HTML element contains XMLNS attribute

I'm getting the following exception thrown when the HTML element contains the XMLNS attribute (in XHTML document): ``` Unhandled Exception: System.ArgumentException: The namespace declaration attribu…

cmwoods updated 9 years ago
2
NaturalNode/natural #231

Detokenization not supported

As in another libraries, detokenization is a wanted feature like at https://opennlp.apache.org/documentation/1.5.3/manual/opennlp.html#tools.tokenizer.detokenizing. Do you have plans on supporting th…

farolfo updated 7 years ago
2
netease-youdao/QAnything #357

纯python环境下，启动服务404，Application QAnything cannot handle your …

**Please Describe The Problem To Be Solved** (Replace This Text: Please present a concise description of the problem to be addressed by this feature request. Please be clear what parts of the problem…

Qin-xb updated 3 weeks ago
7
All-Hands-AI/OpenHands #4486

Improve browser agent' scraping/processing web content

**Summary** Currently the generated axtree content for retrieved websites incurs a huge amount of tokens and cost. Maybe below combination of Playwright with BeautifulSoup can save tokens, cost an…

tobitege updated 2 weeks ago
2

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for html-tokenizer

1000+ results
for html-tokenizer