streaming-tokenizer Search Results

1000+ results
for streaming-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/vscode #580

Can I get scope / scopeRange at a position?

_From @billti on November 1, 2015 6:10_ The API call `document.getWordRangeAtPosition(position)` appears to use its own definition of a word. For example, my tmLanguage defines `attrib-name` as a tok…

seanmcbreen updated 4 days ago
43
exo-explore/exo #133

使用exo+mlx多台mac运行llama-3.1-70b,返现量化时报错[BUG]

使用exo+mlx多台mac运行llama-3.1-70b,返现量化时报错报错的位置: quantized.py文件代码: def call(self, x): s = x.shape x = x.flatten() out = mx.dequantize( self["weight"][x], scales=self["scales"][x], biases=self["…

wjwc updated 1 month ago
4
neuml/txtai #371

Change the HFOnnx pipeline to use Hugging Face Optimum rathe…

The HF documentation says that you can now export seq2seq to ONNX with the OnnxSeq2SeqConfigWithPast class. https://huggingface.co/docs/transformers/v4.23.1/en/main_classes/onnx#onnx-configurations …

nickchomey updated 1 year ago
25
huggingface/optimum #1869

ONNX converted Whisper model takes more than twice the VRAM …

### System Info ```shell optimum==1.19.2 torch==2.1.2 transformers==4.39.3 onnxruntime-gpu==1.17.1 CUDA Version: 12.2 GPU: L4 ``` ### Who can help? @michaelbenayoun ### Information - [X] Th…

bruno-hays updated 1 month ago
1
huggingface/parler-tts #93

Using static cache and torch.compile, generating more tokens…

I have tested the static cache inference, but the results are not as expected. I observed that the first two runs are for warming up, torch compiling... The third run is fast as expected, but from the…

dongngm updated 1 week ago
6
QwenLM/Qwen2.5 #905

Using different methods of vLLM, LLM class and AsyncLLMEngin…

Hi team, I'm using Ray and vLLM to serve `Qwen2-72B-Instruct` with 2 different methods: - using `LLM` class this is the recommended method for offline batch inference method described in t…

pengye91 updated 3 weeks ago
1
SciSharp/LLamaSharp #654

AccessViolationException

I only copied the code from the ReadMe, I installed the LLama NuGet package with the CPU-Only backend, and it always returns System.AccessViolationException: "Attempted to read or write protected …

Rabergsel updated 4 months ago
15
netvl/xml-rs #126

Performance is not comparable to other XML parsing libraries

I build and maintain a library for parsing property list files in Rust, [plist-rs](http://github.com/conradev/plist-rs), and I created benchmarks to compare it to the other common plist parsing librar…

conradev updated 1 year ago
8
hiyouga/LLaMA-Factory #5308

Running tokenizer on dataset 一直阻塞，然后subprocesses has abruptl…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info ![Snipaste_2024-08-30_01-20-17](https://github.com/user-attachments/assets/29edc0c4-ac44-4ccf-b8d3-e82d…

zuishusheng updated 2 months ago
18
dottxt-ai/outlines #1233

context-free grammars example does not work with vLLM integr…

### Describe the issue as clearly as possible: When running provided arithmetic grammar example with vLLM, I get an error `TypeError: Error in model execution: argument 'ids': 'list' object cannot …

captify-sivakhno updated 1 day ago
3

上一页 1...21 22 23 24 25 26 27...100 下一页

1000+ results for streaming-tokenizer

1000+ results
for streaming-tokenizer