streaming-tokenizer Search Results

1000+ results
for streaming-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ollama/ollama #1863

Ollama stuck after few runs

I updated Ollama from 0.1.16 to 0.1.18 and encountered the issue. I am using python to use LLM models with Ollama and Langchain on Linux server(4 x A100 GPU). There are 5,000 prompts to ask and get…

jadhvank updated 1 month ago
97
scylladb/scylladb #2132

SizeTieredCompactionStrategy: Use cardanality-estimator in s…

Idea by @nyh Scylla already has a cardinality estimator (see http://www.datastax.com/dev/blog/improving-compaction-in-cassandra-with-cardinality-estimation) which estimates how many partitions the…

slivne updated 4 years ago
7
modelscope/ms-swift #1703

用swift测试MiniCPM-V-2.6，输出只依赖于视频第一帧

您好，我们测试了您提供的 CUDA_VISIBLE_DEVICES=0 swift infer --model_type minicpm-v-v2_6-chat --model_id_or_path openbmb/MiniCPM-V-2_6 以及 video测试代码（如下）。发现对视频的测试结果，似乎只依赖于视频第一帧。我们尝试了多次对视频OCR的提取，结果显示都只会输出第一帧的OCR结果。请问…

Wuyingwen updated 1 month ago
3
NVIDIA/NeMo #9124

Tokenizer suggestion for fine tuning cache aware streaming m…

Hi I want to fine tune "stt_en_fastconformer_hybrid_large_streaming_multi" on custom data. In my dataset I have things like "Vitamin B12", "Code: c12r5", "hb1ac" etc For these alphanumeric words: …

rkchamp25 updated 2 months ago
4
turboderp/exllamav2 #558

Streaming Issue with ExLlamaV2DynamicJobAsync

I am having issue where streaming the result from ExLlamaV2DynamicJobAsync cause the stream rate to slow by half, however, when the generation reach halfway of the generation, then suddenly all the re…

remichu-ai updated 2 months ago
3
neuml/txtai #371

Change the HFOnnx pipeline to use Hugging Face Optimum rathe…

The HF documentation says that you can now export seq2seq to ONNX with the OnnxSeq2SeqConfigWithPast class. https://huggingface.co/docs/transformers/v4.23.1/en/main_classes/onnx#onnx-configurations …

nickchomey updated 11 months ago
25
pytorch/pytorch #113180

Higher train loss and worse evaluation metrics when using `t…

### 🐛 Describe the bug We are facing issues with loss curves and reproducibility when using `torch.compile()` with our models. Attached below is a graph of train loss with runs with `torch.compile(…

snarayan21 updated 2 months ago
15
hiyouga/LLaMA-Factory #4562

Out of Memory Error on Sagemaker while training LLava on 930…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info Out of Memory error during tokenization. tried streaming and facing same issue with **streaming:true** an…

Hassaan68 updated 4 weeks ago
2
EricLBuehler/mistral.rs #718

Error Running phi3v Example with Local Model

## Describe the bug I'm encountering an error while running the phi3v example using a local model. Here's my code: ```rust use either::Either; use indexmap::IndexMap; use std::{path::PathBuf,…

jiabochao updated 2 weeks ago
3
huggingface/transformers #31963

How to manually stop the LLM output？

I'm using `TextIteratorStreamer` for streaming output. Since LLM may repeat its output indefinitely, I would like to be able to have LLM stop generating when it receives a request to cancel. Is …

invokerbyxv updated 2 months ago
2

上一页 1...19 20 21 22 23 24 25...100 下一页

1000+ results for streaming-tokenizer

1000+ results
for streaming-tokenizer