streaming-tokenizer Search Results

langchain-ai/langserve #218

`stream` API returns entire answer after a while, instead of…

Hello, I built a simple langchain app using `ConversationalRetrievalChain` and `langserve`. It is working great for its `invoke` API. However when it comes to `stream` API, it returns entire an…

tigerinus updated 5 months ago

langchain-ai/langserve #316

Streaming not working as expected when Using Huggingface Pip…

I am using Langserve and Langchain with huggingface pipelines with a Streamer object. If I use TextStreamer obj from huggingface, I can see the stream in stdout. I read that I might need to use…

jefffortune updated 1 month ago

shibing624/MedicalGPT #396

大量数据加载问题

请教一下：训练加载的数据量比较大，300万行左右，大约需要3小时以上。加载500万行，等了差不多一晚上了，还没有完成。最终可能需要加载超过1000万的数据进行训练，担心加载不进去。有没有什么好的方法，一边加载一边训练，避免一次加载全量的数据，导致数据量过大，导致内存溢出。另外，我看代码里面有流式的参数，从代码里面没有看出有太多的差别，只是少了并行处理“num_pr…

dage0127 updated 4 days ago

run-llama/llama_index #13952

[Question]: How to count token in Anthropic models?

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question Hi, I want to know how to count tokens (Embedding Tokens, LLM Prompt Tokens, LLM Complet…

Ninlawat-Puhu updated 1 month ago

microsoft/semantic-kernel #5660

.Net: Provide usage data for GetStreamingChatMessageContents…

Hi, for the non-streaming method `GetChatMessageContentsAsync` I get the usage like this: ``` private decimal CalculateCosts(IReadOnlyList result) { var costs = 0m; if (result[0].M…

SebastianStehle updated 2 months ago

stevennt/myai.abn.khoj.old.abandon #1

NOTES ABOUT CODE KHOJ

https://docs.khoj.dev/get-started/setup Where is the FastAPI initiated / called ? What are the endpoints I can use to call from outside? I want to use its capability as backend for my other…

stevennt updated 3 weeks ago

triton-inference-server/tensorrtllm_backend #34

Accumulated decoding when streaming

I'm trying to serve a Llama-2-70b-chat-hf model using Triton inferencer server with TRT-LLM engine. The script I used is `tools/inflight_batcher_llm/end_to_end_streaming_client.py`: ``` python3 to…

comaniac updated 8 months ago

huggingface/datasets #5610

use datasets streaming mode in trainer ddp mode cause memory…

### Describe the bug use datasets streaming mode in trainer ddp mode cause memory leak ### Steps to reproduce the bug import os import time import datetime import sys import numpy as np…

gromzhu updated 4 months ago

html5lib/html5lib-tests #100

[Suggestion] Add tokens to tree construction tests

**Motivation:** Some HTML parsers (e.g. parse5 and our internal parser) provide a streaming mode in which tokenizer works as if it's executed together with tree construction algorithm, and so tokenize…

RReverser updated 6 years ago

mosaicml/streaming #652

Out of Memory when using Streaming Dataloader

## Environment - OS: Ubuntu 22.04 ## To reproduce Steps to reproduce the behavior: When using the `StreamingDataloader` (or the vanilla pytorch `Dataloader`) with `num_workers>0`, the proces…

VikaasVarma updated 1 week ago

1000+ results for streaming-tokenizer

1000+ results
for streaming-tokenizer