streaming-tokenizer Search Results

1000+ results
for streaming-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #17365

Export Generated Text 1 Token at a Time

### Feature request When using the text-generation pipeline. We would like to be able export each token as it is generated. Currently we have to wait for the generation to be completed to view the re…

anujnayyar1 updated 1 year ago
17
oobabooga/text-generation-webui #83

Local model not recognized when calling on the server script…

If you attempt to run the `server.py` script outside the `text-generation-webui` directory, the `--model` argument will assume you're calling a remote model and attempts to download it from HF. Here's…

AlpinDale updated 1 year ago
1
ggerganov/llama.cpp #1224

Llama Ignoring Reverse Prompt Every Other Time

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

loukylor updated 1 year ago
29
studio1902/statamic-peak #315

Fresh Install results in blank page

**Describe the bug** After installing peak through the latest statamic cli or by first installing statamic and then adding peak as a starter kit (clearing the site first), the page remains blank. The…

Jigsaw5279 updated 1 year ago
2
mosaicml/llm-foundry #244

Triton Test Failed: GPU SMs must run at 1350 MHz / GPU memo…

Hi, I am trying to run the tests suite to see if my setup is correct and I am down to 31 failed, 4852 passed etc... However the ones that failed are strange Here a partial log, full log belo…

cocobeach updated 1 year ago
6
PowerShell/PowerShell #13068

Call native operator

## Problem Statement Currently, there are cases where cutting and pasting a native command line fails to run as expected in PowerShell. This may be due to incorrect parsing of quotes meant to be p…

SteveL-MSFT updated 9 months ago
209
k2-fsa/sherpa #179

Server arguments for Streaming Decoding Pruned_Transducer_St…

I have trained (another) pruned_transducer_stateless5 model on a transcript without disfluencies and the results in offline decoding are better (decode.py of Icefall). Then, I move it to Sherpa to try…

teowenshen updated 1 year ago
6
abetlen/llama-cpp-python #9

[Question] Drop in replacement for OpenAI

I notice that you mentioned your goal of creating a drop in replacement for OpenAI. Awesome job! This is super helpful to have and especially with your demo using fastAPI. I'm looking at langchain …

MillionthOdin16 updated 1 year ago
19
oobabooga/text-generation-webui #177

GPTQ quantization(3 or 4 bit quantization) support for LLaMa

[GPTQ](https://arxiv.org/abs/2210.17323) is currently the SOTA one shot quantization method for LLMs. GPTQ supports amazingly low 3-bit and 4-bit weight quantization. And it can be applied to LLaMa. …

qwopqwop200 updated 1 year ago
215
OptimalScale/LMFlow #183

if i use llama30b-lora-170k , how to set --model_name_or_pat…

when i set --model_name_or_path llama33b-lora \ --model_name_or_path: 未找到命令

Marine98k updated 1 year ago
12

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for streaming-tokenizer

1000+ results
for streaming-tokenizer