streaming-tokenizer Search Results

1000+ results
for streaming-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

oobabooga/text-generation-webui #3251

Error loading model

### Describe the bug Whenever I try to load the model, error shows ### Is there an existing issue for this? - [X] I have searched the existing issues ### Reproduction amd is not suppo…

CriIzel updated 1 year ago
7
espnet/espnet #3977

CPU decoding problem in egs2 with docker

Hello, Please help to resolve the following issue. I built my own recipe based on _egs2/librispeech/asr1_. I was able to successfully run all the stages, where I used GPU for decoding. However…

khassanoff updated 1 year ago
1
turboderp/exllama #166

EOS tokens don't work with llama-2.

title, and to be clear, does llama generate eos tokens? because when i increase the max tokens limit it kept on generating the user's questions and stuff too, although in the generator.py i found logi…

RiyanParvez updated 1 year ago
43
turboderp/exllama #6

RTX 3060 12GB Benchmarking

model: llama-13B-4bit-128g exllama: ``` (exllama) user@debian:~/AI/exllama$ python test_benchmark_inference.py -d ~/AI/2oobabooga/text-generation-webui/models/llama-13b-4bit-128g/ -p -- Loadi…

1aienthusiast updated 1 year ago
6
mosaicml/llm-foundry #106

the problem about mosaicml-streaming

when I follow pip install -e ".[gpu]"，I find the error about mosaicml-streaming #------------------------------------------------------------------------------------------- root@7730f5bd29fa:/hom…

sysusicily updated 1 year ago
1
mosaicml/llm-foundry #107

the problem about mosaicml-streaming

when I follow pip install -e ".[gpu]"，I find the error about mosaicml-streaming #------------------------------------------------------------------------------------------- root@7730f5bd29fa:/hom…

sysusicily updated 1 year ago
1
triton-inference-server/tensorrtllm_backend #48

Unload Model using REST API not relasing GPU memory

Hi, I have used `--model-control-mode=explicit` option start the triton server without any model loading. ``` mpirun --allow-run-as-root -n 1 /opt/tritonserver/bin/tritonserver --model-contro…

kamalkraj updated 12 months ago
2
mosaicml/llm-foundry #220

Question regarding Langchain usage with a transformers "text…

Hi I was attempting to use Langchain with a transformers "text-generation" transformers pipeline as described in the video here (this was in the main readme of this repo, so I guess it is somewhat ap…

thusithaC updated 1 year ago
1
huggingface/transformers #22757

Huge Num Epochs (9223372036854775807) when using Trainer API…

### System Info # System Info Running on SageMaker Studio g4dn 2xlarge. ``` !cat /etc/os-release PRETTY_NAME="Debian GNU/Linux 10 (buster)" ``` ``` !transformers-cli env - `transformers…

oonisim updated 1 year ago
3
run-llama/llama_index #6758

[Bug]: validation error for CompletionResponse in CustomLLM …

### Bug Description I want to reproduce [HuggingFace LLM - StableLM](https://gpt-index.readthedocs.io/en/latest/examples/customization/llms/SimpleIndexDemo-Huggingface_stablelm.html) `response = q…

wodecki updated 1 year ago
1

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for streaming-tokenizer

1000+ results
for streaming-tokenizer