streaming-tokenizer Search Results

1000+ results
for streaming-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mistralai/mistral-inference #207

[BUG: config.json in mamba-codestral-7B-v0.1 is error

### Python -VV ```shell Python 3.10.13 (main, Sep 11 2023, 13:44:35) [GCC 11.2.0] ``` ### Pip Freeze ```shell accelerate==0.33.0 addict==2.4.0 annotated-types==0.7.0 apex @ file:///data2/apex …

Fly-Pluche updated 1 month ago
2
backspacetg/simul_whisper #1

Dimension bug

Hi there, Using the following code (following the example) ``` import os import sys import torch sys.path.append(os.path.dirname(__file__)) from simul_whisper.transcriber.config import Al…

tjongsma updated 1 week ago
2
SciSharp/LLamaSharp #654

AccessViolationException

I only copied the code from the ReadMe, I installed the LLama NuGet package with the CPU-Only backend, and it always returns System.AccessViolationException: "Attempted to read or write protected …

Rabergsel updated 3 months ago
15
huggingface/distil-whisper #95

How to use distil-whisper-large-v3-de-kd model from HF?

Officially, multi-language support is still not implemented in distil-whisper. But I noticed, that the esteemed @sanchit-gandhi uploaded a German model for distil-whisper to HuggingFace, called 'di…

Arche151 updated 5 months ago
10
google-research/bert #160

BERT-SST

How to run Stanford Sentiment Treebank(SST-2) task with BERT?

middle-plat-ai updated 4 years ago
12
Azure/azure-sdk-for-java #39061

How to know the number of tokens returned when calling the G…

When using the non-streaming interface, I can obtain the number of tokens returned from 'chatCompletions.getUsage().getTotalTokens()'. However, how can I determine the number of tokens returned when u…

zhuanshenlikai updated 2 weeks ago
3
molybdenumsoftware/pr-tracker #178

pr-tracker-fetcher `INSERT INTO landings ...` taking a reall…

Ever since upgrading to v3.0.0, I've been seeing pr-tracker-fetcher take a *really* long time to insert landings. For example, here's some info about a current run: ``` [root@clark:~]# ps -efw | g…

jfly updated 5 months ago
1
turboderp/exllamav2 #399

Input the embedding tensor into LLMs?

If I want to work with multimodal LLMs that takes in a set of embedding from vision/audio encoders, what is the proper way of inputting them into a LLM running using exllamav2? Can I just add a custo…

aliencaocao updated 4 months ago
40
deepseek-ai/DeepSeek-Coder #69

Deepseek coder 33B 模型测试输出重复问题

使用官方的推理代码，模型是官方的33B ``` from transformers import AutoTokenizer, AutoModelForCausalLM import torch tokenizer = AutoTokenizer.from_pretrained("deepseek-ai/deepseek-coder-6.7b-base", trust_remote_cod…

txy6666yr updated 9 months ago
3
ollama/ollama #1863

Ollama stuck after few runs

I updated Ollama from 0.1.16 to 0.1.18 and encountered the issue. I am using python to use LLM models with Ollama and Langchain on Linux server(4 x A100 GPU). There are 5,000 prompts to ask and get…

jadhvank updated 1 month ago
97

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for streaming-tokenizer

1000+ results
for streaming-tokenizer