streaming-tokenizer Search Results

1000+ results
for streaming-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xorbitsai/inference #1962

GLM-4 chat 9b:'ChatGLMForConditionalGeneration' object has n…

### System Info / 系統信息 python 3.11.8 ### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？ - [ ] docker / docker - [X] pip install / 通过 pip install 安装 - [ ] installation from source / 从源…

Dravenlll updated 2 weeks ago
30
xorbitsai/inference #2322

能否支持longwriter,我目前使用自定义有报错

### Feature request / 功能建议 2024-09-18 02:22:06,994 xinference.core.worker 68 INFO [request 690d9782-759f-11ef-af77-0242ac110002] Leave launch_builtin_model, elapsed time: 27 s 2024-09-18 02:22:4…

xs818818 updated 52 minutes ago
1
vllm-project/vllm #3872

[Bug]: Does vLLM support Qwen/Qwen1.5-32B-Chat-AWQ? It works…

### Your current environment vllm docker image: vllm/vllm-openai:latest ### 🐛 Describe the bug It works for the first time then stops generating responses, as shown below. ChatCompletion(id='c…

sungkim11 updated 4 months ago
14
gwenn/sqlpop #3

Fallback

[lalrpop](https://github.com/nikomatsakis/lalrpop/issues/156)

gwenn updated 6 years ago
4
conceptofmind/LaMDA-rlhf-pytorch #4

Add `requirement.txt`

Hi, is there any plan to add a requirement.txt that allows us to install needed packages with pip? Thanks.

biofoolgreen updated 2 years ago
8
microcosm-cc/bluemonday #58

Attribute Filter/Transform Callbacks

It would be helpful to have a hook to allow custom attribute filtering. I propose something much simpler than #24 that would integrate with the existing builder syntax: ```go // AttrTransform is …

jhillyerd updated 6 years ago
6
langchain-ai/langchainjs #6493

Failed to calculate number of tokens, falling back to approx…

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain.js documentation with the integrated search. - [X] I used the GitHub search to find a …

TowhidKashem updated 4 weeks ago
3
guidance-ai/guidance #205

[Feature Request] GPTQ support

**Is your feature request related to a problem? Please describe.** VRAM is a major limitation for running most models locally, and guidance by design requires to run models locally to get the most va…

the-xentropy updated 8 months ago
5
Stability-AI/StableLM #17

GPU support Table & VRAM usage

It would be great to get the instructions to run the 3B model locally on a gaming GPU (e.g. 3090/4090 with 24GB VRAM). ### Confirmed GPUs From this thread | GPU Model | VRAM (GB) | Tuned-3b | T…

enricoros updated 1 year ago
34
bigcode-project/starcoder #47

Trying to fine tune starcoderbase model using finetuning.oy …

I am trying to fine tune bigcode/starcoderbase model on compute A100 with 8 GPUs 80Gb VRAM. My initial steps are to adjust parameters. I get some impression that it becomes slow if I increase batch …

dimichgh updated 11 months ago
5

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for streaming-tokenizer

1000+ results
for streaming-tokenizer