optimization-models Search Results

1000+ results
for optimization-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

casper-hansen/AutoAWQ #545

awq quantization is not fully optimized yet. The speed can b…

When i ran quantize code for llama3-70b-instruct. It was successfull, but when i used vllm load quantized model. I got a warning: `awq quantization is not fully optimized yet. The speed can be slower …

jackNhat updated 1 month ago
2
jesse-ai/jesse #464

Machine learning

Feature to automate a variety of tasks associated with training a predictive machine learning model to generate market forecasts given a set of input signals. In general, this aims would be a sandbox …

NicoSan20 updated 1 day ago
5
InternLM/lmdeploy #2040

[Feature] Why not support vision model quantilization?

### Motivation Set internvl as an example, it's vision model is 6B. If the vision model can be quantilized, the inference process can be done in only one 4090. 请问目前vision model不支持量化的原因，是因为feature暂时还…

Leo-yang-1020 updated 2 months ago
1
vllm-project/vllm #6378

[RFC]: A Graph Optimization System in vLLM using torch.compi…

### Motivation. At a high level, we at Neural Magic are writing a custom compiler for Torch Dynamo to define a system within vLLM where we can write graph transformations. The main goal is a separa…

bnellnm updated 1 day ago
5
zenodo/zenodo #2442

Is there a way to increase the timeout for `504 Server Error…

Browser & OS (see also https://www.whatismybrowser.com/): N/A:Windows ## Describe the bug I'm getting `HTTPError: 504 Server Error: Gateway Time-out for url: https://zenodo.org/api/files/38a5a2f8-…

sgbaird updated 1 week ago
2
AutoMQ/automq #1718

[Enhancement] WriteAheadLog with sequentially callback

### Problem Currently, BlockWALService persists data blocks in parallel, responding directly to the upper layer with success as soon as any data block is persisted, even if the previous data block ha…

superhx updated 3 weeks ago
4
AUTOMATIC1111/stable-diffusion-webui #12923

[Bug]: 1.6.0 with SDXL still giving black images.

### Is there an existing issue for this? - [X] I have searched the existing issues and checked the recent builds/commits ### What happened? https://github.com/AUTOMATIC1111/stable-diffusion…

ZeroCool22 updated 4 months ago
12
city96/ComfyUI-GGUF #48

Low speed on AMD GPU

Here is my setup, using ubuntu AMD 6800 XT 16GB Vram 32GB Ram Python version: 3.10.12 pytorch version: 2.2.1+rocm5.7 I am getting between 14s-15s/it with flux1-dev-Q2_K.gguf, also Q4_0 and Q6_…

ArkhamInsanity updated 1 week ago
35
hiyouga/LLaMA-Factory #4744

Enable Contamination-Free Packaging Method During Pretrainin…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - ### Reproduction - ### Expected behavior Currently, the contamination-free packaging method is supp…

kostum123 updated 1 week ago
6
huggingface/diffusers #8390

Stable Cascade Controlnet training

**Is your feature request related to a problem? Please describe.** Last year I wrote [a long article on how to train controlnets](https://civitai.com/articles/2078) using diffusers, and trained [two]…

geroldmeisinger updated 9 hours ago
4

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for optimization-models

1000+ results
for optimization-models