quantization-efficient-network Search Results

375 results
for quantization-efficient-network

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/llama.cpp #630

Combine large LLM with small LLM for faster inference

So I was thinking about the following idea. It is probably completely bogus, but I would definitely investigate it when and if I had the time to, so maybe someone else would be interested as well. …

ggerganov updated 7 months ago
43
ufs-community/ufs-weather-model #2015

multiple netcdf_parallel tests fail on hercules

## Description The control_wrtGauss_netcdf_parallel_intel fails on hercules (intel) with the following error: ``` 2023-11-29 11:42:48.544357 +0000 ERROR /work/noaa/epic/role-epic/spack-stack/…

DeniseWorthen updated 9 months ago
122
bghira/SimpleTuner #334

No images were discovered by the bucket manager

Hi , now i am trying to train SDXL with images of resolutin 768x768, i set batch size for 2 in env file so i have 16 images on folder. I am using resolution type 'pixel' an resolution '768' both env …

elismasilva updated 7 months ago
17
pytorch/pytorch #93757

TorchInductor missing ops tracker

The following ops are using `ir.FallbackKernel` via `make_fallback()` in [lowering.py](https://github.com/pytorch/torchdynamo/blob/main/torchinductor/lowering.py#L894) and appear in benchmarks. We sh…

jansel updated 11 months ago
50
pytorch/pytorch #59835

[RFC] PyTorch Training and Inference for Sparse Models

## 🚀 Feature --- In this doc we are requesting comments for the implementation of the sparsification flow as part of architecture optimization namespace (`torch.ao`) with: @raghuramank100 @dskh…

z-a-f updated 1 year ago
4
hiyouga/LLaMA-Factory #2371

Keep getting 'Connection reset by peer'

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction myvenv) ubuntu@b12420:~/LLaMA-Factory$ CUDA_VISIBLE_DEVICES=0 python src/train_bash.py --stage sft …

bradenacurtis801 updated 9 months ago
2
meta-llama/llama #79

Post your hardware specs here if you got it to work. 🛠

It might be useful if you get the model to work to write down the model (e.g. 7B) and the hardware you got it to run on. Then people can get an idea of what will be the minimum specs. I'd also be inte…

elephantpanda updated 5 months ago
84
hiyouga/LLaMA-Factory #456

多卡RLHF训练时报错Tensors must be CUDA and denseRuntimeError

您好，我在用多卡RLHF训练时报错Tensors must be CUDA and denseRuntimeError，**用单卡训练在这一步是不会报错的**，但是单卡我的显存不够也跑不了，您知道是什么问题吗？参数如下： accelerate launch src/train_bash.py \ --stage ppo \ --model_name_or_path "…

liangjh2001 updated 1 year ago
4
huggingface/diffusers #5516

Lora not functioning when used with t2i adapters Pipeline

### Describe the bug Hello, Thank you for this useful library. I have a small problem, I managed to use the code to generate images using SDXL with t2i, then an Image with a Lora. But for some …

ilisparrow updated 1 year ago
3
ollama/ollama #4767

Model response corruption and leaking data between session.

### What is the issue? `main` when running a model (specifically `llama3:8b-instruct-fp16` will begin to generate gibberish. It will also leak state between sessions. Swapping out the models will…

MarkWard0110 updated 4 months ago
8

上一页 1...21 22 23 24 25 26 27...38 下一页

375 results for quantization-efficient-network

375 results
for quantization-efficient-network