models-optimized Search Results

1000+ results
for models-optimized

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

espressif/esp-idf #14749

error : scene_recall at esp-idf/components/bt/esp_ble_mesh/m…

### Answers checklist. - [X] I have read the documentation [ESP-IDF Programming Guide](https://docs.espressif.com/projects/esp-idf/en/latest/) and the issue is not addressed there. - [X] I have up…

hipal31 updated 1 week ago
2
unslothai/unsloth #1093

Lora adapter is almost as large as model

```py from unsloth import FastLanguageModel from unsloth import is_bfloat16_supported import torch from unsloth.chat_templates import get_chat_template from trl import SFTTrainer from transform…

kirawi updated 3 weeks ago
5
jinx-vi-0/BlogLog #46

Refactor Backend Folder Structure for Better Maintainability

Title: Refactor Backend Folder Structure for Enhanced Maintainability and Scalability Description: The current backend folder structure can be optimized to improve code maintainability, scalability…

PayalSharma2023 updated 2 weeks ago
1
Kexitor/HPE_Mediapipe #2

Is this model working for two or more person fallen?

wdcs-priyankpatel updated 1 week ago
1
huggingface/speech-to-speech #104

Approach for enabling multi client connection

I'd like to explore the best approach for managing multi-client connections in both single and multi-GPU environments. Often, GPUs are underutilized by a single client, especially when smaller mode…

kdcyberdude updated 1 month ago
4
thunlp/Ouroboros #5

How to Perform Inference with Batch Processing.

I'm currently using this model for inference, and I would like to know how to generate inference results in batch mode. Specifically, I'm trying to avoid processing inputs one by one and instead proce…

Chlience updated 2 weeks ago
1
casper-hansen/AutoAWQ #545

awq quantization is not fully optimized yet. The speed can b…

When i ran quantize code for llama3-70b-instruct. It was successfull, but when i used vllm load quantized model. I got a warning: `awq quantization is not fully optimized yet. The speed can be slower …

jackNhat updated 3 months ago
2
microsoft/onnxscript #1625

Pattern rewriter for contrib ops

Hi Currently the Pattern Rewriter/Matcher does not match contrib ops. e.g: def match(op,x,w,b): x = op.Conv(x,w,b) msft = onnxscript.values.Opset("com.microsoft", 1) x…

vid2022 updated 4 months ago
1
InAnYan/jabref #85

Choose embedding model

This is a "living issue". Editing is appreciated. ### Context: - Most prominent benchmark for embedding models: https://huggingface.co/spaces/mteb/leaderboard - We can choose to index the pdf dat…

ThiloteE updated 1 day ago
11
microsoft/Olive #1223

The float16 unet model of stable-diffusion-2-1 outputs NAN r…

**Describe the bug** After running the command "python stable_diffusion.py --provider cuda --optimize --model_id stabilityai/stable-diffusion-2-1" in Olive/examples/stable_diffusion/ directory. floa…

xhcao updated 3 months ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for models-optimized

1000+ results
for models-optimized