efficient-model Search Results

1000+ results
for efficient-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hpcaitech/Open-Sora #723

Question about conditional usage of memory-efficient attenti…

I noticed that the implementation of computing attention in the `forward` method of the `KVCompressAttention` class includes a conditional check https://github.com/hpcaitech/Open-Sora/blob/476b6dc7972…

Chao-Xue updated 5 days ago
1
batmanlab/Mammo-CLIP #20

How to use Mammo-CLIP model easily

Hi, Thank you **so much** for your great work on this project. I am a computer science undergraduate working on my school project. Anyway, I want to ask how to use the trained model easily. …

Iamnotphage updated 8 hours ago
1
huggingface/optimum-habana #1451

Meta-Llama-3 model text-generation example output is unexpec…

### System Info ```shell deepspeed 0.14.4+hpu.synapse.v1.18.0 optimum-habana 1.14.0 docker image: vault.habana.ai/gaudi-docker/1.18.0/ubuntu22.04/habanalabs/pytorch-ins…

aslanxie updated 1 week ago
4
ollama/ollama #7130

GPU VRAM Usage Timeout Warnings on Embeddings Model Load

### What is the issue? Description: We are experiencing repeated GPU VRAM recovery timeouts while running multiple models on the ollama platform. The GPU in use is 2x NVIDIA RTX A5000. The system …

maxruby updated 1 week ago
11
jump-dev/MathOptInterface.jl #2564

Improve performance of adding variables with bounds

From this simple example: ```julia using JuMP using HiGHS using Profile using PProf function add_var() model = direct_model(HiGHS.Optimizer()) @variable(model, 1

joaquimg updated 4 days ago
15
Dahoas/QDSyntheticData #244

Rephrasing the Web: A Recipe for Compute and Data-Efficient …

- Paper name: Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling - ArXiv Link: https://arxiv.org/abs/2401.16380 To close this issue open a PR with a paper report using…

alon-albalak updated 3 months ago
1
phidatahq/phidata #1306

Feature Request 「plz support InternLM2.5」

Hi, I noticed that the repository currently lacks support for the InternLM2.5-7B (1.8B, 20B) model, which may cause compatibility issues or missing steps for users trying to implement it. It would …

boshallen updated 1 week ago
1
vllm-project/vllm #1304

Could you support Attention Sink?

Efficient Streaming Language Models with Attention Sinks [paper](https://arxiv.org/abs/2309.17453) These repo has already implemented it: [attention_sinks](https://github.com/tomaarsen/attention_si…

dongxiaolong updated 5 days ago
10
xdit-project/xDiT #213

RoadMap and Looking for Contributions

## Model Zoo (we generally first implement USP and then PipeFusion for a new model) - [ ] [SD3.5](https://huggingface.co/stabilityai/stable-diffusion-3.5-large) - [ ] mochi (we will wait after it …

feifeibear updated 4 days ago
1
labring/FastGPT #2973

Feature Request 「plz support InternLM2.5」

Hi, I noticed that the repository currently lacks support for the InternLM2.5-7B (1.8B, 20B) model, which may cause compatibility issues or missing steps for users trying to implement it. It would …

boshallen updated 1 week ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for efficient-model

1000+ results
for efficient-model