efficient-inference Search Results

1000+ results
for efficient-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

aigc-apps/sd-webui-EasyPhoto #151

modelscope issues? Failed to obtain Lora after training, ple…

Hello training fails and I am seeing a lot of "modelscope" related warnings. I pasted below some portions of the log. I have updated Automatic1111 with following: version: [v1.6.0](https://github.com…

k8kiss updated 9 months ago
4
matterport/Mask_RCNN #1087

mask RCNN in production

Hi All, have few questions regarding usage of mask Rcnn for small applications...I am asking these questions because it seems pretty slow and needs lot of memory to process. 1) Is it possible to u…

abhrau updated 5 years ago
6
karpathy/llama2.c #346

why not use key and value caches in model.py?

I was just wondering why you didn't use caches to store the key and value tensors in the Transformer like Meta did Also, Meta uses a different generate function that take advantage of these caches. T…

mvuthegoat updated 1 year ago
2
irthomasthomas/undecidability #778

SELF-RAG: Learning to Retrieve, Generate and Critique throug…

- [ ] [SELF-RAG: Learning to Retrieve, Generate and Critique through Self-reflection](https://github.com/AkariAsai/self-rag/blob/main/README.md?plain=1) # SELF-RAG: Learning to Retrieve, Generate and…

irthomasthomas updated 8 months ago
1
blei-lab/edward #759

Efficient Full Model Sampling

`Distribution.sample()` evaluates all distribution parameters, and then samples from the resulting distribution, this means that if parameters are RVs, only one sample is taken. For 'full model' sampl…

cshenton updated 7 years ago
7
exo-explore/exo #52

[BOUNTY - $200] Share kv cache between nodes for redundancy

https://github.com/exo-explore/exo/issues/23#issuecomment-2241521048 Perhaps after each inference, we synchronise the full kv cache between all nodes. This should be fairly straightforward, we can …

AlexCheema updated 1 month ago
12
QingyongHu/SpinNet #13

Inference RuntimeError: CUDA out of memory

Hi, when running preparation.py for 3DMatch I got the following error RuntimeError: CUDA out of memory. Tried to allocate 5.15 GiB (GPU 0; 10.76 GiB total capacity; 6.27 GiB already allocated; 3…

rui2016 updated 3 years ago
3
peddybeats/hands-down #7

Profile current resource utilization and improve performance

Currently the performance of the application is borderline -- if we do a bit more work per inference, we'll definitely start to slow down some more limited devices. First we have to profile to unde…

peddybeats updated 4 years ago
3
guanfuchen/cvpr_review #2

Physics Inspired Optimization on Semantic Transfer Features:…

|id|title|author|year| |---|---|---|---| |2|Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation|Zhao, Hao and Lu, Ming and Yao, Anbang and Guo…

guanfuchen updated 5 years ago
6
thiagopbueno/tf-plan #3

GPU vs CPU performance

Hi @thiagopbueno, I'm also working with @ramonpereira and @miquelramirez and I have been trying to run tf-plan in a Linux box with GPUs. However, in our experiments (the same domains as in issue #2…

meneguzzi updated 5 years ago
1

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for efficient-inference

1000+ results
for efficient-inference