-
Hello training fails and I am seeing a lot of "modelscope" related warnings. I pasted below some portions of the log. I have updated Automatic1111 with following: version: [v1.6.0](https://github.com…
-
Hi All,
have few questions regarding usage of mask Rcnn for small applications...I am asking these questions because it seems pretty slow and needs lot of memory to process.
1) Is it possible to u…
-
I was just wondering why you didn't use caches to store the key and value tensors in the Transformer like Meta did
Also, Meta uses a different generate function that take advantage of these caches. T…
-
- [ ] [SELF-RAG: Learning to Retrieve, Generate and Critique through Self-reflection](https://github.com/AkariAsai/self-rag/blob/main/README.md?plain=1)
# SELF-RAG: Learning to Retrieve, Generate and…
-
`Distribution.sample()` evaluates all distribution parameters, and then samples from the resulting distribution, this means that if parameters are RVs, only one sample is taken. For 'full model' sampl…
-
https://github.com/exo-explore/exo/issues/23#issuecomment-2241521048
Perhaps after each inference, we synchronise the full kv cache between all nodes. This should be fairly straightforward, we can …
-
Hi,
when running preparation.py for 3DMatch I got the following error
RuntimeError: CUDA out of memory. Tried to allocate 5.15 GiB (GPU 0; 10.76 GiB total capacity; 6.27 GiB already allocated; 3…
-
Currently the performance of the application is borderline -- if we do a bit more work per inference, we'll definitely start to slow down some more limited devices.
First we have to profile to unde…
-
|id|title|author|year|
|---|---|---|---|
|2|Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation|Zhao, Hao and Lu, Ming and Yao, Anbang and Guo…
-
Hi @thiagopbueno,
I'm also working with @ramonpereira and @miquelramirez and I have been trying to run tf-plan in a Linux box with GPUs. However, in our experiments (the same domains as in issue #2…