-
_Originally posted by @edgarriba in https://github.com/kornia/kornia/pull/2315#discussion_r1161797100_
Improve our https://kornia.readthedocs.io/en/latest/contrib.html#kornia.contrib.extract_tenso…
-
Hi,
I'm trying to learn your implementation of VSD loss and have a question. To get the noise with CFG, one should compute both conditioned and unconditioned noise. So why do you use
`encoder_hidde…
-
Paper : [https://arxiv.org/pdf/2406.16860](https://arxiv.org/pdf/2406.16860)
Website : [https://cambrian-mllm.github.io](https://cambrian-mllm.github.io)
Code : [https://github.com/cambrian-mllm/cam…
-
# BLIP
* [paper](https://arxiv.org/abs/2201.12086)
* [code](https://github.com/salesforce/BLIP)
* [blog](https://blog.salesforceairesearch.com/blip-bootstrapping-language-image-pretraining/)
* i…
-
```
1. Write image to a file stream
2. Note that the file is locked by the application until the process exits
The bug appears to be in Image.cpp Image_writeToStream. The stream is attached
to the I…
-
the full error output like following
```
Loads SAM model: E:\DEV\ComfyUI_windows_portable\ComfyUI\models\sams\sam_vit_h_4b8939.pth (device:AUTO)
final text_encoder_type: bert-base-uncased
!!! Exce…
-
Hi Minkai,
Thank you for sharing this work! When I analyze the sampling results of GeoLDM, I found the latent variable `z_x` is almost equal to the decoded atom positions. Below are molecules I rec…
-
# Bug Report
### Which model does this pertain to?
T5
### Describe the bug
Running the script in colab
```
from onnxt5 import GenerativeT5
from onnxt5.api import get_encoder_decoder_tokeniz…
-
In paper, figure 3
![image](https://github.com/xinyu1205/recognize-anything/assets/37361632/10c12b82-dedb-4f4c-a934-490db65720b1)
Can you confirm that I understand the overall system architecture …
-
### What happened?
trying to train cascade tenc only with prodigy, but no matter what I tried to change the lr stays at the initial D value of the optimizer setting and does not move and the model …