-
**Hello Everyone,
Could you please explain the following section of the code in relation to the pipeline. My question is why did you use the conv2d followed by conv1d. What are the benefits and wh…
-
用faster RCNN计算出的框的分类,和用同样的框和参数然后使用Vtranse计算得出的结果不一致。为啥啊。而且你的计算Recall的方式和Lifeifei的有一定的区别,会导致方法的结果高1到2个百分点。
-
Hi,
Is there any plan to release the pretrained models and training code for the Detic model in Table-2 (Open-vocabulary LVIS compared to ViLD)?
Thank you
-
Traceback (most recent call last):
File "rpnet.py", line 405, in
model_conv = train_model(model_conv, criterion, optimizer_conv, num_epochs=epochs)
File "rpnet.py", line 365, in train_mode…
-
[paper](https://arxiv.org/pdf/2204.14198.pdf)
## TL;DR
- **I read this because.. :** #116 읽고 나서 읽고 싶어짐. 예전에 성현님이 소개해주셨는데 디테일 모름. GPT 열풍으로 요즘 다시 많이 언급됨.
- **task :** Vision Language Model in g…
-
I wonder which topic is studied on Facebook.
Let's Go.
![image](https://user-images.githubusercontent.com/11758940/34806498-acc34bde-f6c6-11e7-9a51-b278bfe9f771.png)
https://research.fb.com/
h…
-
I meets the errors when funtune using loar.
ValueError: Target module LlamaDecoderLayer(
(self_attn): LlamaAttention(
(q_proj): Linear(in_features=2048, out_features=2048, bias=False)
(k…
-
Thanks for your solid work. I'm a little confused the usage of the backbone.
In your paper, you said that you have utilized the ViT-L/14 image encoder in your "Our Implementation Details". Is this …
-
hi! @xxradon
i want to convert osnet
osnet. py
```
from __future__ import absolute_import
from __future__ import division
__all__ = [
'osnet_x1_0', 'osnet_x0_75', 'osnet_x0_5', 'os…
-
I'm trying to implement Hough Transform for road lane detection. Currently I have the hough transform filter, however I would like to extract the coordinates of the lines generated by this filter, to …