-
With these code, i got [none,none,none,none] box output
import torch
from PIL import Image
import os
import torch.utils.data as data
from torchvision import transforms
import matplotlib.pyplot…
-
Just that ...
Here's a running list of papers:
- https://arxiv.org/pdf/2201.05047.pdf [TransVOD]
-
# Interesting papers
## Meta의 'An Introduction to Vision-Language Modeling'
- https://ai.meta.com/research/publications/an-introduction-to-vision-language-modeling/
![image](https://github.c…
-
# Vision Transformers are Overrated | Frank’s Ramblings
Attaining ViT/ConvNeXt performance with a couple of simple modifications to ResNet.
[https://frankzliu.com/blog/vision-transformers-are-overra…
-
### Describe the issue
Issue: I'm trying to use llava-1.5-7b-hf and i'm new and clueless in debugging LMMs. Ihave an error when i try to use the simple example of usage:
raise ValueError(
ValueEr…
-
[paper](https://noushineftekhari.github.io/publication/2024-marine-plankton-classification)
[repository](https://github.com/alan-turing-institute/ViT-LASNet)
[model weights](https://drive.google…
-
### Describe the issue as clearly as possible:
When using outlines with the Llama 3.2 Vision model, simple regex pattern generation works, but JSON schema-based generation fails with index out of bou…
-
## タイトル: プリオン-ViT:スペックルグラムを用いた温度予測のためのプリオンに着想を得たビジョン・トランスフォーマー
## リンク: https://arxiv.org/abs/2411.05836
## 概要:
ファイバースペックルグラムセンサー (FSS) は、温度変化に対する高い感度から環境モニタリングで広く利用されていますが、スペックルグラムデータの複雑で非線形な性質は、従…
-
(minimindv) root@autodl-container-651142bf34-3b6ccf84:~/minimind-v# sudo apt-get install git-lfs
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
gi…
-
# Transformer 在视觉方面的应用
## Reference
- 2021-01 A Survey on Visual Transformer [[Paper](https://arxiv.org/pdf/2012.12556.pdf)] [[Note](https://github.com/junxnone/tech-io/issues/926)]
- 2021-01 Tr…