vision-language-transformer Search Results

989 results
for vision-language-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vespa-engine/vespa #28090

[FR] Support of more HuggingFace embedders for multimodality

My goal is to build a unique multimodal WooCommerce search experience with Vespa multivectors and an hybrid ranking on text-BM25, text-vectors, and image-vectors. For instance, E-commerce can use: …

eostis updated 2 days ago
10
dandelin/ViLT #23

FLOPS calculation

hi when you compute the FLOPS in table 6 for baseline models such as ViLBERT, do you also include the FLOPS computation of feature extraction models?

junchen14 updated 2 years ago
1
huggingface/transformers #23846

Add LaVIN model

### Model description LaVIN is a vision-language instructed model that is affordable to train (it was trained in a few hours on 8 A100 GPUs) with good performance on ScienceQA. I'd like to add …

tensorpro updated 1 year ago
1
huggingface/transformers #17224

ALBEF: Align Before Fuse

### Model description Align Before Fuse (ALBEF) is a vision-language (VL) model that showed competitive results in numerous VL tasks such as image-text retrieval, visual question answering, visual …

ggoggam updated 1 month ago
7
epogrebnyak/mlmw #6

Reorganize beginner section

Updates from: - https://github.com/jacobhilton/deep_learning_curriculum (focus on transformers) - Raschka book 1. Math prerequisites Taking a derivative to find a point of minimum or maxim…

epogrebnyak updated 1 month ago
3
RupertLuo/Valley #12

Encounting error when loading Valley2 7b with transformers 4…

I encountered the following error when loading Valley2 7b with transformers Code: ============================================================ ”from transformers import AutoModelForCausalLM mode…

BinZhu-ece updated 10 months ago
1
ramkrishna2910/onnx-models #5

Determine the list of onnx models that will be saved in Azur…

Our available storage capacity for ONNX model zoo on GitHub LFS is currently at 100GB. We need to decide on the list of models to be stored here, with the objective of maximizing the usage of the spac…

ramkrishna2910 updated 1 year ago
2
AutoGPTQ/AutoGPTQ #450

Trying to adapt the cogvlm model, but encountering errors.

url:https://github.com/Minami-su/AutoGPTQ_cogvlm I attempted to quantize it, and it seemed to be effective. However, when I used it for inference, it resulted in errors.: ![image](https://github.com…

Minami-su updated 7 months ago
6
awesome-davian/awesome-reviews-kaist #496

[2022 Spring] ICCV 2021 Emerging Properties in Self-Supervi…

좋은 리뷰 감사합니다. 몇 가지 추가하면 좋겠다고 생각한 것은 1. BYOL architecture 그림을 추가하면 선행연구를 이해하기 더 좋을 것 같습니다 2. 본 논문의 중요한 idea 가 collapsing 을 막는 centering 과 sharpening 인 것 같은데, 그 방법에 대해서 조금 더 자세히 설명해주시면 더 좋을 것 같습니다.…

nooppi18 updated 2 years ago
2
huggingface/transformers #30809

[Llava] Phi text model produces `ValueError: Attention mask …

### System Info - `transformers` version: 4.38.2 - Platform: Linux-6.1.58+-x86_64-with-glibc2.35 - Python version: 3.10.12 - Huggingface_hub version: 0.23.0 - Safetensors version: 0.4.3 - Accele…

xenova updated 3 days ago
10

上一页 1...4 5 6 7 8 9 10...99 下一页

989 results for vision-language-transformer

989 results
for vision-language-transformer