lxmert Search Results - Githubissues

240 results
for lxmert

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

e4exp/paper_manager_abstract #686

Unifying Multimodal Transformer for Bi-directional Image and…

- https://arxiv.org/abs/2110.09753 - ACM MM 2021 本研究では、画像からテキスト、テキストから画像への世代交代という自然な双方向タスクの共同学習について研究する。既存の研究では、それぞれのタスクに特化した2つのモデルを設計しているため、設計コストが高くなってしまう。本研究では、単一のマルチモーダルモデルに基づいて、双方向タスクを共同で学…

e4exp updated 3 years ago
5
JayYip/m3tl #68

When I use the new version, there's some problems.

''' WARNING:root:bert_config not exists. will load model from huggingface checkpoint. Traceback (most recent call last): File "run_weibo_ner_cws.py", line 31, in train_bert_multitask(proble…

AIikai updated 4 years ago
11
airsplay/py-bottom-up-attention #3

A plan to reveal a Batch-based RoI feature extractor?

Thanks for your great job. I am trying to use the demo tools you have revealed to extract RoI and box features. Since It is too slow to extract features by inputing single image, would you plan to …

yanan1989 updated 4 years ago
3
hammoudhasan/SynthCLIP #8

TextGen missing setup steps?

``` $ python captions_generator.py --save_path synthetic_captions --generation_idx 0 --concept_bank_size -1 --me…

escorciav updated 8 hours ago
2
long8v/PTIR #159

[147] Generic Attention-model Explainability for Interpretin…

[paper](https://arxiv.org/pdf/2103.15679.pdf), [code](https://github.com/hila-chefer/Transformer-MM-Explainability) ## TL;DR - **I read this because.. :** aka. CheferCAM. explainable CLIP scor…

long8v updated 3 months ago
2
j-min/VL-T5 #1

Inference on my own data?

Hello! First of all thank you so much for your work. I have read your paper and I want to carry out some open-ended VQA/answer generation VQA experiments with the model you proposed (VL-T5). However I…

puzzlecollector updated 2 years ago
21
Unipisa/diaparser #1

DiaParser destroys Icelandic tokenizer

I've tried to install `DiaParser` via `pip install diaparser`, and it destroyed [Icelandic tokenizer](https://pypi.org/project/tokenizer/). I'm vague why they conflicted in installing...

KoichiYasuoka updated 3 years ago
4
airsplay/lxmert #58

The Hyper parameters for the VizWiz datasets

Dear Pro: I read about the Vizwiz Leaderboard for ECCV 2018. The results shown are 55.40 for no model ensemble. But I trained the Vizwiz datasets and the results are only 51.96. So I want to know…

runzeer updated 4 years ago
10
e4exp/paper_manager_abstract #424

Playing Lottery Tickets with Vision and Language

- https://arxiv.org/abs/2104.11832 - 2021 大規模な変換器ベースの事前学習は、近年、視覚と言語（V+L）の研究に革命をもたらした。 LXMERT、ViLBERT、UNITERなどのモデルは、広範囲のV+Lタスクにおいて、技術的な状況を大幅に改善した。しかし、このようなモデルはパラメータの数が多いため、実際には適用できません。これと並行して、…

e4exp updated 3 years ago
3
michaelfeil/infinity #451

Support for Model2Vec Embedding models

### Model description Do we support Model2Vec embedding models? E.g: https://huggingface.co/minishlab/potion-base-8M https://minishlab.github.io/tokenlearn_blogpost/ ### Open source status - […

S1LV3RJ1NX updated 3 days ago
3

上一页 1...3 4 5 6 7 8 9...24 下一页

240 results for lxmert

240 results
for lxmert