image-captioning Search Results

1000+ results
for image-captioning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

InternLM/InternLM-XComposer #234

piece id is out of range

Hitting this issue when decoding: any thoughts? ``` File "/home/ubuntu/wbc/captioning/InternLM-XComposer/projects/ShareGPT4V/run_captioning.py", line 102, in gen_json captions = eval_mode…

tianrengao updated 1 month ago
1
pytorch/captum #1188

Issue with images input for Integrated Gradient for image ca…

Hi all, I have an issue regarding inputs to attribute method of integrated gradient algorithm. I am using the GIT model for image captioning and defined the forward function to return one token_id of…

alimirgh75 updated 1 month ago
2
microsoft/Oscar #94

VINVL image captioning features

Hello! I have a question about extracting region features for image captioning: - in VinVL paper, it states that 2048 region features are stacked with 6 positionally encoded features (bbox, its h…

EddieKro updated 2 years ago
13
salesforce/BLIP #58

Batch predictions Image Captioning task

Hi, glad to see and use this cool project, thanks you. I have a question: if it possible to batch predictions on Image captioning task? I see https://github.com/salesforce/BLIP/issues/48 but it's no…

MikeMACintosh updated 9 months ago
3
QoutiOussama13/InsurAI #1

Add the audio transcription / image captioning to UI

Add the audio transcription using whisper and image captioning with gpt4-v functions and implement them in the `gradio ui` notebook

QoutiOussama13 updated 4 months ago
3
salesforce/BLIP #37

Fine tune BLIP Image Captioning to custom dataset

Hi, thanks for your amaizing work, i'm enjoy to use BLIP which demonstrate impressive results:) Now i have a question: how can i fine tune BLIP for Image Captioning task on custom dataset? My dat…

MikeMACintosh updated 3 months ago
19
Doubiiu/DynamiCrafter #103

Is there a list of prompts used in training that we can use …

I'm curious what prompts DynamiCrafter responds to? I see some in the examples, but I was wondering if there was a resource that had more info on what type of prompts it responds best to?

brad12d3 updated 1 week ago
1
the-scamper-ai/pre-image-captioning #3

สอบถามถึงการเตรียมdataset เพื่อtrainครับ ไม่รู้ว่าจะจัดฟอร์ม…

`processor = AutoProcessor.from_pretrained("Salesforce/blip-image-captioning-base")` `model = BlipForConditionalGeneration.from_pretrained("Salesforce/blip-image-captioning-base")` Dataset[datatset3…

Harry-pop updated 2 months ago
1
mbzuai-oryx/groundingLMM #57

Training on New Data

Hello, I wondered if there was a way to train the model and new data. I am sorry if I missed the documentation somewhere.

ajb8866 updated 1 week ago
2
OFA-Sys/OFA #346

About finetuning on image captioning

Hi, I want to finetune the model on my own dataset. How should I prepare the stage1 and stage2 training data? What is the difference? The description of caption_stage1_train.tsv and caption_stage2_t…

victorup updated 1 year ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for image-captioning

1000+ results
for image-captioning