Text generation - Githubissues

CERC-AAI / multimodal

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Apache License 2.0

8 stars 3 forks source link

support text generation with image as input.

Use instruction:

put the image_path and text prompt to jsonl file, set the config of text-generation.yml eg: sample_input.jsonl {'image':'image_path', 'text':'your_prompt'}
set up the environment just like the one for training.
run with python deepy.py generate.py -d your_train_configs(eg: 410M.yml summit-setup.yml) text-generation.yml

To do: support the use_cache=True, which shall speed up the inference.

CERC-AAI / multimodal