Making custom inferences

gicheonkang / dan-visdial

✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"

MIT License

45 stars 10 forks source link

Hi @puneet-kr, thank you for your interest.

General procedures for inference are as follows:

load the pre-trained model
embed inputs to vector (image, query, dialog history, answer candidates)
feed the embeddings to the model
transform the model output to human readable output

If you need to get custom inferences, pre-processing steps for embedding vector are required ! Embedding for image inputs --> Faster R-CNN Embedding for text inputs --> word tokens to pre-defined numbers using word to index dictionary

gicheonkang / dan-visdial

Making custom inferences #7