This paper purpose a novel recurrent, attention and memory based neural architecture "Memory, Attention and Composition (MAC) cell" for VQA, featuring multi-step reasoning.
Three operations in MAC cell:
Control unit: Updates the control state to attend at each iteration to some aspect of a given question.
Read unit: Extracts information out of a knowledge base, guided by the control and memory states.
Write unit: Integrates the retrieved information into the memory state, iteratively computing the answer.
Metadata