With the aim of building next generation virtual assistants that can handle multimodal inputs and perform multimodal actions, we introduce two new datasets (both in the virtual shopping domain), the annotation schema, the core technical tasks, and the baseline models. The code for the baselines and the datasets will be opensourced.
Hi,
I was checking the table showing the results of baselines over subtask#1 (mm_action_prediction/README.md) and I have noticed that both tables report the results for furniture dataset only. I think there is an error and one of the tables should refer to fashion dataset instead.
Hi, I was checking the table showing the results of baselines over subtask#1 (
mm_action_prediction/README.md
) and I have noticed that both tables report the results for furniture dataset only. I think there is an error and one of the tables should refer to fashion dataset instead.