henryhungle / MTN

Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
MIT License
98 stars 25 forks source link

MTN Baseline for SIMMC 2.0 Dataset #11

Closed satwikkottur closed 3 years ago

satwikkottur commented 3 years ago

Baselines for Multimodal Dialog State Tracking (Subtask 3) and Assistant Response Generation (Subtask 4).