Abstract: Visual Dialog is a challenging multimodal task requiring models to answer questions about images through multi-turn conversations. Despite significant progress, research has predominantly ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results