Building Goal-Oriented Dialogue Systems with Situated Visual Context - 42Papers