A Case Study on Combining ASR and Visual Features for Generating Instructional Video Captions - 42Papers