您的位置：首页 > 其它

Beyond Caption To Narrative: Video Captioning With Multiple Sentences

2017-06-11 10:15 435 查看

Beyond Caption To Narrative: Video Captioning With Multiple Sentences

Andrew Shin, Katsunori Ohnishi, Tatsuya
Harada

(Submitted on 18 May 2016)

Recent advances in image captioning task have led to increasing interests in video captioning task. However, most works on video captioning are focused on generating single input of aggregated features, which hardly deviates from image captioning process and
does not fully take advantage of dynamic contents present in videos. We attempt to generate video captions that convey richer contents by temporally segmenting the video with action localization, generating multiple captions from multiple frames, and connecting
them with natural language processing techniques, in order to generate a story-like caption. We show that our proposed method can generate captions that are richer in contents and can compete with state-of-the-art method without explicitly using video-level
features as input.

Comments:	accepted to ICIP 2016
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1605.05440 [cs.CV]
	(or arXiv:1605.05440v1 [cs.CV] for this version)

Submission history

From: Andrew Shin [view email]
[v1] Wed, 18 May 2016 05:00:12 GMT (1186kb,D)

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签：

相关文章推荐

新的分享

章节导航