Automatic cinematography and multilingual NLG for generating video documentaries

被引:18
作者
Callaway, C [1 ]
Not, E [1 ]
Novello, A [1 ]
Rocchi, C [1 ]
Stock, O [1 ]
Zancanaro, M [1 ]
机构
[1] Ctr Ric Sci & Tecnol, ITC Irst, Trento, Italy
关键词
automatic cinematography; natural language generation; multimedia presentations;
D O I
10.1016/j.artint.2005.02.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
Automatically constructing a complete documentary or educational film from scattered pieces of images and knowledge is a significant challenge. Even when this information is provided in an annotated format, the problems of ordering, structuring and animating sequences of images, and producing natural language descriptions that correspond to those images within multiple constraints, are each individually difficult tasks. This paper describes an approach for tackling these problems through a combination of rhetorical structures with narrative and film theory to produce movie-like visual animations from still images along with natural language generation techniques needed to produce text descriptions of what is being seen in the animations. The use of rhetorical structures from NLG is used to integrate separate components for video creation and script generation. We further describe an implementation, named GLAMOUR, that produces actual, short video documentaries, focusing on a cultural heritage domain, and that have been evaluated by professional filmmakers. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:57 / 89
页数:33
相关论文
共 49 条
[1]
Andre E., 2000, HDB NATURAL LANGUAGE, P305
[2]
ANDROUTSOPOULOS I, 2002, P 2002 CLASS WORKSH
[3]
Arijon D., 1976, Grammar of the film language
[4]
Bares W. H., 2000, Smart Graphics. Papers from the 2000 AAAI Symposium, P84
[5]
Bares WH, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P1101
[6]
BATEMAN J, 1998, ECAI WORKSH MULT LEX, V2, P1
[7]
Bateman J., 2000, P 2 INT C LANG RES E, P1763
[8]
BLACK A, 2002, 4 SPEECH SYNTH WORKS
[9]
BUTZ A, 1997, P 9 INN APPL ART INT, P957
[10]
CAHILL L, 2000, REFERENCE ARCHITECTU