Toward speech as a knowledge resource

被引:9
作者
Brown, EW
Srinivasan, S
Coden, A
Ponceleon, D
Cooper, JW
Amir, A
机构
[1] IBM Corp, Almaden Res Ctr, Div Res, San Jose, CA 95120 USA
[2] IBM Corp, Div Res, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
D O I
10.1147/sj.404.0985
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
Speech is a tantalizing mode of human communication. On the one hand, humans understand speech with ease and use speech to express complex ideas, information, and knowledge. On the other hand, automatic speech recognition with computers is very hard, and extracting knowledge from speech is even harder. Nevertheless, the potential reward for solving this problem drives us to pursue it. Before we can exploit speech as a knowledge resource, however, we must understand the current state of the art in speech recognition and the relevant, successful applications of speech recognition in the related areas of multimedia indexing and search. In this paper we advocate the study of speech as a knowledge resource, provide a brief introduction to the state of the art in speech recognition, describe a number of systems that use speech recognition to enable multimedia analysis, indexing, and search, and present a number of exploratory applications of speech recognition that move toward the goal of exploiting speech as a knowledge resource.
引用
收藏
页码:985 / 1001
页数:17
相关论文
共 38 条
[1]
Content-based representation and retrieval of visual media: A state-of-the-art review [J].
Aigrain, P ;
Zhang, HJ ;
Petkovic, D .
MULTIMEDIA TOOLS AND APPLICATIONS, 1996, 3 (03) :179-202
[2]
AMIR A, 2000, P HAW INT C MULT HIC
[3]
[Anonymous], P 1988 ACM C COMP SU
[4]
[Anonymous], 1996, P 19 ANN INT ACM SIG, DOI DOI 10.1145/243199.243202
[5]
BACH JR, 1996, P STOR RETR STILL IM
[6]
VideoQ: An automated content based video search system using visual cues [J].
Chang, SF ;
Chen, W ;
Meng, HJ ;
Sundaram, H ;
Zhong, D .
ACM MULTIMEDIA 97, PROCEEDINGS, 1997, :313-324
[7]
CHEN SS, 2001, IN PRESS SPEECH COMM
[8]
Christel M. G., 1998, CHI 98. Human Factors in Computing Systems. CHI 98 Conference Proceedings, P171, DOI 10.1145/274644.274670
[9]
CODEN A, 2001, P HAW INT C SYST SCI
[10]
Cooper JW, 1997, ACM DIGITAL LIBRARIES '97, P237