Two decades of statistical language modeling: Where do we go from here?

被引:281
作者
Rosenfeld, R [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
natural language processing; natural language technologies; statistical language modeling;
D O I
10.1109/5.880083
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Statistical language models estimate the distribution of various natural language phenomena;for the purpose of speech recognition and other language technologies. Since the first significant model was proposed in 1980, many attempts have been made to improve the state-of-the-art. We review them here, point to a few promising directions, and argue for a Bayesian approach to integration of linguistic theories with data.
引用
收藏
页码:1270 / 1278
页数:9
相关论文
共 85 条
[1]  
[Anonymous], 1991, Proceedings of the DARPA Workshop on Speech Natural Language, February 1991, DOI DOI 10.3115/112405.112464
[2]  
[Anonymous], P INT C AC SPEECH SI
[3]   A TREE-BASED STATISTICAL LANGUAGE MODEL FOR NATURAL-LANGUAGE SPEECH RECOGNITION [J].
BAHL, LR ;
BROWN, PF ;
DESOUZA, PV ;
MERCER, RL .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (07) :1001-1008
[4]  
BAKER JK, 1979, P SPRING C AC SOC AM, P547
[5]  
Beeferman D, 1997, 35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P373
[6]  
Bellegarda JR, 1998, IEEE T SPEECH AUDI P, V6, P456, DOI 10.1109/89.709671
[7]   Large vocabulary speech recognition with multispan statistical language models [J].
Bellegarda, JR .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01) :76-84
[8]  
Berger A, 1999, SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P222, DOI 10.1145/312624.312681
[9]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[10]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946