Hybrid modeling, HMM/NN architectures, and protein applications

被引:29
作者
Baldi, P [1 ]
Chauvin, Y [1 ]
机构
[1] NET ID INC,SAN FRANCISCO,CA 94107
关键词
D O I
10.1162/neco.1996.8.7.1541
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a hybrid modeling approach where the parameters of a model are calculated and modulated by another model, typically a neural network (NN), to avoid both overfitting and underfitting. We develop the approach for the case of Hidden Markov Models (HMMs), by deriving a class of hybrid HMM/NN architectures. These architectures can be trained with unified algorithms that blend HMM dynamic programming with NN backpropagation. In the case of complex data, mixtures of HMMs or modulated HMMs must be used. NNs can then be applied both to the parameters of each single HMM, and to the switching or modulation of the models, as a function of input or context. Hybrid HMM/NN architectures provide a flexible NN parameterization for the control of model structure and complexity. At the same time, they can capture distributions that, in practice, are inaccessible to single HMMs. The HMM/NN hybrid approach is tested, in its simplest form, by constructing a model of the immunoglobulin protein family. A hybrid model is trained, and a multiple alignment derived, with less than a fourth of the number of parameters used with previous single HMMs.
引用
收藏
页码:1541 / 1565
页数:25
相关论文
共 23 条
[1]  
ALTSCHUL SF, 1991, J MOL BIOL, V219, P1
[2]  
Baldi P, 1994, J Comput Biol, V1, P311, DOI 10.1089/cmb.1994.1.311
[3]   HIDDEN MARKOV-MODELS OF BIOLOGICAL PRIMARY SEQUENCE INFORMATION [J].
BALDI, P ;
CHAUVIN, Y ;
HUNKAPILLER, T ;
MCCLURE, MA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (03) :1059-1063
[4]  
BALDI P, 1994, ADV NEURAL INFORMATI, V6, P761
[5]  
BALDI P, 1994, NEURAL COMPUT, V6, P305
[6]  
BENGIO Y, 1995, ADV NEURAL INFORMATI, V7
[7]  
BENGIO Y, 1995, ADV NEURAL INFORMATI, V6
[8]  
Bourlard H. A., 1994, Connectionist speech recognition: a hybrid approach
[9]   AN HMM/MLP ARCHITECTURE FOR SEQUENCE RECOGNITION [J].
CHO, SB ;
KIM, JH .
NEURAL COMPUTATION, 1995, 7 (02) :358-369
[10]   THE HELMHOLTZ MACHINE [J].
DAYAN, P ;
HINTON, GE ;
NEAL, RM ;
ZEMEL, RS .
NEURAL COMPUTATION, 1995, 7 (05) :889-904