Stable encoding of finite-state machines in discrete-time recurrent neural nets with sigmoid units

被引：19

作者：

Carrasco, RC ^{[1
]}

Forcada, ML

Valdés-Muñoz, MA

Ñeco, RP

机构：

[1] Univ Alacant, Dept Llenguatges & Sistemes Informat, E-03071 Alacant, Spain

[2] Univ Miguel Hernandez, Dept Ciencias Expt & Tecnol, E-03202 Elx, Spain

来源：

NEURAL COMPUTATION | 2000年 / 12卷 / 09期

关键词：

D O I：

10.1162/089976600300015097

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

There has been a lot of interest in the use of discrete-time recurrent neural nets (DTRNN) to learn finite-state tasks, with interesting results regarding the induction of simple finite-state machines from input-output strings. Parallel work has studied the computational power of DTRNN in connection with finite-state computation. This article describes a simple strategy to devise stable encodings of finite-state machines in computationally capable discrete-time recurrent neural architectures with sigmoid units and gives a detailed presentation on how this strategy may be applied to encode a general class of finite- state machines in a variety of commonly used first- and second-order recurrent neural networks. Unlike previous work that either imposed some restrictions to state values or used a detailed analysis based on fixed-point attractors, our approach applies to any positive, bounded, strictly growing, continuous activation function and uses simple bounding criteria based on a study of the conditions under which a proposed encoding scheme guarantees that the DTRNN is actually behaving as a finite-state machine.

引用

页码：2129 / 2174

页数：46

共 56 条

[1] EFFICIENT SIMULATION OF FINITE AUTOMATA BY NEURAL NETS
ALON, N
DEWDNEY, AK
OTT, TJ
[J]. JOURNAL OF THE ACM, 1991, 38 (02) : 495 - 514
[2] AN ALGEBRAIC FRAMEWORK TO REPRESENT FINITE-STATE MACHINES IN SINGLE-LAYER RECURRENT NEURAL NETWORKS
ALQUEZAR, R
SANFELIU, A
[J]. NEURAL COMPUTATION, 1995, 7 (05) : 931 - 949
[3] [Anonymous], 1994, P C ADV NEUR INF PRO
[4] [Anonymous], 2001, NEURAL NETWORKS COMP
[5] ARAI K, 1996, P ICANN, P519
[6] ARAI KI, 1997, PROGR CONNECTIONIST, V1, P351
[7] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
BENGIO, Y
SIMARD, P
FRASCONI, P
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166
[8] Analysis of dynamical recognizers
Blair, AD
Pollack, JB
[J]. NEURAL COMPUTATION, 1997, 9 (05) : 1127 - 1142
[9] Carrasco R. C., 1996, Grammatical Inference: Learning Syntax from Sentences. Third International Colloquium, ICGI-96 Proceedings, P274, DOI 10.1007/BFb0033361
[10] Carrasco RC, 1999, IEE CONF PUBL, P673, DOI 10.1049/cp:19991188

← 1 2 3 4 5 6 →