Stable encoding of large finite-state automata in recurrent neural networks with sigmoid discriminants

被引：21

作者：

Omlin, CW ^{[1
]}

Giles, CL ^{[1
]}

机构：

[1] UNIV MARYLAND,UMIACS,COLLEGE PK,MD 20742

来源：

NEURAL COMPUTATION | 1996年 / 8卷 / 04期

关键词：

D O I：

10.1162/neco.1996.8.4.675

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose an algorithm for encoding deterministic finite-state automata (DFAs) in second-order recurrent neural networks with sigmoidal discriminant function and we prove that the languages accepted by the constructed network and the DFA are identical. The desired finite-state network dynamics is achieved by programming a small subset of all weights. A worst case analysis reveals a relationship between the weight strength and the maximum allowed network size, which guarantees finite-state behavior of the constructed network. We illustrate the method by encoding random DFAs with 10, 100, and 1000 states. While the theory predicts that the weight strength scales with the DFA size, we find empirically the weight strength to be almost constant for all the random DFAs. These results can be explained by noting that the generated DFAs represent average cases. We empirically demonstrate the existence of extreme DFAs for which the weight strength scales with DFA size.

引用

页码：675 / 696

页数：22

共 17 条

[1] EFFICIENT SIMULATION OF FINITE AUTOMATA BY NEURAL NETS
ALON, N
DEWDNEY, AK
OTT, TJ
[J]. JOURNAL OF THE ACM, 1991, 38 (02) : 495 - 514
[2] AN ALGEBRAIC FRAMEWORK TO REPRESENT FINITE-STATE MACHINES IN SINGLE-LAYER RECURRENT NEURAL NETWORKS
ALQUEZAR, R
SANFELIU, A
[J]. NEURAL COMPUTATION, 1995, 7 (05) : 931 - 949
[3] FINDING STRUCTURE IN TIME
ELMAN, JL
[J]. COGNITIVE SCIENCE, 1990, 14 (02) : 179 - 211
[4] FRASCONI P, 1993, INJECTING NONDETERMI
[5] FRASCONI P, 1991, P INT JOINT C NEURAL, V1, P811
[6] GILES CL, 1993, CONNECT SCI, V5, P307
[7] GILES CL, 1992, NEURAL COMPUT, V4, P380
[8] GORI M, 1996, IN PRESS MACHINE LEA
[9] Hopcroft J. E., 2007, Introduction to Automata Theory, Languages and Computation
[10] HORNE BG, 1994, ADV NEURAL INFORMATI, V6, P359

← 1 2 →