One-unambiguous regular languages

被引:106
作者
Bruggemann-Klein, A
Wood, D
机构
[1] Tech Univ Munich, Inst Informat, D-80290 Munich, Germany
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci, Kowloon, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1006/inco.1997.2695
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The ISO standard for the Standard Generalized Markup Language (SGML) provides a syntactic meta-language for the definition of textual markup systems. In the standard, the right-hand sides of productions are based on regular expressions, although only regular expressions that denote words unambiguously, in the sense of the ISO standard, are allowed. In general, a word that is denoted by a regular expression is witnessed by a sequence of occurrences of symbols in the regular expression that match the word. In an unambiguous regular expression as defined by Book at al. (1971, IEEE Trans. Comput. C-20(2), 149-153) each word has at most one witness. But the SGML standard also requires that a witness be computed incrementally from the word with a one-symbol lookahead; we call such regular expressions I-unambiguous. A regular language is a I-unambiguous language if it is denoted by some 1-unambiguous regular expression. We give a Kleene theorem for 1-unambiguous languages and characterize 1-unambiguous regular language in terms of structural properties of the minimal deterministic automate that recognize them. As a result we are able to prove the decidability of whether a given regular expression denotes a 1-unambiguous language; if it does, then we can construct an equivalent 1-unambiguous regular expression in worst-case optimal time. (C) 1998 Academic Press.
引用
收藏
页码:182 / 206
页数:25
相关论文
共 32 条
[1]  
Aho A. V., 1974, ADDISON WESLEY SERIE
[2]  
Ahonen H, 1997, LECT NOTES COMPUT SC, V1293, P27
[3]  
[Anonymous], 1990, SGML HDB
[4]  
[Anonymous], 1986, SER ADDISON WESLEY S
[5]   FROM REGULAR EXPRESSIONS TO DETERMINISTIC AUTOMATA [J].
BERRY, G ;
SETHI, R .
THEORETICAL COMPUTER SCIENCE, 1986, 48 (01) :117-126
[6]   Local languages and the Berry-Sethi algorithm [J].
Berstel, J ;
Pin, JE .
THEORETICAL COMPUTER SCIENCE, 1996, 155 (02) :439-446
[7]   AMBIGUITY IN GRAPHS AND EXPRESSIONS [J].
BOOK, R ;
EVEN, S ;
GREIBACH, S ;
OTT, G .
IEEE TRANSACTIONS ON COMPUTERS, 1971, C 20 (02) :149-+
[8]  
Bruggemann-Klein A., 1993, Algorithms - ESA'93. First Annual European Symposium Proceedings, P73
[9]   REGULAR EXPRESSIONS INTO FINITE AUTOMATA [J].
BRUGGEMANNKLEIN, A .
THEORETICAL COMPUTER SCIENCE, 1993, 120 (02) :197-213
[10]  
BRUGGEMANNKLEIN A, 1992, LECT NOTES COMPUT SC, V583, P87