全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

基于与或树的正则表达式有害二义性检查算法*

, PP. 173-178

Keywords: 正则表达式(RE),匹配,二义性,与或树

Full-Text   Cite this paper   Add to My Lib

Abstract:

在构造面向应用的正则表达式(RE)过程中,引入有益二义性可简化RE构造,而将有害二义性遗留在RE中会危害匹配结果的正确性.为区别对待这两种二义性,基于与或树提出一种检查和定位RE中有害二义性的算法,该算法可减轻RE调试的工作量.实验表明,该算法在时间性能、空间性能和实用性等方面优于现有基于自动机的二义性检查算法.基于此算法的可视化RE编辑调试环境已用于构建国内第一个整合的生物数据仓库.

References

[1]  Chan C Y, Garofalakis M, Rastogi R. RE-Tree: An Efficient Index Structure for Regular Expressions. International Journal on Very Large Data Bases, 2003, 12(2): 102-119
[2]  Embley D W, Campbell D M, Smith R D, et al. Ontology-Based Extraction and Structuring of Information from Data-Rich Unstructured Documents. In: Proc of the 7th International Conference on Information and Knowledge Management. Bethesda, USA, 1998, 52-59
[3]  Laender A H F, Ribeiro-Neto B, da Silva A S. DEByE: Data Extraction by Example. Data and Knowledge Engineering, 2002, 40(2): 121-154
[4]  Wu T H, Pottenger W M. A Semi-Supervised Algorithm for Pattern Discovery in Information Extraction from Textual Data. In: Proc of the 7th Pacific-Asia Conference on Knowledge Discovery and Data Mining. Seoul, Korea, 2003, 117-123
[5]  Heddad A, Brameier M, MacCallum R M. Evolving Regular Expression-Based Sequence Classifiers for Protein Nuclear Localisation. In: Proc of the 2nd European Workshop on Evolutionary Bioinformatics. Coimbra, Portugal, 2004, 31-40
[6]  Vansummeren S. Unique Pattern Matching in Strings. 2003. http://arXiv.org/abs/cs/0302004
[7]  Frisch A, Cardelli L. Greedy Regular Expression Matching. In: Proc of the 31st International Colloquium on Automata, Languages and Programming. Turku, Finland, 2004, 618-629
[8]  Hosoya H. Regular Expression Pattern Matching: A Simpler Design. 2003. http://arbre.is.s.u-tokyo.ac.jp/~hahosoya/papers/ambig.ps

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133