全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

A Tree Pattern Matching Algorithm for XML Queries with Structural Preferences

DOI: 10.4236/jcc.2019.71006, PP. 61-83

Keywords: Semi-Structured Documents, Preference Queries, Tree Pattern Matching, TreeMatch Algorithm, XML, The Skyline Operator

Full-Text   Cite this paper   Add to My Lib

Abstract:

In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly complex model, the lack or the ignorance of the explicit document model (DTD—Document Type Definition, Schema, etc.) increases the risk of obtaining an empty result set when the query is too specific, or, too large result set when it is too vague (e.g. it contains wildcards such as “*”). The reason is that in both cases, users write queries according to the document model they have in mind; this can be very far from the one that can actually be extracted from the document. Opposed to exact queries, preference queries are more flexible and can be relaxed to expand the search space during their evaluations. Indeed, during their evaluation, certain constraints (the preferences they contain) can be relaxed if necessary to avoid precisely empty results; moreover, the returned answers can be filtered to retain only the best ones. This paper presents an algorithm for evaluating such queries inspired by the TreeMatch algorithm proposed by Yao et al. for exact queries. In the proposed algorithm, the best answers are obtained by using an adaptation of the Skyline operator (defined in relational databases) in the context of documents (trees) to incrementally filter into the partial solutions set, those which satisfy the maximum of preferential constraints. The only restriction imposed on documents is No-Self-Containment.

References

[1]  Tchendji, M.T. and Tadonfouet, L. (2014) Evaluation des requêtes avec préférences structurelles sur les documents XML. In: Sellami, M., Badouel, E. and Lo, M., Eds., Actes du CARI 2014, Saint-Louis, 333-341.
[2]  Cho, S.R. and Balke, W.-T. (2009) Building an Efficient Preference XML Query Processor. Proceedings of the 2009 ACM Symposium on Applied Computing, Honolulu, 8-12 March 2009, 1585-1586.
[3]  Abiteboul, S. (1999) On Views and XML. Proceedings of the 18th ACM Symposium on Principles of Database Systems, Philadelphia, 31 May-3 June 1999, 1-9.
https://doi.org/10.1145/303976.303977
[4]  Bosc, P. and Pivert, O. (1995) SQLf: A Relational Database Language for Fuzzy Querying. IEEE Transactions on Fuzzy Systems, 3, 1-17.
[5]  Kiebling, W. (2002) Foundations of Preferences in Database Systems. Proceedings of the 28th International Conference on Very Large Databases (VLDB), Hong Kong 20-23 August 2002, 311-322.
https://doi.org/10.1016/B978-155860869-6/50035-4
[6]  Chomicki, J. (2003) Preference Formulas in Relational Queries. ACM Transactions on Database Systems (TODS), 28, 427-466.
[7]  Agrawal, R., Kiernan, J., Srikant, R. and Xu, Y.R. (2003) An XPath-Based Preference Language for P3P. Proceedings of the 12th international conference on World Wide Web, Budapest, 20-24 May 2003, 629-639.
[8]  Kieβling, W., Hafenrichter, B., Fischer, S. and Holland, S. (2001) Preference XPATH: A Query Language for E-Commerce. Proc. 5th Intern. Konferenz für Wirtschaftsinformatik, Augsburg, September 2001, 425-440.
[9]  Cohen, S. and Shiloach, M. (2009) Flexible XML Querying Using Skyline Semantics. Proceedings of the 2009 IEEE International Conference on Data Engineering, Shanghai, 29 March-2 April 2009, 30-37.
https://doi.org/10.1109/ICDE.2009.24
[10]  Yao, J.T. and Zhang, M. (2004) A Fast Tree Pattern Matching Algorithm for XML Query. Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence, Beijing, 20-24 September 2004, 235-241.
https://doi.org/10.1109/WI.2004.10048
[11]  Börzsönyi, S., Kossmann, D. and Stocker, K. (2001) The Skyline Operator. Proceedings of the 17th International Conference on Data Engineering, Hiedelberg, 2-6 April 2001, 421-430.
[12]  Zhang, C., Naughton, J.F., DeWitt, D.J., Luo, Q. and Lohman, G.M. (2001) On Supporting Containment Queries in Relational Database Management Systems. Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, Santa Barbara, CA, 21-24 May 2001, 425-436.
https://doi.org/10.1145/375663.375722
[13]  Al-Khalifa, S., Jagadish, H.V., Koudas, N., Patel, J.M., Srivastava, D. and Wu, Y. (2002) Structural Joins: A Primitive for Efficient xml Query Pattern Matching. Proceedings of the 18th International Conference on Data Engineering, San Jose, 26 February-1 March 2002, 141-152.
https://doi.org/10.1109/ICDE.2002.994704
[14]  Lu, J., Chen, T. and Ling, T.W. (2004) Efficient Processing of xml Twig Patterns with Parent Child Edges: A Look-Ahead Approach. Proceedings of the 13th ACM International Conference on Information and Knowledge Management, New York, NY, 8-13 November 2004, 533-542.
https://doi.org/10.1145/1031171.1031272
[15]  Chen, S., Li, H.-G., Tatemura, J., Hsiung, W.-P., Agrawal, D. and Candan, K.S. (2006) Twig2stack: Bottom-Up Processing of Generalized-Tree-Pattern Queries over xml Documents. Proceedings of the 32nd International Conference on Very Large Data Bases, Seoul, 12-15 September 2006, 283-294.
[16]  Jiang, Z., Luo, C., Hou, W., Zhu, Q. and Che, D. (2007) Efficient Processing of XML Twig Pattern: A Novel One-Phase Holistic Solution. 18th International Conference Database and Expert Systems Applications, Regensburg, 3-7 September 2007, 87-97.
https://doi.org/10.1007/978-3-540-74469-6_10
[17]  Li, Q. and Moon, B. (2001) Indexing and Querying XML Data for Regular Path Expressions. Proceedings of the 27th VLDB Conference, Roma, 11-14 September 2001, 361-370.
[18]  Tan, K.-L., Eng, P.-K. and Ooi, B.C. (2001) Efficient Progressive Skyline Computation. Proceedings of the 27th VLDB Conference, Roma, 11-14 September 2001, 301-310.
[19]  Bouadi, T., Bringay, S., Poncelet, P. and Teisseire, M. (2010) Requêtes skyline avec prise en compte des préférences utilisateurs pour des données volumineuses. Extraction et gestion des connaissances, Hammamet, 26 au 29 janvier 2010, 399-404.
[20]  Béatrice, B., Ferrari Mirian Halfeld Lima Maria Adriana Vidigal (2011) Attribute Grammar for XML Integrity Constraint Validation. Proceedings of the 22nd International Conference on Database and Expert Systems Applications, Toulouse, 29 August-2 September 2011, 94-109.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133