Skip to Main content Skip to Navigation
Conference papers

Resolving XML Semantic Ambiguity

Abstract : XML semantic-aware processing has become a motivating and important challenge in Web data management, data processing, and information retrieval. While XML data is semi-structured, yet it remains prone to lexical ambiguity, and thus requires dedicated semantic analysis and sense disambiguation processes to assign well-defined meaning to XML elements and attributes. This becomes crucial in an array of applications ranging over semantic-aware query rewriting, semantic document clustering and classification, schema matching, as well as blog analysis and event detection in social networks and tweets. Most existing approaches in this context: i) ignore the problem of identifying ambiguous XML nodes, ii) only partially consider their structural relations/context, iii) use syntactic information in processing XML data regardless of the semantics involved, and iv) are static in adopting fixed disambiguation constraints thus limiting user involvement. In this paper, we provide a new XML Semantic Disambiguation Framework titled XSDF designed to address each of the above motivations, taking as input: an XML document and a general purpose semantic network, and then producing as output a semantically augmented XML tree made of unambiguous semantic concepts. Experiments demonstrate the effectiveness of our approach in comparison with alternative methods
Document type :
Conference papers
Complete list of metadata

Cited literature [62 references]  Display  Hide  Download
Contributor : Julien Rabaud Connect in order to contact the contributor
Submitted on : Wednesday, April 3, 2019 - 10:03:40 AM
Last modification on : Monday, February 14, 2022 - 11:32:03 AM


Files produced by the author(s)




Nathalie Charbel, Joe Tekli, Richard Chbeir, Gilbert Tekli. Resolving XML Semantic Ambiguity. 18th International Conference on Extending Database Technology, EDBT 2015, Mar 2015, Brussels, Belgium. pp.277-288, ⟨10.5441/002/edbt.2015.25⟩. ⟨hal-01909107⟩



Record views


Files downloads