论文标题

内容和结构的动态交织,用于半结构层次数据的鲁棒索引(扩展版本)

Dynamic Interleaving of Content and Structure for Robust Indexing of Semi-Structured Hierarchical Data (Extended Version)

论文作者

Wellenzohn, Kevin, Böhlen, Michael H., Helmer, Sven

论文摘要

我们为半结构化层次数据提出了一个可靠的索引,该数据支持由路径和值谓词指定的内容和结构的查询(CAS)查询。我们方法的核心是一种新型的动态交织方案,该方案以平衡的方式融合了复合键的路径和价值维度。我们将这些钥匙存储在基于TRIE的鲁棒内容和结构指数中,这些键有效地支持了广泛的CAS查询,包括带有通配符和后代轴的查询。此外,我们显示了我们方案的重要特性,例如针对不同选择性的鲁棒性,并在我们的实验评估中证明了比现有方法的最高两个数量级的改善。

We propose a robust index for semi-structured hierarchical data that supports content-and-structure (CAS) queries specified by path and value predicates. At the heart of our approach is a novel dynamic interleaving scheme that merges the path and value dimensions of composite keys in a balanced way. We store these keys in our trie-based Robust Content-And-Structure index, which efficiently supports a wide range of CAS queries, including queries with wildcards and descendant axes. Additionally, we show important properties of our scheme, such as robustness against varying selectivities, and demonstrate improvements of up to two orders of magnitude over existing approaches in our experimental evaluation.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源