Loading...
Please wait, while we are loading the content...
Similar Documents
Schema-independent Querying for Heterogeneous Collections in NoSQL Document Stores
| Content Provider | Hyper Articles en Ligne (HAL) |
|---|---|
| Author | Ben Hamadou, Hamdi Ghozzi, Faiza Péninou, André Teste, Olivier |
| Copyright Year | 2019 |
| Abstract | NoSQL document stores are well-tailored to efficiently load and manage massive collections of heterogeneous documents without any prior structural validation. However, this flexibility becomes a serious challenge when querying heterogeneous documents, and hence the user has to build complex queries or reformulate existing queries whenever new schemas are introduced in a collection. In this paper we propose a novel approach, based on formal foundations, for building schema-independent queries which are designed to query multi-structured documents. We present a query enrichment mechanism that consults a pre-constructed dictionary. This dictionary binds each possible path in the documents to all its corresponding absolute paths in all the documents. We automate the process of query reformulation via a set of rules that reformulate most document store operators, such as select, project, unnest, aggregate and lookup. We then produce queries across multi-structured documents which are compatible with the native query engine of the underlying document store. To evaluate our approach, we conducted experiments on synthetic datasets. Our results show that the induced overhead can be acceptable when compared to the efforts needed to restructure the data or the time required to execute several queries corresponding to the different schemas inside the collection. |
| Related Links | https://hal.science/hal-02976616/file/S0306437918302990.pdf |
| ISSN | 03064379 |
| DOI | 10.1016/j.is.2019.04.005 |
| Journal | Information Systems |
| Volume Number | 85 |
| Language | English |
| Publisher | HAL CCSD Elsevier |
| Publisher Date | 2019-11-01 |
| Access Restriction | Open |
| Subject Keyword | Information Systems Structural Heterogeneity Document Stores Schema-independent Querying Computer Science [cs] |
| Content Type | Text |
| Resource Type | Article |
| Subject | Medicine Information Systems Hardware and Architecture Software |