Next Article in Journal
A Novel Multi-Criteria Decision-Making Model: Interval Rough SAW Method for Sustainable Supplier Selection
Previous Article in Journal
A Feature Selection and Classification Method for Activity Recognition Based on an Inertial Sensing Unit
Open AccessArticle

Blind Queries Applied to JSON Document Stores

1
Department of Management, Information and Production Engineering, University of Bergamo, 24129 Bergamo, Italy
2
Tabulaex - A Burning-Glass Company, 20126 Milano, Italy
*
Author to whom correspondence should be addressed.
Information 2019, 10(10), 291; https://doi.org/10.3390/info10100291
Received: 28 August 2019 / Revised: 17 September 2019 / Accepted: 19 September 2019 / Published: 21 September 2019
Social Media, Web Portals and, in general, information systems offer their own Application Programming Interfaces (APIs), used to provide large data sets concerning every aspect of day-by-day life. APIs usually provide data sets as collections of JSON documents. The heterogeneous structure of JSON documents returned by different APIs constitutes a barrier to effectively query and analyze these data sets. The adoption of NoSQL document stores, such as MongoDB, is useful for gathering these data sets, but does not solve the problem of querying the final heterogeneous repository. The aim of this paper is to provide analysts with a tool, named HammerJDB, that allows for blind querying collections of JSON documents within a NoSQL document database. The idea below is that users may know the application domain but it may be that they are not aware of the real structures of the documents stored in the database—the tool for blind querying tries to bridge the gap, by adopting a query rewriting mechanism. This paper is an evolution of a technique for blind querying Open Data portals and of its implementation within the Hammer framework, presented in some previous work. In this paper, we evolve that approach in order to query a NoSQL document database by evolving the Hammer framework into the HammerJDB framework, which is able to work on MongoDB databases. The effectiveness of the new approach is evaluated on a data set (derived from a real-life one), containing job-vacancy ads collected from European job portals. View Full-Text
Keywords: retrieval from NoSql Databases; JSON documents; blind querying; single document extraction retrieval from NoSql Databases; JSON documents; blind querying; single document extraction
Show Figures

Figure 1

MDPI and ACS Style

Marrara, S.; Pelucchi, M.; Psaila, G. Blind Queries Applied to JSON Document Stores. Information 2019, 10, 291.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop