RDF(S) Store in Object-Relational Databases

RDF(S) Store in Object-Relational Databases

Zongmin Ma, Daiyi Li, Jiawen Lu, Ruizhe Ma, Li Yan
Copyright: © 2024 |Pages: 32
DOI: 10.4018/JDM.334710
Article PDF Download
Open access articles are freely available for download

Abstract

The Resource Description Framework (RDF) and RDF Schema (RDFS) recommended by World Wide Web Consortium (W3C) provide a flexible model for semantically representing data on the web. With the widespread acceptance of RDF(S) (RDF and RDFS for short), a large number of RDF(S) is available. Databases play an important role in managing RDF(S). However, there are few studies on using object-relational databases to store RDF(S). In this paper, the authors propose the formal definitions of RDF(S) model and object-relational databases model. Then they introduce the approach for storing RDF(S) in object-relational databases based on the formal definitions. They implement a prototype system to demonstrate the feasibility of the approach and test the performance and semantic retention ability of this prototype system with the benchmark dataset.
Article Preview
Top

1. Introduction

The Semantic Web has been proposed by Tim Berners-Lee to provide a common framework for information sharing across multiple domains (Crasso et al., 2012). With the Semantic Web, data are provided with data semantic meaning (through metadata), and concepts and entities in the real world can be represented in a machine-readable and structured form. The Resource Description Framework (RDF) proposed by the World Wide Web Consortium (W3C) is a model of representing metadata of resources on the Web. RDF Schema (RDF(S)) as well as Web Ontology Languages (OWL) are the description of vocabulary semantics used in RDF datasets. RDF and RDF Schema (collectively known as RDF(S)) are the core of the Semantic Web. Nowadays, RDF(S) have been increasingly applied in a wide range of Web-based application scenarios, such as semantic data integration (Arsic et al., 2019), semantic search (Xiong, Power and Callan, 2017; Zheng et al., 2019), semantic analysis of Big Data (Smiatacz, 2018; Shen, Hu and Tzeng, 2017), decision making (Rubio-Largo et al., 2017; Zhou et al., 2017) and so on. Currently, RDF(S) has become the de-facto standard of representing and handling data semantics. In particular, knowledge graphs (KGs) mostly adopt RDF mode to represent massive instances, and now are widely investigated and applied in diverse domains for the semantic and intelligent processing of massive data (Song et al., 2019).

With the rapid increase in the number of RDF(S) on the Web, it has become increasingly important to efficiently store massive amounts of RDF(S). The storage of RDF(S) (Ma, Capretz and Yan, 2016) often supports efficient queries of RDF data, mainly because the storage structure of RDF(S) not only directly determines the integrity of storage semantics, but also greatly affects its query efficiency (Ma et al., 2016; Ma, et al, 2018). At present, there have been many studies on RDF(S) storage methods, which can be roughly divided into the following three categories:

  • 1)

    Memory-based storage (e.g., Sesame (Broekstra, et al., 2002) and BitMat (Atre, et al., 2008)). With this category of methods, memory space is directly allocated for RDF data and indexing technology is generally utilized for quick data process. Note that these methods are limited by the size of computer memory and are only suitable for storing a small number of RDF datasets;

  • 2)

    Disk-based storage (e.g., YARS2 (Harth, et al., 2007) and System II (Wu, et al., 2009)). With this category of methods, the storage location is transferred from memory to hard disk. These methods meet the storage requirements of large-scale RDF datasets in space, but frequent reads and writes to disks greatly reduce storage performance;

  • 3)

    Database-based storage (e.g., Jena-TDB (Wilkinson, et al., 2003), 4Store (Harris, et al., 2009), Virtuoso (Erling and Mikhailov, 2007), BigOWLIM/OWLIM-SE (Bishop et al., 2011), SPARQLcity/SPARQLverse1, MarkLogic2, and Clark and Parsia3). This category of methods uses database technology to store RDF data. In addition to some commercial systems, there are some developed prototypes such as RDF-3X (Neumann and Weikum, 2010), SW-Store (Abadi, et al., 2009) and RDFox4.

Complete Article List

Search this Journal:
Reset
Volume 35: 1 Issue (2024)
Volume 34: 3 Issues (2023)
Volume 33: 5 Issues (2022): 4 Released, 1 Forthcoming
Volume 32: 4 Issues (2021)
Volume 31: 4 Issues (2020)
Volume 30: 4 Issues (2019)
Volume 29: 4 Issues (2018)
Volume 28: 4 Issues (2017)
Volume 27: 4 Issues (2016)
Volume 26: 4 Issues (2015)
Volume 25: 4 Issues (2014)
Volume 24: 4 Issues (2013)
Volume 23: 4 Issues (2012)
Volume 22: 4 Issues (2011)
Volume 21: 4 Issues (2010)
Volume 20: 4 Issues (2009)
Volume 19: 4 Issues (2008)
Volume 18: 4 Issues (2007)
Volume 17: 4 Issues (2006)
Volume 16: 4 Issues (2005)
Volume 15: 4 Issues (2004)
Volume 14: 4 Issues (2003)
Volume 13: 4 Issues (2002)
Volume 12: 4 Issues (2001)
Volume 11: 4 Issues (2000)
Volume 10: 4 Issues (1999)
Volume 9: 4 Issues (1998)
Volume 8: 4 Issues (1997)
Volume 7: 4 Issues (1996)
Volume 6: 4 Issues (1995)
Volume 5: 4 Issues (1994)
Volume 4: 4 Issues (1993)
Volume 3: 4 Issues (1992)
Volume 2: 4 Issues (1991)
Volume 1: 2 Issues (1990)
View Complete Journal Contents Listing