Hybrid Method for Semantic Similarity Computation Using Weighted Components in Ontology

Hybrid Method for Semantic Similarity Computation Using Weighted Components in Ontology

Kanishka N. Kamble, Suresh K. Shirgave
Copyright: © 2022 |Pages: 12
DOI: 10.4018/IJSI.309734
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

In this paper, the researchers propose an approach to measure the semantic similarity between two concepts in an ontology like WordNet and DBpedia. Some earlier semantic similarity approaches proposed concentrated on the ontology structure between concepts and some concentrated only on the information content of concepts. This paper proposes a semantic similarity approach with path length, information content, and semantic depth (i.e., PLICD) to combine both path length as well as information content-based approaches. This proposed approach uses weighted shortest path length and information content calculated using semantic depth and hyponyms of the concepts to measure semantic similarity between two concepts. Through experimentations performed on WordNet and DBpedia, the researchers note that the PLICD semantic similarity approach has delivered a statistically meaningful enhancement as compared to the other semantic similarity approaches concerning accuracy and F score.
Article Preview
Top

The semantic similarity approaches are developed to measure the extent to which two concepts are similar using the structural information collected from concept taxonomy or information content. The input to the semantic similarity approach is a concept pair and output is a numerical value showing semantic similarity between the concepts. There are many applications using the semantic similarity value to rank the similarity between different concept pairs.

The knowledge-based approaches (Rada et al., 1989),(Leacock & Chodorow, 1998),(Wu & Palmer, 1994),(Li et al., 2003), compute the semantic similarity between the two ontology concepts using semantic information contained in an ontology. The knowledge-based approaches can be further categorized based on how the semantic similarity between concepts is assessed, as edge-counting approaches, information content-based approaches, feature-based approaches, and hybrid knowledge-based approaches.

The edge counting approaches count the edges in the path connecting two concepts in ontology to compute the similarity between them. If the distance between the two concepts is large then the semantic similarity between them is less. In contrast, if the distance between the two concepts is small then the semantic similarity between them is more.

Rada et al. (1989) stated path-based approach that calculates the semantic similarity between two concepts Conc1 and Conc2 by using the shortest path length represented as SPL (Conc1, Conc2) as (Rada et al., 1989)

IJSI.309734.m01
(1)

Leacock, Claudia, and Martin Chodorow (1998) stated lch approach that computes the semantic similarity between two concepts Conc1and Conc2 using a non-linear function illustrated as (Leacock & Chodorow, 1998)

IJSI.309734.m02
(2) where SPL (Conc1, Conc2) is the shortest path length and MAX_DEPTH is the maximum depth of the ontology.

The disadvantage of edge-counting approaches is that all concept pairs having same path length gives the same semantic similarity value.

Figure 1.

WordNet “is-a” hierarchical taxonomy fragment

IJSI.309734.f01

Complete Article List

Search this Journal:
Reset
Volume 12: 1 Issue (2024)
Volume 11: 1 Issue (2023)
Volume 10: 4 Issues (2022): 2 Released, 2 Forthcoming
Volume 9: 4 Issues (2021)
Volume 8: 4 Issues (2020)
Volume 7: 4 Issues (2019)
Volume 6: 4 Issues (2018)
Volume 5: 4 Issues (2017)
Volume 4: 4 Issues (2016)
Volume 3: 4 Issues (2015)
Volume 2: 4 Issues (2014)
Volume 1: 4 Issues (2013)
View Complete Journal Contents Listing