Learning Pattern Relation-Based Hyperbolic Embedding for Adverse Drug Reaction Extraction

Learning Pattern Relation-Based Hyperbolic Embedding for Adverse Drug Reaction Extraction

Siriwon Taewijit, Thanaruk Theeramunkong
Copyright: © 2021 |Pages: 19
DOI: 10.4018/IJKSS.2021040105
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Hyperbolic embedding has been recently developed to allow us to embed words in a Cartesian product of hyperbolic spaces, and its efficiency has been proved in several works of literature since the hierarchical structure is the natural form of texts. Such a hierarchical structure exhibits not only the syntactic structure but also semantic representation. This paper presents an approach to learn meaningful patterns by hyperbolic embedding and then extract adverse drug reactions from electronic medical records. In the experiments, the public source of data from MIMIC-III (Medical Information Mart for Intensive Care III) with over 58,000 observed hospital admissions of the brief hospital course section is used, and the result shows that the approach can construct a set of efficient word embeddings and also retrieve texts of the same relation type with the input. With the Poincaré embeddings model and its vector sum (PC-S), the authors obtain up to 82.3% in the precision at ten, 85.7% in the mean average precision, and 93.6% in the normalized discounted cumulative gain.
Article Preview
Top

Introduction

Emerging technology supports the successful transition of paper-based medical records to electronic form. These electronic medical records (EMRs) benefit from fast data retrieval, time reduction during patient visiting, data sharing among medical departments, and high data security and privacy due to limitable user access. There are many research works in recent year utilize EMRs for knowledge extraction (Menaouer et al., 2020). The main advantage of such a data source is that EMRs repository contains tacit knowledge and explicit knowledge. The know-how and professional’s experience are usually narrated during medical treatment, including laboratory results and the diagnosis procedure. Among research works on EMRs analysis, the automated adverse drug reaction (ADR) extraction is a highlight. The ADR terminology is an unpleasant event (e.g., symptom, disease, and finding) associsated with a medication given at recommended dosages (Lortie, 1986).

In the earlier research to extract ADR from unstructured texts, the statistical co-occurrence analysis has been widely deployed (Wang et al., 2009; White et al., 2016; Nikfarjam et al., 2019) due to less complexity, straightforward, and highly significant results. However, the major drawback of the co-occurrence approach is disregard relation context. Drug and event entities are expected to appear together over a chance frequently regardless of the considering on clinical relation meaning between two entities; for example, a drug treats a medical event, or a drug may cause an adverse event. The pattern-based method is proposed to overcome the co-occurrence limitation. Xu & Wang (2014a, 2014b) proposes a pattern-ranking method. Similarly, Taewijit et al. (2017) incorporates the distant supervision approach with pattern-based for ADR identification from EMRs. Bollegala (2018) deploys lexical patterns from social media.

Regarding the success of many medical applications using deep learning, Zhang et al. (2018) construct word sequence dependency and relation sequence dependency from the dependency graph of a given medical sentence, then learn medical relation using a hybrid model of recurrent neural networks (RNNs) and convolutional neural networks (CNNs). Similar to the work of Gupta et al. (2018) and Cocos et al. (2019), they propose RNN to discover ADR relations. Although embedding methods have been applied to texts such as Word2vec (Mikolov et al., 2013), Glove (Pennington et al., 2014), Fast Text (Bojanowski et al., 2017), and graphs such as Node2vec (Grover et al., 2016), DeepWalk (Perozziea al., 2014), LINE (Tang et al., 2015), Metapath2vec (Dong et al., 2017), since the modeling of intricate patterns using embedding methods require a large dimensionality, it is fundamentally hard to compute the embeddings of large graph-structure such as social network, knowledge graphs or taxonomies without loss of information (Nickel and Kiela, 2017).

Unlike the above deep learning methods, they try to learn distributional semantic vectors on labeled instances that require massive manual data annotation to achieve good performance. In this work, we extend our previous work (Taewijit & Theeramunkong, 2016). We deploy distant supervision settings and integrate the pattern-based method with pattern expansion by learning hierarchical representations to examine entity pairs relation through the relation triple <entity1>, <phrase>, <entity2> embedding for ADR extraction from EMRs. All triples in a corpus are learned the semantic on hyperbolic space. Moreover, we add the evaluation results of our pattern-based relation by two domain experts. For the remainder of this paper, we organize it into three sections; the background of our study, the material and method, and the experimental results.

Complete Article List

Search this Journal:
Reset
Volume 15: 1 Issue (2024)
Volume 14: 1 Issue (2023)
Volume 13: 4 Issues (2022): 2 Released, 2 Forthcoming
Volume 12: 4 Issues (2021)
Volume 11: 4 Issues (2020)
Volume 10: 4 Issues (2019)
Volume 9: 4 Issues (2018)
Volume 8: 4 Issues (2017)
Volume 7: 4 Issues (2016)
Volume 6: 4 Issues (2015)
Volume 5: 4 Issues (2014)
Volume 4: 4 Issues (2013)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing