Article Preview
Top1. Introduction
Our present modern lives are blessed with the Internet. Among several others, most importantly, Internet helps us to acquire / enhance our knowledge. Due to ever increasing volume of text documents, it is very hard to read every single line of every single document. For this reason, text summarization plays a crucial role towards knowledge acquisition from available text documents (Pokojski et al., 2018).
Text summarization incorporates the uses of keywords. Keywords provide a compact representation about contents of a document. Keyword extraction is considered as primary task towards the automatic summarization of documents. Several text mining applications e.g., ‘just-in-time (JIT)’ based information retrieval, automatic classification, summarization, and filtering etc. were presented which uses keywords (Zhang, 2008; and Reddivari et al., 2018). Manual keyword extraction from any text document is time consuming, costly and tedious task. Furthermore, the ever-increasing number of the online documents makes the situation more critical towards manual processing. For this reason, automated text summarization and keyword extraction have attracted the attention of investigators over the past years (Beliga et al., 2015).
Automatic text summarization is a text mining mission that facilitates quick grasp of the overall perception for a text document (Thakkar et al., 2010; and Bharti et al., 2017). Text summarization may be achieved in the form of an abstractive summary or, as an extractive summary. Abstractive Summaries are often achieved after learning the internal representation of the article and the quality of summary is similar to the quality as produced by human being (https://rare-technologies.com/text-summarization-in-python-extractive-vs-abstractive-techniques-revisited/). On the other hand, extractive summary extracts detail from the input article and presents the result to the user (Bharti et al., 2017). In our studies, we found that extractive summarization (based on keywords extraction) is mostly popular. For this reason, in our present research, we have focused towards the keywords-based extractive summarization. In our study, we find that graph-based approach is popular towards text summarization (Thakkar et al., 2010). Hence, a brief description on graph as presented in (Ruohonen, 2013), is presented next.
Graph is a pair, containing set of vertices , set of edges , and a relation associated with each edge. Mathematically graph (G) as found in (Ruohonen, 2013) is presented once again in following Equation 1 (Ruohonen, 2013):
(1) where,
denotes graph,
denotes set of vertices and
denotes edges formed by pair of vertices i.e, an arc or, edge between vertex
and vertex
is described as
.