Recommender Systems in Digital Libraries Using Artificial Intelligence and Machine Learning: A Proposal to Create Automated Links Between Different Articles Dealing With Similar Topics

Recommender Systems in Digital Libraries Using Artificial Intelligence and Machine Learning: A Proposal to Create Automated Links Between Different Articles Dealing With Similar Topics

Namik Delilovic
DOI: 10.4018/978-1-7998-7156-9.ch012
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Searching for contents in present digital libraries is still very primitive; most websites provide a search field where users can enter information such as book title, author name, or terms they expect to be found in the book. Some platforms provide advanced search options, which allow the users to narrow the search results by specific parameters such as year, author name, publisher, and similar. Currently, when users find a book which might be of interest to them, this search process ends; only a full-text search or references at the end of the book may provide some additional pointers. In this chapter, the author is going to give an example of how a user could permanently get recommendations for additional contents even while reading the article, using present machine learning and artificial intelligence techniques.
Chapter Preview
Top

Introduction

Natural Language Processing (NLP) is not a term invented recently. As noted by (Liddy, 2001), research already started in the late 40s in the form of machine translation, which was used to encipher enemy messages. Early machine translation used primitive dictionary look-ups and some word reordering techniques to fit the grammar of the target language. However, after Chomsky's publication Syntactic Structures, which made the fields of linguistic and machine translations closer to each other, other application emerged, such as speech recognition. Today when we speak about NLP, we usually refer to it as a discipline of artificial intelligence, and indeed the final goal of NLP is Natural Language Understanding (NLU). The objectives of NLU, as stated by (Liddy, 2001), are:

  • 1.

    Paraphrase an input text

  • 2.

    Translate the text into another language

  • 3.

    Answer questions about the contents of the text

  • 4.

    Draw inferences from the text

While the increase of memory and processing power led to the rapid development of machine learning (ML) supported Natural Language Processing (NLP) techniques, the ever-increasing amount of data found on the web and technologies which provide faster internet speed (Jelena, 2020) are crucial for the advance of such techniques. This is also the reason the Austria-Forum platform (Austria-Forum) which holds over 1.2 million objects in the form of web-books (digital books), documents, images and other multimedia objects is essential for the author’s experiments and research. After analysing the usage behaviour of the Austria-Forum users, the author realised that most of the readers (87.34%) come directly from the Google search engine. The different channels and their values are shown in the pie chart of Figure 1. Combining this information with the average number of users from the last 30 days (right plot in Figure 1) the author concludes that out of 7000 daily users, 6000 come from the Google search engine. Even though the platform providers are happy for any channel which leads the user to the content they offer, the fact that the user is utilising the limited Google search option is not satisfying. In section CURRENT STATE the author is going to examine the current state of the Austria-Forum platform and how the users find new contents using essential tools such as navigation, search and browsing, in section FIRST ATTEMPTS the author presents the first version of the linker tool which automatically suggests links for words found in articles. This tool is simple in its implementation and therefore, in section FUTURE IMPLEMENTATIONS the author analyses some current machine learning techniques and proposes a new implementation for the linker tool and finally, in section EVALUATION the author examines the evaluation possibilities for the results which the new linker implementation will provide.

Figure 1.

Austria-Forum top channels (left pie chart), number of users per day (right plot) and the average number of users from the last month (right, dashed line)

978-1-7998-7156-9.ch012.f01
Top

Current State

The book Digitale Bibliotheken (Endres & Fellner, 2000), presents three different information search and retrieval techniques: navigation, search and browsing. There is also a potential fourth option called Software-Agents and which is related to Artificial Intelligence and Machine Learning. The current state of Austria-Forum also follows this categorisation and provides the first three options which the author is going to describe in the following subsections. The fourth option (Software-Agents) is especially interesting in combination with the Browsing option (see subsection Browsing), which the author partially implemented in Austria-Forum (see section FIRST ATTEMPTS).

Complete Chapter List

Search this Book:
Reset