Developing a Graphical User Interface for an Artificial Intelligence-Based Voice Assistant

Developing a Graphical User Interface for an Artificial Intelligence-Based Voice Assistant

Subhash S., Siddesh S., Prajwal N. Srivatsa, Ullas A., Santhosh B.
Copyright: © 2021 |Pages: 19
DOI: 10.4018/IJOCI.2021070104
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Artificial intelligence machineries have been extensively active in human life in recent times. Self-governing devices are enhancing their way of interacting with both human and devices. Contemporary vision in this topic can pave the way for a new process of human-machine interaction in which users will get to know how people can understand human language, adapting and communicating through it. One such tool is voice assistant, which can be incorporated into many other brilliant devices. In this article, the voice assistant will receive the audio from the microphone and then convert that into text, later with the help of ‘pyttsx3', and then the text response will be converted into an audio file; then the audio file will be played. The audio is processed using the voice user interface (VUI). This article develops a functional intelligent personal assistant (IPA) and integrates it with a graphical user interface that can perform mental tasks such as ON/OFF of smart applications based on the user commands.
Article Preview
Top

Introduction

An Intelligent Personal Assistant (IPA) is a computer program with Artificial Intelligence (Chowdury, 2018) (AI) created to help the users in their task using Natural language processing. It has access to the large amount of information on a device or on an online platform, the helps voice assistants to perform the tasks easily. They communicate easily with users even while answering their queries and perform actions to satisfy the requests. Recently developed Voice Assistants performs wide variety of tasks from setting alarms to more complex things like taking notes or making phone calls. (Chowdury, 2018). There are three stages in which the voice assistant work: text to speech, text to intention, and intention to action. The basic six components of any voice assistant is Speech Recognition Engine, language apperception, Dialog Manager, Natural Language generator and knowledge based.

The Voice assistants provide wide variety of services such as giving information and facts from Google, Play music from streaming services such as Spotify, Wynk Music, videos and TV shows or movies on television. User can ask questions to control home automation etc. (Veena, 2018). According to Ni (2020) Intent Detection and Slot Filling joint task, as one of the core sub-tasks in NLU, has been widely used in different human–computer interaction scenarios (Ni, 2020). The Automatic Speech Recognition used by the authors is Pyttsx3, it is the text to speech conversion library in python. It also includes the TTS engines such as Sapi5, NSSS and espeak. Windows supports only Sapi5 as its driver, NSSS for Mac OS and espeak support other operating systems. Voice Assistant need not only communicate through voice but the authors have taken a step ahead by introducing Graphical User Interface (GUI). (Figure 1) The authors have integrated the GUI application using Tkinter which is a Python Toolkit, where user can interact with the AI-based voice assistant using visual components.

The initial voice assistants could do a very few tasks, but with the better NLP algorithms and more data available they can even fix appointments with the doctor or shop nearby. They can also send messages to the people as both SMS using other applications; also, more apps will be supporting integration with voice assistants in the OS without opening apps – it can make a tweet, read a recent message or send it to someone. Helping the people with disabilities either guiding them or speaking on their behalf, it becomes easier for them. The main aim of this proposal is to develop a functional IPA which performs mental tasks such as turning ON/OFF smart phone applications with the help of voice user interface (VUI) which is used to listen and process audio commands and integrating it with the Graphical User Interface.

Figure 1.

Artificial Intelligence-based Voice Assistant Integrated with Graphical User Interface

IJOCI.2021070104.f01
Top

Literature Survey

Vashistha et al., (2019) used a technology called Zero UI that uses movement, statement and also introspections which make a system react to users according to the situations. Instead of depending on typing, clicking and knocking, the users have to enter the data through voice. Interactions will be changed from landlines and personal computers to the electronic gadgets which users can interact with. All these can happen with the robotics and IoT fields.

Burbach et al., (2019) in this study the author explains about the approval of voice assistant, Performance of Natural Language Processing (NLP) and Price etc., Privacy is the important factor for the acknowledgement of voice assistants. Many techniques were used to develop a voice assistant for their explored factors for the likely users, but according to this study, the author developed voice assistant for particular user groups. The acceptance of voice assistant and decision to utilize the voice assistant, relies upon the various elements of privacy which are most important other than four different groups of virtual assistants with different inclinations could be identified.

Complete Article List

Search this Journal:
Reset
Volume 14: 1 Issue (2024): Forthcoming, Available for Pre-Order
Volume 13: 1 Issue (2023)
Volume 12: 4 Issues (2022)
Volume 11: 4 Issues (2021)
Volume 10: 4 Issues (2020)
Volume 9: 4 Issues (2019)
Volume 8: 4 Issues (2018)
Volume 7: 4 Issues (2017)
Volume 6: 4 Issues (2016)
Volume 5: 4 Issues (2015)
Volume 4: 4 Issues (2014)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing