Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is Q-learning

Handbook of Research on Novel Soft Computing Intelligent Algorithms: Theory and Practical Applications
Is a reinforcement learning technique that works by learning an action-value function.
Published in Chapter:
Distributed Learning Algorithm Applications to the Scheduling of Wireless Sensor Networks
Fatemeh Daneshfar (University of Kurdistan, Iran) and Vafa Maihami (University of Kurdistan, Iran)
DOI: 10.4018/978-1-4666-4450-2.ch028
Abstract
Wireless Sensor Network (WSN) is a network of devices denoted as nodes that can sense the environment and communicate gathered data, through wireless medium to a sink node. It is a wireless network with low power consumption, small size, and reasonable price which has a variety of applications in monitoring and tracking. However, WSN is characterized by constrained energy because its nodes are battery-powered and energy recharging is difficult in most of applications. Also the reduction of energy consumption often introduces additional latency of data delivery. To address this, many scheduling approaches have been proposed. In this paper, the authors discuss the applicability of Reinforcement Learning (RL) towards multiple access design in order to reduce energy consumption and to achieve low latency in WSNs. In this learning strategy, an agent would become knowledgeable in making actions through interacting with the environment. As a result of rewards in response to the actions, the agent asymptotically reaches the optimal policy. This policy maximizes the long-term expected return value of the agent.
Full Text Chapter Download: US $37.50 Add to Cart
More Results
Reinforcement Learning for Business Modeling
in this reinforcement learning algorithm the agent evaluates each action (in the possible states of the world) taking into account its possible repercussions in the future behavior of the system. Q-learning is a temporal-differences off-policy algorithm . It is an algorithm for on-line control problems in which values of the states are updated after having observed the outcomes of the actions chosen in a given sequence.
Full Text Chapter Download: US $37.50 Add to Cart
Applications of Reinforcement Learning and Bayesian Networks Algorithms to the Load-Frequency Control Problem
Is a model-free reinforcement learning technique that works by learning an action-value function that gives the expected utility of taking a given action in a given state and following a fixed policy. Q-learning uses temporal differences to estimate the value of In Q-learning, the agent maintains a table of where is the set of states and is the set of actions. represents its current estimate of The learned action-value function, directly approximates the optimal action-value function, independent of the policy being followed. This dramatically simplifies the analysis of the algorithm and enabled early convergence proofs.
Full Text Chapter Download: US $37.50 Add to Cart
Machine Learning in Personalized Anemia Treatment
Reinforcement Learning technique that works by learning an action-value function that gives the expected utility of taking a given action in a given state and following a fixed policy thereafter.
Full Text Chapter Download: US $37.50 Add to Cart
Protect Nature and Reduce the Carbon Footprint With an Application of Blockchain for IIoT
Q-learning is a model-free reinforcement learning algorithm to learn the value of an action in a particular state.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR