Hershey, Pennsylvania

New York, New YorkBeijing, China

Special Offers
- Up to 50% off Thousands of Research Books
  From July 1st through October 31st, 2025, we are offering discounts of up to 50% across thousands of titles in Business & Management; Science, Technology, & Medicine; and Education & Social Sciences. Through this campaign, we’re committed to ensuring that our mutual library customers worldwide can continue to access high-quality, peer-reviewed content during these challenging times. If this campaign is successful, we will extend through the end of the year and beyond if there’s a benefit to all parties involved. When hosted on the InfoSci^® Platform, e-books feature no DRM, no additional cost for unlimited-user licensing, full-text PDF & HTML formats, and more. Discount is automatically added at checkout.
  Browse Titles
- IGI Global Scientific Publishing Launches International Brand Ambassador Program
  IGI Global Scientific Publishing has launched a new Ambassador Program, designed to empower research professionals to help spread scholarly resources and foster global research engagement. As a local, mid-sized publisher, this initiative offers IGI Global Scientific Publishing an exciting opportunity to expand its global presence in the academic community and foster meaningful connections among scholars around the world. With currently over 130 ambassadors worldwide, these scholarly experts are dedicated to supporting the publisher’s initiative of disseminating cutting-edge research.
  Learn More
- Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 20 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no hosting or maintenance fees, no additional cost for unlimited-user licensing, full-text PDF & HTML format, and more.
  Learn More
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all available IGI Global Scientific Publishing open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all available IGI Global Scientific Publishing open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through the IGI Global Scientific Publishing Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global Scientific Publishing to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open access endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global Scientific Publishing to publish your work under open access? Review the IGI Global Scientific Publishing open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

An Introduction to Reinforcement Learning and Its Application in Various Domains

Samina Amin (Institute of Computing, Kohat University of Science and Technology, Pakistan)

Source Title: Deep Learning, Reinforcement Learning, and the Rise of Intelligent Systems

DOI: 10.4018/979-8-3693-1738-9.ch001

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Reinforcement learning (RL) is a dynamic and evolving subfield of machine learning that focuses on training intelligent agents to learn and adapt through interactions with their environment. This introductory article provides an overview of the fundamental concepts and principles of RL, elucidating its core components, such as the agent, environment, actions, and rewards. This study aims to give readers an in-depth introduction to RL and show examples of its different uses in various domains. RL can allow agents to learn through interaction with an environment, which has led to its enormous interest. The core ideas of RL and its essential elements will be covered in this study, after which it will go into applications in industries including robotics, gaming, finance, healthcare, and more. The fundamental ideas of RL will become clearer to readers, and they will recognize how transformative it can be when used to address challenging decision-making issues. These applications demonstrate the versatility and significance of RL in shaping the future of technology and automation.

Chapter Preview

Top

Introduction/Preliminaries

RL is a specialized field within machine learning that is primarily oriented towards solving control problems. It amalgamates the benefits of dynamic programming with a trial-and-error approach. In RL, an agent-based control paradigm is adopted, wherein the agent learns by interacting with the controlled environment. RL draws inspiration from the natural learning process that occurs through interactions with the environment, mirroring how biological systems learn (Amin et al. 2023) (Sutton and Barto 2018). Like other forms of learning, it revolves around establishing connections between states and actions to optimize specific rewards. Nevertheless, the primary challenge in this type of learning lies in the fact that, unlike conventional machine learning approaches, the learner must autonomously discover the optimal actions for specific situations. Consequently, a learning agent needs to comprehend the environment, select actions that maximize rewards, and adapt its behavior accordingly, even in the face of environmental uncertainties. RL systems are well-suited for unsupervised, real-time implementation, as they construct their understanding of the environment through exploration. Figure 1 illustrates a general RL framework. The formalization of RL is achieved through the utilization of a Markov decision process (MDP), which serves as a discrete-time stochastic control process (Amin et al. 2023). MDP offers a structured mathematical foundation for modeling the decision-making process within this framework.

Figure 1.

Representation of RL structure

•
Agent

The learner or decision-maker interacts with the environment. The agent observes the current state, selects actions, and receives feedback from the environment.

•
Environment

The environment's behavior can be described by the state it assumes at a given time, denoted as S(t), which is characterized by a set of attributes or values. Each state is associated with a reward or immediate cost, represented as R(t), that is generated upon entering that state. At each time step, the agent has a choice of taking one of several possible actions, denoted as A(t), which influence the subsequent state of the system, S(t + 1), and consequently the rewards or costs experienced, with probabilities governing these transitions. The agent's decision-making process, considering the current state, is shaped by its past experiences. In this manner, an RL system utilizes its history of actions in specific states and the corresponding rewards to update its strategy for future actions. Over time, the agent evolves a policy for selecting actions based on the state of the system during its interactions with the environment (Sutton and Barto 2018).

•
State

A representation of the environment at a specific time. States can be as simple as raw sensory data or abstract representations, depending on the problem. In RL, the agent typically makes decisions based on the current state.

•
Action

The set of possible choices or decisions the agent can make at each state. Actions can be discrete or continuous, depending on the problem. The agent selects actions to transition from one state to another.

•
Reward

A scalar value that provides feedback to the agent after each action. The reward signal quantifies the immediate desirability or quality of an action taken in a particular state. The agent's objective is to maximize the cumulative reward over time.

•
Policy (π)

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

An Introduction to Reinforcement Learning and Its Application in Various Domains

Abstract

Introduction/Preliminaries

Complete Chapter List