Norm Augmented Reinforcement Learning Agents With Synthesized Normative Rules: A Proposed Normative Agent Framework

Norm Augmented Reinforcement Learning Agents With Synthesized Normative Rules: A Proposed Normative Agent Framework

Mohd Rashdan Abdul Kadir, Ali Selamat, Ondrej Krejcar
Copyright: © 2024 |Pages: 34
DOI: 10.4018/JCIT.345650
Article PDF Download
Open access articles are freely available for download

Abstract

The dynamic deontic (DD) is a norm synthesis framework that extracts normative rules from reinforcement learning (RL), however it was not designed to be applied in agent coordination. This study proposes a norm augmented reinforcement learning framework (NARLF) that extends said model to include a norm deliberation mechanism for learned norms re-imputation for norm biased decision-making RL agents. This study aims to test the effects of synthesized norms applied on-line and off-line on agent learning performance. The framework consists of the DD framework extended with a pre-processing and deliberation component to allow re-imputation of normative rules. A deliberation model, the Norm Augmented Q-Table (NAugQT), is proposed to map normative rules into RL agents via q-values weight updates. Results show that the framework is able to map and improve RL agent's performance but only when synthesized off-line edited absolute norm salience value norms are used. This shows limitations when unstable salience norms are applied. Improvement in norm extraction and pre-processing are required.
Article Preview
Top

Background

Norms are defined as either rules of expected behavior in a society or as behavior that is common in a society (Morris-Martin et al., 2019). Norms provide a means to regulate agent behavior, which requires some consideration of how norms impact agent reasoning and behavior (Finnemore & Sikkink, 1998). In open, dynamic societies, agents must work with others who do not necessarily have the same set of objectives. If left unchecked, self-interested agents will try to accomplish their individual goals without regard for others. The concept of the normative life cycle (NLC) (Finnemore & Sikkink, 1998; Frantz & Pigozzi, 2018; Hollander & Wu, 2011; Mahmoud et al., 2014; Savarimuthu & Cranefield, 2011) has been used to explain the mechanics at a conceptual level, involved or should involve in designing norm aware agents. The concept of norm synthesis—the generation of normative rules—is derived from most of the NLC processes (Frantz & Pigozzi, 2018). Specifically in regard to the aspect of generating normative rules at run time, the formalization of the NLC into a workable and practical framework using machine-learning techniques is best described by Morales et al. (2015b) as the IRON framework. Their automated normative rule-generation method requires the designer to explicitly specify the conflict state for the framework to synthesize normative rules to avoid said conflict state via case-based reasoning. An alternative approach that reduces designer-defined conflict states proves beneficial in designing adaptable NorMAS, and one of those approaches is using RL.

Norm Synthesis

Norm synthesis is designing the emergence of applicable normative rules or policies (Shoham & Tennenholtz, 1992). Norm synthesis can be described as the process of generating rules that are not hard-coded recipes presenting reactive behaviors, such as those in static expert systems, but rather describe consequences arising from observations for reasoning about the current context, resulting in situation-specific norms (Lee et al., 2014). Alternatively, in an abstract state transition system, norm synthesis generates rules for avoiding undesirable conflict states, leading to conflict-free states (Christelis & Rovatsos, 2009) while also aiming at developing a stable set of norms (Morales et al., 2015a). Adopting the viewpoint of Frantz & Pigozzi (2018), we interpret the concept of norm synthesis as an important norm identification/detection approach, which is further supported by the viewpoint that norm emergence can be used as a source for norm synthesis in NorMAS (Morris-Martin et al., 2019). We believe that the current understanding of norm synthesis constitutes the cyclic process of norm emergence (or convention emergence) and norm detection and representation, as shown in Fig. 1. The goal of norm synthesis, then, lies in prescribing and converting implicit norms (social) into explicit norms (legal) represented in a structured format, i.e., rule-based or logical representation (Morris-Martin et al., 2019), making the transfer and interpretability of learned norms possible.

Figure 1.

Current Understanding of Norm-Synthesis Framework

JCIT.345650.f01

Complete Article List

Search this Journal:
Reset
Volume 26: 1 Issue (2024)
Volume 25: 1 Issue (2023)
Volume 24: 5 Issues (2022)
Volume 23: 4 Issues (2021)
Volume 22: 4 Issues (2020)
Volume 21: 4 Issues (2019)
Volume 20: 4 Issues (2018)
Volume 19: 4 Issues (2017)
Volume 18: 4 Issues (2016)
Volume 17: 4 Issues (2015)
Volume 16: 4 Issues (2014)
Volume 15: 4 Issues (2013)
Volume 14: 4 Issues (2012)
Volume 13: 4 Issues (2011)
Volume 12: 4 Issues (2010)
Volume 11: 4 Issues (2009)
Volume 10: 4 Issues (2008)
Volume 9: 4 Issues (2007)
Volume 8: 4 Issues (2006)
Volume 7: 4 Issues (2005)
Volume 6: 1 Issue (2004)
Volume 5: 1 Issue (2003)
Volume 4: 1 Issue (2002)
Volume 3: 1 Issue (2001)
Volume 2: 1 Issue (2000)
Volume 1: 1 Issue (1999)
View Complete Journal Contents Listing