Save 10% on All IGI Global Research Books
& OnDemand Individual Chapter & Article DownloadsAvailable exclusively on IGI Global’s Online Bookstore. Offer valid through October 31, 2024

Special Offers
- Save 10% on the IGI Global Online bookstore
  Now through October 31, 2024, save 10% on all IGI Global research books & OnDemand individual chapter & article downloads. IGI Global contributors may stack this discount with their exclusive 50% contributor discount, which is automatically applied when logged into a contributor portal account. Non-contributors may also combine the discount with one other discount, including coupon codes. Not valid on open access processing charges, e-collections, or videos. Discount is not applicable for distributors.
  Explore Books & Chapters
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Logistics Distribution Route Optimization With Time Windows Based on Multi-Agent Deep Reinforcement Learning

Fahong Yu, Meijia Chen, Xiaoyun Xia, Dongping Zhu, Qiang Peng, Kuibiao Deng

Source Title: International Journal of Information Technologies and Systems Approach (IJITSA) 17(1)

DOI: 10.4018/IJITSA.342084

Article PDF Download Open access articles are freely available for download

Abstract

Multi-depot vehicle routing problem with time windows (MDVRPTW) is a valuable practical issue in urban logistics. However, heuristic methods may fail to generate high-quality solutions for massive problems instantly. Thus, this article presents a novel reinforcement learning algorithm integrated with a multi-head attention mechanism and a local search strategy to solve the problem efficiently. The routing optimization was regarded as a vehicle tour generation process and an encoder-decoder was used to generate routes for vehicles departing from different depots iteratively. A multi-head attention strategy was employed for mining complex spatiotemporal correlations within time windows in the encoder. Then, a decoder with multi-agent was designed to generate solutions by optimizing reward and observing transition state. Meanwhile, a local search strategy was employed to improve the quality of solutions. The experiments results demonstrate that the proposed method can significantly outperform traditional methods in effectiveness and robustness.

Article Preview

Top

Introduction

With the rapid development of the transportation industry, more stringent requirements on vehicle routing have become an emerging issue in transportation service. This challenges vehicle management intelligence. Vehicle Routing Problems (VRPs) have been a subject of extensive research and attention by scholars worldwide since its inception. To generalize the problem to a wider range of use cases, VRPs have been extended to include more complicated scenarios for a real-life environment. The fixed number of vehicles in the fleet and tighter time windows for customer demand have transformed traditional VRPs into vehicle routing problem with time windows (VRPTW). The problem can be characterized as the multi-depot vehicle routing problem with time windows (MDVRPTW). As a much simpler case, the multi-depot vehicle routing problem (MDVRP) itself is NP-hard, implying that it is unrealistic to generate optimal solution for a large-scale problem unless P = NP (Braekers & Nieuwenhuyse, 2020).

For such extensively complicated problems, heuristic methods are conventionally considered as the viable solution tools. However, the logistics industry has been faced with a new challenge for serving massive amounts of requests instantly in the past few decades. Although many researchers propose diverse heuristics to solve the VRPs, it is still a challenging problem to provide reliable solutions for city scale problems within an acceptable amount of time. Artificial intelligence methods have been gradually evolving to tackle vehicle routing problems. Deep reinforcement learning (DRL) has become increasingly prominent in solving complex sequence decision problems, such as dynamic routing choice, automated vehicle control, and emergency evacuation. DRL is the fusion between reinforcement learning (RL) and deep learning (DL) which can addresses the issue of extreme large spaces with the action and state effectively. Subsequently, several studies attempted to solve the VRPs using DRL, with the encoder-decoder architecture being a popular choice for neural network design. Among these studies, an improved pointer network by simplifying the recurrent neural network (RNN) based encoder was proposed which resulted in more efficient solutions to the VRPs (James J., et al. 2019). With respect to graphic representations, an improved DRL incorporated with graph embedding network was given to solve the VRPs problem (Luis et al., 2019). In this model, VRPs were regarded as the route decoder process, and chose action in terms of the output of the graph embedding network. Moreover, a multi-agent attention model to solve the multiple vehicle routing problem with time windows was proposed (Zhang et al., 2020), which could achieve superior performance to several classical heuristic baselines with negligible calculating time.

Even though the proposed approaches have demonstrated superior performance to conventional methods in solving VRPs, most studies tend to focus on addressing straightforward routing issues that are essentially linear programming problems. Nevertheless, the MDVRPTW with various constraints is more challenging to solve. These constraints add complexity to the problem, making MDVRPTW more challenging to solve than traditional VRPs.

(1)
The quality of heuristic methods is often determined by the quality of the groupings, and devising grouping rules requires a substantial amount of expert domain knowledge, making it difficult to achieve optimal results.
(2)
Current research on DRL methods in combinatorial optimization problems mainly focuses on using a single agent to interact with the environment to solve problems like TSP and VRPs, while research on solving MDVRPTW is relatively lacking.
(3)
Compared with single depot vehicle routing problems, the search efficiency of reinforcement learning will be compromised greatly for larger solution space of MDVRP. Furthermore, the decoder framework fixes the order of vehicles in transformer structure, which leads to the restrictions during the exploration of agents and is no longer effective to handle vehicles originated from different depots.

Complete Article List

Search this Journal:

Reset

Volume 17: 1 Issue (2024)

Volume 16: 3 Issues (2023)

Volume 15: 3 Issues (2022)

Volume 14: 2 Issues (2021)

Volume 13: 2 Issues (2020)

Volume 12: 2 Issues (2019)

Volume 11: 2 Issues (2018)

Volume 10: 2 Issues (2017)

Volume 9: 2 Issues (2016)

Volume 8: 2 Issues (2015)

Volume 7: 2 Issues (2014)

Volume 6: 2 Issues (2013)

Volume 5: 2 Issues (2012)

Volume 4: 2 Issues (2011)

Volume 3: 2 Issues (2010)

Volume 2: 2 Issues (2009)

Volume 1: 2 Issues (2008)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Logistics Distribution Route Optimization With Time Windows Based on Multi-Agent Deep Reinforcement Learning

Abstract

Introduction

Complete Article List