13 Artículos

« Anterior Página: 1 de 1 Siguiente »

MARL-Based Dual Reward Model on Segmented Actions for Multiple Mobile Robots in Automated Warehouse Environment

Acceso

en línea

Hyeoksoo Lee, Jiwoo Hong and Jongpil Jeong

The simple and labor-intensive tasks of workers on the job site are rapidly becoming digital. In the work environment of logistics warehouses and manufacturing plants, moving goods to a designated place is a typical labor-intensive task for workers. Thes... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 9 Año: 2022

Long-Term Visitation Value for Deep Exploration in Sparse-Reward Reinforcement Learning

Acceso

en línea

Simone Parisi, Davide Tateo, Maximilian Hensel, Carlo D?Eramo, Jan Peters and Joni Pajarinen

Reinforcement learning with sparse rewards is still an open challenge. Classic methods rely on getting feedback via extrinsic rewards to train the agent, and in situations where this occurs very rarely the agent learns slowly or cannot learn at all. Simi... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 3 Año: 2022

Multi-Agent Collaborative Target Search Based on the Multi-Agent Deep Deterministic Policy Gradient with Emotional Intrinsic Motivation

Acceso

en línea

Xiaoping Zhang, Yuanpeng Zheng, Li Wang, Arsen Abdulali and Fumiya Iida

Multi-agent collaborative target search is one of the main challenges in the multi-agent field, and deep reinforcement learning (DRL) is a good way to learn such a task. However, DRL always faces the problem of sparse reward, which to some extent reduces... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 21 Año: 2023

Research on Wargame Decision-Making Method Based on Multi-Agent Deep Deterministic Policy Gradient

Acceso

en línea

Sheng Yu, Wei Zhu and Yong Wang

Wargames are essential simulators for various war scenarios. However, the increasing pace of warfare has rendered traditional wargame decision-making methods inadequate. To address this challenge, wargame-assisted decision-making methods that leverage ar... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 7 Año: 2023

Multi-UAV Autonomous Path Planning in Reconnaissance Missions Considering Incomplete Information: A Reinforcement Learning Method

Acceso

en línea

Yu Chen, Qi Dong, Xiaozhou Shang, Zhenyu Wu and Jinyu Wang

Unmanned aerial vehicles (UAVs) are important in reconnaissance missions because of their flexibility and convenience. Vitally, UAVs are capable of autonomous navigation, which means they can be used to plan safe paths to target positions in dangerous su... ver más

Revista: Drones Formato: Electrónico

Tabla de contenido: Vol: 7 Num: 0 Par: 1 Año: 2023

A Real-Time and Optimal Hypersonic Entry Guidance Method Using Inverse Reinforcement Learning

Acceso

en línea

Linfeng Su, Jinbo Wang and Hongbo Chen

The mission of hypersonic vehicles faces the problem of highly nonlinear dynamics and complex environments, which presents challenges to the intelligent level and real-time performance of onboard guidance algorithms. In this paper, inverse reinforcement ... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 11 Año: 2023

A Multi-UCAV Cooperative Decision-Making Method Based on an MAPPO Algorithm for Beyond-Visual-Range Air Combat

Acceso

en línea

Xiaoxiong Liu, Yi Yin, Yuzhan Su and Ruichen Ming

To solve the problems of autonomous decision making and the cooperative operation of multiple unmanned combat aerial vehicles (UCAVs) in beyond-visual-range air combat, this paper proposes an air combat decision-making method that is based on a multi-age... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 10 Año: 2022

Double Broad Reinforcement Learning Based on Hindsight Experience Replay for Collision Avoidance of Unmanned Surface Vehicles

Acceso

en línea

Jiabao Yu, Jiawei Chen, Ying Chen, Zhiguo Zhou and Junwei Duan

Although broad reinforcement learning (BRL) provides a more intelligent autonomous decision-making method for the collision avoidance problem of unmanned surface vehicles (USVs), the algorithm still has the problem of over-estimation and has difficulty c... ver más

Revista: Journal of Marine Science and Engineering Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 12 Año: 2022

An AUV Target-Tracking Method Combining Imitation Learning and Deep Reinforcement Learning

Acceso

en línea

Yubing Mao, Farong Gao, Qizhong Zhang and Zhangyi Yang

This study aims to solve the problem of sparse reward and local convergence when using a reinforcement learning algorithm as the controller of an AUV. Based on the generative adversarial imitation (GAIL) algorithm combined with a multi-agent, a multi-age... ver más

Revista: Journal of Marine Science and Engineering Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 3 Año: 2022

End-to-End AUV Local Motion Planning Method Based on Deep Reinforcement Learning

Acceso

en línea

Xi Lyu, Yushan Sun, Lifeng Wang, Jiehui Tan and Liwen Zhang

This study aims to solve the problems of sparse reward, single policy, and poor environmental adaptability in the local motion planning task of autonomous underwater vehicles (AUVs). We propose a two-layer deep deterministic policy gradient algorithm-bas... ver más

Revista: Journal of Marine Science and Engineering Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 9 Año: 2023

Impact-Angle Constraint Guidance and Control Strategies Based on Deep Reinforcement Learning

Acceso

en línea

Junfang Fan, Denghui Dou and Yi Ji

In this study, two different impact-angle-constrained guidance and control strategies using deep reinforcement learning (DRL) are proposed. The proposed strategies are based on the dual-loop and integrated guidance and control types. To address comprehen... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 11 Año: 2023

Three-Dimensional Path Planning for Unmanned Helicopter Using Memory-Enhanced Dueling Deep Q Network

Acceso

en línea

Jiangyi Yao, Xiongwei Li, Yang Zhang, Jingyu Ji, Yanchao Wang, Danyang Zhang and Yicen Liu

Unmanned helicopter (UH) is often utilized for raid missions because it can evade radar detection by flying at ultra-low altitudes. Path planning is the key technology to realizing the autonomous action of UH. On the one hand, the dynamically changing ra... ver más

Revista: Aerospace Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 8 Año: 2022

Integrating a Path Planner and an Adaptive Motion Controller for Navigation in Dynamic Environments

Acceso

en línea

Junjie Zeng, Long Qin, Yue Hu, Quanjun Yin and Cong Hu

Since an individual approach can hardly navigate robots through complex environments, we present a novel two-level hierarchical framework called JPS-IA3C (Jump Point Search improved Asynchronous Advantage Actor-Critic) in this paper for robot navigation ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 7 Año: 2019

« Anterior Página: 1 de 1 Siguiente »