|
|
|
Dugan Um, Prasad Nethala and Hocheol Shin
In this paper, a hierarchical reinforcement learning (HRL) architecture, namely a ?Hierarchical Deep Deterministic Policy Gradient (HDDPG)? has been proposed and studied. A HDDPG utilizes manager and worker formation similar to other HRL structures. Howe...
ver más
|
|
|