Hierarchical Relative Entropy Policy Search

Hierarchical Relative Entropy Policy Search

An Information Theoretic Learning Algorithm in Multimodal Solution Spaces for Real Robots

Versandfertig in 6-10 Tagen
21,99 €
inkl. MwSt.
PAYBACK Punkte
11 °P sammeln!
Many real-world problems are inherently hierarchically structured. The use of this structure in an agent's policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy - the `mixed option' policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solu...