hrl-q3
Questions to HRL Paper 3
Q1: How does the update rule for the Manager find subgoals?
Q2: Why is the architecture not trained end-to-end?
Q3: What would you consider the most important contribution? What the most important hyperparameter?