1. Yunfei Bai, Mrinal Kalakrishnan, Sergey Levine, and Chelsea Finn. Watch, try, learn: Meta-learning from demonstrations and reward;zhou,2019
2. Meta-world: A benchmark and evaluation for multi-task and meta reinforcement learning;yu;Conference on Robot Learning,2020
3. Learning to Learn Using Gradient Descent
4. Gradient Theory of Optimal Flight Paths
5. Hallucinative topological memory for zero-shot visual planning;liu,2020