1. Neural production systems;Goyal;Advances in Neural Information Processing Systems,2021
2. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor;Haarnoja
3. Truly proximal policy optimization;Wang
4. Nltopddl: One-shot learning of pddl models from natural language process manuals;Miglani
5. HDDL: An Extension to PDDL for Expressing Hierarchical Planning Problems