探索与利用
VIME: Variational Information Maximizing Exploration
Curiosity-driven Exploration by Self-supervised Prediction
Unifying Count-Based Exploration and Intrinsic Motivation
Text-based RL
Deep Reinforcement Learning with a Natural Language Action Space
Keep CALM and Explore: Language Models for Action Generation in Text-based Games