Multi-task Deep Reinforcement Learning with PopArt
부제: impala에 popart를 싸서 드셔보세요
More …부제: impala에 popart를 싸서 드셔보세요
More …尹授老, 遠藤靖典, 木下尚彦, “許容範囲付きデータに対する多項式回帰モデル”, 筑波大学, 2016
More …IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures https://arxiv.org/abs/1802.01561
More …https://arxiv.org/abs/1805.11604
More …