화학공학소재연구정보센터
검색결과 : 5건
No. Article
1 Mastering Atari, Go, chess and shogi by planning with a learned model
Schrittwieser J, Antonoglou I, Hubert T, Simonyan K, Sifre L, Schmitt S, Guez A, Lockhart E, Hassabis D, Graepel T, Lillicrap T, Silver D
Nature, 588(7839), 604, 2020
2 Human-level performance in 3D multiplayer games with population-based reinforcement learning
Jaderberg M, Czarnecki WM, Dunning I, Marris L, Lever G, Castaneda AG, Beattie C, Rabinowitz NC, Morcos AS, Ruderman A, Sonnerat N, Green T, Deason L, Leibo JZ, Silver D, Hassabis D, Kavukcuoglu K, Graepel T
Science, 364(6443), 859, 2019
3 A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
Silver D, Hubert T, Schrittwieser J, Antonoglou I, Lai M, Guez A, Lanctot M, Sifre L, Kumaran D, Graepel T, Lillicrap T, Simonyan K, Hassabis D
Science, 362(6419), 1140, 2018
4 Mastering the game of Go without human knowledge
Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A, Hubert T, Baker L, Lai M, Bolton A, Chen YT, Lillicrap T, Hui F, Sifre L, van den Driessche G, Graepel T, Hassabis D
Nature, 550(7676), 354, 2017
5 Mastering the game of Go with deep neural networks and tree search
Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, Hassabis D
Nature, 529(7587), 484, 2016