검색결과 : 5건
No. | Article |
---|---|
1 |
Mastering Atari, Go, chess and shogi by planning with a learned model Schrittwieser J, Antonoglou I, Hubert T, Simonyan K, Sifre L, Schmitt S, Guez A, Lockhart E, Hassabis D, Graepel T, Lillicrap T, Silver D Nature, 588(7839), 604, 2020 |
2 |
Human-level performance in 3D multiplayer games with population-based reinforcement learning Jaderberg M, Czarnecki WM, Dunning I, Marris L, Lever G, Castaneda AG, Beattie C, Rabinowitz NC, Morcos AS, Ruderman A, Sonnerat N, Green T, Deason L, Leibo JZ, Silver D, Hassabis D, Kavukcuoglu K, Graepel T Science, 364(6443), 859, 2019 |
3 |
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play Silver D, Hubert T, Schrittwieser J, Antonoglou I, Lai M, Guez A, Lanctot M, Sifre L, Kumaran D, Graepel T, Lillicrap T, Simonyan K, Hassabis D Science, 362(6419), 1140, 2018 |
4 |
Mastering the game of Go without human knowledge Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A, Hubert T, Baker L, Lai M, Bolton A, Chen YT, Lillicrap T, Hui F, Sifre L, van den Driessche G, Graepel T, Hassabis D Nature, 550(7676), 354, 2017 |
5 |
Mastering the game of Go with deep neural networks and tree search Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, Lillicrap T, Leach M, Kavukcuoglu K, Graepel T, Hassabis D Nature, 529(7587), 484, 2016 |