화학공학소재연구정보센터
검색결과 : 4건
No. Article
1 A distributional code for value in dopamine-based reinforcement learning
Dabney W, Kurth-Nelson Z, Uchida N, Starkweather CK, Hassabis D, Munos R, Botvinick M
Nature, 577(7792), 671, 2020
2 Continuous-action planning for discounted infinite-horizon nonlinear optimal control with Lipschitz values
Busoniu L, Pall B, Munos R
Automatica, 92, 100, 2018
3 Performance bounds in L-p-norm for approximate value iteration
Munos R
SIAM Journal on Control and Optimization, 46(2), 541, 2007
4 Sensitivity analysis using Ito-Malliavin calculus and martingales, and application to stochastic optimal control
Gobet E, Munos R
SIAM Journal on Control and Optimization, 43(5), 1676, 2005