검색결과 : 4건
No. | Article |
---|---|
1 |
A distributional code for value in dopamine-based reinforcement learning Dabney W, Kurth-Nelson Z, Uchida N, Starkweather CK, Hassabis D, Munos R, Botvinick M Nature, 577(7792), 671, 2020 |
2 |
Continuous-action planning for discounted infinite-horizon nonlinear optimal control with Lipschitz values Busoniu L, Pall B, Munos R Automatica, 92, 100, 2018 |
3 |
Performance bounds in L-p-norm for approximate value iteration Munos R SIAM Journal on Control and Optimization, 46(2), 541, 2007 |
4 |
Sensitivity analysis using Ito-Malliavin calculus and martingales, and application to stochastic optimal control Gobet E, Munos R SIAM Journal on Control and Optimization, 43(5), 1676, 2005 |