화학공학소재연구정보센터(CHERIC) | 연구정보 | 문헌DB

검색결과 : 4건

No.	Article
1	A distributional code for value in dopamine-based reinforcement learning Dabney W, Kurth-Nelson Z, Uchida N, Starkweather CK, Hassabis D, Munos R, Botvinick M Nature, 577(7792), 671, 2020
2	Continuous-action planning for discounted infinite-horizon nonlinear optimal control with Lipschitz values Busoniu L, Pall B, Munos R Automatica, 92, 100, 2018
3	Performance bounds in L-p-norm for approximate value iteration Munos R SIAM Journal on Control and Optimization, 46(2), 541, 2007
4	Sensitivity analysis using Ito-Malliavin calculus and martingales, and application to stochastic optimal control Gobet E, Munos R SIAM Journal on Control and Optimization, 43(5), 1676, 2005