화학공학소재연구정보센터
IEEE Transactions on Automatic Control, Vol.46, No.4, 644-647, 2001
Death and discounting
Markov decision processes (MDPs) with discounted cost are equivalent to processes with a finite-random duration, and, hence, the discount factor models a (random) time horizon for the life of the process. We elaborate on this idea, but show that an objective function which is a Linear combination of several discounted costs (each with a different discount factor) does not, in general, model professes with several time scales, but rather professes with partial information.