Death and discounting

Shwartz A

IEEE Transactions on Automatic Control, Vol.46, No.4, 644-647, 2001

Markov decision processes (MDPs) with discounted cost are equivalent to processes with a finite-random duration, and, hence, the discount factor models a (random) time horizon for the life of the process. We elaborate on this idea, but show that an objective function which is a Linear combination of several discounted costs (each with a different discount factor) does not, in general, model professes with several time scales, but rather professes with partial information.

Keywords:discounted cost;Markov decision processes (MDPs);multiple time scales