Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities
Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities
This paper describes sufficient conditions for the existence of optimal policies for partially observable Markov decision processes (POMDPs) with Borel state, observation, and action sets, when the goal is to minimize the expected total costs over finite or infinite horizons. For infinite-horizon problems, one-step costs are either discounted or assumed …