Ask a Question

Prefer a chat interface with context about you and your work?

Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

This paper describes sufficient conditions for the existence of optimal policies for partially observable Markov decision processes (POMDPs) with Borel state, observation, and action sets, when the goal is to minimize the expected total costs over finite or infinite horizons. For infinite-horizon problems, one-step costs are either discounted or assumed …