Performance Guarantees for Homomorphisms beyond Markov Decision Processes
Performance Guarantees for Homomorphisms beyond Markov Decision Processes
Most real-world problems have huge state and/or action spaces. Therefore, a naive application of existing tabular solution methods is not tractable on such problems. Nonetheless, these solution methods are quite useful if an agent has access to a relatively small state-action space homomorphism of the true environment and near-optimal performance …