Ask a Question

Prefer a chat interface with context about you and your work?

Linear Fitted-Q Iteration with Multiple Reward Functions

Linear Fitted-Q Iteration with Multiple Reward Functions

We present a general and detailed development of an algorithm for finite-horizon fitted-Q iteration with an arbitrary number of reward signals and linear value function approximation using an arbitrary number of state features. This includes a detailed treatment of the 3-reward function case using triangulation primitives from computational geometry and …