Markov Assumption

We are going to assume that the state used by the agent is a sufficient statistic of the history, in that, in order to predict the future, you only need to know about the current state of the environment.

This implies that the future is independent of the past given the present (if you have the right aggregate statistics in the present)

  • Information State : sufficient statistic of history
  • State st is Markov iff p(st+1|st, at) = p(st+1|ht, at)

Leave a comment