Thursday, October 9, 2014

What is a state?

From time to time I will add a post to this blog which I hope will add to your appreciation of the course. This post relates to today's lecture, in which we had the idea of a state-structured model. The notion of a state is intuitive, or is it?

"What is a state?"

Warren B. Powell (a professor at Princeton and researcher in optimization) has mused upon this question. You can read his blog about this here: http://www.castlelab.princeton.edu/cso.htm

I think he makes a nice definition of a state, in these words:

A state variable is a minimally dimensioned function of history that is necessary and sufficient to compute the decision function, cost/reward/contribution function, and the transition function.


By "decision function" he means the $F(x,t)$ that I was today calling the "value function". For the other terms he is thinking of $c(x,u,t)$ and $a(x,u,t)$.

He discusses the fact that "state" can be a physical state, an information state, or a belief state.

Please be aware that a state can be a vector of more that one component. For example, consider the uncontrolled autoregressive plant equation,

$x_t = a_1 x_{t-1} + a_2 x_{t-2}$.

Here we would take the state to be $y_t=(x_t,x_{t-1})$, because $y_{t-1}$ is needed to compute $y_t$.

Warren is particularly interested reconciling the many communities of stochastic optimization, amongst which he includes
  • Dynamic programming (including Markov decision processes)
  • Stochastic programming (two-stage, multistage, chance constrained)
  • Stochastic search (including simulation optimization, infinitessimal perturbation analysis, and ranking and selection)
  • Optimal control (primarily engineering, but also economics and finance)
  • Approximate dynamic programming and reinforcement learning
  • ...
The first and fourth of these are the ones which our courses addresses. You may find it interesting to read his essay on what is a state?