11 In this footnote, I propose a formal definition of threat. Denote by Xπ(s) the random variable giving the total discounted future reward starting from initial state s, following policy π, and using discount factor λ. That is, Xπ(s) = t=0λtr t | π,s(0) = s. The expectation of this quantity is nothing else than the state-value function, and that will be used as the baseline. Thus, subtracting the baseline we obtain the random variable ˜Xπ(s) = Xπ(s) - V π(s) = t=0λt(r t - E{rt}) | π,s(0) = s. Some measure of the downside risk (negative tail) of X˜π(s) is now defined a measure of threat for state s. We could use, for example, the conditional value-at-risk (expected shortfall), see footnote 7 in this chapter for discussion of various downside risk measures.