Proof: Induction on the structure of . There are several base
cases, all fairly trivial. If
or
there is
nothing to prove, as these progress to themselves and hold
everywhere and nowhere respectively. If
then if
holds in
then it progresses to
which holds in
while if
does not hold in
then it
progresses to
which does not hold in
. The
case
is similar. In the last base case,
.
Then the following are equivalent:
Induction case 1:
. The following are equivalent:
and
and
(by induction
hypothesis)
Induction case 2: . Analogous to case 1.
Induction case 3:
. Trivial by inspection of the definitions.
Induction case 4:
. Then
is logically equivalent
to
which by cases 1, 2 and 3 holds at
stage
of
for behaviour
iff
holds at stage i+1.
Theorem 1
Let be reward-normal, and let
be the result of progressing it through the successive
states of a sequence
. Then, provided no
is
,
for all
iff
.
Proof: First, by the definition of
reward-normality, if is reward-normal then
iff for all
, if
then
.
Next, if
then progressing
through
according to
(that is, letting each
be true iff
) cannot lead to a contradiction because by
Property 1, progression is truth-preserving.
It remains, then, to show that if
then
progressing
through
according to
must lead
eventually to
. The proof
of this is by induction on the structure of
and as usual the
base case in which
is a literal (an atom, a negated atom or
,
or
) is trivial.
Case
. Suppose
. Then
either
or
, so
by the induction hypothesis either
or
progresses
eventually to
, and hence so does their conjunction.
Case
. Suppose
. Then both
and
, so by the
induction hypothesis each of
and
progresses eventually to
. Suppose without loss of generality that
does not
progress to
before
does. Then at some point
has
progressed to some formula
and
has progressed to
which simplifies to
. Since
also progresses to
eventually, so does
.
Case
. Suppose
. Let
and let
. Then
, so by the induction
hypothesis
progressed through
according to
eventually reaches
. But The progression of
through
according to
is exactly the same after the first
step, so that too leads to
.
Case
. Suppose
. Then
there is some
such that
and for
all
,
. We proceed by
induction on
. In the base case
, and both
and
whence by the main
induction hypothesis both
and
will eventually progress to
. Thus
progresses eventually to
for any
, and in particular for
,
establishing the base case. For the induction case, suppose
(and of course
).
Since
is equivalent to
and
,
and
, clearly
. Where
and
are as in the previous case, therefore,
and the failure occurs at stage
of
. Therefore the hypothesis of the induction on
applies, and
progressed through
according to
goes eventually to
, and so
progressed through
according to
goes similarly to
.
Theorem 3
Let be the set of e-states in an equivalent MDP
for
.
is minimal iff every e-state
in
is reachable and
contains no two distinct e-states
and
with
and
.
Proof: Proof is by construction of the canonical equivalent MDP
. Let the set of finite prefixes of state sequences in
be partitioned into equivalence classes, where
iff
and for all
such that
,
. Let
denote the equivalence
class of
. Let
be the set of these equivalence
classes. Let
be the function that takes each
in
to
. For each
and
and for each
, let
be
if
. Otherwise let
. Let
be
. Then note the following four facts:
For allSuppose, if there is
such that
, then for all
such that
, there exists a unique
,
, such that for all
,
.
To establish existence, we need that ifis the unique element
of
with
such that for all
,
.
Theorem 4
Let be the translation of
as in
Definition 5.
is a blind minimal
equivalent MDP for
.
Proof: Reachability of all the e-states is obvious, as they are
constructed only when reached. Each e-state is a pair
where
is a state of
and
is a reward
function specification. In fact,
and
determines a distribution of rewards over all
continuations of the sequences that reach
.
That is, for all
in
such that
, the
reward for
is
. If
is not blind minimal, then there exist distinct
e-states
and
for which this sum is the same for all
. But this makes
and
semantically equivalent, contradicting the
supposition that they are distinct.