Conditional expectation w.r.t. random variable and w.r.t. $\sigma$-algebra, equivalence

2,411

If a probability space is given then the conditional expectation does not depend on the values taken by the random variable in the condition; it depends only on the sets on which this random variable takes these values as constants. You can experience this if you calculate a very simple example when the random variables involved are discrete.

Since only the sets count, one can say that only the indicator functions count.

So, if you have a $(\sigma$-)algebra in the condition then you can replace this $\ \ (\sigma-)$algebra by any random variable which generates the same $(\sigma$-)algebra. Or the other way around, if you have a random variable in the condition then you can replace it by the $(\sigma$-)algebra generated by that random variable.

To answer your question briefly: If a random variable is in the condition of a conditional expectation then it can be considered as a system of indicator functions of sets on which the random variable is constant.

Share:
2,411

Related videos on Youtube

StefanH
Author by

StefanH

Updated on August 01, 2022

Comments

  • StefanH
    StefanH over 1 year

    Let $\Omega = \{ \omega_i \}$ be a countable set, and consider some probability space $(\omega, \mathcal F, P)$ with $p_i := P(\{ w_i \})$. Let $X : \Omega \to \mathbb R$ be a random variable, then it's expected value is $$ E(X) := \sum_i X(\omega_i) \cdot p_i. $$ Now it is possible to define the conditional expectation $E(X | Y)$ of $X$ dependend on a second random variable, on Wikipedia it is written that

    [...] the conditional expectation of $X$ given the event $Y = y$ is a function of $y$ over the range of $Y$ [...]

    This is the conditional expectation with respect (conditioned) to another random variable $Y$. But as is also written on wikipedia, we can condition on a sub-$\sigma$-algebra, see Wikipedia too.

    The conditional expectation w.r.t. a random variable is a function on the range of $Y$, i.e. $E(X | Y) : Y(\Omega) \to \mathbb R$, the condittional expectation w.r.t. a sub-$\sigma$-algebra $\mathcal H \subseteq \mathcal F$ is a function $E(X | \mathcal H) : \Omega \to \mathbb R$, so has another domain.

    Now my question, as they both capture in some sense the same concept, how to convert between them? On the german Wikipedia I found an explanation, there it is said (in a very free translation)

    ... to get from $E(X|Y)$ to $E(X|\mathcal H)$ set $\mathcal H := \sigma(Y)$, then $E(X|\sigma(Y))(\omega) = E(X | \{ \omega' : Y(\omega') = Y(\omega) \} = E(X|Y=y)$ with $y = Y(\omega)$ and $E(X|Y=y) = E(X|\sigma(Y))(\omega)$ for some $\omega$ with $y = Y(\omega)$. For the other conversion given $E(X|\mathcal H)$ let $Y$ be the family $(1_{B})_{B\in \mathcal H}$.

    The conversion from $E(X|\mathcal H)$ to the form $E(X|Y)$ for some random variable $Y$ I do not understand, how could $Y$ be a family of indicator functions as said above (I made the part I do not understand bold)? Can someone please explain, thank you!

    • Did
      Did almost 9 years
      "The conditional expectation w.r.t. a random variable is a function on the range of $Y$, i.e. $E(X | Y) : Y(\Omega) \to \mathbb R$" Not at all (what is your source?). Actually, if $X$ and $Y$ are random variables defined on $\Omega$ then $E(X|Y)$ is also a (class of) random variable(s) defined on $\Omega$. The function $y\mapsto E(X|Y=y)$, defined on $Y(\Omega)$, is a different beast, which is best defined (up to null sets) using the random variable $E(X|Y)$.
    • StefanH
      StefanH almost 9 years
      @Did: This is written on wikipedia: "Let $X$ and $Y$ be discrete random variables, then the conditional expectation of $X$ given the event $Y=y$ is a function of $y$ over the range of $Y$" (see en.wikipedia.org/wiki/Conditional_expectation) and it is also written here: en.wikipedia.org/wiki/Expected_value#Iterated_expectation. But according to the general definition this is just one version, equal a.s. to other version.
    • Did
      Did almost 9 years
      Exactly what my previous comment explains. You seem to be confusing the random variable $\omega\mapsto E(X|Y)(\omega)$ and the function $y\mapsto E(X|Y=y)$.
    • StefanH
      StefanH almost 9 years
      The function $y \mapsto E(X|Y = y)$ could be made to a random variable by setting $\omega \mapsto E(X|Y = Y(\omega))$, and for this modified version it has to hold that $E(X|Y=Y(\omega)) = E(X|\mathcal H)$ almost everywhere.
    • Did
      Did almost 9 years
      Except that the valid construction goes in the other way: one deduces the function $y\mapsto E(X|Y=y)$ from the random variable $E(X|Y)$. May I suggest to get a reliable textbook?
    • StefanH
      StefanH almost 9 years
      What textbook do you think of?
    • Did
      Did almost 9 years
      This very much depends on your background, about which you say nothing, hence the question in your comment is absurd. Nevertheless I might mention that the book Probability with martingales by David Williams is excellent and congenial.
  • StefanH
    StefanH almost 9 years
    "if you have a $\sigma$-algebra [...] then you can replace this $\sigma$-algebra by any random variable which generates [it]". But what if the $\sigma$-algebra ist not generated by any random variable: math.stackexchange.com/questions/267584/…?
  • StefanH
    StefanH almost 9 years
    And also, how is a system of indicator functions related to a random variable, for example consider $\{ \emptyset, \{1,2\},\{3\}, \{4\}, \{3,4\},\{1,2,3\}, \{1,2,4\}, \{1,2,3,4\}\}$ which is a ($\sigma$)-algebra over $Y = \{1,2,3,4\}$. Then for example $X(1)=X(2)=0, X(3)=1, X(4)=2$ generates this ($\sigma$)-algebra. But the system of indicator functions is $\{ I_{\emptyset}, I_{\{1,2\}}, I_{\{3\}}, I_{\{4\}}, I_{\{1,2,3\}}, I_{\{1,2,4\}}, I_Y \}$, but $X$ could be written in terms of indicator just as $X = 0\cdot I_{\{1,2\}} + I_{\{3\}} + I_{\{4\}}$, is there any construction to get there?
  • zoli
    zoli almost 9 years
    But what is wrong? If X is in the condition then only the indicator functions count. Only those, of course, that play role in the construction.
  • zoli
    zoli almost 9 years
    As far as the first question: The conditional expectation and every random variables are defined only a.s. Or as eq. classses of measurable functions that are equal a.s. So it is meaningless to say that a random variable is only Borel measurable and not Lebesgue measurable.
  • StefanH
    StefanH almost 9 years
    If I give you some family of indicator functions $\{ I_B \}_{B \in \mathbb B}$ how do you construct a random variable out of it? What sets (or indicator functions) do you choose where your random variable is constant?
  • zoli
    zoli almost 9 years
    The indicators will have describe dijoint sets whose union is Omega. Then I will construct many r.v.'s.
  • StefanH
    StefanH almost 9 years
    Yes, then it is clear, but what if a $\sigma$-Algebra could not be decomposed in such a way? Or is it always possible... sorry maybe you already addressed this in your forementioned comment, but I do not understand...
  • zoli
    zoli almost 9 years
    If the union of a sequence of sets cover $\Omega$ and these sets belong to the actual $\sigma-$algebra then you can always construct a partitioning of $\Omega$ such that the new covering sets will belong to the same $\sigma-$ algebra.
  • StefanH
    StefanH almost 9 years
    How? Can you please explain? And by the way, the new partitioning must be finer I guess, this seems non-trivial to me...
  • zoli
    zoli almost 9 years
    Dearest Stefan, please give a specific example: (1) a discrete random variable and a set of indicators for the condition.