Difference between revisions of "Aufgaben:Exercise 1.5: Binary Markov Source"

Latest revision as of 14:03, 10 August 2021

Binary Markov diagram

Exercise 1.4 has shown that the calculation of the entropy for a source with memory can be very time-consuming. One must then first calculate (very many) entropy approximations $H_k$ for $k$–tuples and only then the source entropy can be determined with the boundary transition $k \to \infty$ :

$$H = \lim_{k \rightarrow \infty } H_k \hspace{0.05cm}.$$

Often, $H_k$ tends only very slowly towards the limit $H$.

The calculation process is drastically reduced if the message source has Markov properties. The graphic shows the transition diagram for a binary Markov source with the two states (symbols) $\rm A$ and $\rm B$.

This is clearly determined by the two conditional probabilities $p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B} = p$ and $p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A} = q$ .
The other conditional probabilities $p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A}$ and $p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}B}$ as well as the (unconditional) symbol probabilities $p_{\rm A}$ and $p_{\rm B}$ can be determined from this.

The entropy of the binary Markov chain (with the unit "bit/symbol") is then:

$$H = H_{\rm M} = p_{\rm AA} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A}} + p_{\rm AB} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}} + p_{\rm BA} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B}} + p_{\rm BB} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}B}} \hspace{0.05cm}.$$

It should be noted that in the argument of the "binary logarithm", the conditional probabilities $p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A}$, $p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}$, ... are to be used, while the "Conditional Probabilities" $p_{\rm AA}$, $p_{\rm AB}$, ... are to be used for the weighting.

Using the first order entropy approximation,

$$H_1 = p_{\rm A} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm A}} + p_{\rm B} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm B}} \hspace{0.5cm}({\rm unit\hspace{-0.1cm}: \hspace{0.1cm}bit/symbol})\hspace{0.05cm},$$

as well as the (actual) entropy $H = H_{\rm M}$ given above, all further entropy approximations $(k = 2, 3, \text{...})$ can also be given directly for a Markov source:

$$H_k = \frac{1}{k} \cdot \big [ H_{\rm 1} + (k-1) \cdot H_{\rm M} \big ] \hspace{0.05cm}.$$

Hints:

The exercise belongs to the chapter Discrete Sources with Memory.
Reference is also made in particular to the two pages "Intersection" and "Conditional Probability".
With the exception of subtask (6) $p = 1/4$ and $q = 1/2$ always apply.

For the (ergodic) symbol probabilities of a first order Markov chain applies:

$$ p_{\rm A} = \frac {p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B}} { p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B} + p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}} \hspace{0.05cm}, \hspace{0.3cm} p_{\rm B} = \frac {p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}} { p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B} + p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}} \hspace{0.05cm}.$$

Questions

$p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A} \ = \ $

$p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}B} \ = \ $

$p_{\rm A} \ = \ $

$p_{\rm B} \ = \ $

$H_1 \ = \ $

$\ \rm bit/symbol$

$H \ = \ $

$\ \rm bit/symbol$

$H_2 \ = \ $

$\ \rm bit/symbol$

$H_3 \ = \ $

$\ \rm bit/symbol$

$H_4 \ = \ $

$\ \rm bit/symbol$

$H \ = \ $

$\ \rm bit/symbol$

Solution

Markov diagram for tasks (1), ... , (5)

After $\rm A$ , $\rm A$ and $\rm B$ are equally probable. After $\rm B$ , $\rm B$ occurs much more frequently than $\rm A$ . The following applies to the transition probabilities

$$p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A} \hspace{0.1cm} = \hspace{0.1cm} 1 - p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}= 1 - q \hspace{0.15cm} \underline {= 0.5} \hspace{0.05cm},$$

$$ p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}B} \hspace{0.1cm} = \hspace{0.1cm} 1 - p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B}= 1 - p \hspace{0.15cm} \underline {= 0.75} \hspace{0.05cm}.$$

(2) According to the equations given:

$$p_{\rm A}= \frac{p}{p+q} = \frac{0.25}{0.25 + 0.50} \hspace{0.15cm} \underline {= 0.333} \hspace{0.05cm}, \hspace{0.5cm} p_{\rm B} = \frac{q}{p+q} = \frac{0.50}{0.25 + 0.50} \hspace{0.15cm} \underline {= 0.667} \hspace{0.05cm}.$$

(3) With the probabilities calculated in the last sub-task:

$$H_{\rm 1} = H_{\rm bin}(p_{\rm A}) = 1/3 \cdot {\rm log}_2\hspace{0.01cm} (3) + 2/3 \cdot {\rm log}_2\hspace{0.01cm} (1.5) = 1.585 - 2/3\hspace{0.15cm} \underline {= 0.918 \,{\rm bit/symbol}} \hspace{0.05cm}.$$

(4) The entropy of the Markov source is according to:

$$H = p_{\rm AA} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A}} + p_{\rm AB} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}} + p_{\rm BA} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B}} + p_{\rm BB} \cdot {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}B}} \hspace{0.05cm}.$$

For the composite probabilities holds:

$$p_{\rm AA} = p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A} \cdot p_{\rm A} = (1-q) \cdot \frac{p}{p+q} = \frac{1/2 \cdot 1/4}{3/4} = {1}/{6} \hspace{0.05cm},$$

$$ p_{\rm AB} = p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A} \cdot p_{\rm A} = q \cdot \frac{p}{p+q} = \frac{1/2 \cdot 1/4}{3/4} = {1}/{6} \hspace{0.05cm},$$

$$ p_{\rm BA} = p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B} \cdot p_{\rm B} = p \cdot \frac{q}{p+q} = p_{\rm AB} = {1}/{6} \hspace{0.05cm},$$

$$ p_{\rm BB} = p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}B} \cdot p_{\rm B} = (1-p) \cdot \frac{q}{p+q} = \frac{3/4 \cdot 1/2}{3/4} = {1}/{2} $$

$$\Rightarrow\hspace{0.3cm} H = 1/6 \cdot {\rm log}_2\hspace{0.01cm} (2) + 1/6 \cdot {\rm log}_2\hspace{0.01cm} (2) + 1/6 \cdot {\rm log}_2\hspace{0.01cm} (4) + 1/2 \cdot {\rm log}_2\hspace{0.1cm} (4/3) = 10/6 - 1/2 \cdot {\rm log}_2\hspace{0.01cm} (3) \hspace{0.15cm} \underline {= 0.875 \,{\rm bit/symbol}} \hspace{0.05cm}.$$

(5) In general, with $H_{\rm M} = H$ for $k$–th entropy approximation:

$$H_k = {1}/{k} \cdot [ H_{\rm 1} + (k-1) \cdot H_{\rm M}] \hspace{0.05cm}.$$

It follows that:

$$H_2 = {1}/{2} \cdot [ 0.918 + 1 \cdot 0.875] \hspace{0.15cm} \underline {= 0.897 \,{\rm bit/symbol}} \hspace{0.05cm},$$

$$ H_3 = {1}/{3} \cdot [ 0.918 + 2 \cdot 0.875] \hspace{0.15cm} \underline {= 0.889 \,{\rm bit/symbol}} \hspace{0.05cm},$$

$$ H_4 = {1}/{4} \cdot [ 0.918 + 3 \cdot 0.875] \hspace{0.15cm} \underline {= 0.886 \,{\rm bit/symbol}} \hspace{0.05cm}.$$

Markov diagram for subtask (6)

(6) With the new set of parameters $(p = 1/4, q = 3/4)$, we obtain for the symbol probabilities:

$$ p_{\rm A} = 1/4, \ p_{\rm B} = 3/4.$$

This special case thus leads to statistically independent symbols:

$$ p_{\rm A} = p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A} = p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B} \hspace{0.05cm}, \hspace{0.2cm} p_{\rm B} = p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A} = p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}B} \hspace{0.05cm}.$$

Thus the entropy $H$ is identical with the entropy approximation $H_1$:

$$H = H_{\rm 1} = 1/4 \cdot {\rm log}_2\hspace{0.01cm} (4) + 3/4 \cdot {\rm log}_2\hspace{0.01cm} (4/3) = 2 - 0.75 \cdot {\rm log}_2\hspace{0.01cm} (3) \hspace{0.15cm} \underline {= 0.811 \,{\rm bit/Symbol}} \hspace{0.05cm}.$$

The entropy approximations $H_2$, $H_3$, $H_4$, ... also yield the result $0.811 \, \rm bit/symbol$.

@@ Line 1: / Line 1: @@
-{{quiz-Header|Buchseite=Informationstheorie/Nachrichtenquellen mit Gedächtnis
+{{quiz-Header|Buchseite=Information_Theory/Discrete_Sources_with_Memory
 }}
 [[File:P_ID2250__Inf_A_1_5.png|right|frame|Binary Markov diagram]]
-&nbsp; [[Aufgaben:1.4_Entropienäherungen_für_den_AMI-Code|Task 1.4]]&nbsp; has shown that the calculation of the entropy for a memory-containing source can be very time-consuming.&nbsp; One must then first calculate (very many) entropy approximations&nbsp; $H_k$&nbsp; for&nbsp; $k$&ndash;tuples and only then can the source entropy be determined with the boundary transition&nbsp; $k \to \infty$&nbsp;:
+[[Aufgaben:Exercise_1.4:_Entropy_Approximations_for_the_AMI_Code|Exercise 1.4]]&nbsp; has shown that the calculation of the entropy for a source with memory can be very time-consuming.&nbsp; One must then first calculate (very many) entropy approximations&nbsp; $H_k$&nbsp; for&nbsp; $k$&ndash;tuples and only then the source entropy can be determined with the boundary transition&nbsp; $k \to \infty$&nbsp;:
 :$$H  =  \lim_{k \rightarrow \infty } H_k  \hspace{0.05cm}.$$
 Often,&nbsp; $H_k$&nbsp; tends only very slowly towards the limit&nbsp; $H$.
-The calculation process is drastically reduced if the message source has Markov properties.&nbsp; The diagram shows the transition diagram for a binary Markov source with the two states (symbols)&nbsp; $\rm A$&nbsp; and&nbsp; $\rm B$. This is determined by the two conditional Markov sources.
+The calculation process is drastically reduced if the message source has Markov properties.&nbsp; The graphic shows the transition diagram for a binary Markov source with the two states (symbols)&nbsp; $\rm A$&nbsp; and&nbsp; $\rm B$.
 *This is clearly determined by the two conditional probabilities&nbsp; $p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B} = p$ &nbsp;and&nbsp; $p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A} = q$&nbsp;.
@@ Line 18: / Line 18: @@
 {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}B}}
   \hspace{0.05cm}.$$
-In this equation, it should be noted that in the argument of the <i>binary logarithm</i>&nbsp;, the&nbsp; [[Theory_of_Stochastic_Signals/Statistische_Abhängigkeit_und_Unabhängigkeit#Bedingte_Wahrscheinlichkeit|conditional probabilities]]&nbsp; $p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A}$,&nbsp; $p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}$, ... &nbsp; are to be used, while the&nbsp; [[Theory_of_Stochastic_Signals/Mengentheoretische_Grundlagen#Schnittmenge|joint probabilities]]&nbsp; $p_{\rm AA}$,&nbsp; $p_{\rm AB}$, ... &nbsp; are to be used for the weighting.
+It should be noted that in the argument of the&nbsp; "binary logarithm", the&nbsp; [[Theory_of_Stochastic_Signals/Statistical_Dependence_and_Independence#Conditional_Probability|conditional probabilities]]&nbsp; $p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A}$,&nbsp; $p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}$, ... &nbsp; are to be used, while the&nbsp; [[Theory_of_Stochastic_Signals/Statistical_Dependence_and_Independence#Conditional_Probability|"Conditional Probabilities"]]&nbsp; $p_{\rm AA}$,&nbsp; $p_{\rm AB}$, ... &nbsp; are to be used for the weighting.
 Using the first order entropy approximation,
@@ Line 24: / Line 24: @@
 {\rm log}_2\hspace{0.1cm}\frac {1}{ p_{\rm B}}
   \hspace{0.5cm}({\rm unit\hspace{-0.1cm}: \hspace{0.1cm}bit/symbol})\hspace{0.05cm},$$
-as well as the (actual) entropy&nbsp; $H = H_{\rm M}$&nbsp; given above, all further entropy approximations&nbsp; $(k = 2,, 3, \text{...})$&nbsp; can also be given directly for a Markov source:
+as well as the (actual) entropy&nbsp; $H = H_{\rm M}$&nbsp; given above, all further entropy approximations&nbsp; $(k = 2, 3, \text{...})$&nbsp; can also be given directly for a Markov source:
 :$$H_k =  \frac{1}{k} \cdot \big [ H_{\rm 1} + (k-1) \cdot H_{\rm M} \big ]
   \hspace{0.05cm}.$$
@@ Line 36: / Line 32: @@
 ''Hints:''
-*The task belongs to the chapter&nbsp; [[Information_Theory/Nachrichtenquellen_mit_Gedächtnis|Sources with Memory]].
+*The exercise belongs to the chapter&nbsp; [[Information_Theory/Discrete_Sources_with_Memory|Discrete Sources with Memory]].
-*Reference is also made in particular to the two pages&nbsp;   [[Theory_of_Stochastic_Signals/Mengentheoretische_Grundlagen#Schnittmenge|Schnittmenge]]&nbsp; und&nbsp; [[Theory_of_Stochastic_Signals/Statistische_Abhängigkeit_und_Unabhängigkeit#Bedingte_Wahrscheinlichkeit|Bedingte Wahrscheinlichkeit]].
+*Reference is also made in particular to the two pages&nbsp;   [[Theory_of_Stochastic_Signals/Set_Theory_Basics#Intersection|"Intersection"]]&nbsp; and&nbsp; [[Theory_of_Stochastic_Signals/Statistical_Dependence_and_Independence#Conditional_Probability|"Conditional Probability"]].
 *With the exception of subtask&nbsp; '''(6)'''&nbsp; &nbsp;  $p = 1/4$ &nbsp;and&nbsp; $q = 1/2$ always apply.
-*For the (ergodic) symbol probabilities of a first order Markov chain applies:
+*For the&nbsp; (ergodic)&nbsp; symbol probabilities of a first order Markov chain applies:
 :$$ p_{\rm A}  = \frac {p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B}}
 { p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}B} + p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}}  \hspace{0.05cm},  \hspace{0.3cm}
@@ Line 58: / Line 54: @@
-{What are the (unconditional) symbol probabilities?&nbsp; Let &nbsp;$p = 1/4$ &nbsp;and&nbsp; $q = 1/2$ still hold.
+{What are the&nbsp; (unconditional)&nbsp; symbol probabilities?&nbsp; Let &nbsp;$p = 1/4$ &nbsp;and&nbsp; $q = 1/2$ still hold.
 |type="{}"}
 $p_{\rm A} \ =  \  $  { 0.333 3% }
@@ Line 64: / Line 60: @@
-{Give the corresponding first-order entropy approximation.
+{Give the corresponding first order entropy approximation.
 |type="{}"}
 $H_1  \ =  \ $ { 0.918 1% } $\ \rm bit/symbol$
@@ Line 91: / Line 87: @@
 ===Solution===
 {{ML-Kopf}}
-[[File:Inf_A_1_5a_vers2.png|right|frame|Markov diagram for the subtasks&nbsp; '''(1)''', ... ,&nbsp; '''(5)''']]
+[[File:Inf_A_1_5a_vers2.png|right|frame|Markov diagram for tasks&nbsp; '''(1)''', ... ,&nbsp; '''(5)''']]
 After&nbsp; $\rm A$&nbsp;, &nbsp; $\rm A$&nbsp; and&nbsp; $\rm B$&nbsp; are equally probable.&nbsp; After&nbsp; $\rm B$&nbsp;, &nbsp; $\rm B$&nbsp; occurs much more frequently than&nbsp; $\rm A$&nbsp;.&nbsp; The following applies to the transition probabilities
 :$$p_{\rm A\hspace{0.01cm}|\hspace{0.01cm}A} \hspace{0.1cm} =  \hspace{0.1cm} 1 - p_{\rm B\hspace{0.01cm}|\hspace{0.01cm}A}= 1 - q \hspace{0.15cm} \underline {= 0.5} \hspace{0.05cm},$$
@@ Line 143: / Line 139: @@
 [[File:Inf_A_1_5f_vers2.png|right|frame|Markov diagram for subtask&nbsp; '''(6)''']]
-'''(6)'''&nbsp; With the new set of parameters&nbsp; $(p = 1/4, q = 3/4)$&nbsp; , we obtain for the symbol probabilities:
+'''(6)'''&nbsp; With the new set of parameters&nbsp; $(p = 1/4, q = 3/4)$,&nbsp; we obtain for the symbol probabilities:
 :$$ p_{\rm A} =  1/4, \ p_{\rm B} =  3/4.$$