Difference between revisions of "Aufgaben:Exercise 3.10Z: BSC Channel Capacity"

Latest revision as of 16:07, 27 September 2021

Entropies of "BC" and "BSC"

The channel capacity $C$ was defined by Claude E. Shannon as the maximum mutual information, whereby the maximization refers solely to the source statistics:

$$ C = \max_{P_X(X)} \hspace{0.15cm} I(X;Y) \hspace{0.05cm}$$

In the binary channel with the probability function $P_X(X) = \big [p_0, \ p_1 \big]$ only one parameter is optimizable, for example $p_0$. The probability for a $1$ is thus also fixed: $p_1 = 1 - p_0.$

The upper graph (reddish background) summarises the results for the asymmetric binary channel with $ε_0 = 0.01$ and $ε_1 = 0.2$ , which was also considered in the theory section.

The maximization leads to the result $p_0 = 0.55$ ⇒ $p_1 = 0.45$, and one obtains for the channel capacity:

$$C_{\rm BC} = \hspace{-0.05cm} \max_{P_X(X)} \hspace{0.1cm} I(X;Y) \big |_{p_0 \hspace{0.05cm} = \hspace{0.05cm}0.55} \hspace{0.05cm}=\hspace{0.05cm} 0.5779\,{\rm bit} \hspace{0.05cm}.$$

In the lower graph (blue background), the same information-theoretical quantities are given for the Binary Symmetric Channel $\rm (BSC)$ with the falsification probabilities $ε_0 = ε_1 = ε = 0.1$, which was also assumed for Exercise 3.10.

In the present exercise you are

to analyze the entropies $H(X)$, $H(Y)$, $H(X|Y)$ and $H(Y|X)$,
to optimize the source parameter $p_0$ with respect to maximum mutual information $I(X; Y)$ ,
to determine the channel capacity $C(ε)$ ,
to give a closed equation for $C(ε)$ by generalization for the BSC channel model $($initially for $ε = 0.1)$.

Hints:

The exercise belongs to the chapter Application to digital signal transmission.
Reference is made in particular to the page Channel capacity of a binary channel.

Questions

	The equivocation results in $H(X\|Y) = H_{\rm bin}(ε)$.
	The irrelevance results in $H(Y\|X) = H_{\rm bin}(ε)$.
	The irrelevance results in $H(Y\|X) = H_{\rm bin}(p_0)$.

	The channel capacity is equal to the maximum mutual information.
	For the BSC, maximization leads to the result $p_0 = p_1 = 0.5$.
	For $p_0 = p_1 = 0.5$ , $H(X) = H(Y) = 1 \ \rm bit$.

$ε = 0.0\text{:} \hspace{0.5cm} C_{\rm BSC} \ = \ $

$\ \rm bit$

$ε = 0.1\text{:} \hspace{0.5cm} C_{\rm BSC} \ = \ $

$\ \rm bit$

$ε = 0.5\text{:} \hspace{0.5cm} C_{\rm BSC} \ = \ $

$\ \rm bit$

$ε = 1.0\text{:} \hspace{0.5cm} C_{\rm BSC} \ = \ $

$\ \rm bit$

$ε = 0.9\text{:} \hspace{0.5cm} C_{\rm BSC} \ = \ $

$\ \rm bit$

Solution

(1) The proposed solution 2 is correct, as the following calculation shows:

$$H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X) = p_0 \cdot (1 - \varepsilon) \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{1 - \varepsilon} + p_0 \cdot \varepsilon \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{\varepsilon} +p_1 \cdot \varepsilon \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{\varepsilon} + p_1 \cdot (1 - \varepsilon) \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{1 - \varepsilon} $$

$$\Rightarrow \hspace{0.3cm} H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X) = (p_0 + p_1) \cdot \left [ \varepsilon \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{\varepsilon} + (1 - \varepsilon) \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{1 - \varepsilon} \right ] \hspace{0.05cm}.$$

With $p_0 + p_1 = 1$ and the binary entropy function $H_{\rm bin}$, one obtains the proposed result: $H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X) = H_{\rm bin}(\varepsilon)\hspace{0.05cm}.$
For $ε = 0.1$ we get $H(Y|X) = 0.4690 \ \rm bit$. The same value stands for $p_0=0.50$ in the given table.
From the table one can also see that for the BSC model (blue background) as well as for the more general (asymmetric) BC model (red background)
the equivocation $H(X|Y)$ depends on the source symbol probabilities $p_0$ and $p_1$. It follows that proposed solution 1 cannot be correct.
The irrelevance $H(Y|X)$ is independent of the source statistics, so that solution proposal 3 can also be excluded.

(2) All given alternative solutions are correct:

The channel capacity is defined as the maximum mutual information, where the maximization has to be done with respect to $P_X = (p_0, p_1)$ :

$$C = \max_{P_X(X)} \hspace{0.15cm} I(X;Y) \hspace{0.05cm}.$$

The equation is generally valid, i.e. also for the unbalanced binary channel highlighted in red.
The mutual information can be calculated, for example, as $I(X;Y) = H(Y) - H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X)\hspace{0.05cm}$,
where according to subtask (1) the term $H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X)\hspace{0.05cm}$ is independent of $p_0$ or $p_1 = 1- p_0$ .
The maximum mutual information thus results exactly when the sink entropy $H(Y)$ is maximum. This is the case for $p_0 = p_1 = 0.5$:

$$H(X) = H(Y) = 1 \ \rm bit.$$

Binary entropy function and BSC channel capacity

(3) According to the subtasks (1) and (2) one obtains for the BSC channel capacity:

$$C = \max_{P_X(X)} \hspace{0.15cm} I(X;Y) \hspace{0.05cm}.$$

The graph shows the binary entropy function on the left and the channel capacity on the right. One obtains:

for $ε = 0.0$ (error-free channel):
$C = 1\ \rm (bit)$ ⇒ dot with yellow filling,
for $ε = 0.1$ (considered so far):
$C = 0.531\ \rm (bit)$ ⇒ dot with green filling,
for $ε = 0.5$ (completely disturbed):
$C = 0\ \rm (bit)$ ⇒ dot with grey filling.

(4) From the graph one can see that from an information-theoretical point of view $ε = 1$ is identical to $ε = 0$ :

$$C_{\rm BSC} \big |_{\hspace{0.05cm}\varepsilon \hspace{0.05cm} = \hspace{0.05cm}1} \hspace{0.05cm}= C_{\rm BSC} \big |_{\hspace{0.05cm}\varepsilon \hspace{0.05cm} = \hspace{0.05cm}0} \hspace{0.15cm} \underline {=1\,{\rm (bit)}} \hspace{0.05cm}.$$

The channel only carries out a renaming here. This is called "mapping".
Every $0$ becomes a $1$ and every $1$ becomes a $0$. Accordingly:

$$C_{\rm BSC} \big |_{\hspace{0.05cm}\varepsilon \hspace{0.05cm} = \hspace{0.05cm}0.9} \hspace{0.05cm}= C_{\rm BSC} \big |_{\hspace{0.05cm}\varepsilon \hspace{0.05cm} = \hspace{0.05cm}0.1} \hspace{0.15cm} \underline {=0.531\,{\rm (bit)}} \hspace{0.05cm}$$

@@ Line 3: / Line 3: @@
 }}
-[[File:P_ID2789__Inf_Z_3_9.png|right|]]
+[[File:EN_Inf_Z_3_9_A.png|right|frame|Entropies of&nbsp; "BC"&nbsp; and&nbsp; "BSC"]]
-Die Kanalkapazität $C$ wurde von Claude $E$. Shannon als die maximale Transinformation definiert, wobei sich die Maximierung allein auf die Quellenstatistik bezieh
+The channel capacity&nbsp; $C$&nbsp; was defined by&nbsp; [https://en.wikipedia.org/wiki/Claude_Shannon Claude E. Shannon]&nbsp; as the maximum mutual information, whereby the maximization refers solely to the source statistics:
-$$ C = \max_{P_X(X)} \hspace{0.15cm} I(X;Y) \hspace{0.05cm}$$
+:$$ C = \max_{P_X(X)} \hspace{0.15cm} I(X;Y) \hspace{0.05cm}$$
-Beim Binärkanal mit der Wahrscheinlichkeitsfunktion $P_X(X) = [p_0, p:1]$ ist nur ein Parameter optimierbar, beispielsweise $p_0$. Die Wahrscheinlichkeit für eine $„1”$ ist damit ebenfalls festgelegt:   $p_1 = 1 – p_0$
+In the binary channel with the probability function&nbsp; $P_X(X) = \big [p_0, \ p_1 \big]$&nbsp; only one parameter is optimizable, for example &nbsp; $p_0$.&nbsp; The probability for a&nbsp; $1$&nbsp; is thus also fixed: &nbsp;  $p_1 = 1 - p_0.$
-Die obere Grafik (rot hinterlegt) fasst die Ergebnisse für den [http://en.lntwww.de/Informationstheorie/Anwendung_auf_die_Digitalsignal%C3%BCbertragung#Kanalkapazit.C3.A4t_eines_Bin.C3.A4rkanals unsymmetrischen Binärkanal] mit $ε_0 = 0.01$ und $ε_1 = 0.2$ zusammen, der im Theorieteil betrachtet wurde. Die Maximierung führt zum Ergebnis $p_0 = 0.55$
+The upper graph (reddish background) summarises the results for the&nbsp; [[Information_Theory/Anwendung_auf_die_Digitalsignalübertragung#Channel_capacity_of_a_binary_channel|asymmetric binary channel]]&nbsp; with&nbsp; $ε_0 = 0.01$&nbsp; and&nbsp; $ε_1 = 0.2$&nbsp;, which was also considered in the theory section.
-$\Rightarrow p_1 = 0.45$,  und man erhält für die Kanalkapazität:
+The maximization leads to the result&nbsp; $p_0 = 0.55$ &nbsp; &rArr; &nbsp; $p_1 = 0.45$,&nbsp; and one obtains for the channel capacity:
-$$C_{\rm BC} = \hspace{-0.05cm} \max_{P_X(X)} \hspace{0.1cm} I(X;Y) \big |_{p_0 \hspace{0.05cm} = \hspace{0.05cm}0.55} \hspace{0.05cm}=\hspace{0.05cm} 0.5779\,{\rm bit} \hspace{0.05cm}$$
+:$$C_{\rm BC} = \hspace{-0.05cm} \max_{P_X(X)} \hspace{0.1cm} I(X;Y) \big |_{p_0 \hspace{0.05cm} = \hspace{0.05cm}0.55} \hspace{0.05cm}=\hspace{0.05cm} 0.5779\,{\rm bit} \hspace{0.05cm}.$$
-In der unteren Grafik (blaue Hinterlegung) sind die gleichen informationstheoretischen Größen für den symmetrischen Kanal $\Rightarrow$ [http://en.lntwww.de/index.php?title=Digitalsignal%C3%BCbertragung/Binary_Symmetric_Channel_(BSC)&action=edit&redlink=1 Binary Symmetric Channel] (BSC) mit den Verfälschungswahrscheinlichkeiten $ε1 = ε2 = ε = 0.1$ angegeben, der auch für die [http://en.lntwww.de/Aufgaben:3.09_Transinformation_beim_BSC Aufgabe A3.9] vorausgesetzt wurde.
+In the lower graph (blue background), the same information-theoretical quantities are given for the&nbsp; [[Channel_Coding/Kanalmodelle_und_Entscheiderstrukturen#Binary_Symmetric_Channel_.E2.80.93_BSC|Binary Symmetric Channel]]&nbsp;  $\rm (BSC)$&nbsp; with the falsification probabilities&nbsp; $ε_0 = ε_1 = ε = 0.1$,&nbsp; which was also assumed for&nbsp; [[Aufgaben:Exercise_3.10:_Mutual_Information_at_the_BSC| Exercise 3.10]].
-In der vorliegenden Aufgabe sollen Sie für das BSC–Kanalmodell (zunächst für $ε = 0\ cdot 1$)
+In the present exercise you are
-:* die Entropien $H(X)$, $H(Y)$, $H(X|Y)$, $H(Y|X)$ analysieren,
+:* to analyze the entropies&nbsp; $H(X)$,&nbsp; $H(Y)$,&nbsp; $H(X|Y)$&nbsp; and&nbsp; $H(Y|X)$,
-:* den Quellenparameter $p_0$ hinsichtlich maximaler Transinformation $I(X; Y)$ optimieren,
+:* to optimize the source parameter &nbsp;$p_0$&nbsp; with respect to maximum mutual information &nbsp;$I(X; Y)$&nbsp;,
-:* somit die Kanalkapazität $C(ε)$  bestimmen, sowie
+:* to determine the channel capacity &nbsp;$C(ε)$&nbsp;,
-:* durch Verallgemeinerung eine geschlossene Gleichung für $C(ε)$ angeben.
+:* to give a closed equation for &nbsp;$C(ε)$&nbsp; by generalization for the BSC channel model&nbsp; $($initially for&nbsp; $ε = 0.1)$.
-'''Hinweis:'''  Die Aufgabe bezieht sich auf die Thematik von [http://en.lntwww.de/Informationstheorie/Anwendung_auf_die_Digitalsignal%C3%BCbertragung Kapitel 3.3]
-===Fragebogen===
+Hints:
+*The exercise belongs to the chapter&nbsp; [[Information_Theory/Anwendung_auf_die_Digitalsignalübertragung|Application to digital signal transmission]].
+*Reference is made in particular to the page&nbsp;    [[Information_Theory/Application_to_Digital_Signal_Transmission#Channel_capacity_of_a_binary_channel|Channel capacity of a binary channel]].
-<quiz display=simple>
-{Berechnen Sie die Quellenentropie allgemein und für $p_0 = 0.4.$
-|type="{}"}
-$p_0 = 0.4:   H(X)$ = { 0.971 3% } $bit$
-{Berechnen Sie die Sinkenentropie allgemein und für $p_0 = 0.4.$
+===Questions===
-|type="{}"}
-$p_0 = 0 \cdot 4:   H(Y)$ = { 0.881 3% }  $bit$
-{Berechnen Sie die Verbundentropie allgemein und für $p_0 = 0.4.$
+<quiz display=simple>
-|type="{}"}
+{ Which statements are true for the conditional entropies in the BSC model?
-$p_0 = 0\cdot 4:   H(XY)$ = { 1.571 3% } $bit$
+|type="[]"}
+- The equivocation results in&nbsp; $H(X|Y) = H_{\rm bin}(ε)$.
++ The irrelevance results in&nbsp; $H(Y|X) = H_{\rm bin}(ε)$.
+- The irrelevance results in&nbsp; $H(Y|X) = H_{\rm bin}(p_0)$.
-{Berechnen Sie die Transinformation allgemein und für $p_0 = 0.4.$
+{Which statements are true for the channel capacity&nbsp; $C_{\rm BSC}$&nbsp; of the BSC model?
-|type="{}"}
+|type="[]"}
-$p_0 = 0\cdot 4:   I(X; Y)$ = { 0.281 3% } $bit$
++ The channel capacity is equal to the maximum mutual information.
++ For the BSC, maximization leads to the result &nbsp;$p_0 = p_1 = 0.5$.
++ For&nbsp; $p_0 = p_1 = 0.5$&nbsp; &nbsp;,&nbsp; $H(X) = H(Y) = 1 \ \rm bit$.
-{Welche Wahrscheinlichkeit $p_0$ führt zur Kanalkapazität $C$?
+{Which channel capacity&nbsp; $C_{\rm BSC}$&nbsp; results depending on the BSC parameter&nbsp; $ε$?
 |type="{}"}
-$ Maximierung:   p_0$ = { 0.6 3% }
+$ε = 0.0\text{:} \hspace{0.5cm}  C_{\rm BSC} \ = \ $ { 1 1% } $\ \rm bit$
+$ε = 0.1\text{:} \hspace{0.5cm} C_{\rm BSC} \ = \ $ { 0.531 1% } $\ \rm bit$
+$ε = 0.5\text{:} \hspace{0.5cm} C_{\rm BSC} \ = \ $ { 0. } $\ \rm bit$
-{Wie groß ist die Kanalkapazität des vorliegenden Kanals?
+{Which channel capacity&nbsp; $C_{\rm BSC}$&nbsp; results depending on the BSC parameter &nbsp; $ε$?
 |type="{}"}
-$C$ = { 0.322 3% }   $bit$
+$ε = 1.0\text{:} \hspace{0.5cm} C_{\rm BSC} \ = \ $ { 1 1% } $\ \rm bit$
+$ε = 0.9\text{:} \hspace{0.5cm} C_{\rm BSC} \ = \ $ { 0.531 1% } $\ \rm bit$
-{Wie groß sind die bedingten Entropien?
-|type="{}"}
-$p_0 gemäß (e):   H(X|Y)$ = { 0.649 3% }   $bit$
-$H(Y|X)$ = { 0.4 3% } $bit$
 </quiz>
-===Musterlösung===
+===Solution===
 {{ML-Kopf}}
-'''1.'''
+'''(1)'''&nbsp; The&nbsp; <u>proposed solution 2</u> is correct, as the following calculation shows:
-'''2.'''
+:$$H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X) = p_0 \cdot (1 - \varepsilon) \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{1 - \varepsilon} + p_0 \cdot \varepsilon \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{\varepsilon} +p_1 \cdot \varepsilon \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{\varepsilon} + p_1 \cdot (1 - \varepsilon) \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{1 - \varepsilon} $$
-'''3.'''
+:$$\Rightarrow \hspace{0.3cm} H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X) =  (p_0 + p_1) \cdot \left [ \varepsilon \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{\varepsilon} + (1 - \varepsilon) \cdot {\rm log}_2 \hspace{0.1cm} \frac{1}{1 - \varepsilon} \right ] \hspace{0.05cm}.$$
-'''4.'''
+*With&nbsp; $p_0 + p_1 = 1$&nbsp;  and the binary entropy function&nbsp; $H_{\rm bin}$,&nbsp;  one obtains the proposed result: &nbsp; $H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X) = H_{\rm bin}(\varepsilon)\hspace{0.05cm}.$
-'''5.'''
+*For&nbsp; $ε = 0.1$&nbsp; we get&nbsp; $H(Y|X) = 0.4690 \ \rm bit$.&nbsp; The same value stands for&nbsp; $p_0=0.50$&nbsp; in the given table.
-'''6.'''
+*From the table one can also see that for the BSC model (blue background) as well as for the more general (asymmetric) BC model (red background) <br>the equivocation&nbsp; $H(X|Y)$&nbsp; depends on the source symbol probabilities&nbsp; $p_0$&nbsp; and&nbsp; $p_1$.&nbsp; It follows that proposed solution 1 cannot be correct.
-'''7.'''
+*The irrelevance&nbsp; $H(Y|X)$&nbsp; is independent &nbsp; of the source statistics, so that solution proposal 3 can also be excluded.
+'''(2)'''&nbsp;  <u>All given alternative solutions</u> are correct:
+*The channel capacity is defined as the maximum mutual information, where the maximization has to be done with respect to&nbsp; $P_X = (p_0, p_1)$&nbsp;:
+:$$C = \max_{P_X(X)} \hspace{0.15cm} I(X;Y) \hspace{0.05cm}.$$
+*The equation is generally valid, i.e. also for the unbalanced binary channel highlighted in red.
+*The mutual information can be calculated, for example, as&nbsp; $I(X;Y) = H(Y) - H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X)\hspace{0.05cm}$,&nbsp; <br>where according to subtask&nbsp; '''(1)'''&nbsp; the term&nbsp; $H(Y \hspace{-0.1cm}\mid \hspace{-0.1cm} X)\hspace{0.05cm}$&nbsp; is independent of&nbsp; $p_0$&nbsp; or&nbsp; $p_1 = 1- p_0$&nbsp;.
+*The maximum mutual information thus results exactly when the sink entropy&nbsp; $H(Y)$&nbsp; is maximum.&nbsp; This is the case for&nbsp; $p_0 = p_1 = 0.5$:
+:$$H(X) = H(Y) =  1 \ \rm bit.$$
+[[File:P_ID2790__Inf_Z_3_9_B.png|frame|Binary entropy function and BSC channel capacity]]
+'''(3)'''&nbsp;  According to the subtasks&nbsp; '''(1)'''&nbsp; and&nbsp; '''(2)'''&nbsp; one obtains for the BSC channel capacity:
+:$$C = \max_{P_X(X)} \hspace{0.15cm}  I(X;Y)  \hspace{0.05cm}.$$
+The graph shows the binary entropy function on the left and the channel capacity on the right.&nbsp; One obtains:
+* for&nbsp; $ε = 0.0$&nbsp; (error-free channel): <br> &nbsp; &nbsp; $C = 1\ \rm  (bit)$  &nbsp; &rArr; &nbsp; dot with yellow filling,
+* for&nbsp; $ε = 0.1$&nbsp; (considered so far): <br> &nbsp; &nbsp; $C = 0.531\ \rm  (bit)$  &nbsp; &rArr; &nbsp;   dot with green filling,
+* for&nbsp; $ε = 0.5$&nbsp; (completely disturbed): <br> &nbsp; &nbsp; $C = 0\ \rm  (bit)$   &nbsp; &rArr; &nbsp;  dot with grey filling.
+'''(4)'''&nbsp;  From the graph one can see that from an information-theoretical point of view&nbsp; $ε = 1$&nbsp; is identical to&nbsp; $ε = 0$&nbsp;:
+:$$C_{\rm BSC} \big |_{\hspace{0.05cm}\varepsilon \hspace{0.05cm} = \hspace{0.05cm}1} \hspace{0.05cm}= C_{\rm BSC} \big |_{\hspace{0.05cm}\varepsilon \hspace{0.05cm} = \hspace{0.05cm}0} \hspace{0.15cm} \underline {=1\,{\rm (bit)}} \hspace{0.05cm}.$$
+*The channel only carries out a renaming here.&nbsp; This is called&nbsp; "mapping".
+*Every&nbsp; $0$&nbsp; becomes a&nbsp; $1$&nbsp; and every&nbsp; $1$&nbsp; becomes a&nbsp; $0$.&nbsp; Accordingly:
+:$$C_{\rm BSC} \big |_{\hspace{0.05cm}\varepsilon \hspace{0.05cm} = \hspace{0.05cm}0.9} \hspace{0.05cm}= C_{\rm BSC} \big |_{\hspace{0.05cm}\varepsilon \hspace{0.05cm} = \hspace{0.05cm}0.1} \hspace{0.15cm} \underline {=0.531\,{\rm (bit)}} \hspace{0.05cm}$$
 {{ML-Fuß}}
-[[Category:Aufgaben zu Informationstheorie|^3.3 Anwendung auf die Digitalsignalübertragung^]]
+[[Category:Information Theory: Exercises|^3.3 Application to Digital Signal Transmission^]]