Difference between revisions of "Aufgaben:Exercise 2.7Z: Huffman Coding for Two-Tuples of a Ternary Source"

Latest revision as of 16:57, 1 November 2022

Huffman tree for
a ternary source

We consider the same situation as in Exercise A2.7:

The Huffman algorithm leads to a better result, i.e. to a smaller average code word length $L_{\rm M}$, if one does not apply it to individual symbols but forms $k$–tuples beforehand.
This increases the symbol set size from $M$ to $M\hspace{0.03cm}' = M^k$.

For the message source considered here, the following applies:

Symbol set size: $M = 3$,
Symbol set: $\{$ $\rm X$, $\rm Y$, $\rm Z$ $\}$,
Probabilities: $p_{\rm X} = 0.7$, $p_{\rm Y} = 0.2$, $p_{\rm Z} = 0.1$,
Entropy: $H = 1.157 \ \rm bit/ternary\hspace{0.12cm}symbol$.

The graph shows the Huffman tree when the Huffman algorithm is applied to single symbols $(k= 1)$.
In subtask (2) you are to give the corresponding Huffman code when two-tuples are formed beforehand $(k=2)$.

Hints:

The exercise belongs to the chapter Entropy Coding according to Huffman.
In particular, reference is made to the page Application of Huffman coding to $k$-tuples.
A comparable task with binary input symbols is dealt with in Exercise 2.7 .
Designate the possible two-tuples with $\rm XX = A$, $\rm XY = B$, $\rm XZ = C$, $\rm YX = D$, $\rm YY = E$, $\rm YZ = F$, $\rm ZX = G$, $\rm ZY = H$, $\rm ZZ = I$.

Questions

$\underline{k=1}\text{:} \hspace{0.25cm}L_{\rm M} \ = \ $

$\ \rm bit/source\hspace{0.12cm}symbol$

$p_{\rm A} = \rm Pr(XX)\ = \ $

$p_{\rm B} = \rm Pr(XY)\ = \ $

$p_{\rm C} = \rm Pr(XZ)\ = \ $

$\underline{k=2}\text{:} \hspace{0.25cm}L_{\rm M} \ = \ $

$\ \rm bit/source\hspace{0.12cm}symbol$

	$L_{\rm M}$ decreases monotonically with increasing $k$.
	$L_{\rm M}$ does not change when $k$ is increased.
	Für $k= 3$ you get $L_{\rm M} = 1.05 \ \rm bit/source\hspace{0.12cm}symbol$.

Solution

(1) The average code word length is with $p_{\rm X} = 0.7$, $L_{\rm X} = 1$, $p_{\rm Y} = 0.2$, $L_{\rm Y} = 2$, $p_{\rm Z} = 0.1$, $L_{\rm Z} = 2$:

$$L_{\rm M} = p_{\rm X} \cdot 1 + (p_{\rm Y} + p_{\rm Z}) \cdot 2 \hspace{0.15cm}\underline{= 1.3\,\,{\rm bit/source\:symbol}}\hspace{0.05cm}. $$

This value is greater than the source entropy $H = 1.157$ bit/source symbol.

(2) There are $M\hspace{0.03cm}' = M^k = 3^2 = 9$ two-tuples with the following probabilities:

Huffman tree for ternary source and two-tuples.

$$p_{\rm A} = \rm Pr(XX) = 0.7 \cdot 0.7\hspace{0.15cm}\underline{= 0.49},$$

$$p_{\rm B} = \rm Pr(XY) = 0.7 \cdot 0.2\hspace{0.15cm}\underline{= 0.14},$$

$$p_{\rm C} = \rm Pr(XZ) = 0.7 \cdot 0.1\hspace{0.15cm}\underline{= 0.07},$$

$$p_{\rm D} = \rm Pr(YX) = 0.2 \cdot 0.7 = 0.14,$$

$$p_{\rm E} = \rm Pr(YY) = 0.2 \cdot 0.2 = 0.04,$$

$$p_{\rm F} = \rm Pr(YZ) = 0.2 \cdot 0.1 = 0.02,$$

$$p_{\rm G} = \rm Pr(ZX) = 0.1 \cdot 0.7 = 0.07,$$

$$p_{\rm H} = \rm Pr(ZY) = 0.1 \cdot 0.2 = 0.02,$$

$$p_{\rm I} = \rm Pr(ZZ) = 0.1 \cdot 0.1 = 0.01.$$

(3) The graph shows the Huffman tree for the application with $k = 2$. Thus we obtain

for the individual two-tuples the following binary codings:

$\rm XX = A$ → 0, $\rm XY = B$ → 111, $\rm XZ = C$ → 1011,

$\rm YX = D$ → 110, $\rm YY = E$ → 1000, $\rm YZ = F$ → 10010,

$\rm ZX = G$ → 1010, $\rm ZY = H$ → 100111, $\rm ZZ =I$ → 100110;

for the average code word length:

$$L_{\rm M}\hspace{0.01cm}' =0.49 \cdot 1 + (0.14 + 0.14) \cdot 3 + (0.07 + 0.04 + 0.07) \cdot 4 + 0.02 \cdot 5 + (0.02 + 0.01) \cdot 6 = 2.33\,\,{\rm bit/two tuples}$$

$$\Rightarrow\hspace{0.3cm}L_{\rm M} = {L_{\rm M}\hspace{0.01cm}'}/{2}\hspace{0.15cm}\underline{ = 1.165\,\,{\rm bit/source\hspace{0.12cm}symbol}}\hspace{0.05cm}.$$

(4) Statement 1 is correct, even if $L_{\rm M}$ decreases very slowly as $k$ increases.

The last statement is false because $L_{\rm M}$ cannot be smaller than $H = 1.157$ bit/source symbol even for $k → ∞$ .
But the second statement is not necessarily correct either: Since $L_{\rm M} > H$ still applies with $k = 2$ , $k = 3$ can lead to a further improvement.

@@ Line 1: / Line 1: @@
-{{quiz-Header|Buchseite=Informationstheorie/Entropiecodierung nach Huffman
+{{quiz-Header|Buchseite=Information_Theory/Entropy_Coding_According_to_Huffman
 }}
-[[File:P_ID2458__Inf_Z_2_7.png|right|frame|Huffman–Baum für <br>eine Ternärquelle]]
+[[File:P_ID2458__Inf_Z_2_7.png|right|frame|Huffman tree for <br>a ternary source]]
-Wir betrachten den gleichen Sachverhalt wie in der&nbsp; [[Aufgaben:2.7_Huffman-Anwendung_für_binäre_Zweiertupel|Aufgabe A2.7]]: &nbsp;
+We consider the same situation as in&nbsp; [[Aufgaben:Exercise_2.7:_Huffman_Application_for_Binary_Two-Tuples|Exercise A2.7]]: &nbsp;
-*Der Huffman&ndash;Algorithmus führt zu einem besseren Ergebnis, das heißt zu einer kleineren mittleren Codewortlänge&nbsp; $L_{\rm M}$, wenn man ihn nicht auf einzelne Symbole anwendet, sondern vorher&nbsp; $k$&ndash;Tupel bildet.&nbsp;
+*The Huffman algorithm leads to a better result, i.e. to a smaller average code word length&nbsp; $L_{\rm M}$, if one does not apply it to individual symbols but forms&nbsp; $k$&ndash;tuples beforehand. &nbsp;
-*Dadurch erhöht man den Symbolumfang von&nbsp; $M$&nbsp; auf $M\hspace{0.03cm}' = M^k$.
+*This increases the symbol set size from&nbsp; $M$&nbsp; to&nbsp; $M\hspace{0.03cm}' = M^k$.
-Für die hier betrachtete Nachrichtenquelle gilt:
+For the message source considered here, the following applies:
-* Symbolumfang: &nbsp; $M = 3$,
+* Symbol set size: &nbsp; $M = 3$,
-* Symbolvorrat: &nbsp; $\{$ $\rm X$,&nbsp; $\rm Y$,&nbsp; $\rm Z$ $\}$,
+* Symbol set: &nbsp; $\{$ $\rm X$,&nbsp; $\rm Y$,&nbsp; $\rm Z$ $\}$,
-* Wahrscheinlichkeiten: &nbsp;  $p_{\rm X} = 0.7$,&nbsp; $p_{\rm Y} = 0.2$,&nbsp; $p_{\rm Z} = 0.1$,
+* Probabilities: &nbsp;  $p_{\rm X} = 0.7$,&nbsp; $p_{\rm Y} = 0.2$,&nbsp; $p_{\rm Z} = 0.1$,
-* Entropie: &nbsp; $H = 1.157 \ \rm  bit/Ternärsymbol$.
+* Entropy: &nbsp; $H = 1.157 \ \rm  bit/ternary\hspace{0.12cm}symbol$.
-Die Grafik zeigt den Huffman&ndash;Baum, wenn man den Huffman&ndash;Algorithmus auf Einzelsymbole anwendet, also den Fall&nbsp; $k= 1$. <br>In der Teilaufgabe&nbsp; '''(2)'''&nbsp; sollen Sie den entsprechenden Huffman&ndash;Code angeben, wenn vorher Zweiertupel gebildet werden&nbsp; $(k=2)$.
+The graph shows the Huffman tree when the Huffman algorithm is applied to single symbols&nbsp; $(k= 1)$. <br>In subtask&nbsp; '''(2)'''&nbsp; you are to give the corresponding Huffman code when two-tuples are formed beforehand&nbsp; $(k=2)$.
+<u>Hints:</u>
+*The exercise belongs to the chapter&nbsp; [[Information_Theory/Entropiecodierung_nach_Huffman|Entropy Coding according to Huffman]].
+*In particular, reference is made to the page&nbsp; [[Information_Theory/Entropiecodierung_nach_Huffman#Application_of_Huffman_coding_to_.7F.27.22.60UNIQ-MathJax168-QINU.60.22.27.7F.E2.80.93tuples|Application of Huffman coding to&nbsp; $k$-tuples]].
+*A comparable task with binary input symbols is dealt with in&nbsp;   [[Aufgaben:Exercise_2.7:_Huffman_Application_for_Binary_Two-Tuples|Exercise 2.7]]&nbsp;.
-''Hinweise:''
+*Designate the possible two-tuples with &nbsp; &nbsp; $\rm XX = A$,&nbsp; &nbsp;$\rm XY = B$,&nbsp; &nbsp;$\rm XZ = C$,&nbsp;&nbsp; $\rm YX = D$,&nbsp; &nbsp;$\rm YY = E$,&nbsp; &nbsp;$\rm YZ = F$,&nbsp; &nbsp;$\rm ZX = G$,&nbsp; &nbsp;$\rm ZY = H$,&nbsp; &nbsp;$\rm ZZ = I$.
-*Die Aufgabe gehört zum  Kapitel&nbsp; [[Information_Theory/Entropiecodierung_nach_Huffman|Entropiecodierung nach Huffman]].
-*Insbesondere wird auf die Seite&nbsp; [[Information_Theory/Entropiecodierung_nach_Huffman#Anwendung_der_Huffman.E2.80.93Codierung_auf_.7F.27.22.60UNIQ-MathJax164-QINU.60.22.27.7F.E2.80.93Tupel|Anwendung der Huffman-Codierung auf&nbsp; $k$-Tupel]]&nbsp; Bezug genommen.
-*Eine vergleichbare Aufgabenstellung mit binären Eingangssymbolen wird in der&nbsp;   [[Aufgaben:2.7_Huffman-Anwendung_für_binäre_Zweiertupel|Aufgabe 2.7]]&nbsp; behandelt.
-*Bezeichnen Sie die möglichen Zweiertupel mit &nbsp; &nbsp; $\rm XX = A$,&nbsp; &nbsp;$\rm XY = B$,&nbsp; &nbsp;$\rm XZ = C$,&nbsp;&nbsp; $\rm YX = D$,&nbsp; &nbsp;$\rm YY = E$,&nbsp; &nbsp;$\rm YZ = F$,&nbsp; &nbsp;$\rm ZX = G$,&nbsp; &nbsp;$\rm ZY = H$,&nbsp; &nbsp;$\rm ZZ = I$.
-===Fragebogen===
+===Questions===
 <quiz display=simple>
-{Wie groß ist die mittlere Codewortlänge, wenn der Huffman&ndash;Algorithmus direkt auf die ternären Quellensymbole&nbsp; $\rm X$,&nbsp; $\rm Y$&nbsp; und&nbsp; $\rm Z$&nbsp; angewendet wird?
+{What is the average code word length when the Huffman algorithm is applied directly to the ternary source symbols&nbsp; $\rm X$,&nbsp; $\rm Y$&nbsp; und&nbsp; $\rm Z$&nbsp;?
 |type="{}"}
-$\underline{k=1}\text{:} \hspace{0.25cm}L_{\rm M} \ = \ $ { 1.3 3% } $\ \rm bit/Quellensymbol$
+$\underline{k=1}\text{:} \hspace{0.25cm}L_{\rm M} \ = \ $ { 1.3 3% } $\ \rm bit/source\hspace{0.12cm}symbol$
-{Wie groß sind hier die Tupel&ndash;Wahrscheinlichkeiten? Insbesondere:
+{What are the tuple probabilities here?&nbsp; In particular:
 |type="{}"}
 $p_{\rm A} = \rm Pr(XX)\ = \ $ { 0.49 3% }
@@ Line 48: / Line 44: @@
-{Wie groß ist die mittlere Codewortlänge, wenn man zuerst Zweiertupel bildet und darauf den Huffman&ndash;Algorithmus  anwendet?
+{What is the average code word length if you first form two-tuples and apply the Huffman algorithm to them?
 |type="{}"}
-$\underline{k=2}\text{:} \hspace{0.25cm}L_{\rm M} \ = \ $ { 1.165 3% } $\ \rm bit/Quellensymbol$
+$\underline{k=2}\text{:} \hspace{0.25cm}L_{\rm M} \ = \ $ { 1.165 3% } $\ \rm bit/source\hspace{0.12cm}symbol$
-{Welche der folgenden Aussagen sind zutreffend, wenn man mehr als zwei Ternärzeichen zusammenfasst&nbsp; $(k>2)$?
+{Which of the following statements are true when more than two ternary symbols are combined&nbsp; $(k>2)$?
 |type="[]"}
-+ $L_{\rm M}$&nbsp; fällt monoton mit steigendem &nbsp;$k$&nbsp; ab.
++ $L_{\rm M}$&nbsp; decreases monotonically with increasing &nbsp;$k$.
-- $L_{\rm M}$&nbsp; ändert sich nicht, wenn man &nbsp;$k$&nbsp; erhöht.
+- $L_{\rm M}$&nbsp; does not change when &nbsp;$k$&nbsp; is increased.
-- Für &nbsp;$k= 3$&nbsp; erhält man &nbsp;$L_{\rm M} = 1.05 \ \rm bit/Quellensymbol$.
+- Für &nbsp;$k= 3$&nbsp; you get &nbsp;$L_{\rm M} = 1.05 \ \rm bit/source\hspace{0.12cm}symbol$.
@@ Line 63: / Line 59: @@
 </quiz>
-===Musterlösung===
+===Solution===
 {{ML-Kopf}}
-'''(1)'''&nbsp; Die mittlere Codewortlänge ergibt sich mit &nbsp;$p_{\rm X} = 0.7$, &nbsp;$L_{\rm X} = 1$, &nbsp;$p_{\rm Y} = 0.2$, &nbsp;$L_{\rm Y} = 2$, &nbsp;$p_{\rm Z} = 0.1$, &nbsp;$L_{\rm Z} = 2$ zu
+'''(1)'''&nbsp; The average code word length is with &nbsp;$p_{\rm X} = 0.7$, &nbsp;$L_{\rm X} = 1$, &nbsp;$p_{\rm Y} = 0.2$, &nbsp;$L_{\rm Y} = 2$, &nbsp;$p_{\rm Z} = 0.1$, &nbsp;$L_{\rm Z} = 2$:
-:$$L_{\rm M} = p_{\rm X} \cdot 1 + (p_{\rm Y} + p_{\rm Z}) \cdot 2 \hspace{0.15cm}\underline{= 1.3\,\,{\rm bit/Quellensymbol}}\hspace{0.05cm}. $$
+:$$L_{\rm M} = p_{\rm X} \cdot 1 + (p_{\rm Y} + p_{\rm Z}) \cdot 2 \hspace{0.15cm}\underline{= 1.3\,\,{\rm bit/source\:symbol}}\hspace{0.05cm}. $$
-*Dieser Wert liegt noch deutlich über der Quellenentropie&nbsp; $H = 1.157$&nbsp; bit/Quellensymbol.
+*This value is greater than the source entropy&nbsp; $H = 1.157$&nbsp; bit/source&nbsp; symbol.
-'''(2)'''&nbsp; Es gibt&nbsp; $M\hspace{0.03cm}' = M^k = 3^2 = 9$&nbsp; Zweiertupel mit folgenden Wahrscheinlichkeiten:
+'''(2)'''&nbsp; There are&nbsp; $M\hspace{0.03cm}' = M^k = 3^2 = 9$&nbsp; two-tuples with the following probabilities:
-[[File:P_ID2459__Inf_Z_2_7c.png|right|frame|Huffman–Baum für Ternärquelle und Zweiertupel]]
+[[File:P_ID2459__Inf_Z_2_7c.png|right|frame|Huffman tree for ternary source and two-tuples.]]
 :$$p_{\rm A} = \rm Pr(XX) = 0.7 \cdot 0.7\hspace{0.15cm}\underline{= 0.49},$$
 :$$p_{\rm B} = \rm Pr(XY) = 0.7 \cdot 0.2\hspace{0.15cm}\underline{= 0.14},$$
@@ Line 85: / Line 81: @@
 <br clear=all>
-'''(3)'''&nbsp; Die Grafik zeigt den Huffman&ndash;Baum für die Anwendung mit $k = 2$.&nbsp; Damit erhält man
+'''(3)'''&nbsp; The graph shows the Huffman tree for the application with $k = 2$.&nbsp; Thus we obtain
-* für die einzelnen Zweiertupels folgende Binärcodierungen: <br>
+* for the individual two-tuples the following binary codings: <br>
 : &nbsp; &nbsp;  $\rm XX = A$ &nbsp; &#8594; &nbsp; '''0''', &nbsp; &nbsp;  $\rm XY = B$ &nbsp; &#8594; &nbsp; '''111''', &nbsp; &nbsp;  $\rm XZ = C$ &nbsp; &#8594; &nbsp; <b>1011</b>,
 : &nbsp; &nbsp;  $\rm YX = D$ &nbsp; &#8594; &nbsp; <b>110</b>, &nbsp; &nbsp;  $\rm YY = E$ &nbsp; &#8594; &nbsp; <b>1000</b>, &nbsp; &nbsp;  $\rm YZ = F$ &nbsp; &#8594; &nbsp; <b>10010</b>,
 : &nbsp; &nbsp;  $\rm ZX = G$ &nbsp; &#8594; &nbsp; <b>1010</b>, &nbsp; &nbsp;  $\rm ZY = H$ &nbsp; &#8594; &nbsp; <b>100111</b>, &nbsp; &nbsp;  $\rm ZZ =I$ &nbsp; &#8594; &nbsp; <b>100110</b>;
-* für die mittlere Codewortlänge:
+* for the average code word length:
-:$$L_{\rm M}\hspace{0.01cm}' =0.49 \cdot 1 + (0.14 + 0.14) \cdot 3 + (0.07 + 0.04 + 0.07) \cdot 4 + 0.02 \cdot 5 + (0.02 + 0.01) \cdot 6 = 2.33\,\,{\rm bit/Zweiertupel}$$
+:$$L_{\rm M}\hspace{0.01cm}' =0.49 \cdot 1 + (0.14 + 0.14) \cdot 3 + (0.07 + 0.04 + 0.07) \cdot 4 + 0.02 \cdot 5 + (0.02 + 0.01) \cdot 6 = 2.33\,\,{\rm bit/two tuples}$$
-:$$\Rightarrow\hspace{0.3cm}L_{\rm M} = {L_{\rm M}\hspace{0.01cm}'}/{2}\hspace{0.15cm}\underline{  = 1.165\,\,{\rm bit/Quellensymbol}}\hspace{0.05cm}.$$
+:$$\Rightarrow\hspace{0.3cm}L_{\rm M} = {L_{\rm M}\hspace{0.01cm}'}/{2}\hspace{0.15cm}\underline{  = 1.165\,\,{\rm bit/source\hspace{0.12cm}symbol}}\hspace{0.05cm}.$$
-'''(4)'''&nbsp; Richtig ist die <u>Aussage 1</u>, auch wenn&nbsp; $L_{\rm M}$&nbsp; mit wachsendem&nbsp; $k$&nbsp; nur sehr langsam abfällt.
+'''(4)'''&nbsp;<u>Statement 1</u> is correct,&nbsp; even if&nbsp; $L_{\rm M}$&nbsp;  decreases very slowly as&nbsp; $k$&nbsp; increases.
-* Die letzte Aussage ist falsch, da&nbsp; $L_{\rm M}$&nbsp; auch für&nbsp; $k &#8594; &#8734;$&nbsp; nicht kleiner sein kann als&nbsp; $H = 1.157$&nbsp; bit/Quellensymbol.
+* The last statement is false because&nbsp; $L_{\rm M}$&nbsp; cannot be smaller than&nbsp; $H = 1.157$&nbsp; bit/source&nbsp; symbol even for &nbsp; $k &#8594; &#8734;$&nbsp; .
-* Aber auch die zweite Aussage ist nicht unbedingt richtig: &nbsp; Da mit&nbsp; $k = 2$&nbsp; weiterhin&nbsp; $L_{\rm M} > H$&nbsp; gilt, kann&nbsp; $k = 3$&nbsp; zu einer weiteren Verbesserung führen.
+* But the second statement is not necessarily correct either: &nbsp; Since&nbsp; $L_{\rm M} > H$&nbsp; still applies with&nbsp; $k = 2$&nbsp;,&nbsp; $k = 3$&nbsp; can lead to a further improvement.
 {{ML-Fuß}}
-[[Category:Information Theory: Exercises|^2.3 Entropiecodierung nach Huffman^]]
+[[Category:Information Theory: Exercises|^2.3 Entropy Coding according to Huffman^]]