Difference between revisions of "Aufgaben:Exercise 2.2: Kraft–McMillan Inequality"

Four examples of binary codes and in addition three ternary codes

In the figure some exemplary binary and ternary codes are given.

In binary code $\rm B1$ all possible source symbols $q_\mu$ $($with run index $\mu = 1$, ... , $8)$ are each represented by a code symbol sequence $\langle c_\mu \rangle $ of uniform length $L_\mu = 3$ .
This code is unsuitable for data compression for this reason.

The possibility of data compression arises only when

the $M$ Quellensymbole nicht gleichwahrscheinlich, und
die Länge $L_\mu$ der Codeworte unterschiedlich sind.

For example, the binary code $\rm B2$ has this property:

Here, one codeword each has the length $1$, $2$ and $3$, respectively $(N_1 = N_2 = N_3 = 1)$.
Two code words have the length $L_\mu = 4$ $(N_4 = N_5 = 2)$.

A prerequisite for the decodability of such a code is that the code is prefix-free .

That is, no codeword may be the prefix (i.e., the beginning) of a longer codeword.

A necessary condition for a code to be prefix-free for data compression was stated by Leon Kraft in 1949, called Kraft's inequality:

$$\sum_{\mu=1}^{M} \hspace{0.2cm} D^{-L_{\mu}} \le 1 \hspace{0.05cm}.$$

Here denote

$M$ the number of possible source symbols $q_\mu$,
$L_\mu$ the length of the codeword $q_\mu$ associated with the source symbol $c_\mu$,
$D = 2$ denotes a binary code $(\rm 0$ or $\rm 1)$ and $D = 3$ denotes a ternary code $(\rm 0$, $\rm 1$, $\rm 2)$.

A code can be prefix-free only if Kraft's inequality is satisfied. The converse does not hold: if Kraft's inequality is satisfied, it does not mean that this code is actually prefix-free.

Hint:

The task belongs to the chapter General description of source coding.

Question

Musterlösung

Solution

(1) Richtig sind die Lösungsvorschläge 1, 2 und 3. Für die angegebenen Binärcodes gilt:

$\rm B1$: $8 \cdot 2^{-3} = 1$ ⇒ Bedingung erfüllt,
$\rm B2$: $1 \cdot 2^{-1} + 1 \cdot 2^{-2} + 1 \cdot 2^{-3} + 2 \cdot 2^{-4}= 1$ ⇒ Bedingung erfüllt,
$\rm B3$: $1 \cdot 2^{-1} + 1 \cdot 2^{-2} + 1 \cdot 2^{-3} + 2 \cdot 2^{-4}= 1$ ⇒ Bedingung erfüllt,
$\rm B4$: $1 \cdot 2^{-1} + 1 \cdot 2^{-2} + 2 \cdot 2^{-3} + 1 \cdot 2^{-4}= 17/16$ ⇒ Bedingung nicht erfüllt.

(2) Richtig sind die Lösungsvorschläge 1 und 2:

Der Code $\rm B4$, der die Kraftsche Ungleichung nicht erfüllt, ist mit Sicherheit auch nicht präfixfrei.
Aber bei Erfüllung der Kraftschen Ungleichung ist noch nicht sicher, dass dieser Code auch präfixfrei ist.
Beim Code $\rm B3$ ist "10" der Beginn des Codewortes "1011".
Dagegen sind die Codes $\rm B1$ und $\rm B2$ tatsächlich präfixfrei.

(3) Richtig sind die Antworten 1 und 3:

Die Kraftsche Ungleichung wird von allen drei Codes erfüllt.
Wie aus der Tabelle hervorgeht, sind die Codes $\rm T1$ und $\rm T3$ tatsächlich präfixfrei.
Der Code $\rm T2$ ist dagegen nicht präfixfrei, da "1" der Beginn des Codewortes "10" ist.

(4) $N_i$ gibt an, wieviele Codeworte mit $i$ Symbolen es im Code gibt. Für den Code $\rm T1$ gilt:

$$N_1 \hspace{0.15cm}\underline{= 1}\hspace{0.05cm}, \hspace{0.2cm}N_2 \hspace{0.15cm}\underline{= 2}\hspace{0.05cm}, \hspace{0.2cm}N_3 \hspace{0.15cm}\underline{= 6}\hspace{0.05cm}.$$

(5) Nach der Kraftschen Ungleichung muss gelten

$$N_1 \cdot 3^{-1} + N_2 \cdot 3^{-2} + N_3 \cdot 3^{-3 } \le 1\hspace{0.05cm}.$$

Bei gegebenem $N_1 = 1$ und $N_2 = 2$ wird dies erfüllt, solange gilt:

$$N_3 \cdot 3^{-3 } \le 1 - 1/3 - 2/9 = 4/9 \hspace{0.3cm} \Rightarrow \hspace{0.3cm}N_3 \le 12 \hspace{0.3cm} \Rightarrow \hspace{0.3cm} {\rm \Delta}\,N_3 \hspace{0.15cm}\underline{= 6}\hspace{0.05cm}.$$

Die zusätzlichen Codeworte sind $\rm 210, \,211, \,212, \,220, \,221, \,222$.

(6) Für den Code $\rm T3$ gilt:

$S({\rm T3})= 2 \cdot 3^{-1} + 2 \cdot 3^{-2} + 1 \cdot 3^{-3 } = {25}/{27}\hspace{0.05cm}.$
Wegen $S({\rm T3}) \le 1$ erfüllt der Ternärcode $\rm T3$ die Kraftsche Ungleichung und er ist zudem auch präfixfrei.

Betrachten wir nun die vorgeschlagenen neuen Codes.

Code $\rm T4$ $(N_1 = 2, \ N_2 = 2, \ N_3 = 5)$:

$$S({\rm T4})= S({\rm T3}) + 4 \cdot 3^{-3 } = {29}/{27}\hspace{0.1cm} > \hspace{0.1cm}1\hspace{0.3cm} \Rightarrow \hspace{0.3cm} {\rm T4 \hspace{0.15cm}ist\hspace{0.15cm} ungeeignet}\hspace{0.05cm},$$

Code $\rm T5$ $(N_1 = 2, \ N_2 = 2, \ N_3 = 1, \ N_4 = 4)$:

$$S({\rm T5})= S({\rm T3}) + 4 \cdot 3^{-4 } = {79}/{81}\hspace{0.1cm} < \hspace{0.1cm}1\hspace{0.3cm} \Rightarrow \hspace{0.3cm} {\rm T5 \hspace{0.15cm}ist\hspace{0.15cm} geeignet}\hspace{0.05cm},$$

Code $\rm T6$ $(N_1 = 2, \ N_2 = 2, \ N_3 = 2, \ N_4 = 3)$:

$$S({\rm T6})= S({\rm T3}) + 1 \cdot 3^{-3 } + 3 \cdot 3^{-4 } = \frac{75 + 3 + 3}{81}\hspace{0.1cm} = \hspace{0.1cm}1\hspace{0.3cm} \Rightarrow \hspace{0.3cm} {\rm T6 \hspace{0.15cm}ist\hspace{0.15cm} geeignet}\hspace{0.05cm}.$$

Richtig sind also die zwei letzten Lösungsvorschläge. Beispielsweise lauten die insgesamt $N = 9$ Codeworte des präfixfreien Codes $\rm T6$:

$$\rm 0, \, 1, \, 20, \,21, \,220, \,221, \,2220, \, 2221 , \,2222.$$

@@ Line 3: / Line 3: @@
 }}
-[[File:P_ID2416__Inf_A_2_2.png|right|frame|Vier Beispiele für Binärcodes und dazu drei Ternärcodes]]
+[[File:P_ID2416__Inf_A_2_2.png|right|frame|Four examples of binary codes and in addition three ternary codes]]
-In der Abbildung sind einige beispielhafte Binär&ndash; und Ternärcodes angegeben.
+In the figure some exemplary binary and ternary codes are given.
-*Beim Binärcode&nbsp; $\rm B1$&nbsp; werden alle möglichen Quellensymbole&nbsp; $q_\mu$&nbsp; $($mit Laufindex&nbsp; $\mu = 1$, ... , $8)$&nbsp; durch jeweils eine Codesymbolfolge&nbsp; $\langle c_\mu \rangle $&nbsp; einheitlicher Länge&nbsp; $L_\mu = 3$&nbsp; dargestellt.
+*In binary code&nbsp; $\rm B1$&nbsp; all possible source symbols&nbsp; $q_\mu$&nbsp; $($with run index&nbsp; $\mu = 1$, ... , $8)$&nbsp; are each represented by a code symbol sequence&nbsp; $\langle c_\mu \rangle $&nbsp; of uniform length&nbsp; $L_\mu = 3$&nbsp;.
-*Dieser Code ist aus diesem Grund zur Datenkomprimierung ungeeignet.
+*This code is unsuitable for data compression for this reason.
-Die Möglichkeit zur Datenkomprimierung ergibt sich erst dann, wenn
+The possibility of data compression arises only when
-* die&nbsp; $M$&nbsp; Quellensymbole nicht gleichwahrscheinlich, und
+* the&nbsp; $M$&nbsp; Quellensymbole nicht gleichwahrscheinlich, und
 *die Länge&nbsp; $L_\mu$&nbsp; der Codeworte unterschiedlich sind.
-Diese Eigenschaft weist zum Beispiel der Binärcode&nbsp;  $\rm B2$&nbsp; auf: &nbsp;
+For example, the binary code&nbsp;  $\rm B2$&nbsp; has this property:   &nbsp;
-*Je ein Codewort hat hier die Länge&nbsp; $1$,&nbsp; $2$&nbsp; bzw.&nbsp; $3$&nbsp; $(N_1 = N_2 = N_3 = 1)$.
+*Here, one codeword each has the length &nbsp; $1$,&nbsp; $2$&nbsp; and&nbsp; $3$, respectively&nbsp; $(N_1 = N_2 = N_3 = 1)$.
-*Zwei Codeworte haben die Länge&nbsp; $L_\mu = 4$&nbsp; $(N_4 = N_5 = 2)$.
+*Two code words have the length&nbsp; $L_\mu = 4$&nbsp; $(N_4 = N_5 = 2)$.
-Voraussetzung für die Decodierbarkeit eines solchen Codes ist, dass der Code&nbsp; '''präfixfrei'''&nbsp; ist.&nbsp;
+A prerequisite for the decodability of such a code is that the code is&nbsp; '''prefix-free'''&nbsp;.&nbsp;
-*Das heißt, dass kein Codewort der Präfix (also der Beginn) eines längeren Codewortes sein darf.
+*That is, no codeword may be the prefix (i.e., the beginning) of a longer codeword.
-*Eine notwendige Bedingung dafür, dass ein Code zur Datenkomprimierung  präfixfrei sein kann, wurde 1949 von Leon Kraft angegeben, die so genannte&nbsp;  '''Kraftsche Ungleichung''':
+*A necessary condition for a code to be prefix-free for data compression was stated by Leon Kraft in 1949, called&nbsp;  '''Kraft's inequality''':
 :$$\sum_{\mu=1}^{M} \hspace{0.2cm} D^{-L_{\mu}} \le 1 \hspace{0.05cm}.$$
-Hierbei bezeichnen
+Here denote
-* $M$&nbsp; die Anzahl der möglichen Quellensymbole&nbsp; $q_\mu$,
+* $M$&nbsp; the number of possible source symbols&nbsp; $q_\mu$,
-* $L_\mu$&nbsp; die Länge des zum Quellensymbol&nbsp; $q_\mu$&nbsp; gehörigen Codewortes&nbsp;  $c_\mu$,
+* $L_\mu$&nbsp; the length of the codeword&nbsp; $q_\mu$&nbsp; associated with the source symbol&nbsp;  $c_\mu$,
-* $D = 2$&nbsp; kennzeichnet einen Binärcode&nbsp; $(\rm 0$  oder&nbsp; $\rm 1)$&nbsp; und&nbsp; $D = 3$&nbsp; einen Ternärcode&nbsp; $(\rm 0$,&nbsp; $\rm 1$,&nbsp; $\rm 2)$.
+* $D = 2$&nbsp; denotes a binary code&nbsp; $(\rm 0$  or&nbsp; $\rm 1)$&nbsp; and&nbsp; $D = 3$&nbsp; denotes a ternary code&nbsp; $(\rm 0$,&nbsp; $\rm 1$,&nbsp; $\rm 2)$.
-Ein Code kann nur dann präfixfrei sein, wenn die Kraftsche Ungleichung erfüllt ist.&nbsp; Die Umkehrung gilt nicht: &nbsp; Wird die Kraftsche Ungleichung erfüllt, so bedeutet das noch lange nicht, dass dieser Code tatsächlich präfixfrei ist.
+A code can be prefix-free only if Kraft's inequality is satisfied.&nbsp; The converse does not hold: &nbsp; if Kraft's inequality is satisfied, it does not mean that this code is actually prefix-free.
+''Hint:''
-''Hinweis:''
+*The task belongs to the chapter&nbsp; [[Information_Theory/Allgemeine_Beschreibung|General description of source coding]].
-*Die Aufgabe gehört zum  Kapitel&nbsp; [[Information_Theory/Allgemeine_Beschreibung|Allgemeine Beschreibung der Quellencodierung]].
-===Fragebogen===
+===Question===
 <quiz display=simple>
-{Welche der Binärcodes erfüllen die Kraftsche Ungleichung?
+{Which of the binary codes satisfy Kraft's inequality?
 |type="[]"}
 + $\rm B1$,
@@ Line 54: / Line 53: @@
-{Welche der vorgegebenen Binärcodes sind präfixfrei?
+{Which of the given binary codes are prefix-free?
 |type="[]"}
 + $\rm B1$,
@@ Line 62: / Line 61: @@
-{Welche der vorgegebenen Ternärcodes sind präfixfrei?
+{Which of the given ternary codes are prefix-free?
 |type="[]"}
 + $\rm T1$,
@@ Line 69: / Line 68: @@
-{Wie lauten die Kenngrößen des Ternärecodes&nbsp; $\rm T1$?
+{What are the characteristics of the ternary code&nbsp; $\rm T1$?
 |type="{}"}
 $ N_1 \ = \ $ { 1 }
@@ Line 77: / Line 76: @@
-{Wieviel dreiwertige Codeworte&nbsp; $(L_\mu = 3)$&nbsp; könnte man dem Code&nbsp; $\rm T1$&nbsp; hinzufügen, ohne dass sich an der Präfixfreiheit etwas ändert?
+{How many trivalent codewords&nbsp; $(L_\mu = 3)$&nbsp; could be added to the&nbsp; $\rm T1$&nbsp; code without changing the prefix freedom?
 |type="{}"}
 $\Delta N_3 \ = \ $ { 6 }
-{Der Ternärcode&nbsp; $\rm T3$&nbsp; soll auf insgesamt&nbsp; $N = 9$&nbsp; Codeworte erweitert werden.&nbsp; Wie erreicht man das ohne Verletzung der Präfixfreiheit?
+{The ternary code&nbsp; $\rm T3$&nbsp; is to be expanded to a total of&nbsp; $N = 9$&nbsp; codewords.&nbsp; How to achieve this without violating the prefix freedom?
 |type="[]"}
-- Ergänzung um vier dreiwertige Codeworte.
+- Addition of four three-valued codewords.
-+ Ergänzung um vier vierwertige Codeworte.
++ Addition of four four-valued codewords.
-+ Ergänzung um ein dreiwertiges und drei vierwertige Codeworte.
++ Addition of one trivalent and three tetravalent codewords.

	$\rm B1$,
	$\rm B2$,
	$\rm B3$,
	$\rm B4$.

	$\rm B1$,
	$\rm B2$,
	$\rm B3$,
	$\rm B4$.

	Addition of four three-valued codewords.
	Addition of four four-valued codewords.
	Addition of one trivalent and three tetravalent codewords.

Difference between revisions of "Aufgaben:Exercise 2.2: Kraft–McMillan Inequality"

Revision as of 21:26, 29 June 2021

Question

Musterlösung

Solution