Movatterモバイル変換

Zemor's decoding algorithm

From Wikipedia, the free encyclopedia

Coding theory algorithm

Incoding theory,Zemor's algorithm, designed and developed by Gilles Zémor,^[1] is a recursive low-complexity approach to code construction. It is an improvement over the algorithm ofSipser andSpielman.

Zemor considered a typical class of Sipser–Spielman construction ofexpander codes, where the underlying graph isbipartite graph. Sipser and Spielman introduced a constructive family of asymptotically good linear-error codes together with a simple parallel algorithm that will always remove a constant fraction of errors. The article is based on Dr. Venkatesan Guruswami's course notes^[2]

Code construction

[edit]

Zemor's algorithm is based on a type ofexpander graphs calledTanner graph. The construction of code was first proposed by Tanner.^[3] The codes are based ondouble cover $d {\displaystyle d}$ , regular expander $G {\displaystyle G}$ , which is a bipartite graph. $G {\displaystyle G}$ = $\left(V,E\right)$ , where $V {\displaystyle V}$ is the set of vertices and $E {\displaystyle E}$ is the set of edges and $V {\displaystyle V}$ = $A {\displaystyle A}$ $\cup$ $B {\displaystyle B}$ and $A {\displaystyle A}$ $\cap$ $B {\displaystyle B}$ = $\emptyset$ , where $A {\displaystyle A}$ and $B {\displaystyle B}$ denotes sets of vertices. Let $n {\displaystyle n}$ be the number of vertices in each group,i.e, $|A|=|B|=n$ . The edge set $E {\displaystyle E}$ be of size $N {\displaystyle N}$ = $n d {\displaystyle nd}$ and every edge in $E {\displaystyle E}$ has one endpoint in both $A {\displaystyle A}$ and $B {\displaystyle B}$ . $E(v)$ denotes the set of edges containing $v {\displaystyle v}$ .

Assume an ordering on $V {\displaystyle V}$ , therefore ordering will be done on every edges of $E(v)$ for every $v\in V$ . Letfinite field $\mathbb {F} =GF(2)$ , and for a word $x=(x_{e}),e\in E$ in $\mathbb {F} ^{N}$ , let the subword of the word will be indexed by $E(v)$ . Let that word be denoted by $(x)_{v}$ . The subset of vertices $A {\displaystyle A}$ and $B {\displaystyle B}$ induces every word $x\in \mathbb {F} ^{N}$ a partition into $n {\displaystyle n}$ non-overlapping sub-words $\left(x\right)_{v}\in \mathbb {F} ^{d}$ , where $v {\displaystyle v}$ ranges over the elements of $A {\displaystyle A}$ . For constructing a code $C {\displaystyle C}$ , consider a linear subcode $C_{o}$ , which is a $[d,r_{o}d,\delta ]$ code, where $q {\displaystyle q}$ , the size of the alphabet is $2 {\displaystyle 2}$ . For any vertex $v\in V$ , let $v(1),v(2),\ldots ,v(d)$ be some ordering of the $d {\displaystyle d}$ vertices of $E {\displaystyle E}$ adjacent to $v {\displaystyle v}$ . In this code, each bit $x_{e}$ is linked with an edge $e {\displaystyle e}$ of $E {\displaystyle E}$ .

We can define the code $C {\displaystyle C}$ to be the set of binary vectors $x=\left(x_{1},x_{2},\ldots ,x_{N}\right)$ of $\{0,1\}^{N}$ such that, for every vertex $v {\displaystyle v}$ of $V {\displaystyle V}$ , $\left(x_{v(1)},x_{v(2)},\ldots ,x_{v(d)}\right)$ is a code word of $C_{o}$ . In this case, we can consider a special case when every edge of $E {\displaystyle E}$ is adjacent to exactly $2 {\displaystyle 2}$ vertices of $V {\displaystyle V}$ . It means that $V {\displaystyle V}$ and $E {\displaystyle E}$ make up, respectively, the vertex set and edge set of $d {\displaystyle d}$ regular graph $G {\displaystyle G}$ .

Let us call the code $C {\displaystyle C}$ constructed in this way as $\left(G,C_{o}\right)$ code. For a given graph $G {\displaystyle G}$ and a given code $C_{o}$ , there are several $\left(G,C_{o}\right)$ codes as there are different ways of ordering edges incident to a given vertex $v {\displaystyle v}$ , i.e., $v(1),v(2),\ldots ,v(d)$ . In fact our code $C {\displaystyle C}$ consist of all codewords such that $x_{v}\in C_{o}$ for all $v\in A,B$ . The code $C {\displaystyle C}$ is linear $[N,K,D]$ in $\mathbb {F}$ as it is generated from a subcode $C_{o}$ , which is linear. The code $C {\displaystyle C}$ is defined as $C=\{c\in \mathbb {F} ^{N}:(c)_{v}\in C_{o}\}$ for every $v\in V$ .

In this figure, $(x)_{v}=\left(x_{e1},x_{e2},x_{e3},x_{e4}\right)\in C_{o}$ . It shows the graph $G {\displaystyle G}$ and code $C {\displaystyle C}$ .

In matrix $G {\displaystyle G}$ , let $\lambda$ is equal to the second largesteigenvalue ofadjacency matrix of $G {\displaystyle G}$ . Here the largest eigenvalue is $d {\displaystyle d}$ . Two important claims are made:

D\geq N\left({\dfrac {(\delta -({\dfrac {\lambda }{d}}))}{(1-({\dfrac {\lambda }{d}})}})\right)^{2}

=N\left(\delta ^{2}-O\left({\dfrac {\lambda }{d}}\right)\right)

\rightarrow (1)

If $S {\displaystyle S}$ is linear code of rate $r {\displaystyle r}$ , block code length $d {\displaystyle d}$ , and minimum relative distance $\delta$ , and if $B {\displaystyle B}$ is the edge vertex incidence graph of a $d {\displaystyle d}$ – regular graph with second largest eigenvalue $\lambda$ , then the code $C(B,S)$ has rate at least $2r_{o}-1$ and minimum relative distance at least $\left(\left({\dfrac {\delta -\left({\dfrac {\lambda }{d}}\right)}{1-\left({\dfrac {\lambda }{d}}\right)}}\right)\right)^{2}$ .

Proof

[edit]

Let $B {\displaystyle B}$ be derived from the $d {\displaystyle d}$ regular graph $G {\displaystyle G}$ . So, the number of variables of $C(B,S)$ is $\left({\dfrac {dn}{2}}\right)$ and the number of constraints is $n {\displaystyle n}$ . According to Alon - Chung,^[4] if $X {\displaystyle X}$ is a subset of vertices of $G {\displaystyle G}$ of size $\gamma n$ , then the number of edges contained in the subgraph is induced by $X {\displaystyle X}$ in $G {\displaystyle G}$ is at most $\left({\dfrac {dn}{2}}\right)\left(\gamma ^{2}+({\dfrac {\lambda }{d}})\gamma \left(1-\gamma \right)\right)$ .

As a result, any set of $\left({\dfrac {dn}{2}}\right)\left(\gamma ^{2}+\left({\dfrac {\lambda }{d}}\right)\gamma \left(1-\gamma \right)\right)$ variables will be having at least $\gamma n$ constraints as neighbours. So the average number of variables per constraint is : $\left({\dfrac {({\dfrac {2nd}{2}})\left(\gamma ^{2}+({\dfrac {\lambda }{d}})\gamma \left(1-\gamma \right)\right)}{\gamma n}}\right)$ $=d\left(\gamma +({\dfrac {\lambda }{d}})\left(1-\gamma \right)\right)$ $\rightarrow (2)$

So if $d\left(\gamma +({\dfrac {\lambda }{d}})\left(1-\gamma \right)\right)<\gamma d$ , then a word of relative weight $\left(\gamma ^{2}+({\dfrac {\lambda }{d}})\gamma \left(1-\gamma \right)\right)$ , cannot be a codeword of $C(B,S)$ . The inequality $(2) {\displaystyle (2)}$ is satisfied for $\gamma <\left({\dfrac {1-({\dfrac {\lambda }{d}})}{\delta -({\dfrac {\lambda }{d}})}}\right)$ . Therefore, $C(B,S)$ cannot have a non zero codeword of relative weight $\left({\dfrac {\delta -({\dfrac {\lambda }{d}})}{1-({\dfrac {\lambda }{d}})}}\right)^{2}$ or less.

In matrix $G {\displaystyle G}$ , we can assume that $\lambda /d$ is bounded away from $1 {\displaystyle 1}$ . For those values of $d {\displaystyle d}$ in which $d-1$ is odd prime, there are explicit constructions of sequences of $d {\displaystyle d}$ - regular bipartite graphs with arbitrarily large number of vertices such that each graph $G {\displaystyle G}$ in the sequence is aRamanujan graph. It is called Ramanujan graph as it satisfies the inequality $\lambda (G)\leq 2{\sqrt {d-1}}$ . Certain expansion properties are visible in graph $G {\displaystyle G}$ as the separation between the eigenvalues $d {\displaystyle d}$ and $\lambda$ . If the graph $G {\displaystyle G}$ is Ramanujan graph, then that expression $(1) {\displaystyle (1)}$ will become $0 {\displaystyle 0}$ eventually as $d {\displaystyle d}$ becomes large.

Zemor's algorithm

[edit]

The iterative decoding algorithm written below alternates between the vertices $A {\displaystyle A}$ and $B {\displaystyle B}$ in $G {\displaystyle G}$ and corrects the codeword of $C_{o}$ in $A {\displaystyle A}$ and then it switches to correct the codeword $C_{o}$ in $B {\displaystyle B}$ . Here edges associated with a vertex on one side of a graph are not incident to other vertex on that side. In fact, it doesn't matter in which order, the set of nodes $A {\displaystyle A}$ and $B {\displaystyle B}$ are processed. The vertex processing can also be done in parallel.

The decoder $\mathbb {D} :\mathbb {F} ^{d}\rightarrow C_{o}$ stands for a decoder for $C_{o}$ that recovers correctly with any codewords with less than $\left({\dfrac {d}{2}}\right)$ errors.

Decoder algorithm

[edit]

Received word : $w=(w_{e}),e\in E$
$z\leftarrow w$ For $t\leftarrow 1$ to $m {\displaystyle m}$ do // $m {\displaystyle m}$ is the number of iterations { if ( $t {\displaystyle t}$ is odd) // Here the algorithm will alternate between its two vertex sets. $X\leftarrow A$ else $X\leftarrow B$ Iteration $t {\displaystyle t}$ : For every $v\in X$ , let $(z)_{v}\leftarrow \mathbb {D} ((z)_{v})$ // Decoding $z_{v}$ to its nearest codeword. }Output: $z {\displaystyle z}$

Explanation of the algorithm

[edit]

Since $G {\displaystyle G}$ is bipartite, the set $A {\displaystyle A}$ of vertices induces the partition of the edge set $E {\displaystyle E}$ = $\cup _{v\in A}E_{v}$ . The set $B {\displaystyle B}$ induces another partition, $E {\displaystyle E}$ = $\cup _{v\in B}E_{v}$ .

Let $w\in \{0,1\}^{N}$ be the received vector, and recall that $N=dn$ . The first iteration of the algorithm consists of applying the complete decoding for the code induced by $E_{v}$ for every $v\in A$ . This means that for replacing, for every $v\in A$ , the vector $\left(w_{v(1)},w_{v(2)},\ldots ,w_{v(d)}\right)$ by one of the closest codewords of $C_{o}$ . Since the subsets of edges $E_{v}$ are disjoint for $v\in A$ , the decoding of these $n {\displaystyle n}$ subvectors of $w {\displaystyle w}$ may be done in parallel.

The iteration will yield a new vector $z {\displaystyle z}$ . The next iteration consists of applying the preceding procedure to $z {\displaystyle z}$ but with $A {\displaystyle A}$ replaced by $B {\displaystyle B}$ . In other words, it consists of decoding all the subvectors induced by the vertices of $B {\displaystyle B}$ . The coming iterations repeat those two steps alternately applying parallel decoding to the subvectors induced by the vertices of $A {\displaystyle A}$ and to the subvectors induced by the vertices of $B {\displaystyle B}$ .
Note: [If $d=n$ and $G {\displaystyle G}$ is thecomplete bipartite graph, then $C {\displaystyle C}$ is a product code of $C_{o}$ with itself and the above algorithm reduces to the natural hard iterative decoding of product codes].

Here, the number of iterations, $m {\displaystyle m}$ is $\left({\dfrac {(\log {n})}{\log(2-\alpha )}}\right)$ . In general, the above algorithm can correct a code word whoseHamming weight is no more than $({\dfrac {1}{2}}).\alpha N\delta \left(({\dfrac {\delta }{2}})-({\dfrac {\lambda }{d}})\right)=\left(({\dfrac {1}{4}}).\alpha N(\delta ^{2}-O({\dfrac {\lambda }{d}})\right)$ for values of $\alpha <1$ . Here, the decoding algorithm is implemented as a circuit of size $O(N\log {N})$ and depth $O(\log {N})$ that returns the codeword given that error vector has weight less than $\alpha N\delta ^{2}(1-\epsilon )/4$ .

Theorem

[edit]

If $G {\displaystyle G}$ is a Ramanujan graph of sufficiently high degree, for any $\alpha <1$ , the decoding algorithm can correct $({\dfrac {\alpha \delta _{o}^{2}}{4}})(1-\varepsilon )N$ errors, where $\varepsilon$ tends to 0 when $\lambda /d$ tends to 0,in $O(\log {n})$ rounds ( where the big- $O {\displaystyle O}$ notation hides a dependence on $\alpha$ ). This can be implemented in linear time on a single processor; on $n {\displaystyle n}$ processors each round can be implemented in constant time.

Proof

[edit]

Since the decoding algorithm is insensitive to the value of the edges and by linearity, we can assume that the transmitted codeword is the all zeros - vector. Let the received codeword be $w {\displaystyle w}$ . The set of edges which has an incorrect value while decoding is considered. Here by incorrect value, we mean $1 {\displaystyle 1}$ in any of the bits. Let $w=w^{0}$ be the initial value of the codeword, $w^{1},w^{2},\ldots ,w^{t}$ be the values after first, second . . . $t {\displaystyle t}$ stages of decoding. Here, $X^{i}={e\in E|x_{e}^{i}=1}$ , and $S^{i}={v\in V^{i}|E_{v}\cap X^{i+1}!=\emptyset }$ . Here $S^{i}$ corresponds to those set of vertices that was not able to successfully decode their codeword in the $i^{th}$ round. From the above algorithm $S^{1}<S^{0}$ as number of unsuccessful vertices will be corrected in every iteration. We can prove that $S^{0}>S^{1}>S^{2}>\cdots$ is a decreasing sequence.In fact, $|S_{i+1}|<=({\dfrac {1}{2-\alpha }})|S_{i}|$ . As we are assuming, $\alpha <1$ , the above equation is in ageometric decreasing sequence. So, when $|S_{i}|<n$ , more than $log_{2-\alpha }n$ rounds are necessary. Furthermore, $\sum |S_{i}|=n\sum ({\dfrac {1}{(2-\alpha )^{i}}})=O(n)$ , and if we implement the $i^{th}$ round in $O(|S_{i}|)$ time, then the total sequential running time will be linear.

Drawbacks of Zemor's algorithm

[edit]

It is lengthy process as the number of iterations $m {\displaystyle m}$ in decoder algorithm takes is $[(\log {n})/(\log(2-\alpha ))]$
Zemor's decoding algorithm finds it difficult to decode erasures. A detailed way of how we can improve the algorithm is

given in.^[5]

References

[edit]

^"Gilles Zémor".www.math.u-bordeaux.fr. Retrieved9 April 2023.
^Guruswami, Venkatesan; Cary, Matt (January 27, 2003)."Lecture 5".CSE590G: Codes and Pseudorandom Objects. University of Washington. Archived fromthe original on 2014-02-24.
^"Lecture notes"(PDF).washington.edu. Retrieved9 April 2023.
^N. Alon; F.R.K. Chung (December 1988). "Explicit construction of linear sized tolerant networks".Discrete Mathematics.72 (1–3):15–19.CiteSeerX 10.1.1.300.7495.doi:10.1016/0012-365X(88)90189-6.
^"Archived copy". Archived fromthe original on September 14, 2004. RetrievedMay 1, 2012.{{cite web}}: CS1 maint: archived copy as title (link)