![]() | This article has multiple issues. Please helpimprove it or discuss these issues on thetalk page.(Learn how and when to remove these messages) (Learn how and when to remove this message)
|
LZWL is a syllable-based variant of theLZW (Lempel-Ziv-Welch) compression algorithm, designed to work with syllables derived from any syllable decomposition algorithm. This approach allows LZWL to efficiently process both syllables and words, offering a nuanced method for data compression.
The LZWL algorithm initializes by populating a dictionary with all characters from the alphabet. It then searches for the longest string,S, that exists in both the dictionary and as a prefix of the unencoded portion of the input. The algorithm outputs the identifier ofS and augments the dictionary with a new phrase, which combinesS with the subsequent character in the input. The input position advances by the length ofS. During decoding, LZWL addresses scenarios where the received phrase identifier does not exist in the dictionary by constructing the missing phrase from the concatenation of the last added phrase and its initial character.
In its syllable-based adaptation, LZWL employs a list of syllables as its alphabet. The initialization step includes the empty syllable and integrates small, frequently occurring syllables into the dictionary. IdentifyingS and encoding its identifier mirrors the original algorithm, with the distinction thatS represents a syllable string. IfS is an empty syllable, the algorithm extracts a syllableK from the input and encodesK using methods for new syllables before addingK to the dictionary and advancing the input position accordingly.
A notable variation in the syllable-based LZWL involves dictionary expansion. When bothS and the subsequent stringS1 are non-empty syllables, a new phrase is added to the dictionary by concatenatingS1 withS’s initial syllable. This method prevents the formation of strings from syllables that appear only once and ensures the decoder does not encounter undefined phrase identifiers.