Efficient algorithms for Lempel-Ziv encoding

Leszek Gasieniec, Marek Karpinski, Wojciech Plandowski, Wojciech Rytter

Research output: Chapter in Book/Report/Conference proceedingConference contribution

36 Scopus citations

Abstract

We consider several basic problems for texts and show that if the input texts are given by their Lempel-Ziv codes then the problems can be solved deterministically in polynomial time in the case when the original (uncompressed) texts are of exponential size. The growing importance of massively stored information requires new approaches to algorithms for compressed texts without decompressing. Denote by LZ(w) the version of a string w produced by Lempel-Ziv encoding algorithm. For given compressed strings LZ(T), LZ(P) we give the first known deterministic polynomial time algorithms to compute compressed representations of the set of all occurrences of the pattern P in T, all periods of T, all palindromes of T, and all squares of T. Then we consider several classical language recognition problems: •regular language recognition: given LZ(T) and a language L described by a regular expression, test if T ∈ L, •extended regular language recognition: given LZ(T) and a language L described by a L.Z-compressed regular expression, test if T ∈ L, the alphabet is unary, •context-free language recognition: given LZ(T) and a language L described by a context-free grammar, test if T ∈ L, the alphabet is unary. We show that the first recognition problem has a polynomial time algorithm and the other two problems are MV-hard. We show also that the LZ encoding can be computed on-line in polynomial time delay and small space (i.e. proportional to the size of the compressed text). Also the compressed representation of a pattern-matching automaton for the compressed pattern is computed in polynomial time.

Original languageEnglish (US)
Title of host publicationAlgorithm Theory - SWAT 1996 - 5th Scandinavian Workshop on Algorithm Theory, Proceedings
EditorsRolf Karlsson, Andrzej Lingas
PublisherSpringer Verlag
Pages404-415
Number of pages12
ISBN (Print)3540614222, 9783540614227
StatePublished - 1996
Externally publishedYes
Event5th Scandinavian Workshop on Algorithm Theory, SWAT 1996 - Reykjavik, Iceland
Duration: Jul 3 1996Jul 5 1996

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1097
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th Scandinavian Workshop on Algorithm Theory, SWAT 1996
Country/TerritoryIceland
CityReykjavik
Period7/3/967/5/96

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Efficient algorithms for Lempel-Ziv encoding'. Together they form a unique fingerprint.

Cite this