Boyer-Moore String Matching over Ziv-Lempel Compressed Text
Gonzalo Navarro and Jorma Tarhio
We present a Boyer-Moore approach to string matching over LZ78 and LZW
compressed text. The key idea is that, despite that we cannot exactly
choose which text characters to inspect, we can still use the characters
explicitly represented in those formats to shift the pattern in the text.
We present a basic and a more advanced approach. Despite that the
theoretical average complexity does not improve because still all the
symbols in the compressed text have to be scanned, we show experimentally
that speedups of up to 30% over the fastest previous approaches are
obtained.