Lempel–Ziv factorization; Repetition-aware data structures; Repetitive text collections; Run-length encoded BWT; Computer Science (all); Computer Science Applications1707 Computer Vision and Pattern Recognition; Applied Mathematics

LZ77 Computation Based on the Run-Length Encoded BWT

Abstract ComputingtheLZ77factorizationisafundamentaltaskintextcompression and indexing, being the size z of this compressed representation closely related to the self-repetitiveness of the text. A long-standing problem is to compute LZ77 using small …