Ask a Question

Prefer a chat interface with context about you and your work?

A codebook generation algorithm for document image compression

A codebook generation algorithm for document image compression

Pattern-matching based document compression systems rely on finding a small set of patterns that can be used to represent all of the ink in the document. Finding an optimal set of patterns is NP-hard; previous compression schemes have resorted to heuristics. We extend the cross-entropy approach, used previously for measuring …