Ask a Question

Prefer a chat interface with context about you and your work?

Clustering by Compression

Clustering by Compression

We present a new method for clustering based on compression. The method does not use subject-specific features or background knowledge, and works as follows: First, we determine a parameter-free, universal, similarity distance, the normalized compression distance or NCD, computed from the lengths of compressed data files (singly and in pairwise …