首页|On the Approximation of the Kolmogorov Complexity for DNA Sequences

On the Approximation of the Kolmogorov Complexity for DNA Sequences

扫码查看
The Kolmogorov complexity furnishes several ways for studying different natural processes that can be expressed using sequences of symbols from a finite alphabet, such as the case of DNA sequences。 Although the Kolmogorov complexity is not algorithmically computable, it can be approximated by lossless normal compressors。 In this paper, we use a specific DNA compressor to approximate the Kolmogorov complexity and we assess it regarding its normality。 Then, we use it on several datasets, that are constituted by different DNA sequences, representing complete genomes of different species and domains。 We show several evolution-related insights associated with the complexity, namely that, globally, archaea have higher relative complexity than bacteria and eukaryotes。

Kolmogorov complexityCompressionDNA sequences

Diogo Pratas、Armando J. Pinho

展开 >

IEETA, University of Aveiro, 3810-193 Aveiro, Portugal

Iberian conference on pattern recognition and image analysis

Faro(PT)

Pattern recognition and image analysis

259-266

2017