Identifying a Type of Genetic Code in an Anonymous, Prokaryotic DNA Sequence
MetadataShow full item record
Here we present an ab-initio approach for predicting the genetic code of an anonymous prokaryotic DNA sequence. To the best of our knowledge it is the first tool of such kind. In times of metagenomics more and more non-cultivable species are sequenced coming with an increasing number of discoveries of alternations of the canonical genetic code. The Genetic Code Detector (GCD) delineated below is capable of identifying the genetic code of complete genomes with a sensitivity and specificity of 1.0. Furthermore, it performs well on contigs as small as 10Kbp with a specificity of 0.99 and a sensitivity of 0.92. Recently, the class of crAssphage has been discovered which show two different genetic codes. Hence, it is of interest to predict the position in the genome where the genetic codes changes. The presented GCD is capable to predict the switching point with a mean error of 0.53 genes and a standard deviation of 6.47 genes.