Journal of the American Chemical Society, Vol.134, No.21, 8814-8816, 2012
DNA-Water Interactions Distinguish Messenger RNA Genes from Transfer RNA Genes
Physicochemical properties of DNA sequences as a guide to developing insights into genome organization has received little attention. Here, we utilize the energetics of DNA to further advance the knowledge on its language at a molecular level. Specifically, we ask the question whether physicochemical properties of different functional units on genomes differ. We extract intramolecular and solvation energies of different DNA base pair steps from a comprehensive set of molecular dynamics simulations. We then investigate the solvation behavior of DNA sequences coding for mRNAs and tRNAs. Distinguishing mRNA genes from tRNA genes is a tricky problem in genome annotation without assumptions on length of DNA and secondary structure of the product of transcription. We find that solvation energetics of DNA behaves as an extremely efficient property in discriminating 2 063 537 genes coding for mRNAs from 56 251 genes coding for tRNAs in all (similar to 1500) completely sequenced prokaryotic genomes.