Recently, Trifonov's group proposed a 10-mer DNA motif YYYYYRRRRR as a solution of the long-standing problem of sequence-based nucleosome positioning. To test whether this generic decamer represents a biological meaningful signal, we compare the distribution of this motif in primates and Archaea, which are known to contain nucleosomes, and in Eubacteria, which do not possess nucleosomes. The distribution of the motif is analyzed by the mutual information function (MIF) with a shifted version of itself (MIF profile). We found common features in the patterns of this generic decamer on MIF profiles among primate species, and interestingly we found conspicuous but dissimilar MIF profiles for each Archaea tested. The overall MIF profiles for each chromosome in each primate species also follow a similar pattern. Trifonov’s generic decamer may be a highly conserved motif for the nucleosome positioning, but we argue that this is not the only motif. The distribution of this generic decamer exhibits previously unidentified periodicities, which are associated to highly repetitive sequences in the genome. Alu repetitive elements contribute to the most fundamental structure of nucleosome positioning in higher Eukaryotes. In some regions of primate chromosomes, the distribution of the decamer shows symmetrical patterns including inverted repeats. 1. Introduction It is generally accepted that the chromatin organization of eukaryotic DNA is strongly governed by a code inherent to the DNA sequence. Modulating the accessibility of individual DNA sequences involves many complex interactions, the most prevalent of which are the interactions between histone octamers and DNA in compacted chromosomes [1, 2]. The condensation of DNA into an ordered chromatin structure allows the cell to solve the topological problems associated with storing huge amount of information of chromosomal DNA within the nucleus. In Eukaryotes and Archaea, DNA is packaged into chromatin in orderly repetitive protein-DNA complexes called nucleosomes. Each nucleosome consists of approximately 146-147?bp of dsDNA wound 1.7-1.8 times around a histone octamer [3–5] to form the basic unit of chromatin structure, the nucleosome. Each octamer is composed of two H3-H4 histone dimers bridged together as a stable tetramer that is flanked by two separate H2A-H2B dimers [6]. Stretches of DNA called linker up to 100?bp, often with an increment of 10?bp, separate adjacent nucleosomes. Multiple nuclear proteins bind to this linker region, some of which may be responsible for the ordered wrapping of strings of
References
[1]
G. Felsenfeld and M. Groudine, “Controlling the double helix,” Nature, vol. 421, no. 6921, pp. 448–453, 2003.
[2]
A. Valouev, S. M. Johnson, S. D. Boyd, C. L. Smith, A. Z. Fire, and A. Sidow, “Determinants of nucleosome organization in primary human cells,” Nature, vol. 474, no. 7352, pp. 516–522, 2011.
[3]
D. E. Sterner and S. L. Berger, “Acetylation of histones and transcription-related factors,” Microbiology and Molecular Biology Reviews, vol. 64, no. 2, pp. 435–459, 2000.
[4]
K. Luger, T. J. Rechsteiner, A. J. Flaus, M. M. Y. Waye, and T. J. Richmond, “Characterization of nucleosome core particles containing histone proteins made in bacteria,” Journal of Molecular Biology, vol. 272, no. 3, pp. 301–311, 1997.
[5]
C. A. Davey, D. F. Sargent, K. Luger, A. W. Maeder, and T. J. Richmond, “Solvent mediated interactions in the structure of the nucleosome core particle at 1.9 ? resolution,” Journal of Molecular Biology, vol. 319, no. 5, pp. 1097–1113, 2002.
[6]
E. I. Campos and D. Reinberg, “Histones: annotating chromatin,” Annual Review of Genetics, vol. 43, pp. 559–599, 2009.
[7]
R. L. Redner, J. Wang, and J. M. Liu, “Chromatin remodeling and leukemia: new therapeutic paradigms,” Blood, vol. 94, no. 2, pp. 417–428, 1999.
[8]
T. J. Richmond and C. A. Davey, “The structure of DNA in the nucleosome core,” Nature, vol. 423, no. 6936, pp. 145–150, 2003.
[9]
K. Sandman, J. A. Krzycki, B. Dobrinski, B. Lurz, and J. N. Reeve, “HMf, a DNA-binding protein isolated from the hyperthermophilic archaeon Methanothermus fervidus, is most closely related to histones,” Proceedings of the National Academy of Sciences of the United States of America, vol. 87, no. 15, pp. 5788–5791, 1990.
[10]
R. L. Fahrner, D. Cascio, J. A. Lake, and A. Slesarev, “An ancestral nuclear protein assembly: crystal structure of the Methanopyrus kandleri histone,” Protein Science, vol. 10, no. 10, pp. 2002–2007, 2001.
[11]
I. Ioshikhes, A. Bolshoy, K. Derenshteyn, M. Borodovsky, and E. N. Trifonov, “Nucleosome RNA sequence pattern revealed by multiple alignment of experimentally mapped sequences,” Journal of Molecular Biology, vol. 262, no. 2, pp. 129–139, 1996.
[12]
S. C. Satchwell, H. R. Drew, and A. A. Travers, “Sequence periodicities in chicken nucleosome core DNA,” Journal of Molecular Biology, vol. 191, no. 4, pp. 659–675, 1986.
[13]
A. Valouev, J. Ichikawa, T. Tonthat et al., “A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning,” Genome Research, vol. 18, no. 7, pp. 1051–1063, 2008.
[14]
A. B. Cohanim, Y. Kashi, and E. N. Trifonov, “Yeast nucleosome DNA pattern: deconvolution from genome sequences of S. cerevisiae,” Journal of Biomolecular Structure and Dynamics, vol. 22, no. 6, pp. 687–693, 2005.
[15]
I. Ioshikhes, S. Hosid, and B. F. Pugh, “Variety of genomic DNA patterns for nucleosome positioning,” Genome Research, vol. 21, no. 11, pp. 1863–1871, 2011.
[16]
E. Segal, Y. Fondufe-Mittendorf, L. Chen et al., “A genomic code for nucleosome positioning,” Nature, vol. 442, no. 7104, pp. 772–778, 2006.
[17]
Y. Zhang, Z. Moqtaderi, B. P. Rattner et al., “Intrinsic histone-DNA interactions are not the major determinant of nucleosome positions in vivo,” Nature Structural and Molecular Biology, vol. 16, no. 8, pp. 847–852, 2009.
[18]
Y. Zhang, Z. Moqtaderi, B. P. Rattner et al., “Evidence against a genomic code for nucleosome positioning,” Nature Structural and Molecular Biology, vol. 17, no. 8, pp. 920–923, 2010.
[19]
N. Kaplan, I. Moore, Y. Fondufe-Mittendorf et al., “Nucleosome sequence preferences influence in vivo nucleosome organization,” Nature Structural and Molecular Biology, vol. 17, no. 8, pp. 918–920, 2010.
[20]
A. Travers, “The nature of DNA sequence preferences for nucleosome positioning. Comment on ‘Cracking the chromatin code: precise rule of nucleosome positioning’ by Trifonov,” Physics of Life Reviews, vol. 8, no. 1, pp. 53–55, 2011.
[21]
E. N. Trifonov, “Cracking the chromatin code: precise rule of nucleosome positioning,” Physics of Life Reviews, vol. 8, no. 1, pp. 39–50, 2011.
[22]
K. Sha, S. G. Gu, L. C. Pantalena-Filho et al., “Distributed probing of chromatin structure in vivo reveals pervasive chromatin accessibility for expressed and non-expressed genes during tissue differentiation in C. elegans,” BMC Genomics, vol. 11, no. 1, article 465, 2010.
[23]
M. Yaniv and S. C. Elgin, “Chromosomes and expression mechanisms: bringing together the roles of DNA, RNA and proteins,” Current Opinion in Genetics and Development, vol. 18, no. 2, pp. 107–108, 2008.
[24]
L. E. Gracey, Z. Chen, J. M. Maniar et al., “An in vitro-identified high-affinity nucleosome-positioning signal is capable of transiently positioning a nucleosome in vivo,” Epigenetics and Chromatin, vol. 3, no. 1, article 13, 2010.
[25]
S. Sasaki, C. C. Mello, A. Shimada et al., “Chromatin-associated periodicity in genetic variation downstream of transcriptional start sites,” Science, vol. 323, no. 5912, pp. 401–404, 2009.
[26]
D. Tillo, N. Kaplan, I. K. Moore et al., “High nucleosome occupancy is encoded at human regulatory sequences,” PLoS ONE, vol. 5, no. 2, Article ID e9129, 2010.
[27]
I. Ioshikhes, E. N. Trifonov, and M. Q. Zhang, “Periodical distribution of transcription factor sites in promoter regions and connection with chromatin structure,” Proceedings of the National Academy of Sciences of the United States of America, vol. 96, no. 6, pp. 2891–2895, 1999.
[28]
R. Sadeh and C. D. Allis, “Genome-wide “re”-modeling of nucleosome positions,” Cell, vol. 147, pp. 263–266, 2011.
[29]
S. Henikoff, “Nucleosome destabilization in the epigenetic regulation of gene expression,” Nature Reviews Genetics, vol. 9, no. 1, pp. 15–26, 2008.
[30]
F. Moreno-Herrero, R. Seidel, S. M. Johnson, A. Fire, and N. H. Dekker, “Structural analysis of hyperperiodic DNA from Caenorhabditis elegans,” Nucleic Acids Research, vol. 34, no. 10, pp. 3057–3066, 2006.
[31]
H. R. Widlund, H. Cao, S. Simonsson et al., “Identification and characterization of genomic nucleosome-positioning sequences,” Journal of Molecular Biology, vol. 267, no. 4, pp. 807–817, 1997.
[32]
P. Oudet, M. Gross Bellard, and P. Chambon, “Electron microscopic and biochemical evidence that chromatin structure is a repeating unit,” Cell, vol. 4, no. 4, pp. 281–300, 1975.
[33]
W. Linxweiler and W. H?rz, “Reconstitution of mononucleosomes: characterization of distinct particles that differ in the position of the histone core,” Nucleic Acids Research, vol. 12, no. 24, pp. 9395–9413, 1984.
[34]
H. R. Drew and C. R. Calladine, “Sequence-specific positioning of core histones on an 860 base-pair DNA. Experiment and theory,” Journal of Molecular Biology, vol. 195, no. 1, pp. 143–173, 1987.
[35]
N. Kaplan, I. K. Moore, Y. Fondufe-Mittendorf et al., “The DNA-encoded nucleosome organization of a eukaryotic genome,” Nature, vol. 458, no. 7236, pp. 362–366, 2009.
[36]
E. N. Trifonov, “Nucleosome positioning by sequence, state of the art and apparent finale,” Journal of Biomolecular Structure and Dynamics, vol. 27, no. 6, pp. 741–746, 2010.
[37]
I. Gabdank, D. Barash, and E. N. Trifonov, “Open access article nucleosome DNA bendability matrix (C. elegans),” Journal of Biomolecular Structure and Dynamics, vol. 26, no. 4, pp. 403–412, 2009.
[38]
S. M. Johnson, F. J. Tan, H. L. McCullough, D. P. Riordan, and A. Z. Fire, “Flexibility and constraint in the nucleosome core landscape of Caenorhabditis elegans chromatin,” Genome Research, vol. 16, no. 12, pp. 1505–1516, 2006.
[39]
S. G. Gu and A. Fire, “Partitioning the C. elegans genome by nucleosome modification, occupancy, and positioning,” Chromosoma, vol. 119, no. 1, pp. 73–87, 2010.
[40]
F. Salih, B. Salih, S. Kogan, and E. N. Trifonov, “Epigenetic nucleosomes: Alu sequences and CG as nucleosome positioning element,” Journal of Biomolecular Structure and Dynamics, vol. 26, no. 1, pp. 9–15, 2008.
[41]
F. Salih, B. Salih, and E. N. Trifonov, “Sequence structure of hidden 10.4-base repeat in the nucleosomes of C. elegans,” Journal of Biomolecular Structure and Dynamics, vol. 26, no. 3, pp. 273–281, 2008.
[42]
T. Bettecken and E. N. Trifonov, “Repertoires of the nucleosome-positioning dinucleotides,” PLoS ONE, vol. 4, no. 11, Article ID e7654, 2009.
[43]
S. L. Pereira and J. N. Reeve, “Histones and nucleosomes in Archaea and Eukarya: a comparative analysis,” Extremophiles, vol. 2, no. 3, pp. 141–148, 1998.
[44]
C. E. Shannon, “A mathematical theory of communication,” The Bell System Technical Journal, vol. 27, pp. 379–423, 1948.
[45]
W. Li, “Mutual information functions versus correlation functions,” Journal of Statistical Physics, vol. 60, no. 5-6, pp. 823–837, 1990.
[46]
Y. Tanaka, R. Yamashita, Y. Suzuki, and K. Nakai, “Effects of Alu elements on global nucleosome positioning in the human genome,” BMC Genomics, vol. 11, no. 1, article 309, 2010.
[47]
D. Holste, I. Grosse, S. Beirer, P. Schieg, and H. Herzel, “Repeats and correlations in human DNA sequences,” Physical Review E, vol. 67, no. 6, Article ID 061913, pp. 1–7, 2003.
[48]
A. Kundaje, S. Kyriazopoulou-Panagiotopoulou, M. Libbrecht et al., “Ubiquitous heterogeneity and asymmetry of the chromatin environment at regulatory elements,” Genome Research, vol. 22, pp. 1735–1747, 2012.
[49]
K. A. Bailey, F. Marc, K. Sandman, and J. N. Reeve, “Both DNA and histone fold sequences contribute to archaeal nucleosome stability,” The Journal of Biological Chemistry, vol. 277, no. 11, pp. 9293–9301, 2002.
[50]
E. S. Lander, L. M. Linton, and B. Birren, “International human genome sequencing consortium (2001) Initial sequencing and analysis of the human genome,” Nature, vol. 409, no. 6822, pp. 860–921.
[51]
M. A. Batzer and P. L. Deininger, “Alu repeats and human genomic diversity,” Nature Reviews Genetics, vol. 3, no. 5, pp. 370–379, 2002.
[52]
J. H?sler and K. Strub, “Alu RNP and Alu RNA regulate translation initiation in vitro,” Nucleic Acids Research, vol. 34, no. 8, pp. 2374–2385, 2006.
[53]
O. Weichenrieder, K. Wild, K. Strub, and S. Cusack, “Structure and assembly of the Alu domain of the mammalian signal recognition particle,” Nature, vol. 408, no. 6809, pp. 167–173, 2000.
[54]
K. Strub, J. Moss, and P. Walter, “Binding sites of the 9- and 14-kilodalton heterodimeric protein subunit of the signal recognition particle (SRP) are contained exclusively in the Alu domain of SRP RNA and contain a sequence motif that is conserved in evolution,” Molecular and Cellular Biology, vol. 11, no. 8, pp. 3949–3959, 1991.
[55]
A. J. Mighell, A. F. Markham, and P. A. Robinson, “Alu sequences,” FEBS Letters, vol. 417, no. 1, pp. 1–5, 1997.
[56]
J. Kreahling and B. R. Graveley, “The origins and implications of Aluternative splicing,” Trends in Genetics, vol. 20, no. 1, pp. 1–4, 2004.
[57]
W. Li, D. Sosa, and M. V. José, “Human repetitive sequence densities are mostly negatively correlated with R/Y-based nucleosome-positioning motifs and positively correlated with W/S-based motifs,” Genomics, vol. 101, no. 2, pp. 125–133, 2013.
[58]
T. Bettecken, Z. M. Frenkel, and E. N. Trifonov, “Human nucleosomes: special role of CG dinucleotides and Alu-nucleosomes,” BMC Genomics, vol. 12, article 273, 2011.
[59]
M. S. Ong, T. J. Richmond, and C. A. Davey, “DNA stretching and extreme kinking in the nucleosome core,” Journal of Molecular Biology, vol. 368, no. 4, pp. 1067–1074, 2007.