Using principal component analysis to find correlations between loop-related and thermodynamic variables for G-quadruplex-forming sequences. Jaumot, J. & Gargallo, R. Biochimie, 92(8):1016–1023 ST – Using principal component analysis, 2010.
Using principal component analysis to find correlations between loop-related and thermodynamic variables for G-quadruplex-forming sequences [link]Paper  doi  abstract   bibtex   
The application of Principal Component Analysis (PCA) is proposed here as a simple means of revealing correlations between thermodynamic variables corresponding to folding equilibria of intramolecular G-quadruplexes and Watson-Crick duplexes, and the length of loops in the corresponding guanine-rich DNA sequences. To this end, two previously studied data sets were analyzed (Arora and Maiti, J. Phys. Chem. B. 2009 and Kumar and Maiti, Nucleic Acids. Res. 2008). All of the sequences considered shared the common structure 5'- GGG - loop1 - GGG - loop2 - GGG - loop3 - GGG -3'. PCA of these data sets supported a series of correlations between the variables studied. First, the association of loop length with thermodynamic stability and quadruplex structure was corroborated. Secondly, it is proposed that the addition of ethylene glycol produces a stronger stabilization on those sequences showing long loop1 and/or loop3. Thirdly, it is proposed that a low content of adenine in loop1 and/or loop3 will produce an increase in the stability of G-quadruplex and its related Watson-Crick duplex.
@article{Jaumot2010,
	title = {Using principal component analysis to find correlations between loop-related and thermodynamic variables for {G}-quadruplex-forming sequences},
	volume = {92},
	issn = {0300-9084},
	url = {http://www.sciencedirect.com/science/article/B6VRJ-501FPRF-1/2/1f59aa9b8cb9285d3bf3665290bfb5ed},
	doi = {10.1016/j.biochi.2010.04.024},
	abstract = {The application of Principal Component Analysis (PCA) is proposed here as a simple means of revealing correlations between thermodynamic variables corresponding to folding equilibria of intramolecular G-quadruplexes and Watson-Crick duplexes, and the length of loops in the corresponding guanine-rich DNA sequences. To this end, two previously studied data sets were analyzed (Arora and Maiti, J. Phys. Chem. B. 2009 and Kumar and Maiti, Nucleic Acids. Res. 2008). All of the sequences considered shared the common structure 5'- GGG - loop1 - GGG - loop2 - GGG - loop3 - GGG -3'. PCA of these data sets supported a series of correlations between the variables studied. First, the association of loop length with thermodynamic stability and quadruplex structure was corroborated. Secondly, it is proposed that the addition of ethylene glycol produces a stronger stabilization on those sequences showing long loop1 and/or loop3. Thirdly, it is proposed that a low content of adenine in loop1 and/or loop3 will produce an increase in the stability of G-quadruplex and its related Watson-Crick duplex.},
	number = {8},
	journal = {Biochimie},
	author = {Jaumot, Joaquim and Gargallo, Raimundo},
	year = {2010},
	keywords = {\#nosource, G-quadruplex Principal component analysis Multivar},
	pages = {1016--1023 ST -- Using principal component analysis},
}

Downloads: 0