Mapping side chain interactions at protein helix termini. Newell, N. E BMC Bioinformatics, 16(1):231, 2015.
Mapping side chain interactions at protein helix termini [link]Paper  doi  abstract   bibtex   
Background: Interactions that involve one or more amino acid side chains near the ends of protein helices stabilize helix termini and shape the geometry of the adjacent loops, making a substantial contribution to overall protein structure. Previous work has identified key helix-terminal motifs, such as Asx/ST N-caps, the capping box, and hydrophobic and electrostatic interactions, but important questions remain, including: 1) What loop backbone geometries are favoured by each motif? 2) To what extent are multi-amino acid motifs likely to represent genuine cooperative interactions? 3) Can new motifs be identified in a large, recent dataset using the latest bioinformatics tools? Results: Three analytical tools are applied here to answer these questions. First, helix-terminal structures are partitioned by loop backbone geometry using a new 3D clustering algorithm. Next, Cascade Detection, a motif detection algorithm recently published by the author, is applied to each cluster to determine which sequence motifs are overrepresented in each geometry. Finally, the results for each motif are presented in a CapMap, a 3D conformational heatmap that displays the distribution of the motif's overrepresentation across loop geometries, enabling the rapid isolation and characterization of the associated side chain interaction. This work identifies a library of geometry-specific side chain interactions that provides a new, detailed picture of loop structure near the helix terminus. Highlights include determinations of the favoured loop geometries for the Asx/ST N-cap motifs, capping boxes, “big” boxes, and other hydrophobic, electrostatic, H-bond, and pi stacking interactions, many of which have not been described before. Conclusions: This work demonstrates that the combination of structural clustering and motif detection in the sequence space can efficiently identify side chain motifs and map them to the loop geometries which they support. Protein designers should find this study useful, because it identifies side chain interactions which are good candidates for inclusion in synthetic helix-terminal loops with specific desired geometries, since they are used in nature to support these geometries. The techniques described here can also be applied to map side chain interactions associated with other structural components of proteins such as beta and gamma turns. Keywords:
@article{newell_mapping_2015,
	title = {Mapping side chain interactions at protein helix termini},
	volume = {16},
	issn = {1471-2105},
	url = {http://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-015-0671-4 http://www.biomedcentral.com/1471-2105/16/231},
	doi = {10.1186/s12859-015-0671-4},
	abstract = {Background: Interactions that involve one or more amino acid side chains near the ends of protein helices stabilize helix termini and shape the geometry of the adjacent loops, making a substantial contribution to overall protein structure. Previous work has identified key helix-terminal motifs, such as Asx/ST N-caps, the capping box, and hydrophobic and electrostatic interactions, but important questions remain, including: 1) What loop backbone geometries are favoured by each motif? 2) To what extent are multi-amino acid motifs likely to represent genuine cooperative interactions? 3) Can new motifs be identified in a large, recent dataset using the latest bioinformatics tools? Results: Three analytical tools are applied here to answer these questions. First, helix-terminal structures are partitioned by loop backbone geometry using a new 3D clustering algorithm. Next, Cascade Detection, a motif detection algorithm recently published by the author, is applied to each cluster to determine which sequence motifs are overrepresented in each geometry. Finally, the results for each motif are presented in a CapMap, a 3D conformational heatmap that displays the distribution of the motif's overrepresentation across loop geometries, enabling the rapid isolation and characterization of the associated side chain interaction. This work identifies a library of geometry-specific side chain interactions that provides a new, detailed picture of loop structure near the helix terminus. Highlights include determinations of the favoured loop geometries for the Asx/ST N-cap motifs, capping boxes, “big” boxes, and other hydrophobic, electrostatic, H-bond, and pi stacking interactions, many of which have not been described before. Conclusions: This work demonstrates that the combination of structural clustering and motif detection in the sequence space can efficiently identify side chain motifs and map them to the loop geometries which they support. Protein designers should find this study useful, because it identifies side chain interactions which are good candidates for inclusion in synthetic helix-terminal loops with specific desired geometries, since they are used in nature to support these geometries. The techniques described here can also be applied to map side chain interactions associated with other structural components of proteins such as beta and gamma turns. Keywords:},
	number = {1},
	journal = {BMC Bioinformatics},
	author = {Newell, Nicholas E},
	year = {2015},
	keywords = {Asx N-cap, Big box, Capping box, CxxC motif, Data mining, Helix capping, Helix caps, Helix termini, Helix terminus, Helix-terminal loop, ST N-cap, Statistical feature detection, Statistical motif detection, asx n-cap, big box, capping box, cxxc motif, data mining, feature detection, helix capping, helix caps, helix termini, helix terminus, helix-terminal loop, st n-cap, statistical, statistical motif detection},
	pages = {231},
}

Downloads: 0