The Development of QSAR Models To Predict and Interpret the Biological Activity of Artemisinin Analogues. Guha, R. & Jurs, P. J.~Chem.~Inf.~Comput.~Sci., 44:1440--1449, 2004.
The Development of QSAR Models To Predict and Interpret the Biological Activity of Artemisinin Analogues [link]Paper  abstract   bibtex   
This work presents the development of Quantitative Structure-Activity Relationship (QSAR) models to predict the biological activity of 179 artemisinin analogues. The structures of the molecules are represented by chemical descriptors that encode topological, geometric, and electronic structure features. Both linear (multiple linear regression) and nonlinear (computational neural network) models are developed to link the structures to their reported biological activity. The best linear model was subjected to a PLS analysis to provide model interpretability. While the best linear model does not perform as well as the nonlinear model in terms of predictive ability, the application of PLS analysis allows for a sound physical interpretation of the structure-activity trend captured by the model. On the other hand, the best nonlinear model is superior in terms of pure predictive ability, having a training error of 0.47 log RA units (R2 = 0.96) and a prediction error of 0.76 log RA units (R2 = 0.88).
@article{Guha:2004ac,
	Abstract = {This work presents the development of Quantitative Structure-Activity Relationship (QSAR) models to predict the biological activity of 179 artemisinin analogues. The structures of the molecules are represented by chemical descriptors that encode topological, geometric, and electronic structure features. Both linear (multiple linear regression) and nonlinear (computational neural network) models are developed to link the structures to their reported biological activity. The best linear model was subjected to a PLS analysis to provide model interpretability. While the best linear model does not perform as well as the nonlinear model in terms of predictive ability, the application of PLS analysis allows for a sound physical interpretation of the structure-activity trend captured by the model. On the other hand, the best nonlinear model is superior in terms of pure predictive ability, having a training error of 0.47 log RA units (R2 = 0.96) and a prediction error of 0.76 log RA units (R2 = 0.88).},
	Author = {Guha, R. and Jurs, P.C.},
	Date-Added = {2007-12-11 17:01:03 -0500},
	Date-Modified = {2007-12-11 17:01:03 -0500},
	Journal = {J.~Chem.~Inf.~Comput.~Sci.},
	Keywords = {malaria; qsar; data mining; random forest; interpretation},
	Local-Url = {file://localhost/Users/rguha/Documents/articles/artem.pdf},
	Owner = {rajarshi},
	Pages = {1440--1449},
	Timestamp = {2007.04.11},
	Title = {The Development of {QSAR} Models To Predict and Interpret the Biological Activity of Artemisinin Analogues},
	Url = {http://dx.doi.org/10.1021/ci0499469},
	Volume = {44},
	Year = {2004},
	Bdsk-File-1 = {YnBsaXN0MDDUAQIDBAUIJidUJHRvcFgkb2JqZWN0c1gkdmVyc2lvblkkYXJjaGl2ZXLRBgdUcm9vdIABqAkKFRYXGyIjVSRudWxs0wsMDQ4RElpOUy5vYmplY3RzViRjbGFzc1dOUy5rZXlzog8QgASABoAHohMUgAKAA1lhbGlhc0RhdGFccmVsYXRpdmVQYXRo0hgMGRpXTlMuZGF0YU8RAV4AAAAAAV4AAgAAA212IAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAMU5bQNIKwAAABCNbQlhcnRlbS5wZGYAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAEI3EvpT+VAAAAAAAAAAAAAIAAwAACSAAAAAAAAAAAAAAAAAAAAAIYXJ0aWNsZXMAEAAIAADFObNTAAAAEQAIAAC+lUSkAAAAAQAQABCNbQAKTIAACkxpAAB8EwACACxtdiA6VXNlcnM6cmd1aGE6RG9jdW1lbnRzOmFydGljbGVzOmFydGVtLnBkZgAOABQACQBhAHIAdABlAG0ALgBwAGQAZgAPAAgAAwBtAHYAIAASAChVc2Vycy9yZ3VoYS9Eb2N1bWVudHMvYXJ0aWNsZXMvYXJ0ZW0ucGRmABMAAS8AABUAAgAM//8AAIAF0hwdHh9YJGNsYXNzZXNaJGNsYXNzbmFtZaMfICFdTlNNdXRhYmxlRGF0YVZOU0RhdGFYTlNPYmplY3RfECIuLi8uLi9Eb2N1bWVudHMvYXJ0aWNsZXMvYXJ0ZW0ucGRm0hwdJCWiJSFcTlNEaWN0aW9uYXJ5EgABhqBfEA9OU0tleWVkQXJjaGl2ZXIACAARABYAHwAoADIANQA6ADwARQBLAFIAXQBkAGwAbwBxAHMAdQB4AHoAfACGAJMAmACgAgICBAIJAhICHQIhAi8CNgI/AmQCaQJsAnkCfgAAAAAAAAIBAAAAAAAAACgAAAAAAAAAAAAAAAAAAAKQ},
	Bdsk-Url-1 = {http://dx.doi.org/10.1021/ci0499469}}
Downloads: 0