An accurate and interpretable bayesian classification model for prediction of hERG liability. Sun, H. ChemMedChem, 1(3):315--322, Mar, 2006.
doi  abstract   bibtex   
Drug-induced QT interval prolongation has been identified as a critical side-effect of non-cardiovascular therapeutic agents and has resulted in the withdrawal of many drugs from the market. As almost all cases of drug-induced QT prolongation can be traced to the blockade of a voltage-dependent potassium ion channel encoded by the hERG (the human ether-à-go-go-related gene), early identification of potential hERG channel blockers will decrease the risk of cardiotoxicity-induced attritions in the later and more expensive development stage. Presented herein is a naive Bayes classifier to categorize hERG blockers into active and inactive classes, by using a universal, generic molecular descriptor system.1 The naive Bayes classifier was built from a training set containing 1979 corporate compounds, and exhibited an ROC accuracy of 0.87. The model was validated on an external test set of 66 drugs, of which 58 were correctly classified. The cumulative probabilities reflected the confidence of prediction and were proven useful for the identification of hERG blockers. Relative performance was compared for two classifiers constructed from either an atom-type-based molecular descriptor or the long range functional class fingerprint descriptor FCFP_6. The combination of an atom-typing descriptor and the naive Bayes classification technique enables the interpretation of the resulting model, which offers extra information for the design of compounds free of undesirable hERG activity.
@article{Sun:2006kx,
	Abstract = {Drug-induced QT interval prolongation has been identified as a critical side-effect of non-cardiovascular therapeutic agents and has resulted in the withdrawal of many drugs from the market. As almost all cases of drug-induced QT prolongation can be traced to the blockade of a voltage-dependent potassium ion channel encoded by the hERG (the human ether-{\`a}-go-go-related gene), early identification of potential hERG channel blockers will decrease the risk of cardiotoxicity-induced attritions in the later and more expensive development stage. Presented herein is a naive Bayes classifier to categorize hERG blockers into active and inactive classes, by using a universal, generic molecular descriptor system.1 The naive Bayes classifier was built from a training set containing 1979 corporate compounds, and exhibited an ROC accuracy of 0.87. The model was validated on an external test set of 66 drugs, of which 58 were correctly classified. The cumulative probabilities reflected the confidence of prediction and were proven useful for the identification of hERG blockers. Relative performance was compared for two classifiers constructed from either an atom-type-based molecular descriptor or the long range functional class fingerprint descriptor FCFP_6. The combination of an atom-typing descriptor and the naive Bayes classification technique enables the interpretation of the resulting model, which offers extra information for the design of compounds free of undesirable hERG activity.},
	Author = {Sun, Hongmao},
	Date-Added = {2011-11-22 09:35:22 -0500},
	Date-Modified = {2011-11-22 09:35:45 -0500},
	Doi = {10.1002/cmdc.200500047},
	Journal = {{ChemMedChem}},
	Journal-Full = {ChemMedChem},
	Keywords = {herg;virtual screen},
	Mesh = {Bayes Theorem; Ether-A-Go-Go Potassium Channels; Models, Theoretical; Potassium Channel Blockers},
	Month = {Mar},
	Number = {3},
	Pages = {315--322},
	Pmid = {16892366},
	Pst = {ppublish},
	Title = {An accurate and interpretable bayesian classification model for prediction of {hERG} liability},
	Volume = {1},
	Year = {2006},
	Bdsk-Url-1 = {http://dx.doi.org/10.1002/cmdc.200500047}}

Downloads: 0