Visualization of Large-Scale Aqueous Solubility Data Using a Novel Hierarchical Data Visualization Technique. Yamashita, F., Itoh, T., & Hashida, M. J.~Chem.~Inf.~Model., 46(3):1054--1059, 2006.
abstract   bibtex   
It is a difficult task to recognize the trends in molecular physical properties relevant to a specific chemical class and find a way to optimize potential compounds. We present here a novel hierarchical data visualization technique, named ``HeiankyoView'', to visualize large-scale multidimensional chemical information. HeiankyoView represents hierarchically organized data objects by mapping leaf nodes as colored square icons and nonleaf nodes as rectangular borders. In this way, data objects can be expressed as equishaped icons without overlapping one another in the two-dimensional display space. HeiankyoView has been applied to visualize aqueous solubility data for 908 compounds collected from the published literature. When the results of a recursive partitioning analysis and hierarchical clustering analysis were visualized, the trends hidden in the solubility data could be effectively displayed as intuitively understandable visual images. Most interestingly, the data visualization technique, without any statistical computations, was able to assist us in extracting from such large-scale data meaningful information establishing that ClogP and the molecular weight are critical factors in determining aqueous solubility. Thus, HeiankyoView is a powerful tool to help us understand structure−activity relationships intuitively from a large-scale data set.
@article{Yamashita:2006aa,
	Abstract = {It is a difficult task to recognize the trends in molecular physical properties relevant to a specific chemical class and find a way to optimize potential compounds. We present here a novel hierarchical data visualization technique, named ``HeiankyoView'', to visualize large-scale multidimensional chemical information. HeiankyoView represents hierarchically organized data objects by mapping leaf nodes as colored square icons and nonleaf nodes as rectangular borders. In this way, data objects can be expressed as equishaped icons without overlapping one another in the two-dimensional display space. HeiankyoView has been applied to visualize aqueous solubility data for 908 compounds collected from the published literature. When the results of a recursive partitioning analysis and hierarchical clustering analysis were visualized, the trends hidden in the solubility data could be effectively displayed as intuitively understandable visual images. Most interestingly, the data visualization technique, without any statistical computations, was able to assist us in extracting from such large-scale data meaningful information establishing that ClogP and the molecular weight are critical factors in determining aqueous solubility. Thus, HeiankyoView is a powerful tool to help us understand structure−activity relationships intuitively from a large-scale data set.},
	Author = {Yamashita, F. and Itoh, T. and Hashida, M.},
	Date-Added = {2007-12-11 17:01:03 -0500},
	Date-Modified = {2009-02-21 09:44:16 -0500},
	Journal = {J.~Chem.~Inf.~Model.},
	Keywords = {clustering; cluster; aqueous solubility},
	Number = {3},
	Pages = {1054--1059},
	Title = {Visualization of Large-Scale Aqueous Solubility Data Using a Novel Hierarchical Data Visualization Technique},
	Volume = {46},
	Year = {2006},
	Bdsk-File-1 = {YnBsaXN0MDDUAQIDBAUIJidUJHRvcFgkb2JqZWN0c1gkdmVyc2lvblkkYXJjaGl2ZXLRBgdUcm9vdIABqAkKFRYXGyIjVSRudWxs0wsMDQ4RElpOUy5vYmplY3RzViRjbGFzc1dOUy5rZXlzog8QgASABoAHohMUgAKAA1lhbGlhc0RhdGFccmVsYXRpdmVQYXRo0hgMGRpXTlMuZGF0YU8RAW4AAAAAAW4AAgAAA212IAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAMU5bQNIKwAAABCNbQ1jaTA1MDQ3NzAucGRmAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAEI4wxcV9XQAAAAAAAAAAAAIAAwAACSAAAAAAAAAAAAAAAAAAAAAIYXJ0aWNsZXMAEAAIAADFObNTAAAAEQAIAADFxcOtAAAAAQAQABCNbQAKTIAACkxpAAB8EwACADBtdiA6VXNlcnM6cmd1aGE6RG9jdW1lbnRzOmFydGljbGVzOmNpMDUwNDc3MC5wZGYADgAcAA0AYwBpADAANQAwADQANwA3ADAALgBwAGQAZgAPAAgAAwBtAHYAIAASACxVc2Vycy9yZ3VoYS9Eb2N1bWVudHMvYXJ0aWNsZXMvY2kwNTA0NzcwLnBkZgATAAEvAAAVAAIADP//AACABdIcHR4fWCRjbGFzc2VzWiRjbGFzc25hbWWjHyAhXU5TTXV0YWJsZURhdGFWTlNEYXRhWE5TT2JqZWN0XxAmLi4vLi4vRG9jdW1lbnRzL2FydGljbGVzL2NpMDUwNDc3MC5wZGbSHB0kJaIlIVxOU0RpY3Rpb25hcnkSAAGGoF8QD05TS2V5ZWRBcmNoaXZlcgAIABEAFgAfACgAMgA1ADoAPABFAEsAUgBdAGQAbABvAHEAcwB1AHgAegB8AIYAkwCYAKACEgIUAhkCIgItAjECPwJGAk8CeAJ9AoACjQKSAAAAAAAAAgEAAAAAAAAAKAAAAAAAAAAAAAAAAAAAAqQ=},
	Bdsk-Url-1 = {http://dx.doi.org/10.1021/ci0504770}}

Downloads: 0