PerfScope: Practical Online Server Performance Bug Inference in Production Cloud Computing Infrastructures. Dean, D. J., Nguyen, H., Gu, X., Zhang, H., Rhee, J., Arora, N., & Jiang, G. In Proc. of the ACM Symp. on Cloud Computing, 2014.
abstract   bibtex   
Performance bugs which manifest in a production cloud computing infrastructure are notoriously difficult to diagnose because of both the difficulty of reproducing those bugs and the lack of debugging information. In this paper, we present PerfScope, a practical online performance bug inference tool to help the developer understand how a performance bug happened during the production run. PerfScope achieves online bug inference to obviate the need for offline bug reproduction. PerfScope does not require application source code or any runtime instrumentation to the production system. PerfScope is application-agnostic, which can support both interpreted and compiled programs running inside a cloud infrastructure. We have implemented PerfScope and tested it using real performance bugs on seven popular open source server systems (Hadoop, HDFS, Cassandra, Tomcat, Apache, Lighttpd, MySQL). The results show that PerfScope can narrow down the search scope of the bug-related functions to a small percentage (0.03-2.3%) and rank the real bug-related functions within top five candidates in the majority of cases. PerfScope only imposes on average 1.8% runtime overhead to the tested server applications.
@inproceedings{dean_perfscope:_2014,
	title = {{PerfScope}: {Practical} {Online} {Server} {Performance} {Bug} {Inference} in {Production} {Cloud} {Computing} {Infrastructures}},
	abstract = {Performance bugs which manifest in a production cloud computing infrastructure are notoriously difficult to diagnose because of both the difficulty of reproducing those bugs and the lack of debugging information. In this paper, we present PerfScope, a practical online performance bug inference tool to help the developer understand how a performance bug happened during the production run. PerfScope achieves online bug inference to obviate the need for offline bug reproduction. PerfScope does not require application source code or any runtime instrumentation to the production system. PerfScope is application-agnostic, which can support both interpreted and compiled programs running inside a cloud infrastructure. We have implemented PerfScope and tested it using real performance bugs on seven popular open source server systems (Hadoop, HDFS, Cassandra, Tomcat, Apache, Lighttpd, MySQL). The results show that PerfScope can narrow down the search scope of the bug-related functions to a small percentage (0.03-2.3\%) and rank the real bug-related functions within top five candidates in the majority of cases. PerfScope only imposes on average 1.8\% runtime overhead to the tested server applications.},
	booktitle = {Proc. of the {ACM} {Symp}. on {Cloud} {Computing}},
	author = {Dean, Daniel J. and Nguyen, Hiep and Gu, Xiaohui and Zhang, Hui and Rhee, Junghwan and Arora, Nipun and Jiang, Geoff},
	year = {2014},
	keywords = {performance, reliability, testing and debugging, veniok}
}

Downloads: 0