Evaluating Dynamic Software Update Safety Using Systematic Testing. Hayden, C., Smith, E., Hardisty, E., Hicks, M., & Foster, J. IEEE Trans. Softw. Eng., 38(6):1340--1354, 2012.
abstract   bibtex   
Dynamic software updating (DSU) systems patch programs on the fly without incurring downtime. To avoid failures due to the updating process itself, many DSU systems employ timing restrictions. However, timing restrictions are theoretically imperfect, and their practical effectiveness is an open question. This paper presents the first significant empirical evaluation of three popular timing restrictions: activeness safety (AS), which prevents updates to active functions, con-freeness safety (CFS), which only allows modifications to active functions when doing so is provably type-safe, and manual identification of the event-handling loops during which an update may occur. We evaluated these timing restrictions using a series of DSU patches to three programs: OpenSSH, vsftpd, and ngIRCd. We systematically applied updates at each distinct update point reached during execution of a suite of system tests for these programs to determine which updates pass and which fail. We found that all three timing restrictions prevented most failures, but only manual identification allowed none. Further, although CFS and AS allowed many more update points, manual identification still supported updates with minimal delay. Finally, we found that manual identification required the least developer effort. Overall, we conclude that manual identification is most effective.
@article{hayden_evaluating_2012,
	title = {Evaluating {Dynamic} {Software} {Update} {Safety} {Using} {Systematic} {Testing}},
	volume = {38},
	abstract = {Dynamic software updating (DSU) systems patch programs on the fly without incurring downtime. To avoid failures due to the updating process itself, many DSU systems employ timing restrictions. However, timing restrictions are theoretically imperfect, and their practical effectiveness is an open question. This paper presents the first significant empirical evaluation of three popular timing restrictions: activeness safety (AS), which prevents updates to active functions, con-freeness safety (CFS), which only allows modifications to active functions when doing so is provably type-safe, and manual identification of the event-handling loops during which an update may occur. We evaluated these timing restrictions using a series of DSU patches to three programs: OpenSSH, vsftpd, and ngIRCd. We systematically applied updates at each distinct update point reached during execution of a suite of system tests for these programs to determine which updates pass and which fail. We found that all three timing restrictions prevented most failures, but only manual identification allowed none. Further, although CFS and AS allowed many more update points, manual identification still supported updates with minimal delay. Finally, we found that manual identification required the least developer effort. Overall, we conclude that manual identification is most effective.},
	number = {6},
	journal = {IEEE Trans. Softw. Eng.},
	author = {Hayden, C.M. and Smith, E.K. and Hardisty, E.A. and Hicks, M. and Foster, J.S.},
	year = {2012},
	keywords = {AS, CFS, DSU systems, Dynamic software updating (DSU), OpenSSH, Servers, VSFTPD, active functions, activeness safety, con-freeness safety, dynamic software updating safety evaluation, event-handling loop identification, failure prevention, hot-swapping, manual identification, ngIRCd, program testing, program tracing, safety-critical software, software fault tolerance, software maintenance, software reliability, software testing, systematic testing, testing, timing restrictions},
	pages = {1340--1354}
}

Downloads: 0