PrivacyCAT: Privacy-Aware Code Analysis at Scale. Mao, K., Åhs, C. T, Cela, S., Distefano, D., Gardner, N., Grigore, R., Gustafsson, P., Hajdu, Á., Kapus, T., Marescotti, M., Cunha Sampaio, G., & Suzanne, T. In Proceedings of the 46th International Conference on Software Engineering: Software Engineering in Practice, pages 106–117, 2024. ACM.
PrivacyCAT: Privacy-Aware Code Analysis at Scale [pdf]Pdf  PrivacyCAT: Privacy-Aware Code Analysis at Scale [link]Link  doi  abstract   bibtex   8 downloads  
Static and dynamic code analyses have been widely adopted in industry to enhance software reliability, security, and performance by automatically detecting bugs in the code. In this paper, we introduce PrivacyCAT, a code analysis system developed and deployed at WhatsApp to protect user privacy. PrivacyCAT automatically detects privacy defects in code at early stages (before reaching production and affecting users), and therefore, it prevents such vulnerabilities from evolving into privacy incidents. PrivacyCAT comprises of a collection of static and dynamic taint analysers. We report on the technical development of PrivacyCAT and the results of two years of its large-scale industrial deployment at WhatsApp. We present our experience in designing its system architecture, and continuous integration process. We discuss the unique challenges encountered in developing and deploying such kind of analyses within an industrial context. Since its deployment in 2021, PrivacyCAT has safeguarded data privacy in $74%$ of privacy site events (SEVs). It has prevented 493 potential privacy SEVs from being introduced into the codebases, enabling developers to maintain a high privacy standard for the code that supports over two billion WhatsApp users.

Downloads: 8