Cross-Language Source Code Plagiarism Detection using Explicit Semantic Analysis and Scored Greedy String Tilling. Foltýnek, T., Vsiansky, R., Meuschke, N., Dlabolova, D., & Gipp, B. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL), Virtual Event, August, 2020. Venue Rating: CORE A*
Cross-Language Source Code Plagiarism Detection using Explicit Semantic Analysis and Scored Greedy String Tilling [pdf]Paper  doi  abstract   bibtex   
We present a method for source code plagiarism detection that is independent of the programming language. Our method EsaGst combines Explicit Semantic Analysis and Greedy String Tiling. Using 25 cases of source code plagiarism in C++, Java, JavaScript, PHP, and Python, we show that EsaGst outperforms a baseline method in identifying plagiarism across programming languages.

Downloads: 0