TEIMMA: The First Content Reuse Annotator for Text, Images, and Math. Satpute, A., Greiner-Petter, A., Schubotz, M., Meuschke, N., Aizawa, A., Teschke, O., & Gipp, B. In 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL), pages 271–273, Santa Fe, NM, USA, June, 2023. IEEE.
TEIMMA: The First Content Reuse Annotator for Text, Images, and Math [pdf]Paper  doi  abstract   bibtex   
This demo paper presents the first tool to annotate the reuse of text, images, and mathematical formulae in a document pair – TEIMMA. Annotating content reuse is particularly useful to develop plagiarism detection algorithms. Real-world content reuse is often obfuscated, which makes it challenging to identify such cases. TEIMMA allows entering the obfuscation type to enable novel classifications for confirmed cases of plagiarism. It enables recording different reuse types for text, images, and mathematical formulae in HTML and supports users by visualizing the content reuse in a document pair using similarity detection methods for text and math.

Downloads: 0