High precision but variable recall – comparing the performance of five deduplication tools. Janka, H. & Metzendorf, M. Journal of EAHIL, 20(1):12–17, March, 2024.
High precision but variable recall – comparing the performance of five deduplication tools [link]Paper  doi  abstract   bibtex   
Deduplication methods for multiple database searches conducted for evidence syntheses differ in terms of time invested, accuracy, and comprehensiveness of identified duplicates. Deduplication tools can significantly contribute to a more efficient conduct of the search task in evidence syntheses. Widely-used tools for deduplication include reference management software (e.g. EndNote), built-in deduplication features in systematic review software (e.g. Covidence, Rayyan), and automated deduplication tools (e.g. Deduklick, SRA Deduplicator). Newer tools leverage machine learning algorithms crafted by information specialists, that encompass natural language normalization and rule-based approaches. We investigated five frequently used automated and semi-automated deduplication tools regarding their performance, core features and time efficiency in comparison to manual deduplication in EndNote using six datasets.
@article{janka_high_2024,
	title = {High precision but variable recall – comparing the performance of five deduplication tools},
	volume = {20},
	copyright = {Copyright (c) 2024 Heidrun Ilonka Janka},
	issn = {1841-0715},
	url = {https://ojs.eahil.eu/JEAHIL/article/view/607},
	doi = {10.32384/jeahil20607},
	abstract = {Deduplication methods for multiple database searches conducted for evidence syntheses differ in terms of time invested, accuracy, and comprehensiveness of identified duplicates. Deduplication tools can significantly contribute to a more efficient conduct of the search task in evidence syntheses. Widely-used tools for deduplication include reference management software (e.g. EndNote), built-in deduplication features in systematic review software (e.g. Covidence, Rayyan), and automated deduplication tools (e.g. Deduklick, SRA Deduplicator). Newer tools leverage machine learning algorithms crafted by information specialists, that encompass natural language normalization and rule-based approaches. We investigated five frequently used automated and semi-automated deduplication tools regarding their performance, core features and time efficiency in comparison to manual deduplication in EndNote using six datasets.},
	language = {en},
	number = {1},
	urldate = {2025-10-02},
	journal = {Journal of EAHIL},
	author = {Janka, Heidrun and Metzendorf, Maria-Inti},
	month = mar,
	year = {2024},
	keywords = {\_annoté\_FF},
	pages = {12--17},
}

Downloads: 0