The topological nature of tag jumping in environmental DNA metabarcoding studies. Rodriguez-Martinez, S., Klaminder, J., Morlock, M. A., Dalén, L., & Huang, D. Y. Molecular Ecology Resources, 23(3):621–631, April, 2023. Publisher: John Wiley & Sons, Ltd
The topological nature of tag jumping in environmental DNA metabarcoding studies [link]Paper  doi  abstract   bibtex   
Abstract Metabarcoding of environmental DNA constitutes a state-of-the-art tool for environmental studies. One fundamental principle implicit in most metabarcoding studies is that individual sample amplicons can still be identified after being pooled with others?based on their unique combinations of tags?during the so-called demultiplexing step that follows sequencing. Nevertheless, it has been recognized that tags can sometimes be changed (i.e., tag jumping), which ultimately leads to sample crosstalk. Here, using four DNA metabarcoding data sets derived from the analysis of soils and sediments, we show that tag jumping follows very specific and systematic patterns. Specifically, we find a strong correlation between the number of reads in blank samples and their topological position in the tag matrix (described by vertical and horizontal vectors). This observed spatial pattern of artefactual sequences could be explained by polymerase activity, which leads to the exchange of the 3? tag of single stranded tagged sequences through the formation of heteroduplexes with mixed barcodes. Importantly, tag jumping substantially distorted our data sets?despite our use of methods suggested to minimize this error. We developed a topological model to estimate the noise based on the counts in our blanks, which suggested that 40%?80% of the taxa in our soil and sedimentary samples were likely false positives introduced through tag jumping. We highlight that the amount of false positive detections caused by tag jumping strongly biased our community analyses.
@article{rodriguez-martinez_topological_2023,
	title = {The topological nature of tag jumping in environmental {DNA} metabarcoding studies},
	volume = {23},
	issn = {1755-098X},
	url = {https://doi.org/10.1111/1755-0998.13745},
	doi = {10.1111/1755-0998.13745},
	abstract = {Abstract Metabarcoding of environmental DNA constitutes a state-of-the-art tool for environmental studies. One fundamental principle implicit in most metabarcoding studies is that individual sample amplicons can still be identified after being pooled with others?based on their unique combinations of tags?during the so-called demultiplexing step that follows sequencing. Nevertheless, it has been recognized that tags can sometimes be changed (i.e., tag jumping), which ultimately leads to sample crosstalk. Here, using four DNA metabarcoding data sets derived from the analysis of soils and sediments, we show that tag jumping follows very specific and systematic patterns. Specifically, we find a strong correlation between the number of reads in blank samples and their topological position in the tag matrix (described by vertical and horizontal vectors). This observed spatial pattern of artefactual sequences could be explained by polymerase activity, which leads to the exchange of the 3? tag of single stranded tagged sequences through the formation of heteroduplexes with mixed barcodes. Importantly, tag jumping substantially distorted our data sets?despite our use of methods suggested to minimize this error. We developed a topological model to estimate the noise based on the counts in our blanks, which suggested that 40\%?80\% of the taxa in our soil and sedimentary samples were likely false positives introduced through tag jumping. We highlight that the amount of false positive detections caused by tag jumping strongly biased our community analyses.},
	number = {3},
	urldate = {2023-07-22},
	journal = {Molecular Ecology Resources},
	author = {Rodriguez-Martinez, Saul and Klaminder, Jonatan and Morlock, Marina A. and Dalén, Love and Huang, Doreen Yu-Tuan},
	month = apr,
	year = {2023},
	note = {Publisher: John Wiley \& Sons, Ltd},
	keywords = {a-DNA, detection limits, e-DNA, false positive, index hopping, sample crosstalk},
	pages = {621--631},
}

Downloads: 0