Optimization of the genotyping-by-sequencing strategy for population genomic analysis in conifers. Pan, J., Wang, B., Pei, Z., Zhao, W., Gao, J., Mao, J., & Wang, X. Molecular Ecology Resources, 15(4):711–722, 2015. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/1755-0998.12342
Optimization of the genotyping-by-sequencing strategy for population genomic analysis in conifers [link]Paper  doi  abstract   bibtex   
Flexibility and low cost make genotyping-by-sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI-MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference-free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000–11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking.
@article{pan_optimization_2015,
	title = {Optimization of the genotyping-by-sequencing strategy for population genomic analysis in conifers},
	volume = {15},
	issn = {1755-0998},
	url = {https://onlinelibrary.wiley.com/doi/abs/10.1111/1755-0998.12342},
	doi = {10.1111/1755-0998.12342},
	abstract = {Flexibility and low cost make genotyping-by-sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI-MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference-free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000–11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking.},
	language = {en},
	number = {4},
	urldate = {2023-04-27},
	journal = {Molecular Ecology Resources},
	author = {Pan, Jin and Wang, Baosheng and Pei, Zhi-Yong and Zhao, Wei and Gao, Jie and Mao, Jian-Feng and Wang, Xiao-Ru},
	year = {2015},
	note = {\_eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/1755-0998.12342},
	keywords = {SNP validation, conifer genome, genotyping-by-sequencing, library preparation parameters, sequencing depth},
	pages = {711--722},
}

Downloads: 0