Graph-based pan-genome: increased opportunities in plant genomics. Wang, S., Qian, Y., Zhao, R., Chen, L., & Song, J.
abstract   bibtex   
Due to the development of sequencing technology and the great reduction in sequencing costs, the genomes of an increasing number of plant species have been assembled, and the numerous genomes have revealed large amounts of variation. However, a single reference genome does not allow the exploration of species diversity; therefore, the concept of the pan-genome was developed. A pan-genome is a collection of all sequences available for a species, including a large number of consensus sequences, large structural variations (SVs), and small variations, including single nucleotide polymorphisms (SNPs) and insertions/deletions (InDels). A simple linear pan-genome does not allow these SVs to be intuitively characterized, so graph-based pan-genomes have been developed. These pan-genomes store sequence and SV information in the form of nodes and paths to store and display species variation information in a more intuitive manner. The key role of graph-based pan-genome is to expand the coordinate system of the linear reference genome to accommodate more regions of genetic diversity. Here, we review the origin and development of graph-based pan-genomes, explore their application in plant research, and further highlight the application of graph-based pan-genomes for future plant breeding.
@article{wang_graph-based_nodate,
	title = {Graph-based pan-genome: increased opportunities in plant genomics},
	abstract = {Due to the development of sequencing technology and the great reduction in sequencing costs, the genomes of an increasing number of plant species have been assembled, and the numerous genomes have revealed large amounts of variation. However, a single reference genome does not allow the exploration of species diversity; therefore, the concept of the pan-genome was developed. A pan-genome is a collection of all sequences available for a species, including a large number of consensus sequences, large structural variations (SVs), and small variations, including single nucleotide polymorphisms (SNPs) and insertions/deletions (InDels). A simple linear pan-genome does not allow these SVs to be intuitively characterized, so graph-based pan-genomes have been developed. These pan-genomes store sequence and SV information in the form of nodes and paths to store and display species variation information in a more intuitive manner. The key role of graph-based pan-genome is to expand the coordinate system of the linear reference genome to accommodate more regions of genetic diversity. Here, we review the origin and development of graph-based pan-genomes, explore their application in plant research, and further highlight the application of graph-based pan-genomes for future plant breeding.},
	author = {Wang, Shuo and Qian, Yongqing and Zhao, Rupeng and Chen, Lingling and Song, Jiaming},
	keywords = {pangenome, pangenome graph, review},
}

Downloads: 0