PGen: large-scale genomic variations analysis workflow and browser in SoyKB. Liu, Y., Khan, S. M., Wang, J., Rynge, M., Zhang, Y., Zeng, S., Chen, S., Maldonado dos Santos, J. V., Valliyodan, B., Calyam, P. P., Merchant, N., Nguyen, H. T., Xu, D., & Joshi, T. BMC Bioinformatics, 17(13):337, 2016.
PGen: large-scale genomic variations analysis workflow and browser in SoyKB [link]Paper  doi  abstract   bibtex   
With the advances in next-generation sequencing (NGS) technology and significant reductions in sequencing costs, it is now possible to sequence large collections of germplasm in crops for detecting genome-scale genetic variations and to apply the knowledge towards improvements in traits. To efficiently facilitate large-scale NGS resequencing data analysis of genomic variations, we have developed PGen, an integrated and optimized workflow using the Extreme Science and Engineering Discovery Environment (XSEDE) high-performance computing (HPC) virtual system, iPlant cloud data storage resources and Pegasus workflow management system (Pegasus-WMS). The workflow allows users to identify single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), perform SNP annotations and conduct copy number variation analyses on multiple resequencing datasets in a user-friendly and seamless way.
@Article{	  liu2016,
  Author	= {Liu, Yang and Khan, Saad M. and Wang, Juexin and Rynge,
		  Mats and Zhang, Yuanxun and Zeng, Shuai and Chen, Shiyuan
		  and Maldonado dos Santos, Joao V. and Valliyodan, Babu and
		  Calyam, Prasad P. and Merchant, Nirav and Nguyen, Henry T.
		  and Xu, Dong and Joshi, Trupti},
  Title		= {PGen: large-scale genomic variations analysis workflow and
		  browser in SoyKB},
  Journal	= {BMC Bioinformatics},
  Year		= {2016},
  Volume	= {17},
  Number	= {13},
  Pages		= {337},
  Abstract	= {With the advances in next-generation sequencing (NGS)
		  technology and significant reductions in sequencing costs,
		  it is now possible to sequence large collections of
		  germplasm in crops for detecting genome-scale genetic
		  variations and to apply the knowledge towards improvements
		  in traits. To efficiently facilitate large-scale NGS
		  resequencing data analysis of genomic variations, we have
		  developed PGen, an integrated and optimized workflow using
		  the Extreme Science and Engineering Discovery Environment
		  (XSEDE) high-performance computing (HPC) virtual system,
		  iPlant cloud data storage resources and Pegasus workflow
		  management system (Pegasus-WMS). The workflow allows users
		  to identify single nucleotide polymorphisms (SNPs) and
		  insertion-deletions (indels), perform SNP annotations and
		  conduct copy number variation analyses on multiple
		  resequencing datasets in a user-friendly and seamless
		  way.},
  ISSN		= {1471-2105},
  DOI		= {10.1186/s12859-016-1227-y},
  URL		= {http://dx.doi.org/10.1186/s12859-016-1227-y}
}

Downloads: 0