Deleterious- and disease-allele prevalence in healthy individuals: insights from current predictions, mutation databases, and population-scale resequencing. Xue, Y., Chen, Y., Ayub, Q., Huang, N., Ball, E. V, Mort, M., Phillips, A. D, Shaw, K., Stenson, P. D, Cooper, D. N, Tyler-Smith, C., & Consortium, T. 1. G. P. American journal of human genetics, 91(6):1022–1032, December, 2012.
Deleterious- and disease-allele prevalence in healthy individuals: insights from current predictions, mutation databases, and population-scale resequencing. [link]Paper  doi  abstract   bibtex   
We have assessed the numbers of potentially deleterious variants in the genomes of apparently healthy humans by using (1) low-coverage whole-genome sequence data from 179 individuals in the 1000 Genomes Pilot Project and (2) current predictions and databases of deleterious variants. Each individual carried 281-515 missense substitutions, 40-85 of which were homozygous, predicted to be highly damaging. They also carried 40-110 variants classified by the Human Gene Mutation Database (HGMD) as disease-causing mutations (DMs), 3-24 variants in the homozygous state, and many polymorphisms putatively associated with disease. Whereas many of these DMs are likely to represent disease-allele-annotation errors, between 0 and 8 DMs (0-1 homozygous) per individual are predicted to be highly damaging, and some of them provide information of medical relevance. These analyses emphasize the need for improved annotation of disease alleles both in mutation databases and in the primary literature; some HGMD mutation data have been recategorized on the basis of the present findings, an iterative process that is both necessary and ongoing. Our estimates of deleterious-allele numbers are likely to be subject to both overcounting and undercounting. However, our current best mean estimates of \textasciitilde400 damaging variants and \textasciitilde2 bona fide disease mutations per individual are likely to increase rather than decrease as sequencing studies ascertain rare variants more effectively and as additional disease alleles are discovered.
@article{xue_deleterious-_2012,
	title = {Deleterious- and disease-allele prevalence in healthy individuals: insights from current predictions, mutation databases, and population-scale resequencing.},
	volume = {91},
	url = {http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=23217326&retmode=ref&cmd=prlinks},
	doi = {10.1016/j.ajhg.2012.10.015},
	abstract = {We have assessed the numbers of potentially deleterious variants in the genomes of apparently healthy humans by using (1) low-coverage whole-genome sequence data from 179 individuals in the 1000 Genomes Pilot Project and (2) current predictions and databases of deleterious variants. Each individual carried 281-515 missense substitutions, 40-85 of which were homozygous, predicted to be highly damaging. They also carried 40-110 variants classified by the Human Gene Mutation Database (HGMD) as disease-causing mutations (DMs), 3-24 variants in the homozygous state, and many polymorphisms putatively associated with disease. Whereas many of these DMs are likely to represent disease-allele-annotation errors, between 0 and 8 DMs (0-1 homozygous) per individual are predicted to be highly damaging, and some of them provide information of medical relevance. These analyses emphasize the need for improved annotation of disease alleles both in mutation databases and in the primary literature; some HGMD mutation data have been recategorized on the basis of the present findings, an iterative process that is both necessary and ongoing. Our estimates of deleterious-allele numbers are likely to be subject to both overcounting and undercounting. However, our current best mean estimates of {\textbackslash}textasciitilde400 damaging variants and {\textbackslash}textasciitilde2 bona fide disease mutations per individual are likely to increase rather than decrease as sequencing studies ascertain rare variants more effectively and as additional disease alleles are discovered.},
	language = {English},
	number = {6},
	journal = {American journal of human genetics},
	author = {Xue, Yali and Chen, Yuan and Ayub, Qasim and Huang, Ni and Ball, Edward V and Mort, Matthew and Phillips, Andrew D and Shaw, Katy and Stenson, Peter D and Cooper, David N and Tyler-Smith, Chris and Consortium, The 1000 Genomes Project},
	month = dec,
	year = {2012},
	pmid = {23217326},
	pmcid = {PMC3516590},
	pages = {1022--1032},
}

Downloads: 0