Your browser doesn't support javascript.
loading
Low frequency variants, collapsed based on biological knowledge, uncover complexity of population stratification in 1000 genomes project data.
Moore, Carrie B; Wallace, John R; Wolfe, Daniel J; Frase, Alex T; Pendergrass, Sarah A; Weiss, Kenneth M; Ritchie, Marylyn D.
Afiliación
  • Moore CB; Center for Human Genetic Research, Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America ; Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, T
  • Wallace JR; Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America.
  • Wolfe DJ; Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America.
  • Frase AT; Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America.
  • Pendergrass SA; Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America.
  • Weiss KM; Department of Anthropology, The Pennsylvania State University, University Park, Pennsylvania, United States of America.
  • Ritchie MD; Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America.
PLoS Genet ; 9(12): e1003959, 2013.
Article en En | MEDLINE | ID: mdl-24385916
ABSTRACT
Analyses investigating low frequency variants have the potential for explaining additional genetic heritability of many complex human traits. However, the natural frequencies of rare variation between human populations strongly confound genetic analyses. We have applied a novel collapsing method to identify biological features with low frequency variant burden differences in thirteen populations sequenced by the 1000 Genomes Project. Our flexible collapsing tool utilizes expert biological knowledge from multiple publicly available database sources to direct feature selection. Variants were collapsed according to genetically driven features, such as evolutionary conserved regions, regulatory regions genes, and pathways. We have conducted an extensive comparison of low frequency variant burden differences (MAF<0.03) between populations from 1000 Genomes Project Phase I data. We found that on average 26.87% of gene bins, 35.47% of intergenic bins, 42.85% of pathway bins, 14.86% of ORegAnno regulatory bins, and 5.97% of evolutionary conserved regions show statistically significant differences in low frequency variant burden across populations from the 1000 Genomes Project. The proportion of bins with significant differences in low frequency burden depends on the ancestral similarity of the two populations compared and types of features tested. Even closely related populations had notable differences in low frequency burden, but fewer differences than populations from different continents. Furthermore, conserved or functionally relevant regions had fewer significant differences in low frequency burden than regions under less evolutionary constraint. This degree of low frequency variant differentiation across diverse populations and feature elements highlights the critical importance of considering population stratification in the new era of DNA sequencing and low frequency variant genomic analyses.
Asunto(s)

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Variación Genética / Genoma Humano / Genética de Población Tipo de estudio: Prognostic_studies Idioma: En Revista: PLoS Genet Asunto de la revista: GENETICA Año: 2013 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Variación Genética / Genoma Humano / Genética de Población Tipo de estudio: Prognostic_studies Idioma: En Revista: PLoS Genet Asunto de la revista: GENETICA Año: 2013 Tipo del documento: Article