Your browser doesn't support javascript.
loading
Identification of genetic outliers due to sub-structure and cryptic relationships.
Schlauch, Daniel; Fier, Heide; Lange, Christoph.
Afiliação
  • Schlauch D; Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, MA 02115, USA.
  • Fier H; Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA 02115, USA.
  • Lange C; Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, MA 02115, USA.
Bioinformatics ; 33(13): 1972-1979, 2017 Jul 01.
Article em En | MEDLINE | ID: mdl-28334167
ABSTRACT
MOTIVATION In order to minimize the effects of genetic confounding on the analysis of high-throughput genetic association studies, e.g. (whole-genome) sequencing (WGS) studies, genome-wide association studies (GWAS), etc., we propose a general framework to assess and to test formally for genetic heterogeneity among study subjects. As the approach fully utilizes the recent ancestor information captured by rare variants, it is especially powerful in WGS studies. Even for relatively moderate sample sizes, the proposed testing framework is able to identify study subjects that are genetically too similar, e.g. cryptic relationships, or that are genetically too different, e.g. population substructure. The approach is computationally fast, enabling the application to whole-genome sequencing data, and straightforward to implement.

RESULTS:

Simulation studies illustrate the overall performance of our approach. In an application to the 1000 Genomes Project, we outline an analysis/cleaning pipeline that utilizes our approach to formally assess whether study subjects are related and whether population substructure is present. In the analysis of the 1000 Genomes Project data, our approach revealed subjects that are most likely related, but had previously passed standard qc-filters. AVAILABILITY AND IMPLEMENTATION An implementation of our method, Similarity Test for Estimating Genetic Outliers (STEGO), is available in the R package stego from Github at https//github.com/dschlauch/stego . CONTACT dschlauch@fas.harvard.edu. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Genoma Humano / Estudos de Associação Genética / Sequenciamento Completo do Genoma Tipo de estudo: Diagnostic_studies Limite: Humans Idioma: En Ano de publicação: 2017 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Genoma Humano / Estudos de Associação Genética / Sequenciamento Completo do Genoma Tipo de estudo: Diagnostic_studies Limite: Humans Idioma: En Ano de publicação: 2017 Tipo de documento: Article