Your browser doesn't support javascript.
loading
FlashPCA2: principal component analysis of Biobank-scale genotype datasets.
Abraham, Gad; Qiu, Yixuan; Inouye, Michael.
Afiliación
  • Abraham G; Centre for Systems Genomics, School of BioSciences.
  • Qiu Y; Department of Pathology, University of Melbourne, Parkville, VIC 3010, Australia.
  • Inouye M; Department of Statistics, Purdue University, West Lafayette, IN 47907-2066, USA.
Bioinformatics ; 33(17): 2776-2778, 2017 Sep 01.
Article en En | MEDLINE | ID: mdl-28475694
ABSTRACT
MOTIVATION Principal component analysis (PCA) is a crucial step in quality control of genomic data and a common approach for understanding population genetic structure. With the advent of large genotyping studies involving hundreds of thousands of individuals, standard approaches are no longer feasible. However, when the full decomposition is not required, substantial computational savings can be made.

RESULTS:

We present FlashPCA2, a tool that can perform partial PCA on 1 million individuals faster than competing approaches, while requiring substantially less memory. AVAILABILITY AND IMPLEMENTATION https//github.com/gabraham/flashpca . CONTACT gad.abraham@unimelb.edu.au. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Genómica / Análisis de Componente Principal / Técnicas de Genotipaje / Genética de Población Límite: Humans Idioma: En Año: 2017 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Genómica / Análisis de Componente Principal / Técnicas de Genotipaje / Genética de Población Límite: Humans Idioma: En Año: 2017 Tipo del documento: Article