Your browser doesn't support javascript.
loading
Accurate, scalable cohort variant calls using DeepVariant and GLnexus.
Yun, Taedong; Li, Helen; Chang, Pi-Chuan; Lin, Michael F; Carroll, Andrew; McLean, Cory Y.
Afiliação
  • Yun T; Google Health, Cambridge, MA 02142, USA.
  • Li H; Google Health, Palo Alto, CA 94304, USA.
  • Chang PC; Google Health, Palo Alto, CA 94304, USA.
  • Lin MF; mlin.net LLC, Honolulu, HI 96816, USA.
  • Carroll A; Google Health, Palo Alto, CA 94304, USA.
  • McLean CY; Google Health, Cambridge, MA 02142, USA.
Bioinformatics ; 36(24): 5582-5589, 2021 Apr 05.
Article em En | MEDLINE | ID: mdl-33399819
MOTIVATION: Population-scale sequenced cohorts are foundational resources for genetic analyses, but processing raw reads into analysis-ready cohort-level variants remains challenging. RESULTS: We introduce an open-source cohort-calling method that uses the highly accurate caller DeepVariant and scalable merging tool GLnexus. Using callset quality metrics based on variant recall and precision in benchmark samples and Mendelian consistency in father-mother-child trios, we optimize the method across a range of cohort sizes, sequencing methods and sequencing depths. The resulting callsets show consistent quality improvements over those generated using existing best practices with reduced cost. We further evaluate our pipeline in the deeply sequenced 1000 Genomes Project (1KGP) samples and show superior callset quality metrics and imputation reference panel performance compared to an independently generated GATK Best Practices pipeline. AVAILABILITY AND IMPLEMENTATION: We publicly release the 1KGP individual-level variant calls and cohort callset (https://console.cloud.google.com/storage/browser/brain-genomics-public/research/cohort/1KGP) to foster additional development and evaluation of cohort merging methods as well as broad studies of genetic variation. Both DeepVariant (https://github.com/google/deepvariant) and GLnexus (https://github.com/dnanexus-rnd/GLnexus) are open-source, and the optimized GLnexus setup discovered in this study is also integrated into GLnexus public releases v1.2.2 and later. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Guideline Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Guideline Idioma: En Ano de publicação: 2021 Tipo de documento: Article