Pesquisa | Biblioteca Virtual em Saúde

Benchmarking computational variant effect predictors by their ability to infer human traits.

Tabet, Daniel R; Kuang, Da; Lancaster, Megan C; Li, Roujia; Liu, Karen; Weile, Jochen; Coté, Atina G; Wu, Yingzhou; Hegele, Robert A; Roden, Dan M; Roth, Frederick P.

Genome Biol ; 25(1): 172, 2024 07 01.

Artigo em Inglês | MEDLINE | ID: mdl-38951922

RESUMO

BACKGROUND: Computational variant effect predictors offer a scalable and increasingly reliable means of interpreting human genetic variation, but concerns of circularity and bias have limited previous methods for evaluating and comparing predictors. Population-level cohorts of genotyped and phenotyped participants that have not been used in predictor training can facilitate an unbiased benchmarking of available methods. Using a curated set of human gene-trait associations with a reported rare-variant burden association, we evaluate the correlations of 24 computational variant effect predictors with associated human traits in the UK Biobank and All of Us cohorts. RESULTS: AlphaMissense outperformed all other predictors in inferring human traits based on rare missense variants in UK Biobank and All of Us participants. The overall rankings of computational variant effect predictors in these two cohorts showed a significant positive correlation. CONCLUSION: We describe a method to assess computational variant effect predictors that sidesteps the limitations of previous evaluations. This approach is generalizable to future predictors and could continue to inform predictor choice for personal and clinical genetics.

Assuntos

Benchmarking , Variação Genética , Humanos , Fenótipo , Biologia Computacional/métodos , Genótipo

Minimum information and guidelines for reporting a multiplexed assay of variant effect.

Claussnitzer, Melina; Parikh, Victoria N; Wagner, Alex H; Arbesfeld, Jeremy A; Bult, Carol J; Firth, Helen V; Muffley, Lara A; Nguyen Ba, Alex N; Riehle, Kevin; Roth, Frederick P; Tabet, Daniel; Bolognesi, Benedetta; Glazer, Andrew M; Rubin, Alan F.

Genome Biol ; 25(1): 100, 2024 04 19.

Artigo em Inglês | MEDLINE | ID: mdl-38641812

RESUMO

Multiplexed assays of variant effect (MAVEs) have emerged as a powerful approach for interrogating thousands of genetic variants in a single experiment. The flexibility and widespread adoption of these techniques across diverse disciplines have led to a heterogeneous mix of data formats and descriptions, which complicates the downstream use of the resulting datasets. To address these issues and promote reproducibility and reuse of MAVE data, we define a set of minimum information standards for MAVE data and metadata and outline a controlled vocabulary aligned with established biomedical ontologies for describing these experimental designs.

Assuntos

Metadados , Projetos de Pesquisa , Reprodutibilidade dos Testes

Pacybara: accurate long-read sequencing for barcoded mutagenized allelic libraries.

Weile, Jochen; Ferra, Gabrielle; Boyle, Gabriel; Pendyala, Sriram; Amorosi, Clara; Yeh, Chiann-Ling; Cote, Atina G; Kishore, Nishka; Tabet, Daniel; van Loggerenberg, Warren; Rayhan, Ashyad; Fowler, Douglas M; Dunham, Maitreya J; Roth, Frederick P.

Bioinformatics ; 40(4)2024 Mar 29.

Artigo em Inglês | MEDLINE | ID: mdl-38569896

RESUMO

MOTIVATION: Long-read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. RESULTS: Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or nonunique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues. AVAILABILITY AND IMPLEMENTATION: Pacybara, freely available at https://github.com/rothlab/pacybara, is implemented using R, Python, and bash for Linux. It runs on GNU/Linux HPC clusters via Slurm, PBS, or GridEngine schedulers. A single-machine simplex version is also available.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala , Software , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Biblioteca Gênica , Genótipo , Análise por Conglomerados

Minimum information and guidelines for reporting a Multiplexed Assay of Variant Effect.

ArXiv ; 2023 Jun 26.

Artigo em Inglês | MEDLINE | ID: mdl-37426450

RESUMO

Multiplexed Assays of Variant Effect (MAVEs) have emerged as a powerful approach for interrogating thousands of genetic variants in a single experiment. The flexibility and widespread adoption of these techniques across diverse disciplines has led to a heterogeneous mix of data formats and descriptions, which complicates the downstream use of the resulting datasets. To address these issues and promote reproducibility and reuse of MAVE data, we define a set of minimum information standards for MAVE data and metadata and outline a controlled vocabulary aligned with established biomedical ontologies for describing these experimental designs.

Genome-scale mapping of DNA damage suppressors through phenotypic CRISPR-Cas9 screens.

Zhao, Yichao; Tabet, Daniel; Rubio Contreras, Diana; Lao, Linjiang; Kousholt, Arne Nedergaard; Weile, Jochen; Melo, Henrique; Hoeg, Lisa; Feng, Sumin; Coté, Atina G; Lin, Zhen-Yuan; Setiaputra, Dheva; Jonkers, Jos; Gingras, Anne-Claude; Gómez Herreros, Fernando; Roth, Frederick P; Durocher, Daniel.

Mol Cell ; 83(15): 2792-2809.e9, 2023 08 03.

Artigo em Inglês | MEDLINE | ID: mdl-37478847

RESUMO

To maintain genome integrity, cells must accurately duplicate their genome and repair DNA lesions when they occur. To uncover genes that suppress DNA damage in human cells, we undertook flow-cytometry-based CRISPR-Cas9 screens that monitored DNA damage. We identified 160 genes whose mutation caused spontaneous DNA damage, a list enriched in essential genes, highlighting the importance of genomic integrity for cellular fitness. We also identified 227 genes whose mutation caused DNA damage in replication-perturbed cells. Among the genes characterized, we discovered that deoxyribose-phosphate aldolase DERA suppresses DNA damage caused by cytarabine (Ara-C) and that GNB1L, a gene implicated in 22q11.2 syndrome, promotes biogenesis of ATR and related phosphatidylinositol 3-kinase-related kinases (PIKKs). These results implicate defective PIKK biogenesis as a cause of some phenotypes associated with 22q11.2 syndrome. The phenotypic mapping of genes that suppress DNA damage therefore provides a rich resource to probe the cellular pathways that influence genome maintenance.

Assuntos

Sistemas CRISPR-Cas , Dano ao DNA , Humanos , Mutação , Reparo do DNA , Fenótipo

A comprehensive map of human glucokinase variant activity.

Gersing, Sarah; Cagiada, Matteo; Gebbia, Marinella; Gjesing, Anette P; Coté, Atina G; Seesankar, Gireesh; Li, Roujia; Tabet, Daniel; Weile, Jochen; Stein, Amelie; Gloyn, Anna L; Hansen, Torben; Roth, Frederick P; Lindorff-Larsen, Kresten; Hartmann-Petersen, Rasmus.

Genome Biol ; 24(1): 97, 2023 04 26.

Artigo em Inglês | MEDLINE | ID: mdl-37101203

RESUMO

BACKGROUND: Glucokinase (GCK) regulates insulin secretion to maintain appropriate blood glucose levels. Sequence variants can alter GCK activity to cause hyperinsulinemic hypoglycemia or hyperglycemia associated with GCK-maturity-onset diabetes of the young (GCK-MODY), collectively affecting up to 10 million people worldwide. Patients with GCK-MODY are frequently misdiagnosed and treated unnecessarily. Genetic testing can prevent this but is hampered by the challenge of interpreting novel missense variants. RESULT: Here, we exploit a multiplexed yeast complementation assay to measure both hyper- and hypoactive GCK variation, capturing 97% of all possible missense and nonsense variants. Activity scores correlate with in vitro catalytic efficiency, fasting glucose levels in carriers of GCK variants and with evolutionary conservation. Hypoactive variants are concentrated at buried positions, near the active site, and at a region of known importance for GCK conformational dynamics. Some hyperactive variants shift the conformational equilibrium towards the active state through a relative destabilization of the inactive conformation. CONCLUSION: Our comprehensive assessment of GCK variant activity promises to facilitate variant interpretation and diagnosis, expand our mechanistic understanding of hyperactive variants, and inform development of therapeutics targeting GCK.

Assuntos

Diabetes Mellitus Tipo 2 , Glucoquinase , Humanos , Glucoquinase/genética , Glucoquinase/química , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/diagnóstico , Mutação de Sentido Incorreto , Testes Genéticos , Mutação

Pacybara: Accurate long-read sequencing for barcoded mutagenized allelic libraries.

bioRxiv ; 2023 Dec 07.

Artigo em Inglês | MEDLINE | ID: mdl-36865234

RESUMO

Long read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or non-unique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues.

Scalable Functional Assays for the Interpretation of Human Genetic Variation.

Tabet, Daniel; Parikh, Victoria; Mali, Prashant; Roth, Frederick P; Claussnitzer, Melina.

Annu Rev Genet ; 56: 441-465, 2022 11 30.

Artigo em Inglês | MEDLINE | ID: mdl-36055970

RESUMO

Scalable sequence-function studies have enabled the systematic analysis and cataloging of hundreds of thousands of coding and noncoding genetic variants in the human genome. This has improved clinical variant interpretation and provided insights into the molecular, biophysical, and cellular effects of genetic variants at an astonishing scale and resolution across the spectrum of allele frequencies. In this review, we explore current applications and prospects for the field and outline the principles underlying scalable functional assay design, with a focus on the study of single-nucleotide coding and noncoding variants.

Assuntos

Variação Genética , Genoma Humano , Humanos , Genoma Humano/genética

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA