Búsqueda | BVS Bolivia

Benchmarking computational variant effect predictors by their ability to infer human traits.

Tabet, Daniel R; Kuang, Da; Lancaster, Megan C; Li, Roujia; Liu, Karen; Weile, Jochen; Coté, Atina G; Wu, Yingzhou; Hegele, Robert A; Roden, Dan M; Roth, Frederick P.

Genome Biol ; 25(1): 172, 2024 07 01.

Artículo en Inglés | MEDLINE | ID: mdl-38951922

RESUMEN

BACKGROUND: Computational variant effect predictors offer a scalable and increasingly reliable means of interpreting human genetic variation, but concerns of circularity and bias have limited previous methods for evaluating and comparing predictors. Population-level cohorts of genotyped and phenotyped participants that have not been used in predictor training can facilitate an unbiased benchmarking of available methods. Using a curated set of human gene-trait associations with a reported rare-variant burden association, we evaluate the correlations of 24 computational variant effect predictors with associated human traits in the UK Biobank and All of Us cohorts. RESULTS: AlphaMissense outperformed all other predictors in inferring human traits based on rare missense variants in UK Biobank and All of Us participants. The overall rankings of computational variant effect predictors in these two cohorts showed a significant positive correlation. CONCLUSION: We describe a method to assess computational variant effect predictors that sidesteps the limitations of previous evaluations. This approach is generalizable to future predictors and could continue to inform predictor choice for personal and clinical genetics.

Asunto(s)

Benchmarking , Variación Genética , Humanos , Fenotipo , Biología Computacional/métodos , Genotipo

Pacybara: accurate long-read sequencing for barcoded mutagenized allelic libraries.

Weile, Jochen; Ferra, Gabrielle; Boyle, Gabriel; Pendyala, Sriram; Amorosi, Clara; Yeh, Chiann-Ling; Cote, Atina G; Kishore, Nishka; Tabet, Daniel; van Loggerenberg, Warren; Rayhan, Ashyad; Fowler, Douglas M; Dunham, Maitreya J; Roth, Frederick P.

Bioinformatics ; 40(4)2024 03 29.

Artículo en Inglés | MEDLINE | ID: mdl-38569896

RESUMEN

MOTIVATION: Long-read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. RESULTS: Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or nonunique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues. AVAILABILITY AND IMPLEMENTATION: Pacybara, freely available at https://github.com/rothlab/pacybara, is implemented using R, Python, and bash for Linux. It runs on GNU/Linux HPC clusters via Slurm, PBS, or GridEngine schedulers. A single-machine simplex version is also available.

Asunto(s)

Secuenciación de Nucleótidos de Alto Rendimiento , Programas Informáticos , Análisis de Secuencia de ADN/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Biblioteca de Genes , Genotipo , Análisis por Conglomerados

Minimum information and guidelines for reporting a multiplexed assay of variant effect.

Claussnitzer, Melina; Parikh, Victoria N; Wagner, Alex H; Arbesfeld, Jeremy A; Bult, Carol J; Firth, Helen V; Muffley, Lara A; Nguyen Ba, Alex N; Riehle, Kevin; Roth, Frederick P; Tabet, Daniel; Bolognesi, Benedetta; Glazer, Andrew M; Rubin, Alan F.

Genome Biol ; 25(1): 100, 2024 04 19.

Artículo en Inglés | MEDLINE | ID: mdl-38641812

RESUMEN

Multiplexed assays of variant effect (MAVEs) have emerged as a powerful approach for interrogating thousands of genetic variants in a single experiment. The flexibility and widespread adoption of these techniques across diverse disciplines have led to a heterogeneous mix of data formats and descriptions, which complicates the downstream use of the resulting datasets. To address these issues and promote reproducibility and reuse of MAVE data, we define a set of minimum information standards for MAVE data and metadata and outline a controlled vocabulary aligned with established biomedical ontologies for describing these experimental designs.

Asunto(s)

Metadatos , Proyectos de Investigación , Reproducibilidad de los Resultados

Genome-scale mapping of DNA damage suppressors through phenotypic CRISPR-Cas9 screens.

Zhao, Yichao; Tabet, Daniel; Rubio Contreras, Diana; Lao, Linjiang; Kousholt, Arne Nedergaard; Weile, Jochen; Melo, Henrique; Hoeg, Lisa; Feng, Sumin; Coté, Atina G; Lin, Zhen-Yuan; Setiaputra, Dheva; Jonkers, Jos; Gingras, Anne-Claude; Gómez Herreros, Fernando; Roth, Frederick P; Durocher, Daniel.

Mol Cell ; 83(15): 2792-2809.e9, 2023 08 03.

Artículo en Inglés | MEDLINE | ID: mdl-37478847

RESUMEN

To maintain genome integrity, cells must accurately duplicate their genome and repair DNA lesions when they occur. To uncover genes that suppress DNA damage in human cells, we undertook flow-cytometry-based CRISPR-Cas9 screens that monitored DNA damage. We identified 160 genes whose mutation caused spontaneous DNA damage, a list enriched in essential genes, highlighting the importance of genomic integrity for cellular fitness. We also identified 227 genes whose mutation caused DNA damage in replication-perturbed cells. Among the genes characterized, we discovered that deoxyribose-phosphate aldolase DERA suppresses DNA damage caused by cytarabine (Ara-C) and that GNB1L, a gene implicated in 22q11.2 syndrome, promotes biogenesis of ATR and related phosphatidylinositol 3-kinase-related kinases (PIKKs). These results implicate defective PIKK biogenesis as a cause of some phenotypes associated with 22q11.2 syndrome. The phenotypic mapping of genes that suppress DNA damage therefore provides a rich resource to probe the cellular pathways that influence genome maintenance.

Asunto(s)

Sistemas CRISPR-Cas , Daño del ADN , Humanos , Mutación , Reparación del ADN , Fenotipo

Minimum information and guidelines for reporting a Multiplexed Assay of Variant Effect.

ArXiv ; 2023 Jun 26.

Artículo en Inglés | MEDLINE | ID: mdl-37426450

RESUMEN

Multiplexed Assays of Variant Effect (MAVEs) have emerged as a powerful approach for interrogating thousands of genetic variants in a single experiment. The flexibility and widespread adoption of these techniques across diverse disciplines has led to a heterogeneous mix of data formats and descriptions, which complicates the downstream use of the resulting datasets. To address these issues and promote reproducibility and reuse of MAVE data, we define a set of minimum information standards for MAVE data and metadata and outline a controlled vocabulary aligned with established biomedical ontologies for describing these experimental designs.

A comprehensive map of human glucokinase variant activity.

Gersing, Sarah; Cagiada, Matteo; Gebbia, Marinella; Gjesing, Anette P; Coté, Atina G; Seesankar, Gireesh; Li, Roujia; Tabet, Daniel; Weile, Jochen; Stein, Amelie; Gloyn, Anna L; Hansen, Torben; Roth, Frederick P; Lindorff-Larsen, Kresten; Hartmann-Petersen, Rasmus.

Genome Biol ; 24(1): 97, 2023 04 26.

Artículo en Inglés | MEDLINE | ID: mdl-37101203

RESUMEN

BACKGROUND: Glucokinase (GCK) regulates insulin secretion to maintain appropriate blood glucose levels. Sequence variants can alter GCK activity to cause hyperinsulinemic hypoglycemia or hyperglycemia associated with GCK-maturity-onset diabetes of the young (GCK-MODY), collectively affecting up to 10 million people worldwide. Patients with GCK-MODY are frequently misdiagnosed and treated unnecessarily. Genetic testing can prevent this but is hampered by the challenge of interpreting novel missense variants. RESULT: Here, we exploit a multiplexed yeast complementation assay to measure both hyper- and hypoactive GCK variation, capturing 97% of all possible missense and nonsense variants. Activity scores correlate with in vitro catalytic efficiency, fasting glucose levels in carriers of GCK variants and with evolutionary conservation. Hypoactive variants are concentrated at buried positions, near the active site, and at a region of known importance for GCK conformational dynamics. Some hyperactive variants shift the conformational equilibrium towards the active state through a relative destabilization of the inactive conformation. CONCLUSION: Our comprehensive assessment of GCK variant activity promises to facilitate variant interpretation and diagnosis, expand our mechanistic understanding of hyperactive variants, and inform development of therapeutics targeting GCK.

Asunto(s)

Diabetes Mellitus Tipo 2 , Glucoquinasa , Humanos , Glucoquinasa/genética , Glucoquinasa/química , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/diagnóstico , Mutación Missense , Pruebas Genéticas , Mutación

Pacybara: Accurate long-read sequencing for barcoded mutagenized allelic libraries.

bioRxiv ; 2023 Dec 07.

Artículo en Inglés | MEDLINE | ID: mdl-36865234

RESUMEN

Long read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or non-unique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues.

Scalable Functional Assays for the Interpretation of Human Genetic Variation.

Tabet, Daniel; Parikh, Victoria; Mali, Prashant; Roth, Frederick P; Claussnitzer, Melina.

Annu Rev Genet ; 56: 441-465, 2022 11 30.

Artículo en Inglés | MEDLINE | ID: mdl-36055970

RESUMEN

Scalable sequence-function studies have enabled the systematic analysis and cataloging of hundreds of thousands of coding and noncoding genetic variants in the human genome. This has improved clinical variant interpretation and provided insights into the molecular, biophysical, and cellular effects of genetic variants at an astonishing scale and resolution across the spectrum of allele frequencies. In this review, we explore current applications and prospects for the field and outline the principles underlying scalable functional assay design, with a focus on the study of single-nucleotide coding and noncoding variants.

Asunto(s)

Variación Genética , Genoma Humano , Humanos , Genoma Humano/genética

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA