Búsqueda | Portal Regional de la BVS

1.

Long-read single-cell RNA sequencing enables the study of cancer subclone-specific genotype and phenotype in chronic lymphocytic leukemia.

Black, Gage S; Huang, Xiaomeng; Qiao, Yi; Moos, Philip; Sampath, Deepa; Stephens, Deborah M; Woyach, Jennifer A; Marth, Gabor T.

bioRxiv ; 2024 Mar 16.

Artículo en Inglés | MEDLINE | ID: mdl-38559060

RESUMEN

Bruton's tyrosine kinase (BTK) inhibitors are effective for the treatment of chronic lymphocytic leukemia (CLL) due to BTK's role in B cell survival and proliferation. Treatment resistance is most commonly caused by the emergence of the hallmark BTKC481S mutation that inhibits drug binding. In this study, we aimed to investigate whether the presence of additional CLL driver mutations in cancer subclones harboring a BTKC481S mutation accelerates subclone expansion. In addition, we sought to determine whether BTK-mutated subclones exhibit distinct transcriptomic behavior when compared to other cancer subclones. To achieve these goals, we employ our recently published method (Qiao et al. 2024) that combines bulk DNA sequencing and single-cell RNA sequencing (scRNA-seq) data to genotype individual cells for the presence or absence of subclone-defining mutations. While the most common approach for scRNA-seq includes short-read sequencing, transcript coverage is limited due to the vast majority of the reads being concentrated at the priming end of the transcript. Here, we utilized MAS-seq, a long-read scRNAseq technology, to substantially increase transcript coverage across the entire length of the transcripts and expand the set of informative mutations to link cells to cancer subclones in six CLL patients who acquired BTKC481S mutations during BTK inhibitor treatment. We found that BTK-mutated subclones often acquire additional mutations in CLL driver genes, leading to faster subclone proliferation. When examining subclone-specific gene expression, we found that in one patient, BTK-mutated subclones are transcriptionally distinct from the rest of the malignant B cell population with an overexpression of CLL-relevant genes.

2.

Differences in molecular sampling and data processing explain variation among single-cell and single-nucleus RNA-seq experiments.

Chamberlin, John T; Lee, Younghee; Marth, Gabor T; Quinlan, Aaron R.

Genome Res ; 34(2): 179-188, 2024 Mar 20.

Artículo en Inglés | MEDLINE | ID: mdl-38355308

RESUMEN

A mechanistic understanding of the biological and technical factors that impact transcript measurements is essential to designing and analyzing single-cell and single-nucleus RNA sequencing experiments. Nuclei contain the same pre-mRNA population as cells, but they contain a small subset of the mRNAs. Nonetheless, early studies argued that single-nucleus analysis yielded results comparable to cellular samples if pre-mRNA measurements were included. However, typical workflows do not distinguish between pre-mRNA and mRNA when estimating gene expression, and variation in their relative abundances across cell types has received limited attention. These gaps are especially important given that incorporating pre-mRNA has become commonplace for both assays, despite known gene length bias in pre-mRNA capture. Here, we reanalyze public data sets from mouse and human to describe the mechanisms and contrasting effects of mRNA and pre-mRNA sampling on gene expression and marker gene selection in single-cell and single-nucleus RNA-seq. We show that pre-mRNA levels vary considerably among cell types, which mediates the degree of gene length bias and limits the generalizability of a recently published normalization method intended to correct for this bias. As an alternative, we repurpose an existing post hoc gene length-based correction method from conventional RNA-seq gene set enrichment analysis. Finally, we show that inclusion of pre-mRNA in bioinformatic processing can impart a larger effect than assay choice itself, which is pivotal to the effective reuse of existing data. These analyses advance our understanding of the sources of variation in single-cell and single-nucleus RNA-seq experiments and provide useful guidance for future studies.

Asunto(s)

Núcleo Celular , Precursores del ARN , Humanos , Animales , Ratones , RNA-Seq , ARN Mensajero/genética , Análisis de Secuencia de ARN/métodos , Núcleo Celular/genética , Perfilación de la Expresión Génica/métodos , Análisis de la Célula Individual

3.

A Bayesian framework to study tumor subclone-specific expression by combining bulk DNA and single-cell RNA sequencing data.

Qiao, Yi; Huang, Xiaomeng; Moos, Philip J; Ahmann, Jonathan M; Pomicter, Anthony D; Deininger, Michael W; Byrd, John C; Woyach, Jennifer A; Stephens, Deborah M; Marth, Gabor T.

Genome Res ; 34(1): 94-105, 2024 Feb 07.

Artículo en Inglés | MEDLINE | ID: mdl-38195207

RESUMEN

Genetic and gene expression heterogeneity is an essential hallmark of many tumors, allowing the cancer to evolve and to develop resistance to treatment. Currently, the most commonly used data types for studying such heterogeneity are bulk tumor/normal whole-genome or whole-exome sequencing (WGS, WES); and single-cell RNA sequencing (scRNA-seq), respectively. However, tools are currently lacking to link genomic tumor subclonality with transcriptomic heterogeneity by integrating genomic and single-cell transcriptomic data collected from the same tumor. To address this gap, we developed scBayes, a Bayesian probabilistic framework that uses tumor subclonal structure inferred from bulk DNA sequencing data to determine the subclonal identity of cells from single-cell gene expression (scRNA-seq) measurements. Grouping together cells representing the same genetically defined tumor subclones allows comparison of gene expression across different subclones, or investigation of gene expression changes within the same subclone across time (i.e., progression, treatment response, or relapse) or space (i.e., at multiple metastatic sites and organs). We used simulated data sets, in silico synthetic data sets, as well as biological data sets generated from cancer samples to extensively characterize and validate the performance of our method, as well as to show improvements over existing methods. We show the validity and utility of our approach by applying it to published data sets and recapitulating the findings, as well as arriving at novel insights into cancer subclonal expression behavior in our own data sets. We further show that our method is applicable to a wide range of single-cell sequencing technologies including single-cell DNA sequencing as well as Smart-seq and 10x Genomics scRNA-seq protocols.

Asunto(s)

Neoplasias , Humanos , Secuenciación del Exoma , Teorema de Bayes , Neoplasias/genética , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos

4.

quickBAM: a parallelized BAM file access API for high-throughput sequence analysis informatics.

Pitman, Anders; Huang, Xiaomeng; Marth, Gabor T; Qiao, Yi.

Bioinformatics ; 39(8)2023 08 01.

Artículo en Inglés | MEDLINE | ID: mdl-37498562

RESUMEN

MOTIVATION: In time-critical clinical settings, such as precision medicine, genomic data needs to be processed as fast as possible to arrive at data-informed treatment decisions in a timely fashion. While sequencing throughput has dramatically increased over the past decade, bioinformatics analysis throughput has not been able to keep up with the pace of computer hardware improvement, and consequently has now turned into the primary bottleneck. Modern computer hardware today is capable of much higher performance than current genomic informatics algorithms can typically utilize, therefore presenting opportunities for significant improvement of performance. Accessing the raw sequencing data from BAM files, e.g. is a necessary and time-consuming step in nearly all sequence analysis tools, however existing programming libraries for BAM access do not take full advantage of the parallel input/output capabilities of storage devices. RESULTS: In an effort to stimulate the development of a new generation of faster sequence analysis tools, we developed quickBAM, a software library to accelerate sequencing data access by exploiting the parallelism in commodity storage hardware currently widely available. We demonstrate that analysis software ported to quickBAM consistently outperforms their current versions, in some cases finishing an analysis in under 3 min while the original version took 1.5 h, using the same storage solution. AVAILABILITY AND IMPLEMENTATION: Open source and freely available at https://gitlab.com/yiq/quickbam/, we envision that quickBAM will enable a new generation of high-performance informatics tools, either directly boosting their performance if they are currently data-access bottlenecked, or allow data-access to keep up with further optimizations in algorithms and compute techniques.

Asunto(s)

Algoritmos , Programas Informáticos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Genómica , Informática , Análisis de Secuencia de ADN/métodos

5.

Archetype tasks link intratumoral heterogeneity to plasticity and cancer hallmarks in small cell lung cancer.

Groves, Sarah M; Ildefonso, Geena V; McAtee, Caitlin O; Ozawa, Patricia M M; Ireland, Abbie S; Stauffer, Philip E; Wasdin, Perry T; Huang, Xiaomeng; Qiao, Yi; Lim, Jing Shan; Bader, Jackie; Liu, Qi; Simmons, Alan J; Lau, Ken S; Iams, Wade T; Hardin, Doug P; Saff, Edward B; Holmes, William R; Tyson, Darren R; Lovly, Christine M; Rathmell, Jeffrey C; Marth, Gabor; Sage, Julien; Oliver, Trudy G; Weaver, Alissa M; Quaranta, Vito.

Cell Syst ; 13(9): 690-710.e17, 2022 09 21.

Artículo en Inglés | MEDLINE | ID: mdl-35981544

RESUMEN

Small cell lung cancer (SCLC) tumors comprise heterogeneous mixtures of cell states, categorized into neuroendocrine (NE) and non-neuroendocrine (non-NE) transcriptional subtypes. NE to non-NE state transitions, fueled by plasticity, likely underlie adaptability to treatment and dismal survival rates. Here, we apply an archetypal analysis to model plasticity by recasting SCLC phenotypic heterogeneity through multi-task evolutionary theory. Cell line and tumor transcriptomics data fit well in a five-dimensional convex polytope whose vertices optimize tasks reminiscent of pulmonary NE cells, the SCLC normal counterparts. These tasks, supported by knowledge and experimental data, include proliferation, slithering, metabolism, secretion, and injury repair, reflecting cancer hallmarks. SCLC subtypes, either at the population or single-cell level, can be positioned in archetypal space by bulk or single-cell transcriptomics, respectively, and characterized as task specialists or multi-task generalists by the distance from archetype vertex signatures. In the archetype space, modeling single-cell plasticity as a Markovian process along an underlying state manifold indicates that task trade-offs, in response to microenvironmental perturbations or treatment, may drive cell plasticity. Stifling phenotypic transitions and plasticity may provide new targets for much-needed translational advances in SCLC. A record of this paper's Transparent Peer Review process is included in the supplemental information.

Asunto(s)

Neoplasias Pulmonares , Carcinoma Pulmonar de Células Pequeñas , Plasticidad de la Célula , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patología , Carcinoma Pulmonar de Células Pequeñas/genética , Carcinoma Pulmonar de Células Pequeñas/metabolismo , Carcinoma Pulmonar de Células Pequeñas/patología

6.

Author Correction: Gene.iobio: an interactive web tool for versatile, clinically-driven variant interrogation and prioritization.

Di Sera, Tonya; Velinder, Matt; Ward, Alistair; Qiao, Yi; Georges, Stephanie; Miller, Chase; Pitman, Anders; Richards, Will; Ekawade, Aditya; Viskochil, David; Carey, John C; Pace, Laura; Bale, Jim; Clardy, Stacey L; Andrews, Ashley; Botto, Lorenzo; Marth, Gabor.

Sci Rep ; 12(1): 5800, 2022 Apr 06.

Artículo en Inglés | MEDLINE | ID: mdl-35388139

7.

Subclonal evolution of CLL driver mutations is associated with relapse in ibrutinib- and acalabrutinib-treated patients.

Black, Gage S; Huang, Xiaomeng; Qiao, Yi; Tarapcsak, Szabolcs; Rogers, Kerry A; Misra, Shrilekha; Byrd, John C; Marth, Gabor T; Stephens, Deborah M; Woyach, Jennifer A.

Blood ; 140(4): 401-405, 2022 07 28.

Artículo en Inglés | MEDLINE | ID: mdl-35476648

Asunto(s)

Leucemia Linfocítica Crónica de Células B , Adenina/análogos & derivados , Benzamidas , Humanos , Leucemia Linfocítica Crónica de Células B/tratamiento farmacológico , Leucemia Linfocítica Crónica de Células B/genética , Mutación , Recurrencia Local de Neoplasia , Piperidinas , Inhibidores de Proteínas Quinasas/uso terapéutico , Pirazinas

8.

A human breast cancer-derived xenograft and organoid platform for drug discovery and precision oncology.

Guillen, Katrin P; Fujita, Maihi; Butterfield, Andrew J; Scherer, Sandra D; Bailey, Matthew H; Chu, Zhengtao; DeRose, Yoko S; Zhao, Ling; Cortes-Sanchez, Emilio; Yang, Chieh-Hsiang; Toner, Jennifer; Wang, Guoying; Qiao, Yi; Huang, Xiaomeng; Greenland, Jeffery A; Vahrenkamp, Jeffery M; Lum, David H; Factor, Rachel E; Nelson, Edward W; Matsen, Cindy B; Poretta, Jane M; Rosenthal, Regina; Beck, Anna C; Buys, Saundra S; Vaklavas, Christos; Ward, John H; Jensen, Randy L; Jones, Kevin B; Li, Zheqi; Oesterreich, Steffi; Dobrolecki, Lacey E; Pathi, Satya S; Woo, Xing Yi; Berrett, Kristofer C; Wadsworth, Mark E; Chuang, Jeffrey H; Lewis, Michael T; Marth, Gabor T; Gertz, Jason; Varley, Katherine E; Welm, Bryan E; Welm, Alana L.

Nat Cancer ; 3(2): 232-250, 2022 02.

Artículo en Inglés | MEDLINE | ID: mdl-35221336

RESUMEN

Models that recapitulate the complexity of human tumors are urgently needed to develop more effective cancer therapies. We report a bank of human patient-derived xenografts (PDXs) and matched organoid cultures from tumors that represent the greatest unmet need: endocrine-resistant, treatment-refractory and metastatic breast cancers. We leverage matched PDXs and PDX-derived organoids (PDxO) for drug screening that is feasible and cost-effective with in vivo validation. Moreover, we demonstrate the feasibility of using these models for precision oncology in real time with clinical care in a case of triple-negative breast cancer (TNBC) with early metastatic recurrence. Our results uncovered a Food and Drug Administration (FDA)-approved drug with high efficacy against the models. Treatment with this therapy resulted in a complete response for the individual and a progression-free survival (PFS) period more than three times longer than their previous therapies. This work provides valuable methods and resources for functional precision medicine and drug development for human breast cancer.

Asunto(s)

Organoides , Neoplasias de la Mama Triple Negativas , Descubrimiento de Drogas , Xenoinjertos , Humanos , Medicina de Precisión/métodos , Neoplasias de la Mama Triple Negativas/tratamiento farmacológico , Estados Unidos , Ensayos Antitumor por Modelo de Xenoinjerto

9.

Comprehensive variant calling from whole-genome sequencing identifies a complex inversion that disrupts ZFPM2 in familial congenital diaphragmatic hernia.

Nicholas, Thomas J; Al-Sweel, Najla; Farrell, Andrew; Mao, Rong; Bayrak-Toydemir, Pinar; Miller, Christine E; Bentley, Dawn; Palmquist, Rachel; Moore, Barry; Hernandez, Edgar J; Cormier, Michael J; Fredrickson, Eric; Noble, Katherine; Rynearson, Shawn; Holt, Carson; Karren, Mary Anne; Bonkowsky, Joshua L; Tristani-Firouzi, Martin; Yandell, Mark; Marth, Gabor; Quinlan, Aaron R; Brunelli, Luca; Toydemir, Reha M; Shayota, Brian J; Carey, John C; Boyden, Steven E; Malone Jenkins, Sabrina.

Mol Genet Genomic Med ; 10(4): e1888, 2022 04.

Artículo en Inglés | MEDLINE | ID: mdl-35119225

RESUMEN

BACKGROUND: Genetic disorders contribute to significant morbidity and mortality in critically ill newborns. Despite advances in genome sequencing technologies, a majority of neonatal cases remain unsolved. Complex structural variants (SVs) often elude conventional genome sequencing variant calling pipelines and will explain a portion of these unsolved cases. METHODS: As part of the Utah NeoSeq project, we used a research-based, rapid whole-genome sequencing (WGS) protocol to investigate the genomic etiology for a newborn with a left-sided congenital diaphragmatic hernia (CDH) and cardiac malformations, whose mother also had a history of CDH and atrial septal defect. RESULTS: Using both a novel, alignment-free and traditional alignment-based variant callers, we identified a maternally inherited complex SV on chromosome 8, consisting of an inversion flanked by deletions. This complex inversion, further confirmed using orthogonal molecular techniques, disrupts the ZFPM2 gene, which is associated with both CDH and various congenital heart defects. CONCLUSIONS: Our results demonstrate that complex structural events, which often are unidentifiable or not reported by clinically validated testing procedures, can be discovered and accurately characterized with conventional, short-read sequencing and underscore the utility of WGS as a first-line diagnostic tool.

Asunto(s)

Hernias Diafragmáticas Congénitas , Proteínas de Unión al ADN/genética , Genómica , Hernias Diafragmáticas Congénitas/genética , Humanos , Recién Nacido , Factores de Transcripción/genética , Secuenciación Completa del Genoma/métodos

10.

Clin.iobio: A Collaborative Diagnostic Workflow to Enable Team-Based Precision Genomics.

Ward, Alistair; Velinder, Matt; Di Sera, Tonya; Ekawade, Aditya; Malone Jenkins, Sabrina; Moore, Barry; Mao, Rong; Bayrak-Toydemir, Pinar; Marth, Gabor.

J Pers Med ; 12(1)2022 Jan 08.

Artículo en Inglés | MEDLINE | ID: mdl-35055388

RESUMEN

The primary goal of precision genomics is the identification of causative genetic variants in targeted or whole-genome sequencing data. The ultimate clinical hope is that these findings lead to an efficacious change in treatment for the patient. In current clinical practice, these findings are typically returned by expert analysts as static, text-based reports. Ideally, these reports summarize the quality of the data obtained, integrate known gene-phenotype associations, follow allele segregation and affected status within the sequenced samples, and weigh computational evidence of pathogenicity. These findings are used to prioritize the variant(s) most likely to cause the given patient's phenotypes. In most diagnostic settings, a team of experts contribute to these reports, including bioinformaticians, clinicians, and genetic counselors, among others. However, these experts often do not have the necessary tools to review genomic findings, test genetic hypotheses, or query specific gene and variant information. Additionally, team members often rely on different tools and methods based on their given expertise, resulting in further difficulties in communicating and discussing genomic findings. Here, we present clin.iobio-a web-based solution to collaborative genomic analysis that enables diagnostic team members to focus on their area of expertise within the diagnostic process, while allowing them to easily review and contribute to all steps of the diagnostic process. Clin.iobio integrates tools from the popular iobio genomic visualization suite into a comprehensive diagnostic workflow, encompassing (1) genomic data quality review, (2) dynamic phenotype-driven gene prioritization, (3) variant prioritization using a comprehensive set of knowledge bases and annotations, (4) and an exportable findings summary. In conclusion, clin.iobio is a comprehensive solution to team-based precision genomics, the findings of which stand to inform genomic considerations in clinical practice.

11.

The Extracellular Milieu of Toxoplasma's Lytic Cycle Drives Lab Adaptation, Primarily by Transcriptional Reprogramming.

Primo, Vincent A; Rezvani, Yasaman; Farrell, Andrew; Murphy, Connor Q; Lou, Jingjing; Vajdi, Amir; Marth, Gabor T; Zarringhalam, Kourosh; Gubbels, Marc-Jan.

mSystems ; 6(6): e0119621, 2021 Dec 21.

Artículo en Inglés | MEDLINE | ID: mdl-34874774

RESUMEN

Evolve and resequencing (E&R) was applied to lab adaptation of Toxoplasma gondii for over 1,500 generations with the goal of mapping host-independent in vitro virulence traits. Phenotypic assessments of steps across the lytic cycle revealed that only traits needed in the extracellular milieu evolved. Nonsynonymous single-nucleotide polymorphisms (SNPs) in only one gene, a P4 flippase, fixated across two different evolving populations, whereas dramatic changes in the transcriptional signature of extracellular parasites were identified. Newly developed computational tools correlated phenotypes evolving at different rates with specific transcriptomic changes. A set of 300 phenotype-associated genes was mapped, of which nearly 50% is annotated as hypothetical. Validation of a select number of genes by knockouts confirmed their role in lab adaptation and highlights novel mechanisms underlying in vitro virulence traits. Further analyses of differentially expressed genes revealed the development of a "pro-tachyzoite" profile as well as the upregulation of the fatty acid biosynthesis (FASII) pathway. The latter aligned with the P4 flippase SNP and aligned with a low abundance of medium-chain fatty acids at low passage, indicating this is a limiting factor in extracellular parasites. In addition, partial overlap with the bradyzoite differentiation transcriptome in extracellular parasites indicated that stress pathways are involved in both situations. This was reflected in the partial overlap between the assembled ApiAP2 and Myb transcription factor network underlying the adapting extracellular state with the bradyzoite differentiation program. Overall, E&R is a new genomic tool successfully applied to map the development of polygenic traits underlying in vitro virulence of T. gondii. IMPORTANCE It has been well established that prolonged in vitro cultivation of Toxoplasma gondii augments progression of the lytic cycle. This lab adaptation results in increased capacities to divide, migrate, and survive outside a host cell, all of which are considered host-independent virulence factors. However, the mechanistic basis underlying these enhanced virulence features is unknown. Here, E&R was utilized to empirically characterize the phenotypic, genomic, and transcriptomic changes in the non-lab-adapted strain, GT1, during 2.5 years of lab adaptation. This identified the shutdown of stage differentiation and upregulation of lipid biosynthetic pathways as the key processes being modulated. Furthermore, lab adaptation was primarily driven by transcriptional reprogramming, which rejected the starting hypothesis that genetic mutations would drive lab adaptation. Overall, the work empirically shows that lab adaptation augments T. gondii's in vitro virulence by transcriptional reprogramming and that E&R is a powerful new tool to map multigenic traits.

12.

Gene.iobio: an interactive web tool for versatile, clinically-driven variant interrogation and prioritization.

Di Sera, Tonya; Velinder, Matt; Ward, Alistair; Qiao, Yi; Georges, Stephanie; Miller, Chase; Pitman, Anders; Richards, Will; Ekawade, Aditya; Viskochil, David; Carey, John C; Pace, Laura; Bale, Jim; Clardy, Stacey L; Andrews, Ashley; Botto, Lorenzo; Marth, Gabor.

Sci Rep ; 11(1): 20307, 2021 10 13.

Artículo en Inglés | MEDLINE | ID: mdl-34645894

RESUMEN

With increasing utilization of comprehensive genomic data to guide clinical care, anticipated to become the standard of care in many clinical settings, the practice of diagnostic medicine is undergoing a notable shift. However, the move from single-gene or panel-based genetic testing to exome and genome sequencing has not been matched by the development of tools to enable diagnosticians to interpret increasingly complex or uncertain genomic findings. Here, we present gene.iobio, a real-time, intuitive and interactive web application for clinically-driven variant interrogation and prioritization. We show gene.iobio is a novel and effective approach that significantly improves upon and reimagines existing methods. In a radical departure from existing methods that present variants and genomic data in text and table formats, gene.iobio provides an interactive, intuitive and visually-driven analysis environment. We demonstrate that adoption of gene.iobio in clinical and research settings empowers clinical care providers to interact directly with patient genomic data both for establishing clinical diagnoses and informing patient care, using sophisticated genomic analyses that previously were only accessible via complex command line tools.

Asunto(s)

Biología Computacional/métodos , Genómica/métodos , Adulto , Algoritmos , Alelos , Bases de Datos Genéticas , Exoma , Pruebas Genéticas , Humanos , Internet , Masculino , Fenotipo , Receptores de Superficie Celular/genética , Análisis de Secuencia de ADN , Programas Informáticos , ATPasas de Translocación de Protón Vacuolares/genética , Secuenciación del Exoma

13.

Novel temporal and spatial patterns of metastatic colonization from breast cancer rapid-autopsy tumor biopsies.

Huang, Xiaomeng; Qiao, Yi; Brady, Samuel W; Factor, Rachel E; Downs-Kelly, Erinn; Farrell, Andrew; McQuerry, Jasmine A; Shrestha, Gajendra; Jenkins, David; Johnson, W Evan; Cohen, Adam L; Bild, Andrea H; Marth, Gabor T.

Genome Med ; 13(1): 170, 2021 10 28.

Artículo en Inglés | MEDLINE | ID: mdl-34711268

RESUMEN

BACKGROUND: Metastatic breast cancer is a deadly disease with a low 5-year survival rate. Tracking metastatic spread in living patients is difficult and thus poorly understood. METHODS: Via rapid autopsy, we have collected 30 tumor samples over 3 timepoints and across 8 organs from a triple-negative metastatic breast cancer patient. The large number of sites sampled, together with deep whole-genome sequencing and advanced computational analysis, allowed us to comprehensively reconstruct the tumor's evolution at subclonal resolution. RESULTS: The most unique, previously unreported aspect of the tumor's evolution that we observed in this patient was the presence of "subclone incubators," defined as metastatic sites where substantial tumor evolution occurs before colonization of additional sites and organs by subclones that initially evolved at the incubator site. Overall, we identified four discrete waves of metastatic expansions, each of which resulted in a number of new, genetically similar metastasis sites that also enriched for particular organs (e.g., abdominal vs bone and brain). The lung played a critical role in facilitating metastatic spread in this patient: the lung was the first site of metastatic escape from the primary breast lesion, subclones at this site were likely the source of all four subsequent metastatic waves, and multiple sites in the lung acted as subclone incubators. Finally, functional annotation revealed that many known drivers or metastasis-promoting tumor mutations in this patient were shared by some, but not all metastatic sites, highlighting the need for more comprehensive surveys of a patient's metastases for effective clinical intervention. CONCLUSIONS: Our analysis revealed the presence of substantial tumor evolution at metastatic incubator sites in a patient, with potentially important clinical implications. Our study demonstrated that sampling of a large number of metastatic sites affords unprecedented detail for studying metastatic evolution.

Asunto(s)

Autopsia , Neoplasias de la Mama/clasificación , Neoplasias de la Mama/genética , Metástasis de la Neoplasia , Biopsia , Evolución Molecular , Femenino , Humanos , Persona de Mediana Edad , Mutación , Filogenia

14.

Mobile element insertions and associated structural variants in longitudinal breast cancer samples.

Steely, Cody J; Russell, Kristi L; Feusier, Julie E; Qiao, Yi; Tavtigian, Sean V; Marth, Gabor; Jorde, Lynn B.

Sci Rep ; 11(1): 13020, 2021 06 22.

Artículo en Inglés | MEDLINE | ID: mdl-34158539

RESUMEN

While mobile elements are largely inactive in healthy somatic tissues, increased activity has been found in cancer tissues, with significant variation among different cancer types. In addition to insertion events, mobile elements have also been found to mediate many structural variation events in the genome. Here, to better understand the timing and impact of mobile element insertions and associated structural variants in cancer, we examined their activity in longitudinal samples of four metastatic breast cancer patients. We identified 11 mobile element insertions or associated structural variants and found that the majority of these occurred early in tumor progression. Most of the variants impact intergenic regions; however, we identified a translocation interrupting MAP2K4 involving Alu elements and a deletion in YTHDF2 involving mobile elements that likely inactivate reported tumor suppressor genes. The high variant allele fraction of the translocation, the loss of the other copy of MAP2K4, the recurrent loss-of-function mutations found in this gene in other cancers, and the important function of MAP2K4 indicate that this translocation is potentially a driver mutation. Overall, using a unique longitudinal dataset, we find that most variants are likely passenger mutations in the four patients we examined, but some variants impact tumor progression.

Asunto(s)

Neoplasias de la Mama/genética , Elementos Transponibles de ADN/genética , Variación Estructural del Genoma , Mutagénesis Insercional/genética , Alelos , Cromosomas Humanos/genética , Femenino , Dosificación de Gen , Humanos , Estudios Longitudinales , MAP Quinasa Quinasa 4/genética

15.

OncoGEMINI: software for investigating tumor variants from multiple biopsies with integrated cancer annotations.

Nicholas, Thomas J; Cormier, Michael J; Huang, Xiaomeng; Qiao, Yi; Marth, Gabor T; Quinlan, Aaron R.

Genome Med ; 13(1): 46, 2021 03 26.

Artículo en Inglés | MEDLINE | ID: mdl-33771218

RESUMEN

BACKGROUND: DNA sequencing has unveiled extensive tumor heterogeneity in several different cancer types, with many exhibiting diverse subclonal populations. Identifying and tracing mutations throughout the expansion and progression of a tumor represents a significant challenge. Furthermore, prioritizing the subset of such mutations most likely to contribute to tumor evolution or that could serve as potential therapeutic targets represents an ongoing problem. RESULTS: Here, we describe OncoGEMINI, a new tool designed for exploring the complex patterns and trajectory of somatic and inherited variation observed in heterogeneous tumors biopsied over the course of treatment. This is accomplished by creating a searchable database of variants that includes tumor sampling time points and allows for filtering methods that reflect specific changes in variant allele frequencies over time. Additionally, by incorporating existing annotations and resources that facilitate the interpretation of cancer mutations (e.g., CIViC, DGIdb), OncoGEMINI enables rapid searches for, and potential identification of, mutations that may be driving subclonal evolution. CONCLUSIONS: By combining relevant genomic annotations alongside specific filtering tools, OncoGEMINI provides powerful and customizable approaches that enable the quick identification of individual tumor variants that meet specified criteria. It can be applied to a wide range of tumor-derived sequence data, but is especially designed for studies with multiple samples, including longitudinal datasets. It is available under an MIT license at github.com/fakedrtom/oncogemini .

Asunto(s)

Neoplasias de la Mama/genética , Neoplasias de la Mama/patología , Variación Genética , Programas Informáticos , Biopsia , Bases de Datos Genéticas , Femenino , Humanos , Estudios Longitudinales , Metástasis de la Neoplasia

16.

Deep whole-genome sequencing of multiple proband tissues and parental blood reveals the complex genetic etiology of congenital diaphragmatic hernias.

Bogenschutz, Eric L; Fox, Zac D; Farrell, Andrew; Wynn, Julia; Moore, Barry; Yu, Lan; Aspelund, Gudrun; Marth, Gabor; Yandell, Mark; Shen, Yufeng; Chung, Wendy K; Kardon, Gabrielle.

HGG Adv ; 1(1)2020 Oct 22.

Artículo en Inglés | MEDLINE | ID: mdl-33263113

RESUMEN

The diaphragm is critical for respiration and separation of the thoracic and abdominal cavities, and defects in diaphragm development are the cause of congenital diaphragmatic hernias (CDH), a common and often lethal birth defect. The genetic etiology of CDH is complex. Single-nucleotide variants (SNVs), insertions/deletions (indels), and structural variants (SVs) in more than 150 genes have been associated with CDH, although few genes are recurrently mutated in multiple individuals and mutated genes are incompletely penetrant. This suggests that multiple genetic variants in combination, other not-yet-investigated classes of variants, and/or nongenetic factors contribute to CDH etiology. However, no studies have comprehensively investigated in affected individuals the contribution of all possible classes of variants throughout the genome to CDH etiology. In our study, we used a unique cohort of four individuals with isolated CDH with samples from blood, skin, and diaphragm connective tissue and parental blood and deep whole-genome sequencing to assess germline and somatic de novo and inherited SNVs, indels, and SVs. In each individual we found a different mutational landscape that included germline de novo and inherited SNVs and indels in multiple genes. We also found in two individuals a 343 bp deletion interrupting an annotated enhancer of the CDH-associated gene GATA4, and we hypothesize that this common SV (found in 1%-2% of the population) acts as a sensitizing allele for CDH. Overall, our comprehensive reconstruction of the genetic architecture of four CDH individuals demonstrates that the etiology of CDH is heterogeneous and multifactorial.

17.

ped_draw: pedigree drawing with ease.

Velinder, Matt; Lee, Dillon; Marth, Gabor.

BMC Bioinformatics ; 21(1): 569, 2020 Dec 09.

Artículo en Inglés | MEDLINE | ID: mdl-33297934

RESUMEN

BACKGROUND: Pedigree files are ubiquitously used within bioinformatics and genetics studies to convey critical information about relatedness, sex and affected status of study samples. While the text based format of ped files is efficient for computational methods, it is not immediately intuitive to a bioinformatician or geneticist trying to understand family structures, many of which encode the affected status of individuals across multiple generations. The visualization of pedigrees into connected nodes with descriptive shapes and shading provides a far more interpretable format to recognize visual patterns and intuit family structures. Despite these advantages of a visual pedigree, it remains difficult to quickly and accurately visualize a pedigree given a pedigree text file. RESULTS: Here we describe ped_draw a command line and web tool as a simple and easy solution to pedigree visualization. Ped_draw is capable of drawing complex multi-generational pedigrees and conforms to the accepted standards for depicting pedigrees visually. The command line tool can be used as a simple one liner command, utilizing graphviz to generate an image file. The web tool, https://peddraw.github.io , allows the user to either: paste a pedigree file, type to construct a pedigree file in the text box or upload a pedigree file. Users can save the generated image file in various formats. CONCLUSIONS: We believe ped_draw is a useful pedigree drawing tool that improves on current methods due to its ease of use and approachability. Ped_draw allows users with various levels of expertise to quickly and easily visualize pedigrees.

Asunto(s)

Biología Computacional/métodos , Linaje , Programas Informáticos , Humanos

18.

gene.iobio: an interactive web tool for versatile, clinically-driven variant interrogation and prioritization.

Di Sera, Tonya; Velinder, Matt; Ward, Alistair; Qiao, Yi; Georges, Stephanie; Miller, Chase; Pitman, Anders; Richards, Will; Ekawade, Aditya; Viskochil, David; Carey, John C; Pace, Laura; Bale, Jim; Clardy, Stacey L; Andrews, Ashley; Botto, Lorenzo; Marth, Gabor.

medRxiv ; 2020 Nov 06.

Artículo en Inglés | MEDLINE | ID: mdl-33173897

RESUMEN

With increasing utilization of comprehensive genomic data to guide clinical care, anticipated to become the standard of care in many clinical settings, the practice of diagnostic medicine is undergoing a notable shift. However, the move from single-gene or panel-based genetic testing to exome and genome sequencing has not been matched by the development of tools to enable diagnosticians to interpret increasingly complex genomic findings. A new paradigm has emerged, where genome-based tests are often evaluated by a large multi-disciplinary collaborative team, typically including a diagnostic pathologist, a bioinformatician, a genetic counselor, and often a subspeciality clinician. This team-based approach calls for new computational tools to allow every member of the clinical care provider team, at varying levels of genetic knowledge and diagnostic expertise, to quickly and easily analyze and interpret complex genomic data. Here, we present gene.iobio , a real-time, intuitive and interactive web application for clinically-driven variant interrogation and prioritization. We show gene.iobio is a novel and effective approach that significantly improves upon and reimagines existing methods. In a radical departure from existing methods that present variants and genomic data in text and table formats, gene.iobio provides an interactive, intuitive and visually-driven analysis environment. We demonstrate that adoption of gene.iobio in clinical and research settings empowers clinical care providers to interact directly with patient genomic data both for establishing clinical diagnoses and informing patient care, using sophisticated genomic analyses that previously were only accessible via complex command line tools.

19.

Somalier: rapid relatedness estimation for cancer and germline studies using efficient genome sketches.

Pedersen, Brent S; Bhetariya, Preetida J; Brown, Joe; Kravitz, Stephanie N; Marth, Gabor; Jensen, Randy L; Bronner, Mary P; Underhill, Hunter R; Quinlan, Aaron R.

Genome Med ; 12(1): 62, 2020 07 14.

Artículo en Inglés | MEDLINE | ID: mdl-32664994

RESUMEN

BACKGROUND: When interpreting sequencing data from multiple spatial or longitudinal biopsies, detecting sample mix-ups is essential, yet more difficult than in studies of germline variation. In most genomic studies of tumors, genetic variation is detected through pairwise comparisons of the tumor and a matched normal tissue from the sample donor. In many cases, only somatic variants are reported, which hinders the use of existing tools that detect sample swaps solely based on genotypes of inherited variants. To address this problem, we have developed Somalier, a tool that operates directly on alignments and does not require jointly called germline variants. Instead, Somalier extracts a small sketch of informative genetic variation for each sample. Sketches from hundreds of germline or somatic samples can then be compared in under a second, making Somalier a useful tool for measuring relatedness in large cohorts. Somalier produces both text output and an interactive visual report that facilitates the detection and correction of sample swaps using multiple relatedness metrics. RESULTS: We introduce the tool and demonstrate its utility on a cohort of five glioma samples each with a normal, tumor, and cell-free DNA sample. Applying Somalier to high-coverage sequence data from the 1000 Genomes Project also identifies several related samples. We also demonstrate that it can distinguish pairs of whole-genome and RNA-seq samples from the same individuals in the Genotype-Tissue Expression (GTEx) project. CONCLUSIONS: Somalier is a tool that can rapidly evaluate relatedness from sequencing data. It can be applied to diverse sequencing data types and genome builds and is available under an MIT license at github.com/brentp/somalier .

Asunto(s)

Biología Computacional/métodos , Genoma Humano , Genómica/métodos , Neoplasias/genética , Programas Informáticos , Algoritmos , Análisis Mutacional de ADN , Variación Genética , Células Germinativas/metabolismo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Análisis de Secuencia de ADN , Navegador Web

20.

Genomic analyses implicate noncoding de novo variants in congenital heart disease.

Richter, Felix; Morton, Sarah U; Kim, Seong Won; Kitaygorodsky, Alexander; Wasson, Lauren K; Chen, Kathleen M; Zhou, Jian; Qi, Hongjian; Patel, Nihir; DePalma, Steven R; Parfenov, Michael; Homsy, Jason; Gorham, Joshua M; Manheimer, Kathryn B; Velinder, Matthew; Farrell, Andrew; Marth, Gabor; Schadt, Eric E; Kaltman, Jonathan R; Newburger, Jane W; Giardini, Alessandro; Goldmuntz, Elizabeth; Brueckner, Martina; Kim, Richard; Porter, George A; Bernstein, Daniel; Chung, Wendy K; Srivastava, Deepak; Tristani-Firouzi, Martin; Troyanskaya, Olga G; Dickel, Diane E; Shen, Yufeng; Seidman, Jonathan G; Seidman, Christine E; Gelb, Bruce D.

Nat Genet ; 52(8): 769-777, 2020 08.

Artículo en Inglés | MEDLINE | ID: mdl-32601476

RESUMEN

A genetic etiology is identified for one-third of patients with congenital heart disease (CHD), with 8% of cases attributable to coding de novo variants (DNVs). To assess the contribution of noncoding DNVs to CHD, we compared genome sequences from 749 CHD probands and their parents with those from 1,611 unaffected trios. Neural network prediction of noncoding DNV transcriptional impact identified a burden of DNVs in individuals with CHD (n = 2,238 DNVs) compared to controls (n = 4,177; P = 8.7 × 10-4). Independent analyses of enhancers showed an excess of DNVs in associated genes (27 genes versus 3.7 expected, P = 1 × 10-5). We observed significant overlap between these transcription-based approaches (odds ratio (OR) = 2.5, 95% confidence interval (CI) 1.1-5.0, P = 5.4 × 10-3). CHD DNVs altered transcription levels in 5 of 31 enhancers assayed. Finally, we observed a DNV burden in RNA-binding-protein regulatory sites (OR = 1.13, 95% CI 1.1-1.2, P = 8.8 × 10-5). Our findings demonstrate an enrichment of potentially disruptive regulatory noncoding DNVs in a fraction of CHD at least as high as that observed for damaging coding DNVs.

Asunto(s)

Variación Genética/genética , Cardiopatías Congénitas/genética , ARN no Traducido/genética , Adolescente , Adulto , Animales , Femenino , Predisposición Genética a la Enfermedad/genética , Genómica , Corazón/fisiología , Humanos , Masculino , Ratones , Persona de Mediana Edad , Sistemas de Lectura Abierta/genética , Proteínas de Unión al ARN/genética , Transcripción Genética/genética , Adulto Joven

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA