Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
NPJ Precis Oncol ; 8(1): 134, 2024 Jun 19.
Artículo en Inglés | MEDLINE | ID: mdl-38898127

RESUMEN

While alterations in nucleus size, shape, and color are ubiquitous in cancer, comprehensive quantification of nuclear morphology across a whole-slide histologic image remains a challenge. Here, we describe the development of a pan-tissue, deep learning-based digital pathology pipeline for exhaustive nucleus detection, segmentation, and classification and the utility of this pipeline for nuclear morphologic biomarker discovery. Manually-collected nucleus annotations were used to train an object detection and segmentation model for identifying nuclei, which was deployed to segment nuclei in H&E-stained slides from the BRCA, LUAD, and PRAD TCGA cohorts. Interpretable features describing the shape, size, color, and texture of each nucleus were extracted from segmented nuclei and compared to measurements of genomic instability, gene expression, and prognosis. The nuclear segmentation and classification model trained herein performed comparably to previously reported models. Features extracted from the model revealed differences sufficient to distinguish between BRCA, LUAD, and PRAD. Furthermore, cancer cell nuclear area was associated with increased aneuploidy score and homologous recombination deficiency. In BRCA, increased fibroblast nuclear area was indicative of poor progression-free and overall survival and was associated with gene expression signatures related to extracellular matrix remodeling and anti-tumor immunity. Thus, we developed a powerful pan-tissue approach for nucleus segmentation and featurization, enabling the construction of predictive models and the identification of features linking nuclear morphology with clinically-relevant prognostic biomarkers across multiple cancer types.

2.
J Thorac Oncol ; 2023 Dec 07.
Artículo en Inglés | MEDLINE | ID: mdl-38070597

RESUMEN

INTRODUCTION: Pathologic response (PathR) by histopathologic assessment of resected specimens may be an early clinical end point associated with long-term outcomes with neoadjuvant therapy. Digital pathology may improve the efficiency and precision of PathR assessment. LCMC3 (NCT02927301) evaluated neoadjuvant atezolizumab in patients with resectable NSCLC and reported a 20% major PathR rate. METHODS: We determined PathR in primary tumor resection specimens using guidelines-based visual techniques and developed a convolutional neural network model using the same criteria to digitally measure the percent viable tumor on whole-slide images. Concordance was evaluated between visual determination of percent viable tumor (n = 151) performed by one of the 47 local pathologists and three central pathologists. RESULTS: For concordance among visual determination of percent viable tumor, the interclass correlation coefficient was 0.87 (95% confidence interval [CI]: 0.84-0.90). Agreement for visually assessed 10% or less viable tumor (major PathR [MPR]) in the primary tumor was 92.1% (Fleiss kappa = 0.83). Digitally assessed percent viable tumor (n = 136) correlated with visual assessment (Pearson r = 0.73; digital/visual slope = 0.28). Digitally assessed MPR predicted visually assessed MPR with outstanding discrimination (area under receiver operating characteristic curve, 0.98) and was associated with longer disease-free survival (hazard ratio [HR] = 0.30; 95% CI: 0.09-0.97, p = 0.033) and overall survival (HR = 0.14, 95% CI: 0.02-1.06, p = 0.027) versus no MPR. Digitally assessed PathR strongly correlated with visual measurements. CONCLUSIONS: Artificial intelligence-powered digital pathology exhibits promise in assisting pathologic assessments in neoadjuvant NSCLC clinical trials. The development of artificial intelligence-powered approaches in clinical settings may aid pathologists in clinical operations, including routine PathR assessments, and subsequently support improved patient care and long-term outcomes.

3.
medRxiv ; 2023 Apr 25.
Artículo en Inglés | MEDLINE | ID: mdl-37162870

RESUMEN

Clinical trials in nonalcoholic steatohepatitis (NASH) require histologic scoring for assessment of inclusion criteria and endpoints. However, guidelines for scoring key features have led to variability in interpretation, impacting clinical trial outcomes. We developed an artificial intelligence (AI)-based measurement (AIM) tool for scoring NASH histology (AIM-NASH). AIM-NASH predictions for NASH Clinical Research Network (CRN) grades of necroinflammation and stages of fibrosis aligned with expert consensus scores and were reproducible. Continuous scores produced by AIM-NASH for key histological features of NASH correlated with mean pathologist scores and with noninvasive biomarkers and strongly predicted patient outcomes. In a retrospective analysis of the ATLAS trial, previously unmet pathological endpoints were met when scored by the AIM-NASH algorithm alone. Overall, these results suggest that AIM-NASH may assist pathologists in histologic review of NASH clinical trials, reducing inter-rater variability on trial outcomes and offering a more sensitive and reproducible measure of patient therapeutic response.

4.
Cell Rep Med ; 4(4): 101016, 2023 04 18.
Artículo en Inglés | MEDLINE | ID: mdl-37075704

RESUMEN

Nonalcoholic steatohepatitis (NASH) is the most common chronic liver disease globally and a leading cause for liver transplantation in the US. Its pathogenesis remains imprecisely defined. We combined two high-resolution modalities to tissue samples from NASH clinical trials, machine learning (ML)-based quantification of histological features and transcriptomics, to identify genes that are associated with disease progression and clinical events. A histopathology-driven 5-gene expression signature predicted disease progression and clinical events in patients with NASH with F3 (pre-cirrhotic) and F4 (cirrhotic) fibrosis. Notably, the Notch signaling pathway and genes implicated in liver-related diseases were enriched in this expression signature. In a validation cohort where pharmacologic intervention improved disease histology, multiple Notch signaling components were suppressed.


Asunto(s)
Aprendizaje Profundo , Enfermedad del Hígado Graso no Alcohólico , Humanos , Enfermedad del Hígado Graso no Alcohólico/complicaciones , Transcriptoma/genética , Progresión de la Enfermedad , Cirrosis Hepática/genética , Cirrosis Hepática/tratamiento farmacológico
5.
Mod Pathol ; 36(6): 100124, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-36841434

RESUMEN

Ulcerative colitis is a chronic inflammatory bowel disease that is characterized by a relapsing and remitting course. Assessment of disease activity critically informs treatment decisions. In addition to endoscopic remission, histologic remission is emerging as a treatment target and a key factor in the evaluation of disease activity and therapeutic efficacy. However, manual pathologist evaluation is semiquantitative and limited in granularity. Machine learning approaches are increasingly being developed to aid pathologists in accurate and reproducible scoring of histology, enabling precise quantitation of clinically relevant features. Here, we report the development and validation of convolutional neural network models that quantify histologic features pertinent to ulcerative colitis disease activity, directly from hematoxylin and eosin-stained whole slide images. Tissue and cell model predictions were used to generate quantitative human-interpretable features to fully characterize the histology samples. Tissue and cell predictions showed comparable agreement to pathologist annotations, and the extracted slide-level human-interpretable features demonstrated strong correlations with disease severity and pathologist-assigned Nancy histological index scores. Moreover, using a random forest classifier based on 13 human-interpretable features derived from the tissue and cell models, we were able to accurately predict Nancy histological index scores, with a weighted kappa (κ = 0.91) and Spearman correlation (⍴ = 0.89, P < .001) when compared with pathologist consensus Nancy histological index scores. We were also able to predict histologic remission, based on the absence of neutrophil extravasation, with a high accuracy of 0.97. This work demonstrates the potential of computer vision to enable a standardized and robust assessment of ulcerative colitis histopathology for translational research and improved evaluation of disease activity and prognosis.


Asunto(s)
Colitis Ulcerosa , Enfermedades Inflamatorias del Intestino , Humanos , Colitis Ulcerosa/tratamiento farmacológico , Inteligencia Artificial , Índice de Severidad de la Enfermedad , Enfermedades Inflamatorias del Intestino/patología , Mucosa Intestinal/patología , Colonoscopía
6.
Mod Pathol ; 35(11): 1529-1539, 2022 11.
Artículo en Inglés | MEDLINE | ID: mdl-35840720

RESUMEN

Assessment of programmed death ligand 1 (PD-L1) expression by immunohistochemistry (IHC) has emerged as an important predictive biomarker across multiple tumor types. However, manual quantitation of PD-L1 positivity can be difficult and leads to substantial inter-observer variability. Although the development of artificial intelligence (AI) algorithms may mitigate some of the challenges associated with manual assessment and improve the accuracy of PD-L1 expression scoring, use of AI-based approaches to oncology biomarker scoring and drug development has been sparse, primarily due to the lack of large-scale clinical validation studies across multiple cohorts and tumor types. We developed AI-powered algorithms to evaluate PD-L1 expression on tumor cells by IHC and compared it with manual IHC scoring in urothelial carcinoma, non-small cell lung cancer, melanoma, and squamous cell carcinoma of the head and neck (prospectively determined during the phase II and III CheckMate clinical trials). 1,746 slides were retrospectively analyzed, the largest investigation of digital pathology algorithms on clinical trial datasets performed to date. AI-powered quantification of PD-L1 expression on tumor cells identified more PD-L1-positive samples compared with manual scoring at cutoffs of ≥1% and ≥5% in most tumor types. Additionally, similar improvements in response and survival were observed in patients identified as PD-L1-positive compared with PD-L1-negative using both AI-powered and manual methods, while improved associations with survival were observed in patients with certain tumor types identified as PD-L1-positive using AI-powered scoring only. Our study demonstrates the potential for implementation of digital pathology-based methods in future clinical practice to identify more patients who would benefit from treatment with immuno-oncology therapy compared with current guidelines using manual assessment.


Asunto(s)
Carcinoma de Pulmón de Células no Pequeñas , Carcinoma de Células Transicionales , Neoplasias Pulmonares , Neoplasias de la Vejiga Urinaria , Humanos , Antígeno B7-H1/metabolismo , Carcinoma de Pulmón de Células no Pequeñas/patología , Nivolumab/uso terapéutico , Ipilimumab , Inteligencia Artificial , Neoplasias Pulmonares/patología , Estudios Retrospectivos , Anticuerpos Monoclonales/uso terapéutico , Biomarcadores de Tumor/metabolismo
7.
Hepatology ; 74(6): 3146-3160, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34333790

RESUMEN

BACKGROUND AND AIMS: The hepatic venous pressure gradient (HVPG) is the standard for estimating portal pressure but requires expertise for interpretation. We hypothesized that HVPG could be extrapolated from liver histology using a machine learning (ML) algorithm. APPROACH AND RESULTS: Patients with NASH with compensated cirrhosis from a phase 2b trial were included. HVPG and biopsies from baseline and weeks 48 and 96 were reviewed centrally, and biopsies evaluated with a convolutional neural network (PathAI, Boston, MA). Using trichrome-stained biopsies in the training set (n = 130), an ML model was developed to recognize fibrosis patterns associated with HVPG, and the resultant ML HVPG score was validated in a held-out test set (n = 88). Associations between the ML HVPG score with measured HVPG and liver-related events, and performance of the ML HVPG score for clinically significant portal hypertension (CSPH) (HVPG ≥ 10 mm Hg), were determined. The ML-HVPG score was more strongly correlated with HVPG than hepatic collagen by morphometry (ρ = 0.47 vs. ρ = 0.28; P < 0.001). The ML HVPG score differentiated patients with normal (0-5 mm Hg) and elevated (5.5-9.5 mm Hg) HVPG and CSPH (median: 1.51 vs. 1.93 vs. 2.60; all P < 0.05). The areas under receiver operating characteristic curve (AUROCs) (95% CI) of the ML-HVPG score for CSPH were 0.85 (0.80, 0.90) and 0.76 (0.68, 0.85) in the training and test sets, respectively. Discrimination of the ML-HVPG score for CSPH improved with the addition of a ML parameter for nodularity, Enhanced Liver Fibrosis, platelets, aspartate aminotransferase (AST), and bilirubin (AUROC in test set: 0.85; 95% CI: 0.78, 0.92). Although baseline ML-HVPG score was not prognostic, changes were predictive of clinical events (HR: 2.13; 95% CI: 1.26, 3.59) and associated with hemodynamic response and fibrosis improvement. CONCLUSIONS: An ML model based on trichrome-stained liver biopsy slides can predict CSPH in patients with NASH with cirrhosis.


Asunto(s)
Hipertensión Portal/diagnóstico , Procesamiento de Imagen Asistido por Computador/métodos , Cirrosis Hepática/complicaciones , Hígado/patología , Enfermedad del Hígado Graso no Alcohólico/complicaciones , Biopsia , Ensayos Clínicos Fase II como Asunto , Diagnóstico Diferencial , Femenino , Humanos , Hipertensión Portal/etiología , Cirrosis Hepática/patología , Aprendizaje Automático , Masculino , Persona de Mediana Edad , Enfermedad del Hígado Graso no Alcohólico/patología , Presión Portal , Pronóstico , Curva ROC , Ensayos Clínicos Controlados Aleatorios como Asunto
8.
Nat Commun ; 12(1): 1613, 2021 03 12.
Artículo en Inglés | MEDLINE | ID: mdl-33712588

RESUMEN

Computational methods have made substantial progress in improving the accuracy and throughput of pathology workflows for diagnostic, prognostic, and genomic prediction. Still, lack of interpretability remains a significant barrier to clinical integration. We present an approach for predicting clinically-relevant molecular phenotypes from whole-slide histopathology images using human-interpretable image features (HIFs). Our method leverages >1.6 million annotations from board-certified pathologists across >5700 samples to train deep learning models for cell and tissue classification that can exhaustively map whole-slide images at two and four micron-resolution. Cell- and tissue-type model outputs are combined into 607 HIFs that quantify specific and biologically-relevant characteristics across five cancer types. We demonstrate that these HIFs correlate with well-known markers of the tumor microenvironment and can predict diverse molecular signatures (AUROC 0.601-0.864), including expression of four immune checkpoint proteins and homologous recombination deficiency, with performance comparable to 'black-box' methods. Our HIF-based approach provides a comprehensive, quantitative, and interpretable window into the composition and spatial architecture of the tumor microenvironment.


Asunto(s)
Neoplasias/clasificación , Neoplasias/diagnóstico por imagen , Neoplasias/patología , Patología Molecular/métodos , Fenotipo , Algoritmos , Aprendizaje Profundo , Humanos , Procesamiento de Imagen Asistido por Computador , Medicina de Precisión , Microambiente Tumoral
9.
Hepatology ; 74(1): 133-147, 2021 07.
Artículo en Inglés | MEDLINE | ID: mdl-33570776

RESUMEN

BACKGROUND AND AIMS: Manual histological assessment is currently the accepted standard for diagnosing and monitoring disease progression in NASH, but is limited by variability in interpretation and insensitivity to change. Thus, there is a critical need for improved tools to assess liver pathology in order to risk stratify NASH patients and monitor treatment response. APPROACH AND RESULTS: Here, we describe a machine learning (ML)-based approach to liver histology assessment, which accurately characterizes disease severity and heterogeneity, and sensitively quantifies treatment response in NASH. We use samples from three randomized controlled trials to build and then validate deep convolutional neural networks to measure key histological features in NASH, including steatosis, inflammation, hepatocellular ballooning, and fibrosis. The ML-based predictions showed strong correlations with expert pathologists and were prognostic of progression to cirrhosis and liver-related clinical events. We developed a heterogeneity-sensitive metric of fibrosis response, the Deep Learning Treatment Assessment Liver Fibrosis score, which measured antifibrotic treatment effects that went undetected by manual pathological staging and was concordant with histological disease progression. CONCLUSIONS: Our ML method has shown reproducibility and sensitivity and was prognostic for disease progression, demonstrating the power of ML to advance our understanding of disease heterogeneity in NASH, risk stratify affected patients, and facilitate the development of therapies.


Asunto(s)
Aprendizaje Profundo , Procesamiento de Imagen Asistido por Computador/métodos , Cirrosis Hepática/diagnóstico , Hígado/patología , Enfermedad del Hígado Graso no Alcohólico/diagnóstico , Biopsia , Humanos , Cirrosis Hepática/patología , Enfermedad del Hígado Graso no Alcohólico/patología , Ensayos Clínicos Controlados Aleatorios como Asunto , Reproducibilidad de los Resultados , Índice de Severidad de la Enfermedad
10.
Hepatology ; 73(2): 625-643, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33169409

RESUMEN

BACKGROUND AND AIMS: Advanced fibrosis attributable to NASH is a leading cause of end-stage liver disease. APPROACH AND RESULTS: In this phase 2b trial, 392 patients with bridging fibrosis or compensated cirrhosis (F3-F4) were randomized to receive placebo, selonsertib 18 mg, cilofexor 30 mg, or firsocostat 20 mg, alone or in two-drug combinations, once-daily for 48 weeks. The primary endpoint was a ≥1-stage improvement in fibrosis without worsening of NASH between baseline and 48 weeks based on central pathologist review. Exploratory endpoints included changes in NAFLD Activity Score (NAS), liver histology assessed using a machine learning (ML) approach, liver biochemistry, and noninvasive markers. The majority had cirrhosis (56%) and NAS ≥5 (83%). The primary endpoint was achieved in 11% of placebo-treated patients versus cilofexor/firsocostat (21%; P = 0.17), cilofexor/selonsertib (19%; P = 0.26), firsocostat/selonsertib (15%; P = 0.62), firsocostat (12%; P = 0.94), and cilofexor (12%; P = 0.96). Changes in hepatic collagen by morphometry were not significant, but cilofexor/firsocostat led to a significant decrease in ML NASH CRN fibrosis score (P = 0.040) and a shift in biopsy area from F3-F4 to ≤F2 fibrosis patterns. Compared to placebo, significantly higher proportions of cilofexor/firsocostat patients had a ≥2-point NAS reduction; reductions in steatosis, lobular inflammation, and ballooning; and significant improvements in alanine aminotransferase (ALT), aspartate aminotransferase (AST), bilirubin, bile acids, cytokeratin-18, insulin, estimated glomerular filtration rate, ELF score, and liver stiffness by transient elastography (all P ≤ 0.05). Pruritus occurred in 20%-29% of cilofexor versus 15% of placebo-treated patients. CONCLUSIONS: In patients with bridging fibrosis and cirrhosis, 48 weeks of cilofexor/firsocostat was well tolerated, led to improvements in NASH activity, and may have an antifibrotic effect. This combination offers potential for fibrosis regression with longer-term therapy in patients with advanced fibrosis attributable to NASH.


Asunto(s)
Azetidinas/administración & dosificación , Enfermedad Hepática en Estado Terminal/prevención & control , Isobutiratos/administración & dosificación , Ácidos Isonicotínicos/administración & dosificación , Cirrosis Hepática/tratamiento farmacológico , Enfermedad del Hígado Graso no Alcohólico/tratamiento farmacológico , Oxazoles/administración & dosificación , Pirimidinas/administración & dosificación , Anciano , Azetidinas/efectos adversos , Benzamidas/administración & dosificación , Benzamidas/efectos adversos , Biomarcadores/sangre , Biopsia , Esquema de Medicación , Quimioterapia Combinada/efectos adversos , Quimioterapia Combinada/métodos , Enfermedad Hepática en Estado Terminal/patología , Femenino , Humanos , Imidazoles/administración & dosificación , Imidazoles/efectos adversos , Isobutiratos/efectos adversos , Ácidos Isonicotínicos/efectos adversos , Hígado/efectos de los fármacos , Hígado/patología , Cirrosis Hepática/complicaciones , Cirrosis Hepática/diagnóstico , Cirrosis Hepática/patología , Pruebas de Función Hepática , Masculino , Persona de Mediana Edad , Enfermedad del Hígado Graso no Alcohólico/diagnóstico , Enfermedad del Hígado Graso no Alcohólico/etiología , Enfermedad del Hígado Graso no Alcohólico/patología , Oxazoles/efectos adversos , Piridinas/administración & dosificación , Piridinas/efectos adversos , Pirimidinas/efectos adversos , Índice de Severidad de la Enfermedad , Resultado del Tratamiento
11.
Microb Genom ; 4(11)2018 11.
Artículo en Inglés | MEDLINE | ID: mdl-30418868

RESUMEN

Accurate orthologue identification is a vital component of bacterial comparative genomic studies, but many popular sequence-similarity-based approaches do not scale well to the large numbers of genomes that are now generated routinely. Furthermore, most approaches do not take gene synteny into account, which is useful information for disentangling paralogues. Here, we present SynerClust, a user-friendly synteny-aware tool based on synergy that can process thousands of genomes. SynerClust was designed to analyse genomes with high levels of local synteny, particularly prokaryotes, which have operon structure. SynerClust's run-time is optimized by selecting cluster representatives at each node in the phylogeny; thus, avoiding the need for exhaustive pairwise similarity searches. In benchmarking against Roary, Hieranoid2, PanX and Reciprocal Best Hit, SynerClust was able to more completely identify sets of core genes for datasets that included diverse strains, while using substantially less memory, and with scalability comparable to the fastest tools. Due to its scalability, ease of installation and use, and suitability for a variety of computing environments, orthogroup clustering using SynerClust will enable many large-scale prokaryotic comparative genomics efforts.


Asunto(s)
Genoma Bacteriano , Programas Informáticos , Sintenía , Algoritmos , Análisis por Conglomerados , Enterobacteriaceae/genética , Escherichia coli/genética , Genómica/métodos , Mycobacterium tuberculosis/genética
12.
Cell Syst ; 4(3): 291-305.e7, 2017 03 22.
Artículo en Inglés | MEDLINE | ID: mdl-28189581

RESUMEN

A systems-level understanding of Gram-positive bacteria is important from both an environmental and health perspective and is most easily obtained when high-quality, validated genomic resources are available. To this end, we constructed two ordered, barcoded, erythromycin-resistance- and kanamycin-resistance-marked single-gene deletion libraries of the Gram-positive model organism, Bacillus subtilis. The libraries comprise 3,968 and 3,970 genes, respectively, and overlap in all but four genes. Using these libraries, we update the set of essential genes known for this organism, provide a comprehensive compendium of B. subtilis auxotrophic genes, and identify genes required for utilizing specific carbon and nitrogen sources, as well as those required for growth at low temperature. We report the identification of enzymes catalyzing several missing steps in amino acid biosynthesis. Finally, we describe a suite of high-throughput phenotyping methodologies and apply them to provide a genome-wide analysis of competence and sporulation. Altogether, we provide versatile resources for studying gene function and pathway and network architecture in Gram-positive bacteria.


Asunto(s)
Bacillus subtilis/genética , Ensayos Analíticos de Alto Rendimiento/métodos , Aminoácidos , Eliminación de Gen , Biblioteca de Genes , Biblioteca Genómica , Genómica , Eliminación de Secuencia/genética , Esporas Bacterianas/genética
13.
Mol Cell Proteomics ; 14(2): 430-40, 2015 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-25473088

RESUMEN

The function of a large percentage of proteins is modulated by post-translational modifications (PTMs). Currently, mass spectrometry (MS) is the only proteome-wide technology that can identify PTMs. Unfortunately, the inability to detect a PTM by MS is not proof that the modification is not present. The detectability of peptides varies significantly making MS potentially blind to a large fraction of peptides. Learning from published algorithms that generally focus on predicting the most detectable peptides we developed a tool that incorporates protein abundance into the peptide prediction algorithm with the aim to determine the detectability of every peptide within a protein. We tested our tool, "Peptide Prediction with Abundance" (PPA), on in-house acquired as well as published data sets from other groups acquired on different instrument platforms. Incorporation of protein abundance into the prediction allows us to assess not only the detectability of all peptides but also whether a peptide of interest is likely to become detectable upon enrichment. We validated the ability of our tool to predict changes in protein detectability with a dilution series of 31 purified proteins at several different concentrations. PPA predicted the concentration dependent peptide detectability in 78% of the cases correctly, demonstrating its utility for predicting the protein enrichment needed to observe a peptide of interest in targeted experiments. This is especially important in the analysis of PTMs. PPA is available as a web-based or executable package that can work with generally applicable defaults or retrained from a pilot MS data set.


Asunto(s)
Algoritmos , Espectrometría de Masas/métodos , Péptidos/metabolismo , Secuencia de Aminoácidos , Bases de Datos de Proteínas , Humanos , Datos de Secuencia Molecular , Biblioteca de Péptidos , Péptidos/química , Reproducibilidad de los Resultados , Saccharomyces cerevisiae/metabolismo
15.
Elife ; 2: e00603, 2013 06 18.
Artículo en Inglés | MEDLINE | ID: mdl-23795289

RESUMEN

Divergence in gene regulation can play a major role in evolution. Here, we used a phylogenetic framework to measure mRNA profiles in 15 yeast species from the phylum Ascomycota and reconstruct the evolution of their modular regulatory programs along a time course of growth on glucose over 300 million years [corrected]. We found that modules have diverged proportionally to phylogenetic distance, with prominent changes in gene regulation accompanying changes in lifestyle and ploidy, especially in carbon metabolism. Paralogs have significantly contributed to regulatory divergence, typically within a very short window from their duplication. Paralogs from a whole genome duplication (WGD) event have a uniquely substantial contribution that extends over a longer span. Similar patterns occur when considering the evolution of the heat shock regulatory program measured in eight of the species, suggesting that these are general evolutionary principles. DOI:http://dx.doi.org/10.7554/eLife.00603.001.


Asunto(s)
Evolución Molecular , Regulación Fúngica de la Expresión Génica , Genes Fúngicos , Ascomicetos/clasificación , Ascomicetos/genética , Duplicación de Gen , Perfilación de la Expresión Génica , Filogenia , Transcripción Genética
16.
Genome Res ; 23(6): 1039-50, 2013 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-23640720

RESUMEN

Comparative functional genomics studies the evolution of biological processes by analyzing functional data, such as gene expression profiles, across species. A major challenge is to compare profiles collected in a complex phylogeny. Here, we present Arboretum, a novel scalable computational algorithm that integrates expression data from multiple species with species and gene phylogenies to infer modules of coexpressed genes in extant species and their evolutionary histories. We also develop new, generally applicable measures of conservation and divergence in gene regulatory modules to assess the impact of changes in gene content and expression on module evolution. We used Arboretum to study the evolution of the transcriptional response to heat shock in eight species of Ascomycota fungi and to reconstruct modules of the ancestral environmental stress response (ESR). We found substantial conservation in the stress response across species and in the reconstructed components of the ancestral ESR modules. The greatest divergence was in the most induced stress, primarily through module expansion. The divergence of the heat stress response exceeds that observed in the response to glucose depletion in the same species. Arboretum and its associated analyses provide a comprehensive framework to systematically study regulatory evolution of condition-specific responses.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Evolución Molecular , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Análisis por Conglomerados , Duplicación de Gen , Respuesta al Choque Térmico/genética , Especificidad de la Especie , Estrés Fisiológico/genética , Levaduras/genética
17.
Mol Syst Biol ; 8: 619, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-23089682

RESUMEN

Evolutionary rewiring of regulatory networks is an important source of diversity among species. Previous evidence suggested substantial divergence of regulatory networks across species. However, systematically assessing the extent of this plasticity and its functional implications has been challenging due to limited experimental data and the noisy nature of computational predictions. Here, we introduce a novel approach to study cis-regulatory evolution, and use it to trace the regulatory history of 88 DNA motifs of transcription factors across 23 Ascomycota fungi. While motifs are conserved, we find a pervasive gain and loss in the regulation of their target genes. Despite this turnover, the biological processes associated with a motif are generally conserved. We explain these trends using a model with a strong selection to conserve the overall function of a transcription factor, and a much weaker selection over the specific genes it targets. The model also accounts for the turnover of bound targets measured experimentally across species in yeasts and mammals. Thus, selective pressures on regulatory networks mostly tolerate local rewiring, and may allow for subtle fine-tuning of gene regulation during evolution.


Asunto(s)
Ascomicetos/genética , Evolución Molecular , Redes Reguladoras de Genes/genética , Modelos Genéticos , Selección Genética , Animales , Secuencia de Bases , Secuencia Conservada , ADN de Hongos/genética , Proteínas Fúngicas/metabolismo , Mamíferos/genética , Datos de Secuencia Molecular , Motivos de Nucleótidos/genética , Filogenia , Unión Proteica , Secuencias Reguladoras de Ácidos Nucleicos/genética , Especificidad de la Especie , Factores de Transcripción/metabolismo
18.
Nature ; 487(7407): 370-4, 2012 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-22722833

RESUMEN

Novel protein-coding genes can arise either through re-organization of pre-existing genes or de novo. Processes involving re-organization of pre-existing genes, notably after gene duplication, have been extensively described. In contrast, de novo gene birth remains poorly understood, mainly because translation of sequences devoid of genes, or 'non-genic' sequences, is expected to produce insignificant polypeptides rather than proteins with specific biological functions. Here we formalize an evolutionary model according to which functional genes evolve de novo through transitory proto-genes generated by widespread translational activity in non-genic sequences. Testing this model at the genome scale in Saccharomyces cerevisiae, we detect translation of hundreds of short species-specific open reading frames (ORFs) located in non-genic sequences. These translation events seem to provide adaptive potential, as suggested by their differential regulation upon stress and by signatures of retention by natural selection. In line with our model, we establish that S. cerevisiae ORFs can be placed within an evolutionary continuum ranging from non-genic sequences to genes. We identify ~1,900 candidate proto-genes among S. cerevisiae ORFs and find that de novo gene birth from such a reservoir may be more prevalent than sporadic gene duplication. Our work illustrates that evolution exploits seemingly dispensable sequences to generate adaptive functional innovation.


Asunto(s)
Evolución Molecular , Genes Fúngicos/genética , Saccharomyces/genética , Secuencia de Bases , Secuencia Conservada , Variación Genética , Datos de Secuencia Molecular , Sistemas de Lectura Abierta , Filogenia , Biosíntesis de Proteínas , Saccharomyces/clasificación , Saccharomyces cerevisiae/clasificación , Saccharomyces cerevisiae/genética , Alineación de Secuencia
19.
BMC Genomics ; 13: 120, 2012 Mar 28.
Artículo en Inglés | MEDLINE | ID: mdl-22452820

RESUMEN

BACKGROUND: The sequence of the pathogen Mycobacterium tuberculosis (Mtb) strain H37Rv has been available for over a decade, but the biology of the pathogen remains poorly understood. Genome sequences from other Mtb strains and closely related bacteria present an opportunity to apply the power of comparative genomics to understand the evolution of Mtb pathogenesis. We conducted a comparative analysis using 31 genomes from the Tuberculosis Database (TBDB.org), including 8 strains of Mtb and M. bovis, 11 additional Mycobacteria, 4 Corynebacteria, 2 Streptomyces, Rhodococcus jostii RHA1, Nocardia farcinia, Acidothermus cellulolyticus, Rhodobacter sphaeroides, Propionibacterium acnes, and Bifidobacterium longum. RESULTS: Our results highlight the functional importance of lipid metabolism and its regulation, and reveal variation between the evolutionary profiles of genes implicated in saturated and unsaturated fatty acid metabolism. It also suggests that DNA repair and molybdopterin cofactors are important in pathogenic Mycobacteria. By analyzing sequence conservation and gene expression data, we identify nearly 400 conserved noncoding regions. These include 37 predicted promoter regulatory motifs, of which 14 correspond to previously validated motifs, as well as 50 potential noncoding RNAs, of which we experimentally confirm the expression of four. CONCLUSIONS: Our analysis of protein evolution highlights gene families that are associated with the adaptation of environmental Mycobacteria to obligate pathogenesis. These families include fatty acid metabolism, DNA repair, and molybdopterin biosynthesis. Our analysis reinforces recent findings suggesting that small noncoding RNAs are more common in Mycobacteria than previously expected. Our data provide a foundation for understanding the genome and biology of Mtb in a comparative context, and are available online and through TBDB.org.


Asunto(s)
Actinobacteria/genética , Evolución Molecular , Mycobacterium tuberculosis/genética , Mycobacterium/genética , Actinobacteria/clasificación , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Coenzimas/genética , Coenzimas/metabolismo , Reparación del ADN , Bases de Datos Genéticas , Ácidos Grasos/genética , Ácidos Grasos/metabolismo , Genoma Bacteriano , Genómica , Metabolismo de los Lípidos/genética , Metaloproteínas/genética , Metaloproteínas/metabolismo , Cofactores de Molibdeno , Mycobacterium/clasificación , Mycobacterium tuberculosis/clasificación , Filogenia , Pteridinas/metabolismo , ARN no Traducido/química , ARN no Traducido/metabolismo
20.
Science ; 332(6032): 930-6, 2011 May 20.
Artículo en Inglés | MEDLINE | ID: mdl-21511999

RESUMEN

The fission yeast clade--comprising Schizosaccharomyces pombe, S. octosporus, S. cryophilus, and S. japonicus--occupies the basal branch of Ascomycete fungi and is an important model of eukaryote biology. A comparative annotation of these genomes identified a near extinction of transposons and the associated innovation of transposon-free centromeres. Expression analysis established that meiotic genes are subject to antisense transcription during vegetative growth, which suggests a mechanism for their tight regulation. In addition, trans-acting regulators control new genes within the context of expanded functional modules for meiosis and stress response. Differences in gene content and regulation also explain why, unlike the budding yeast of Saccharomycotina, fission yeasts cannot use ethanol as a primary carbon source. These analyses elucidate the genome structure and gene regulation of fission yeast and provide tools for investigation across the Schizosaccharomyces clade.


Asunto(s)
Genoma Fúngico , Schizosaccharomyces/genética , Centrómero/genética , Centrómero/fisiología , Centrómero/ultraestructura , Elementos Transponibles de ADN , Evolución Molecular , Perfilación de la Expresión Génica , Regulación Fúngica de la Expresión Génica , Genes del Tipo Sexual de los Hongos , Genómica , Glucosa/metabolismo , Meiosis , Anotación de Secuencia Molecular , Datos de Secuencia Molecular , Filogenia , ARN sin Sentido/genética , ARN de Hongos/genética , ARN Interferente Pequeño/genética , ARN no Traducido/genética , Elementos Reguladores de la Transcripción , Schizosaccharomyces/crecimiento & desarrollo , Schizosaccharomyces/metabolismo , Proteínas de Schizosaccharomyces pombe/genética , Proteínas de Schizosaccharomyces pombe/metabolismo , Análisis de Secuencia de ADN , Especificidad de la Especie , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Transcripción Genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...