Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Proc Natl Acad Sci U S A ; 121(13): e2313367121, 2024 Mar 26.
Artículo en Inglés | MEDLINE | ID: mdl-38517978

RESUMEN

The chronology and phylogeny of bacterial evolution are difficult to reconstruct due to a scarce fossil record. The analysis of bacterial genomes remains challenging because of large sequence divergence, the plasticity of bacterial genomes due to frequent gene loss, horizontal gene transfer, and differences in selective pressure from one locus to another. Therefore, taking advantage of the rich and rapidly accumulating genomic data requires accurate modeling of genome evolution. An important technical consideration is that loci with high effective mutation rates may diverge beyond the detection limit of the alignment algorithms used, biasing the genome-wide divergence estimates toward smaller divergences. In this article, we propose a novel method to gain insight into bacterial evolution based on statistical properties of genome comparisons. We find that the length distribution of sequence matches is shaped by the effective mutation rates of different loci, by the horizontal transfers, and by the aligner sensitivity. Based on these inputs, we build a model and show that it accounts for the empirically observed distributions, taking the Enterobacteriaceae family as an example. Our method allows to distinguish segments of vertical and horizontal origins and to estimate the time divergence and exchange rate between any pair of taxa from genome-wide alignments. Based on the estimated time divergences, we construct a time-calibrated phylogenetic tree to demonstrate the accuracy of the method.


Asunto(s)
Genoma Bacteriano , Modelos Genéticos , Filogenia , Genoma Bacteriano/genética , Genómica/métodos , Bacterias/genética , Evolución Molecular
2.
Am J Respir Crit Care Med ; 208(10): 1075-1087, 2023 Nov 15.
Artículo en Inglés | MEDLINE | ID: mdl-37708400

RESUMEN

Rationale: IL-33 is a proinflammatory cytokine thought to play a role in the pathogenesis of asthma and chronic obstructive pulmonary disease (COPD). A recent clinical trial using an anti-IL-33 antibody showed a reduction in exacerbation and improved lung function in ex-smokers but not current smokers with COPD. Objectives: This study aimed to understand the effects of smoking status on IL-33. Methods: We investigated the association of smoking status with the level of gene expression of IL-33 in the airways in eight independent transcriptomic studies of lung airways. Additionally, we performed Western blot analysis and immunohistochemistry for IL-33 in lung tissue to assess protein levels. Measurements and Main Results: Across the bulk RNA-sequencing datasets, IL-33 gene expression and its signaling pathway were significantly lower in current versus former or never-smokers and increased upon smoking cessation (P < 0.05). Single-cell sequencing showed that IL-33 is predominantly expressed in resting basal epithelial cells and decreases during the differentiation process triggered by smoke exposure. We also found a higher transitioning of this cellular subpopulation into a more differentiated cell type during chronic smoking, potentially driving the reduction of IL-33. Protein analysis demonstrated lower IL-33 levels in lung tissue from current versus former smokers with COPD and a lower proportion of IL-33-positive basal cells in current versus ex-smoking controls. Conclusions: We provide strong evidence that cigarette smoke leads to an overall reduction in IL-33 expression in transcriptomic and protein level, and this may be due to the decrease in resting basal cells. Together, these findings may explain the clinical observation that a recent antibody-based anti-IL-33 treatment is more effective in former than current smokers with COPD.


Asunto(s)
Enfermedad Pulmonar Obstructiva Crónica , Fumadores , Humanos , Interleucina-33/genética , Fumar/genética , Enfermedad Pulmonar Obstructiva Crónica/patología , Perfilación de la Expresión Génica
3.
Nucleic Acids Res ; 47(10): 5114-5125, 2019 06 04.
Artículo en Inglés | MEDLINE | ID: mdl-30916335

RESUMEN

The replication program of vertebrate genomes is driven by the chromosomal distribution and timing of activation of tens of thousands of replication origins. Genome-wide studies have shown the association of origins with promoters and CpG islands, and their enrichment in G-quadruplex motifs (G4). However, the genetic determinants driving their activity remain poorly understood. To gain insight on the constraints operating on origins, we conducted the first evolutionary comparison of origins across vertebrates. We generated a genome-wide map of chicken origins (the first of a bird genome), and performed a comparison with human and mouse maps. The analysis of intra-species polymorphism revealed a strong depletion of genetic diversity at the core of replication initiation loci. This depletion is not linked to the presence of G4 motifs, promoters or CpG islands. In contrast, we show that origins experienced a rapid turnover during vertebrate evolution, since pairwise comparisons of origin maps revealed that <24% of them are conserved among vertebrates. This study unravels the existence of a novel determinant of origins, the precise functional role of which remains to be determined. Despite the importance of replication initiation for the fitness of organisms, the distribution of origins along vertebrate chromosomes is highly flexible.


Asunto(s)
Islas de CpG , Replicación del ADN , Genoma , Origen de Réplica , Animales , Pollos , G-Cuádruplex , Células HeLa , Humanos , Células K562 , Ratones , Polimorfismo Genético , Polimorfismo de Nucleótido Simple , Reproducibilidad de los Resultados , Especificidad de la Especie
5.
Mol Biol Evol ; 32(2): 524-35, 2015 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-25398628

RESUMEN

Genome evolution is shaped by a multitude of mutational processes, including point mutations, insertions, and deletions of DNA sequences, as well as segmental duplications. These mutational processes can leave distinctive qualitative marks in the statistical features of genomic DNA sequences. One such feature is the match length distribution (MLD) of exactly matching sequence segments within an individual genome or between the genomes of related species. These have been observed to exhibit characteristic power law decays in many species. Here, we show that simple dynamical models consisting solely of duplication and mutation processes can already explain the characteristic features of MLDs observed in genomic sequences. Surprisingly, we find that these features are largely insensitive to details of the underlying mutational processes and do not necessarily rely on the action of natural selection. Our results demonstrate how analyzing statistical features of DNA sequences can help us reveal and quantify the different mutational processes that underlie genome evolution.


Asunto(s)
Genoma/genética , Genómica/métodos , Animales , Evolución Biológica , Evolución Molecular , Duplicación de Gen/genética , Humanos , Duplicaciones Segmentarias en el Genoma/genética , Selección Genética
6.
Commun Biol ; 7(1): 823, 2024 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-38971915

RESUMEN

Recent progress in image-based spatial RNA profiling enables to spatially resolve tens to hundreds of distinct RNA species with high spatial resolution. It presents new avenues for comprehending tissue organization. In this context, the ability to assign detected RNA transcripts to individual cells is crucial for downstream analyses, such as in-situ cell type calling. Yet, accurate cell segmentation can be challenging in tissue data, in particular in the absence of a high-quality membrane marker. To address this issue, we introduce ComSeg, a segmentation algorithm that operates directly on single RNA positions and that does not come with implicit or explicit priors on cell shape. ComSeg is applicable in complex tissues with arbitrary cell shapes. Through comprehensive evaluations on simulated and experimental datasets, we show that ComSeg outperforms existing state-of-the-art methods for in-situ single-cell RNA profiling and in-situ cell type calling. ComSeg is available as a documented and open source pip package at https://github.com/fish-quant/ComSeg .


Asunto(s)
Algoritmos , Perfilación de la Expresión Génica , Análisis de la Célula Individual , Transcriptoma , Perfilación de la Expresión Génica/métodos , Análisis de la Célula Individual/métodos , Procesamiento de Imagen Asistido por Computador/métodos , Humanos , Animales , Programas Informáticos , ARN/genética , Hibridación Fluorescente in Situ/métodos
7.
Genome Med ; 16(1): 54, 2024 04 08.
Artículo en Inglés | MEDLINE | ID: mdl-38589970

RESUMEN

BACKGROUND: Lung cancer is the leading cause of cancer-related death in the world. In contrast to many other cancers, a direct connection to modifiable lifestyle risk in the form of tobacco smoke has long been established. More than 50% of all smoking-related lung cancers occur in former smokers, 40% of which occur more than 15 years after smoking cessation. Despite extensive research, the molecular processes for persistent lung cancer risk remain unclear. We thus set out to examine whether risk stratification in the clinic and in the general population can be improved upon by the addition of genetic data and to explore the mechanisms of the persisting risk in former smokers. METHODS: We analysed transcriptomic data from accessible airway tissues of 487 subjects, including healthy volunteers and clinic patients of different smoking statuses. We developed a computational model to assess smoking-associated gene expression changes and their reversibility after smoking is stopped, comparing healthy subjects to clinic patients with and without lung cancer. RESULTS: We find persistent smoking-associated immune alterations to be a hallmark of the clinic patients. Integrating previous GWAS data using a transcriptional network approach, we demonstrate that the same immune- and interferon-related pathways are strongly enriched for genes linked to known genetic risk factors, demonstrating a causal relationship between immune alteration and lung cancer risk. Finally, we used accessible airway transcriptomic data to derive a non-invasive lung cancer risk classifier. CONCLUSIONS: Our results provide initial evidence for germline-mediated personalized smoke injury response and risk in the general population, with potential implications for managing long-term lung cancer incidence and mortality.


Asunto(s)
Neoplasias Pulmonares , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/metabolismo , Fumar/efectos adversos , Fumar/genética , Pulmón/metabolismo , Nicotiana , Mucosa Nasal/metabolismo , Transcriptoma
8.
Phys Rev Lett ; 110(14): 148101, 2013 Apr 05.
Artículo en Inglés | MEDLINE | ID: mdl-25167038

RESUMEN

Recently, an enrichment of identical matching sequences has been found in many eukaryotic genomes. Their length distribution exhibits a power law tail raising the question of what evolutionary mechanism or functional constraints would be able to shape this distribution. Here we introduce a simple and evolutionarily neutral model, which involves only point mutations and segmental duplications, and produces the same statistical features as observed for genomic data. Further, we extend a mathematical model for random stick breaking to analytically show that the exponent of the power law tail is -3 and universal as it does not depend on the microscopic details of the model.


Asunto(s)
ADN/genética , Evolución Molecular , Modelos Genéticos , Eucariontes , Duplicación de Gen , Genoma , Genoma Humano , Humanos , Mutación Puntual
9.
Nat Commun ; 14(1): 4843, 2023 08 10.
Artículo en Inglés | MEDLINE | ID: mdl-37563125

RESUMEN

Replication of vertebrate genomes is tightly regulated to ensure accurate duplication, but our understanding of the interplay between genetic and epigenetic factors in this regulation remains incomplete. Here, we investigated the involvement of three elements enriched at gene promoters and replication origins: guanine-rich motifs potentially forming G-quadruplexes (pG4s), nucleosome-free regions (NFRs), and the histone variant H2A.Z, in the firing of origins of replication in vertebrates. We show that two pG4s on the same DNA strand (dimeric pG4s) are sufficient to induce the assembly of an efficient minimal replication origin without inducing transcription in avian DT40 cells. Dimeric pG4s in replication origins are associated with formation of an NFR next to precisely-positioned nucleosomes enriched in H2A.Z on this minimal origin and genome-wide. Thus, our data suggest that dimeric pG4s are important for the organization and duplication of vertebrate genomes. It supports the hypothesis that a nucleosome close to an NFR is a shared signal for the formation of replication origins in eukaryotes.


Asunto(s)
G-Cuádruplex , Nucleosomas , Animales , Nucleosomas/genética , Origen de Réplica/genética , Replicación del ADN/genética , Histonas/genética , Histonas/metabolismo , Vertebrados/genética , Vertebrados/metabolismo
10.
Nat Commun ; 14(1): 2445, 2023 04 28.
Artículo en Inglés | MEDLINE | ID: mdl-37117166

RESUMEN

Radiation Induced Lung Injury (RILI) is one of the main limiting factors of thorax irradiation, which can induce acute pneumonitis as well as pulmonary fibrosis, the latter being a life-threatening condition. The order of cellular and molecular events in the progression towards fibrosis is key to the physiopathogenesis of the disease, yet their coordination in space and time remains largely unexplored. Here, we present an interactive murine single cell atlas of the lung response to irradiation, generated from C57BL6/J female mice. This tool opens the door for exploration of the spatio-temporal dynamics of the mechanisms that lead to radiation-induced pulmonary fibrosis. It depicts with unprecedented detail cell type-specific radiation-induced responses associated with either lung regeneration or the failure thereof. A better understanding of the mechanisms leading to lung fibrosis will help finding new therapeutic options that could improve patients' quality of life.


Asunto(s)
Lesión Pulmonar , Fibrosis Pulmonar , Traumatismos por Radiación , Neumonitis por Radiación , Femenino , Animales , Ratones , Fibrosis Pulmonar/etiología , Fibrosis Pulmonar/patología , Neumonitis por Radiación/etiología , Neumonitis por Radiación/patología , Calidad de Vida , Pulmón/patología , Lesión Pulmonar/etiología , Lesión Pulmonar/patología , Tórax
11.
Elife ; 102021 06 14.
Artículo en Inglés | MEDLINE | ID: mdl-34121661

RESUMEN

Horizontal gene transfer (HGT) is an essential force in microbial evolution. Despite detailed studies on a variety of systems, a global picture of HGT in the microbial world is still missing. Here, we exploit that HGT creates long identical DNA sequences in the genomes of distant species, which can be found efficiently using alignment-free methods. Our pairwise analysis of 93,481 bacterial genomes identified 138,273 HGT events. We developed a model to explain their statistical properties as well as estimate the transfer rate between pairs of taxa. This reveals that long-distance HGT is frequent: our results indicate that HGT between species from different phyla has occurred in at least 8% of the species. Finally, our results confirm that the function of sequences strongly impacts their transfer rate, which varies by more than three orders of magnitude between different functional categories. Overall, we provide a comprehensive view of HGT, illuminating a fundamental process driving bacterial evolution.


Asunto(s)
Bacterias , Evolución Molecular , Transferencia de Gen Horizontal/genética , Genoma Bacteriano/genética , Archaea/clasificación , Archaea/genética , Bacterias/clasificación , Bacterias/genética , Genoma Arqueal/genética , Genómica , Alineación de Secuencia , Análisis de Secuencia de ADN
12.
Nat Commun ; 12(1): 5330, 2021 09 09.
Artículo en Inglés | MEDLINE | ID: mdl-34504093

RESUMEN

Most autosomal genes are thought to be expressed from both alleles, with some notable exceptions, including imprinted genes and genes showing random monoallelic expression (RME). The extent and nature of RME has been the subject of debate. Here we investigate the expression of several candidate RME genes in F1 hybrid mouse cells before and after differentiation, to define how they become persistently, monoallelically expressed. Clonal monoallelic expression is not present in embryonic stem cells, but we observe high frequencies of monoallelism in neuronal progenitor cells by assessing expression status in more than 200 clones. We uncover unforeseen modes of allelic expression that appear to be gene-specific and epigenetically regulated. This non-canonical allelic regulation has important implications for development and disease, including autosomal dominant disorders and opens up therapeutic perspectives.


Asunto(s)
Alelos , Desequilibrio Alélico , Epigénesis Genética , Enfermedades Musculares/genética , Células-Madre Neurales/metabolismo , Enfermedades Neurodegenerativas/genética , Ácido Anhídrido Hidrolasas/genética , Ácido Anhídrido Hidrolasas/metabolismo , Proteínas Adaptadoras Transductoras de Señales/genética , Proteínas Adaptadoras Transductoras de Señales/metabolismo , Animales , Proteínas Reguladoras de la Apoptosis/genética , Proteínas Reguladoras de la Apoptosis/metabolismo , Diferenciación Celular , Quimera , Células Clonales , Metilación de ADN , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/metabolismo , Modelos Animales de Enfermedad , Femenino , Dosificación de Gen , Frecuencia de los Genes , Sitios Genéticos , Impresión Genómica , Masculino , Ratones , Enfermedades Musculares/metabolismo , Enfermedades Musculares/patología , Neoplasias/genética , Neoplasias/metabolismo , Neoplasias/patología , Células-Madre Neurales/patología , Enfermedades Neurodegenerativas/metabolismo , Enfermedades Neurodegenerativas/patología , Osteoporosis/genética , Osteoporosis/metabolismo , Osteoporosis/patología , Receptores de Ácido Kaínico/genética , Receptores de Ácido Kaínico/metabolismo , Receptor de Ácido Kaínico GluK2
13.
Genetics ; 204(2): 475-482, 2016 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-27474728

RESUMEN

For several decades, sequence alignment has been a widely used tool in bioinformatics. For instance, finding homologous sequences with a known function in large databases is used to get insight into the function of nonannotated genomic regions. Very efficient tools like BLAST have been developed to identify and rank possible homologous sequences. To estimate the significance of the homology, the ranking of alignment scores takes a background model for random sequences into account. Using this model we can estimate the probability to find two exactly matching subsequences by chance in two unrelated sequences. For two homologous sequences, the corresponding probability is much higher, which allows us to identify them. Here we focus on the distribution of lengths of exact sequence matches between protein-coding regions of pairs of evolutionarily distant genomes. We show that this distribution exhibits a power-law tail with an exponent [Formula: see text] Developing a simple model of sequence evolution by substitutions and segmental duplications, we show analytically and computationally that paralogous and orthologous gene pairs contribute differently to this distribution. Our model explains the differences observed in the comparison of coding and noncoding parts of genomes, thus providing a better understanding of statistical properties of genomic sequences and their evolution.


Asunto(s)
Biología Computacional/métodos , Evolución Molecular , Alineación de Secuencia/métodos , Homología de Secuencia , Genoma , Genómica , Modelos Genéticos , Probabilidad , Duplicaciones Segmentarias en el Genoma/genética
14.
Sci Rep ; 6: 30851, 2016 08 04.
Artículo en Inglés | MEDLINE | ID: mdl-27488939

RESUMEN

Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf's law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes.


Asunto(s)
Elementos Alu/genética , Evolución Molecular , Genoma Humano/genética , Modelos Genéticos , Humanos
15.
PLoS One ; 10(3): e0120206, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-25826216

RESUMEN

A Yule tree is the result of a branching process with constant birth and death rates. Such a process serves as an instructive null model of many empirical systems, for instance, the evolution of species leading to a phylogenetic tree. However, often in phylogeny the only available information is the pairwise distances between a small fraction of extant species representing the leaves of the tree. In this article we study statistical properties of the pairwise distances in a Yule tree. Using a method based on a recursion, we derive an exact, analytic and compact formula for the expected number of pairs separated by a certain time distance. This number turns out to follow a increasing exponential function. This property of a Yule tree can serve as a simple test for empirical data to be well described by a Yule process. We further use this recursive method to calculate the expected number of the n-most closely related pairs of leaves and the number of cherries separated by a certain time distance. To make our results more useful for realistic scenarios, we explicitly take into account that the leaves of a tree may be incompletely sampled and derive a criterion for poorly sampled phylogenies. We show that our result can account for empirical data, using two families of birds species.


Asunto(s)
Hojas de la Planta , Árboles/crecimiento & desarrollo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA