Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 58
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Cell ; 179(4): 984-1002.e36, 2019 10 31.
Artigo em Inglês | MEDLINE | ID: mdl-31675503

RESUMO

Genomic studies in African populations provide unique opportunities to understand disease etiology, human diversity, and population history. In the largest study of its kind, comprising genome-wide data from 6,400 individuals and whole-genome sequences from 1,978 individuals from rural Uganda, we find evidence of geographically correlated fine-scale population substructure. Historically, the ancestry of modern Ugandans was best represented by a mixture of ancient East African pastoralists. We demonstrate the value of the largest sequence panel from Africa to date as an imputation resource. Examining 34 cardiometabolic traits, we show systematic differences in trait heritability between European and African populations, probably reflecting the differential impact of genes and environment. In a multi-trait pan-African GWAS of up to 14,126 individuals, we identify novel loci associated with anthropometric, hematological, lipid, and glycemic traits. We find that several functionally important signals are driven by Africa-specific variants, highlighting the value of studying diverse populations across the region.


Assuntos
População Negra/genética , Predisposição Genética para Doença , Genoma Humano/genética , Genômica , Feminino , Frequência do Gene/genética , Estudo de Associação Genômica Ampla , Humanos , Masculino , Polimorfismo de Nucleotídeo Único/genética , Uganda/epidemiologia , Sequenciamento Completo do Genoma
2.
Nature ; 606(7913): 335-342, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35650444

RESUMO

Clonal expansions driven by somatic mutations become pervasive across human tissues with age, including in the haematopoietic system, where the phenomenon is termed clonal haematopoiesis1-4. The understanding of how and when clonal haematopoiesis develops, the factors that govern its behaviour, how it interacts with ageing and how these variables relate to malignant progression remains limited5,6. Here we track 697 clonal haematopoiesis clones from 385 individuals 55 years of age or older over a median of 13 years. We find that 92.4% of clones expanded at a stable exponential rate over the study period, with different mutations driving substantially different growth rates, ranging from 5% (DNMT3A and TP53) to more than 50% per year (SRSF2P95H). Growth rates of clones with the same mutation differed by approximately ±5% per year, proportionately affecting slow drivers more substantially. By combining our time-series data with phylogenetic analysis of 1,731 whole-genome sequences of haematopoietic colonies from 7 individuals from an older age group, we reveal distinct patterns of lifelong clonal behaviour. DNMT3A-mutant clones preferentially expanded early in life and displayed slower growth in old age, in the context of an increasingly competitive oligoclonal landscape. By contrast, splicing gene mutations drove expansion only later in life, whereas TET2-mutant clones emerged across all ages. Finally, we show that mutations driving faster clonal growth carry a higher risk of malignant progression. Our findings characterize the lifelong natural history of clonal haematopoiesis and give fundamental insights into the interactions between somatic mutation, ageing and clonal selection.


Assuntos
Hematopoiese Clonal , Células Clonais , Idoso , Envelhecimento , Hematopoiese Clonal/genética , Células Clonais/citologia , Genoma Humano , Humanos , Estudos Longitudinais , Pessoa de Meia-Idade , Mutação , Filogenia
3.
Nature ; 606(7913): 343-350, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35650442

RESUMO

Age-related change in human haematopoiesis causes reduced regenerative capacity1, cytopenias2, immune dysfunction3 and increased risk of blood cancer4-6, but the reason for such abrupt functional decline after 70 years of age remains unclear. Here we sequenced 3,579 genomes from single cell-derived colonies of haematopoietic cells across 10 human subjects from 0 to 81 years of age. Haematopoietic stem cells or multipotent progenitors (HSC/MPPs) accumulated a mean of 17 mutations per year after birth and lost 30 base pairs per year of telomere length. Haematopoiesis in adults less than 65 years of age was massively polyclonal, with high clonal diversity and a stable population of 20,000-200,000 HSC/MPPs contributing evenly to blood production. By contrast, haematopoiesis in individuals aged over 75 showed profoundly decreased clonal diversity. In each of the older subjects, 30-60% of haematopoiesis was accounted for by 12-18 independent clones, each contributing 1-34% of blood production. Most clones had begun their expansion before the subject was 40 years old, but only 22% had known driver mutations. Genome-wide selection analysis estimated that between 1 in 34 and 1 in 12 non-synonymous mutations were drivers, accruing at constant rates throughout life, affecting more genes than identified in blood cancers. Loss of the Y chromosome conferred selective benefits in males. Simulations of haematopoiesis, with constant stem cell population size and constant acquisition of driver mutations conferring moderate fitness benefits, entirely explained the abrupt change in clonal structure in the elderly. Rapidly decreasing clonal diversity is a universal feature of haematopoiesis in aged humans, underpinned by pervasive positive selection acting on many more genes than currently identified.


Assuntos
Envelhecimento , Hematopoiese Clonal , Células Clonais , Longevidade , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Envelhecimento/genética , Criança , Pré-Escolar , Hematopoiese Clonal/genética , Células Clonais/citologia , Feminino , Neoplasias Hematológicas/genética , Neoplasias Hematológicas/patologia , Células-Tronco Hematopoéticas/citologia , Humanos , Lactente , Recém-Nascido , Masculino , Pessoa de Meia-Idade , Células-Tronco Multipotentes/citologia , Adulto Jovem
4.
Nature ; 604(7906): 517-524, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35418684

RESUMO

The rates and patterns of somatic mutation in normal tissues are largely unknown outside of humans1-7. Comparative analyses can shed light on the diversity of mutagenesis across species, and on long-standing hypotheses about the evolution of somatic mutation rates and their role in cancer and ageing. Here we performed whole-genome sequencing of 208 intestinal crypts from 56 individuals to study the landscape of somatic mutation across 16 mammalian species. We found that somatic mutagenesis was dominated by seemingly endogenous mutational processes in all species, including 5-methylcytosine deamination and oxidative damage. With some differences, mutational signatures in other species resembled those described in humans8, although the relative contribution of each signature varied across species. Notably, the somatic mutation rate per year varied greatly across species and exhibited a strong inverse relationship with species lifespan, with no other life-history trait studied showing a comparable association. Despite widely different life histories among the species we examined-including variation of around 30-fold in lifespan and around 40,000-fold in body mass-the somatic mutation burden at the end of lifespan varied only by a factor of around 3. These data unveil common mutational processes across mammals, and suggest that somatic mutation rates are evolutionarily constrained and may be a contributing factor in ageing.


Assuntos
Longevidade , Taxa de Mutação , Animais , Humanos , Longevidade/genética , Mamíferos/genética , Mutagênese/genética , Mutação
5.
Nature ; 598(7881): 473-478, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34646017

RESUMO

The progression of chronic liver disease to hepatocellular carcinoma is caused by the acquisition of somatic mutations that affect 20-30 cancer genes1-8. Burdens of somatic mutations are higher and clonal expansions larger in chronic liver disease9-13 than in normal liver13-16, which enables positive selection to shape the genomic landscape9-13. Here we analysed somatic mutations from 1,590 genomes across 34 liver samples, including healthy controls, alcohol-related liver disease and non-alcoholic fatty liver disease. Seven of the 29 patients with liver disease had mutations in FOXO1, the major transcription factor in insulin signalling. These mutations affected a single hotspot within the gene, impairing the insulin-mediated nuclear export of FOXO1. Notably, six of the seven patients with FOXO1S22W hotspot mutations showed convergent evolution, with variants acquired independently by up to nine distinct hepatocyte clones per patient. CIDEB, which regulates lipid droplet metabolism in hepatocytes17-19, and GPAM, which produces storage triacylglycerol from free fatty acids20,21, also had a significant excess of mutations. We again observed frequent convergent evolution: up to fourteen independent clones per patient with CIDEB mutations and up to seven clones per patient with GPAM mutations. Mutations in metabolism genes were distributed across multiple anatomical segments of the liver, increased clone size and were seen in both alcohol-related liver disease and non-alcoholic fatty liver disease, but rarely in hepatocellular carcinoma. Master regulators of metabolic pathways are a frequent target of convergent somatic mutation in alcohol-related and non-alcoholic fatty liver disease.


Assuntos
Hepatopatias/genética , Hepatopatias/metabolismo , Fígado/metabolismo , Mutação/genética , Transporte Ativo do Núcleo Celular/genética , Proteínas Reguladoras de Apoptose/genética , Linhagem Celular Tumoral , Doença Crônica , Estudos de Coortes , Ácidos Graxos não Esterificados/metabolismo , Feminino , Proteína Forkhead Box O1/genética , Proteína Forkhead Box O1/metabolismo , Humanos , Resistência à Insulina , Hepatopatias Alcoólicas/genética , Hepatopatias Alcoólicas/metabolismo , Masculino , Hepatopatia Gordurosa não Alcoólica/genética , Hepatopatia Gordurosa não Alcoólica/metabolismo , Triglicerídeos/metabolismo
6.
Nature ; 593(7859): 405-410, 2021 05.
Artigo em Inglês | MEDLINE | ID: mdl-33911282

RESUMO

Somatic mutations drive the development of cancer and may contribute to ageing and other diseases1,2. Despite their importance, the difficulty of detecting mutations that are only present in single cells or small clones has limited our knowledge of somatic mutagenesis to a minority of tissues. Here, to overcome these limitations, we developed nanorate sequencing (NanoSeq), a duplex sequencing protocol with error rates of less than five errors per billion base pairs in single DNA molecules from cell populations. This rate is two orders of magnitude lower than typical somatic mutation loads, enabling the study of somatic mutations in any tissue independently of clonality. We used this single-molecule sensitivity to study somatic mutations in non-dividing cells across several tissues, comparing stem cells to differentiated cells and studying mutagenesis in the absence of cell division. Differentiated cells in blood and colon displayed remarkably similar mutation loads and signatures to their corresponding stem cells, despite mature blood cells having undergone considerably more divisions. We then characterized the mutational landscape of post-mitotic neurons and polyclonal smooth muscle, confirming that neurons accumulate somatic mutations at a constant rate throughout life without cell division, with similar rates to mitotically active tissues. Together, our results suggest that mutational processes that are independent of cell division are important contributors to somatic mutagenesis. We anticipate that the ability to reliably detect mutations in single DNA molecules could transform our understanding of somatic mutagenesis and enable non-invasive studies on large-scale cohorts.


Assuntos
Células Sanguíneas/metabolismo , Diferenciação Celular/genética , Análise Mutacional de DNA/métodos , Músculo Liso/metabolismo , Mutação , Neurônios/metabolismo , Imagem Individual de Molécula/métodos , Células-Tronco/metabolismo , Doença de Alzheimer/genética , Células Sanguíneas/citologia , Divisão Celular , Estudos de Coortes , Colo/citologia , Epitélio/metabolismo , Granulócitos/citologia , Granulócitos/metabolismo , Voluntários Saudáveis , Humanos , Masculino , Pessoa de Meia-Idade , Músculo Liso/citologia , Mutagênese , Taxa de Mutação , Neurônios/citologia , Células-Tronco/citologia
7.
Nature ; 578(7793): 102-111, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-32025015

RESUMO

The discovery of drivers of cancer has traditionally focused on protein-coding genes1-4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium5 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers6,7, raise doubts about others and identify novel candidates, including point mutations in the 5' region of TP53, in the 3' untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available.


Assuntos
Genoma Humano/genética , Mutação/genética , Neoplasias/genética , Quebras de DNA , Bases de Dados Genéticas , Regulação Neoplásica da Expressão Gênica , Estudo de Associação Genômica Ampla , Humanos , Mutação INDEL
8.
Nucleic Acids Res ; 2024 Jul 02.
Artigo em Inglês | MEDLINE | ID: mdl-38953162

RESUMO

Ribosome profiling experiments support the translation of a range of novel human open reading frames. By contrast, most peptides from large-scale proteomics experiments derive from just one source, 5' untranslated regions. Across the human genome we find evidence for 192 translated upstream regions, most of which would produce protein isoforms with extended N-terminal ends. Almost all of these N-terminal extensions are from highly abundant genes, which suggests that the novel regions we detect are just the tip of the iceberg. These upstream regions have characteristics that are not typical of coding exons. Their GC-content is remarkably high, even higher than 5' regions in other genes, and a large majority have non-canonical start codons. Although some novel upstream regions have cross-species conservation - five have orthologues in invertebrates for example - the reading frames of two thirds are not conserved beyond simians. These non-conserved regions also have no evidence of purifying selection, which suggests that much of this translation is not functional. In addition, non-conserved upstream regions have significantly more peptides in cancer cell lines than would be expected, a strong indication that an aberrant or noisy translation initiation process may play an important role in translation from upstream regions.

9.
Nature ; 574(7779): 538-542, 2019 10.
Artigo em Inglês | MEDLINE | ID: mdl-31645727

RESUMO

The most common causes of chronic liver disease are excess alcohol intake, viral hepatitis and non-alcoholic fatty liver disease, with the clinical spectrum ranging in severity from hepatic inflammation to cirrhosis, liver failure or hepatocellular carcinoma (HCC). The genome of HCC exhibits diverse mutational signatures, resulting in recurrent mutations across more than 30 cancer genes1-7. Stem cells from normal livers have a low mutational burden and limited diversity of signatures8, which suggests that the complexity of HCC arises during the progression to chronic liver disease and subsequent malignant transformation. Here, by sequencing whole genomes of 482 microdissections of 100-500 hepatocytes from 5 normal and 9 cirrhotic livers, we show that cirrhotic liver has a higher mutational burden than normal liver. Although rare in normal hepatocytes, structural variants, including chromothripsis, were prominent in cirrhosis. Driver mutations, such as point mutations and structural variants, affected 1-5% of clones. Clonal expansions of millimetres in diameter occurred in cirrhosis, with clones sequestered by the bands of fibrosis that surround regenerative nodules. Some mutational signatures were universal and equally active in both non-malignant hepatocytes and HCCs; some were substantially more active in HCCs than chronic liver disease; and others-arising from exogenous exposures-were present in a subset of patients. The activity of exogenous signatures between adjacent cirrhotic nodules varied by up to tenfold within each patient, as a result of clone-specific and microenvironmental forces. Synchronous HCCs exhibited the same mutational signatures as background cirrhotic liver, but with higher burden. Somatic mutations chronicle the exposures, toxicity, regeneration and clonal structure of liver tissue as it progresses from health to disease.


Assuntos
Células Clonais/citologia , Células Clonais/patologia , Fibrose/genética , Fibrose/patologia , Fígado/citologia , Fígado/metabolismo , Mutação , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/patologia , Células Clonais/metabolismo , Análise Mutacional de DNA , Hepatócitos/citologia , Hepatócitos/metabolismo , Hepatócitos/patologia , Humanos , Fígado/patologia , Masculino , Pessoa de Meia-Idade , Filogenia , Células-Tronco/citologia , Células-Tronco/metabolismo , Células-Tronco/patologia
11.
Nucleic Acids Res ; 49(14): 8232-8246, 2021 08 20.
Artigo em Inglês | MEDLINE | ID: mdl-34302486

RESUMO

Most coding genes in the human genome are annotated with multiple alternative transcripts. However, clear evidence for the functional relevance of the protein isoforms produced by these alternative transcripts is often hard to find. Alternative isoforms generated from tandem exon duplication-derived substitutions are an exception. These splice events are rare, but have important functional consequences. Here, we have catalogued the 236 tandem exon duplication-derived substitutions annotated in the GENCODE human reference set. We find that more than 90% of the events have a last common ancestor in teleost fish, so are at least 425 million years old, and twenty-one can be traced back to the Bilateria clade. Alternative isoforms generated from tandem exon duplication-derived substitutions also have significantly more clinical impact than other alternative isoforms. Tandem exon duplication-derived substitutions have >25 times as many pathogenic and likely pathogenic mutations as other alternative events. Tandem exon duplication-derived substitutions appear to have vital functional roles in the cell and may have played a prominent part in metazoan evolution.


Assuntos
Evolução Molecular , Peixes/genética , Genoma Humano/genética , Isoformas de Proteínas/genética , Processamento Alternativo/genética , Animais , Éxons/genética , Duplicação Gênica/genética , Humanos , Anotação de Sequência Molecular , Alinhamento de Sequência
12.
Mol Cancer ; 21(1): 183, 2022 09 22.
Artigo em Inglês | MEDLINE | ID: mdl-36131292

RESUMO

BACKGROUND: Up to 80% of cases of prostate cancer present with multifocal independent tumour lesions leading to the concept of a field effect present in the normal prostate predisposing to cancer development. In the present study we applied Whole Genome DNA Sequencing (WGS) to a group of morphologically normal tissue (n = 51), including benign prostatic hyperplasia (BPH) and non-BPH samples, from men with and men without prostate cancer. We assess whether the observed genetic changes in morphologically normal tissue are linked to the development of cancer in the prostate. RESULTS: Single nucleotide variants (P = 7.0 × 10-03, Wilcoxon rank sum test) and small insertions and deletions (indels, P = 8.7 × 10-06) were significantly higher in morphologically normal samples, including BPH, from men with prostate cancer compared to those without. The presence of subclonal expansions under selective pressure, supported by a high level of mutations, were significantly associated with samples from men with prostate cancer (P = 0.035, Fisher exact test). The clonal cell fraction of normal clones was always higher than the proportion of the prostate estimated as epithelial (P = 5.94 × 10-05, paired Wilcoxon signed rank test) which, along with analysis of primary fibroblasts prepared from BPH specimens, suggests a stromal origin. Constructed phylogenies revealed lineages associated with benign tissue that were completely distinct from adjacent tumour clones, but a common lineage between BPH and non-BPH morphologically normal tissues was often observed. Compared to tumours, normal samples have significantly less single nucleotide variants (P = 3.72 × 10-09, paired Wilcoxon signed rank test), have very few rearrangements and a complete lack of copy number alterations. CONCLUSIONS: Cells within regions of morphologically normal tissue (both BPH and non-BPH) can expand under selective pressure by mechanisms that are distinct from those occurring in adjacent cancer, but that are allied to the presence of cancer. Expansions, which are probably stromal in origin, are characterised by lack of recurrent driver mutations, by almost complete absence of structural variants/copy number alterations, and mutational processes similar to malignant tissue. Our findings have implications for treatment (focal therapy) and early detection approaches.


Assuntos
Hiperplasia Prostática , Neoplasias da Próstata , Células Clonais/patologia , Humanos , Masculino , Nucleotídeos , Próstata/patologia , Hiperplasia Prostática/genética , Hiperplasia Prostática/patologia , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia
13.
Trends Biochem Sci ; 42(2): 98-110, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27712956

RESUMO

Alternative splicing is commonly believed to be a major source of cellular protein diversity. However, although many thousands of alternatively spliced transcripts are routinely detected in RNA-seq studies, reliable large-scale mass spectrometry-based proteomics analyses identify only a small fraction of annotated alternative isoforms. The clearest finding from proteomics experiments is that most human genes have a single main protein isoform, while those alternative isoforms that are identified tend to be the most biologically plausible: those with the most cross-species conservation and those that do not compromise functional domains. Indeed, most alternative exons do not seem to be under selective pressure, suggesting that a large majority of predicted alternative transcripts may not even be translated into proteins.


Assuntos
Processamento Alternativo/genética , Proteoma/genética , Éxons , Isoformas de Proteínas/genética , Proteômica
14.
Haematologica ; 106(11): 2918-2926, 2021 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-33054126

RESUMO

Nodal peripheral T-cell lymphoma not otherwise specified (PTCL-NOS) remains a diagnosis encompassing a heterogenous group of PTCL cases not fitting criteria for more homogeneous subtypes. They are characterized by a poor clinical outcome when treated with anthracycline-containing regimens. A better understanding of their biology could improve prognostic stratification and foster the development of novel therapeutic approaches. Recent targeted and whole exome sequencing studies have shown recurrent copy number abnormalities (CNAs) with prognostic significance. Here, investigating 5 formalin-fixed, paraffin embedded cases of PTCL-NOS by whole genome sequencing (WGS), we found a high prevalence of structural variants and complex events, such as chromothripsis likely responsible for the observed CNAs. Among them, CDKN2A and PTEN deletions emerged as the most frequent aberration, as confirmed in a final cohort of 143 patients with nodal PTCL. The incidence of CDKN2A and PTEN deletions among PTCL-NOS was 46% and 26%, respectively. Furthermore, we found that co-occurrence of CDKN2A and PTEN deletions is an event associated with PTCL-NOS with absolute specificity. In contrast, these deletions were rare and never co-occurred in angioimmunoblastic and anaplastic lymphomas. CDKN2A deletion was associated with shorter overall survival in multivariate analysis corrected by age, IPI, transplant eligibility and GATA3 expression (adjusted HR =2.53; 95% CI 1.006-6.3; p=0.048). These data suggest that CDKN2A deletions may be relevant for refining the prognosis of PTCL-NOS and their significance should be evaluated in prospective trials.


Assuntos
Inibidor p16 de Quinase Dependente de Ciclina/genética , Linfoma de Células T Periférico , Antraciclinas , Estudos de Coortes , Deleção de Genes , Humanos , Linfoma de Células T Periférico/diagnóstico , Linfoma de Células T Periférico/genética , PTEN Fosfo-Hidrolase , Prognóstico , Estudos Prospectivos
15.
Nucleic Acids Res ; 46(14): 7070-7084, 2018 08 21.
Artigo em Inglês | MEDLINE | ID: mdl-29982784

RESUMO

Seventeen years after the sequencing of the human genome, the human proteome is still under revision. One in eight of the 22 210 coding genes listed by the Ensembl/GENCODE, RefSeq and UniProtKB reference databases are annotated differently across the three sets. We have carried out an in-depth investigation on the 2764 genes classified as coding by one or more sets of manual curators and not coding by others. Data from large-scale genetic variation analyses suggests that most are not under protein-like purifying selection and so are unlikely to code for functional proteins. A further 1470 genes annotated as coding in all three reference sets have characteristics that are typical of non-coding genes or pseudogenes. These potential non-coding genes also appear to be undergoing neutral evolution and have considerably less supporting transcript and protein evidence than other coding genes. We believe that the three reference databases currently overestimate the number of human coding genes by at least 2000, complicating and adding noise to large-scale biomedical experiments. Determining which potential non-coding genes do not code for proteins is a difficult but vitally important task since the human reference proteome is a fundamental pillar of most basic research and supports almost all large-scale biomedical projects.


Assuntos
Genes , Anticorpos , Variações do Número de Cópias de DNA , Variação Genética , Genoma Humano , Humanos , Anotação de Sequência Molecular , Proteínas/genética , Proteínas/imunologia , Proteínas/metabolismo , Pseudogenes
16.
Plant Mol Biol ; 97(4-5): 337-345, 2018 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-29850988

RESUMO

KEY MESSAGE: We provide novel genomic resources for Taxus baccata in the form of a reference transcriptome, SSR and SNP markers, and orthologous single-copy genes, useful for phylogenomic and population genomic applications. English yew (T. baccata) is the only European representative of the Taxaceae family, a conifer group originated in the Jurassic period. The wide extent of environmental heterogeneity within the species' range, together with its long presence in Europe, make English yew an ideal species to investigate adaptive evolution in conifers. To enlarge the genomic resources available for this species, we used Illumina short read sequencing followed by de novo assembly to build the transcriptome of English yew. In addition to a fully annotated transcriptome as well as large sets of new potential SSR and SNP markers for T. baccata, we provide a data set of orthologous single-copy genes across three Taxus species using Picea sitchensis as outgroup, and discuss ortholog uses and limitations for phylogenomic and population genomic applications.


Assuntos
Taxus/genética , Transcriptoma , DNA Complementar/química , DNA Complementar/genética , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites/genética , Anotação de Sequência Molecular , Filogenia , Picea/genética , Polimorfismo de Nucleotídeo Único/genética , Análise de Sequência de DNA
18.
Bioinformatics ; 31(14): 2257-61, 2015 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-25735770

RESUMO

Transposable elements constitute a large fraction of vertebrate genomes and, during evolution, may be co-opted for new functions. Exonization of transposable elements inserted within or close to host genes is one possible way to generate new genes, and alternative splicing of the new exons may represent an intermediate step in this process. The genes TMPO and ZNF451 are present in all vertebrate lineages. Although they are not evolutionarily related, mammalian TMPO and ZNF451 do have something in common-they both code for splice isoforms that contain LAP2alpha domains. We found that these LAP2alpha domains have sequence similarity to repetitive sequences in non-mammalian genomes, which are in turn related to the first ORF from a DIRS1-like retrotransposon. This retrotransposon domestication happened separately and resulted in proteins that combine retrotransposon and host protein domains. The alternative splicing of the retrotransposed sequence allowed the production of both the new and the untouched original isoforms, which may have contributed to the success of the colonization process. The LAP2alpha-specific isoform of TMPO (LAP2α) has been co-opted for important roles in the cell, whereas the ZNF451 LAP2alpha isoform is evolving under strong purifying selection but remains uncharacterized.


Assuntos
Processamento Alternativo , Proteínas de Ligação a DNA/genética , Mamíferos/genética , Proteínas de Membrana/genética , Retroelementos , Fatores de Transcrição/genética , Aminoaciltransferases , Animais , Evolução Molecular , Éxons , Genoma , Humanos , Isoformas de Proteínas/genética , Vertebrados/genética
19.
PLoS Comput Biol ; 11(6): e1004325, 2015 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-26061177

RESUMO

Alternative splicing of messenger RNA can generate a wide variety of mature RNA transcripts, and these transcripts may produce protein isoforms with diverse cellular functions. While there is much supporting evidence for the expression of alternative transcripts, the same is not true for the alternatively spliced protein products. Large-scale mass spectroscopy experiments have identified evidence of alternative splicing at the protein level, but with conflicting results. Here we carried out a rigorous analysis of the peptide evidence from eight large-scale proteomics experiments to assess the scale of alternative splicing that is detectable by high-resolution mass spectroscopy. We find fewer splice events than would be expected: we identified peptides for almost 64% of human protein coding genes, but detected just 282 splice events. This data suggests that most genes have a single dominant isoform at the protein level. Many of the alternative isoforms that we could identify were only subtly different from the main splice isoform. Very few of the splice events identified at the protein level disrupted functional domains, in stark contrast to the two thirds of splice events annotated in the human genome that would lead to the loss or damage of functional domains. The most striking result was that more than 20% of the splice isoforms we identified were generated by substituting one homologous exon for another. This is significantly more than would be expected from the frequency of these events in the genome. These homologous exon substitution events were remarkably conserved--all the homologous exons we identified evolved over 460 million years ago--and eight of the fourteen tissue-specific splice isoforms we identified were generated from homologous exons. The combination of proteomics evidence, ancient origin and tissue-specific splicing indicates that isoforms generated from homologous exons may have important cellular roles.


Assuntos
Processamento Alternativo/genética , Éxons/genética , Isoformas de Proteínas/genética , Sequência de Aminoácidos , Animais , Biologia Computacional , Bases de Dados Genéticas , Humanos , Camundongos , Modelos Moleculares , Dados de Sequência Molecular , Especificidade de Órgãos/genética , Peptídeos/química , Peptídeos/genética , Peptídeos/metabolismo , Conformação Proteica , Isoformas de Proteínas/química , Isoformas de Proteínas/metabolismo , Alinhamento de Sequência , Análise de Sequência de DNA
20.
Biochim Biophys Acta ; 1840(5): 1468-81, 2014 May.
Artigo em Inglês | MEDLINE | ID: mdl-24355433

RESUMO

BACKGROUND: Membrane intrinsic proteins (MIPs) are the proteins in charge of regulating water transport into cells. Because of this essential function, the MIP family is ancient, widespread, and highly diverse. SCOPE OF REVIEW: The rapidly accumulating genomic and transcriptomic data from previously poorly known groups such as unicellular eukaryotes, fungi, green algae, mosses, and non-vertebrate animals are contributing to expand our view of MIP evolution throughout the diversity of life. Here, by analyzing more than 1700 sequences, we provide an updated and comprehensive phylogeny of MIPs MAJOR CONCLUSIONS: The reconstructed phylogeny supports (i) deep orthology of X intrinsic proteins (XIPs; present from unicellular eukaryotes to plants); (ii) that the origin of small intrinsic proteins (SIPs) traces back to the common ancestor of all plants; and (iii) the expansion of aquaglyceroporins (GLPs) in Oomycetes, as well as their loss in vascular plants and in the ancestor of endopterygote insects. Additionally, conserved positions in the protein, and residues involved in glycerol selectivity are reviewed within a phylogenetic framework. Furthermore, functional diversification of human and Arabidopsis paralogs are analyzed in an evolutionary genomic context. GENERAL SIGNIFICANCE: Our results show that while bacteria and archaea generally function with one copy of each a water channel (aquaporin or AQP) and a GLP, recurrent independent expansions have greatly diversified the structures and functions of the different members of both MIP paralog subfamilies throughout eukaryote evolution (and not only in flowering plants and vertebrates, as previously thought). This article is part of a Special Issue entitled Aquaporins.


Assuntos
Evolução Química , Proteínas de Membrana/química , Sequência de Aminoácidos , Animais , Proteínas Arqueais/química , Proteínas de Bactérias/química , Proteínas Fúngicas/química , Proteínas de Membrana/classificação , Dados de Sequência Molecular , Filogenia , Plantas/química , Conformação Proteica , Homologia de Sequência de Aminoácidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA