RESUMO
Triple-negative breast cancers (TNBCs) are a heterogeneous set of tumors defined by an absence of actionable therapeutic targets (ER, PR, and HER-2). Microdissected normal ductal epithelium from healthy volunteers represents a novel comparator to reveal insights into TNBC heterogeneity and to inform drug development. Using RNA-sequencing data from our institution and The Cancer Genome Atlas (TCGA) we compared the transcriptomes of 94 TNBCs, 20 microdissected normal breast tissues from healthy volunteers from the Susan G. Komen for the Cure Tissue Bank, and 10 histologically normal tissues adjacent to tumor. Pathway analysis comparing TNBCs to optimized normal controls of microdissected normal epithelium versus classic controls composed of adjacent normal tissue revealed distinct molecular signatures. Differential gene expression of TNBC compared with normal comparators demonstrated important findings for TNBC-specific clinical trials testing targeted agents; lack of over-expression for negative studies and over-expression in studies with drug activity. Next, by comparing each individual TNBC to the set of microdissected normals, we demonstrate that TNBC heterogeneity is attributable to transcriptional chaos, is associated with non-silent DNA mutational load, and explains transcriptional heterogeneity in addition to known molecular subtypes. Finally, chaos analysis identified 146 core genes dysregulated in >90 % of TNBCs revealing an over-expressed central network. In conclusion, use of microdissected normal ductal epithelium from healthy volunteers enables an optimized approach for studying TNBC and uncovers biological heterogeneity mediated by transcriptional chaos.
Assuntos
Neoplasias de Mama Triplo Negativas/genética , Neoplasias de Mama Triplo Negativas/patologia , Estudos de Casos e Controles , Análise por Conglomerados , Feminino , Proteína Forkhead Box M1 , Fatores de Transcrição Forkhead/metabolismo , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Glândulas Mamárias Humanas/metabolismo , Microdissecção , Mutação , Análise de Sequência de RNA , Transcrição Gênica , Neoplasias de Mama Triplo Negativas/tratamento farmacológicoRESUMO
High-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusions-particularly those expressed with low abundance- is a challenge with short- and medium-length sequencing reads. To address this challenge, we implemented an RNA-Seq mapping pipeline within the LifeScope software. We introduced new features including filter and junction mapping, annotation-aided pairing rescue and accurate mapping quality values. We combined this pipeline with a Suffix Array Spliced Read (SASR) aligner to detect chimeric transcripts. Performing paired-end RNA-Seq of the breast cancer cell line MCF-7 using the SOLiD system, we called 40 gene fusions among over 120,000 splicing junctions. We validated 36 of these 40 fusions with TaqMan assays, of which 25 were expressed in MCF-7 but not the Human Brain Reference. An intra-chromosomal gene fusion involving the estrogen receptor alpha gene ESR1, and another involving the RPS6KB1 (Ribosomal protein S6 kinase beta-1) were recurrently expressed in a number of breast tumor cell lines and a clinical tumor sample.
Assuntos
Algoritmos , Fusão Gênica/genética , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Análise de Sequência de RNA/métodos , Software , Sequência de Bases , Dados de Sequência MolecularRESUMO
In the Circulating Cell-free Genome Atlas (NCT02889978) substudy 1, we evaluate several approaches for a circulating cell-free DNA (cfDNA)-based multi-cancer early detection (MCED) test by defining clinical limit of detection (LOD) based on circulating tumor allele fraction (cTAF), enabling performance comparisons. Among 10 machine-learning classifiers trained on the same samples and independently validated, when evaluated at 98% specificity, those using whole-genome (WG) methylation, single nucleotide variants with paired white blood cell background removal, and combined scores from classifiers evaluated in this study show the highest cancer signal detection sensitivities. Compared with clinical stage and tumor type, cTAF is a more significant predictor of classifier performance and may more closely reflect tumor biology. Clinical LODs mirror relative sensitivities for all approaches. The WG methylation feature best predicts cancer signal origin. WG methylation is the most promising technology for MCED and informs development of a targeted methylation MCED test.
Assuntos
Ácidos Nucleicos Livres , Neoplasias , Humanos , Ácidos Nucleicos Livres/genética , Detecção Precoce de Câncer , Neoplasias/diagnóstico , Neoplasias/genética , Biomarcadores Tumorais/genética , Metilação de DNARESUMO
MOTIVATION: Gene duplications and losses (GDLs) are important events in genome evolution. They result in expansion or contraction of gene families, with a likely role in phenotypic evolution. As more genomes become available and their annotations are improved, software programs capable of rapidly and accurately identifying the content of ancestral genomes and the timings of GDLs become necessary to understand the unique evolution of each lineage. RESULTS: We report EvolMAP, a new algorithm and software that utilizes a species tree-based gene clustering method to join all-to-all symmetrical similarity comparisons of multiple gene sets in order to infer the gene composition of multiple ancestral genomes. The algorithm further uses Dollo parsimony-based comparison of the inferred ancestral genes to pinpoint the timings of GDLs onto evolutionary intervals marked by speciation events. Using EvolMAP, first we analyzed the expansion of four families of G-protein coupled receptors (GPCRs) within animal lineages. Additional to demonstrating the unique expansion tree for each family, results also show that the ancestral eumetazoan genome contained many fewer GPCRs than modern animals, and these families expanded through concurrent lineage-specific duplications. Second, we analyzed the history of GDLs in mammalian genomes by comparing seven proteomes. In agreement with previous studies, we report that the mammalian gene family sizes have changed drastically through their evolution. Interestingly, although we identified a potential source of duplication for 75% of the gained genes, remaining 25% did not have clear-cut sources, revealing thousands of genes that have likely gained their distinct sequence identities within the descent of mammals. AVAILABILITY: Query server, source code and executable are available at http://kosik-web.mcdb.ucsb.edu/evolmap/index.htm .
Assuntos
Genoma , Alinhamento de Sequência , Algoritmos , Animais , Evolução Molecular , Duplicação GênicaRESUMO
Horizontal gene transfer (HGT) is common between prokaryotes and phagotrophic eukaryotes. In metazoans, the scale and significance of HGT remains largely unexplored but is usually linked to a close association with parasites and endosymbionts. Marine sponges (Porifera), which host many microorganisms in their tissues and lack an isolated germ line, are potential carriers of genes transferred from prokaryotes. In this study, we identified a number of potential horizontally transferred genes within the genome of the sponge, Amphimedon queenslandica. We further identified homologs of some of these genes in other sponges. The transferred genes, most of which possess catalytic activity for carbohydrate or protein metabolism, have assimilated host genome characteristics and are actively expressed. The diversity of functions contributed by the horizontally transferred genes is likely an important factor in the adaptation and evolution of A. queenslandica. These findings highlight the potential importance of HGT on the success of sponges in diverse ecological niches.
Assuntos
Genoma/genética , Poríferos/genética , Animais , Evolução Molecular , Transferência Genética Horizontal/genética , FilogeniaRESUMO
Triple-negative breast cancer (TNBC) is characterized by the absence of expression of estrogen receptor, progesterone receptor, and HER-2. Thirty percent of patients recur after first-line treatment, and metastatic TNBC (mTNBC) has a poor prognosis with median survival of one year. Here, we present initial analyses of whole genome and transcriptome sequencing data from 14 prospective mTNBC. We have cataloged the collection of somatic genomic alterations in these advanced tumors, particularly those that may inform targeted therapies. Genes mutated in multiple tumors included TP53, LRP1B, HERC1, CDH5, RB1, and NF1. Notable genes involved in focal structural events were CTNNA1, PTEN, FBXW7, BRCA2, WT1, FGFR1, KRAS, HRAS, ARAF, BRAF, and PGCP. Homozygous deletion of CTNNA1 was detected in 2 of 6 African Americans. RNA sequencing revealed consistent overexpression of the FOXM1 gene when tumor gene expression was compared with nonmalignant breast samples. Using an outlier analysis of gene expression comparing one cancer with all the others, we detected expression patterns unique to each patient's tumor. Integrative DNA/RNA analysis provided evidence for deregulation of mutated genes, including the monoallelic expression of TP53 mutations. Finally, molecular alterations in several cancers supported targeted therapeutic intervention on clinical trials with known inhibitors, particularly for alterations in the RAS/RAF/MEK/ERK and PI3K/AKT/mTOR pathways. In conclusion, whole genome and transcriptome profiling of mTNBC have provided insights into somatic events occurring in this difficult to treat cancer. These genomic data have guided patients to investigational treatment trials and provide hypotheses for future trials in this irremediable cancer.
Assuntos
Neoplasias da Mama/genética , Transcriptoma , Adulto , Neoplasias da Mama/tratamento farmacológico , Neoplasias da Mama/metabolismo , Neoplasias da Mama/patologia , Cromossomos Humanos Par 7 , Análise Mutacional de DNA , Feminino , Proteína Forkhead Box M1 , Fatores de Transcrição Forkhead/genética , Fatores de Transcrição Forkhead/metabolismo , Expressão Gênica , Genes Neoplásicos , Genoma Humano , Estudo de Associação Genômica Ampla , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Pessoa de Meia-Idade , Terapia de Alvo Molecular , Metástase Neoplásica , Estudos Prospectivos , Receptor ErbB-2/metabolismo , Receptores de Estrogênio/metabolismo , Receptores de Progesterona/metabolismo , Análise de Sequência de RNA , Deleção de Sequência , Transdução de Sinais , Resultado do Tratamento , Proteína Supressora de Tumor p53/genética , alfa Catenina/genéticaRESUMO
We describe the draft genome of the microcrustacean Daphnia pulex, which is only 200 megabases and contains at least 30,907 genes. The high gene count is a consequence of an elevated rate of gene duplication resulting in tandem gene clusters. More than a third of Daphnia's genes have no detectable homologs in any other available proteome, and the most amplified gene families are specific to the Daphnia lineage. The coexpansion of gene families interacting within metabolic pathways suggests that the maintenance of duplicated genes is not random, and the analysis of gene expression under different environmental conditions reveals that numerous paralogs acquire divergent expression patterns soon after duplication. Daphnia-specific genes, including many additional loci within sequenced regions that are otherwise devoid of annotations, are the most responsive genes to ecological challenges.
Assuntos
Daphnia/genética , Ecossistema , Genoma , Adaptação Fisiológica , Sequência de Aminoácidos , Animais , Sequência de Bases , Mapeamento Cromossômico , Daphnia/fisiologia , Meio Ambiente , Evolução Molecular , Conversão Gênica , Duplicação Gênica , Expressão Gênica , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Genes , Genes Duplicados , Redes e Vias Metabólicas/genética , Anotação de Sequência Molecular , Dados de Sequência Molecular , Família Multigênica , Filogenia , Análise de Sequência de DNARESUMO
Ion channels establish and regulate membrane potentials in excitable and non-excitable cells. How functional diversification of ion channels contributed to the evolution of nervous systems may be understood by studying organisms at key positions in the evolution of animal multicellularity. We have carried out the first analysis of ion channels cloned from a marine sponge, Amphimedon queenslandica. Phylogenetic comparison of sequences encoding for poriferan inward-rectifier K(+) (Kir) channels suggests that Kir channels from sponges, cnidarians and triploblastic metazoans each arose from a single channel and that duplications arose independently in the different groups. In Xenopus oocytes, AmqKirA and AmqKirB produced K(+) currents with strong inward rectification, as seen in the mammalian Kir2 channels, which are found in excitable cells. The pore properties of AmqKir channels demonstrated strong K(+) selectivity and block by Cs(+) and Ba(2+). We present an original analysis of sponge ion channel physiology and an examination of the phylogenetic relationships of this channel with other cloned Kir channels.
Assuntos
Evolução Biológica , Regulação da Expressão Gênica/fisiologia , Poríferos/metabolismo , Canais de Potássio Corretores do Fluxo de Internalização/genética , Canais de Potássio Corretores do Fluxo de Internalização/metabolismo , Sequência de Aminoácidos , Animais , Bário/metabolismo , Sequência de Bases , Venenos de Abelha/farmacologia , Césio/metabolismo , Eletrofisiologia , Transporte de Íons/efeitos dos fármacos , Dados de Sequência Molecular , Poríferos/genética , Potássio/metabolismo , Canais de Potássio Corretores do Fluxo de Internalização/químicaRESUMO
BACKGROUND: The evolution of complex sub-cellular structures such as the synapse requires the assembly of multiple proteins, each conferring added functionality to the integrated structure. Tracking the early evolution of synapses has not been possible without genomic information from the earliest branching animals. As the closest extant relatives to the Eumetazoa, Porifera (sponges) represent a pivotal group for understanding the evolution of nervous systems, because sponges lack neurons with clearly recognizable synapses, in contrast to eumetazoan animals. METHODOLOGY/PRINCIPAL FINDINGS: We show that the genome of the demosponge Amphimedon queenslandica possesses a nearly complete set of post-synaptic protein homologs whose conserved interaction motifs suggest assembly into a complex structure. In the critical synaptic scaffold gene, dlg, residues that make hydrogen bonds and van der Waals interactions with the PDZ ligand are 100% conserved between sponge and human, as is the motif organization of the scaffolds. Expression in Amphimedon of multiple post-synaptic gene homologs in larval flask cells further supports the existence of an assembled structure. Among the few post-synaptic genes absent from Amphimedon, but present in Eumetazoa, are receptor genes including the entire ionotropic glutamate receptor family. CONCLUSIONS/SIGNIFICANCE: Highly conserved protein interaction motifs and co-expression in sponges of multiple proteins whose homologs interact in eumetazoan synapses indicate that a complex protein scaffold was present at the origin of animals, perhaps predating nervous systems. A relatively small number of crucial innovations to this pre-existing structure may represent the founding changes that led to a post-synaptic element.