Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Genome Res ; 27(6): 997-1003, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-28298429

RESUMEN

Rapid species radiation due to adaptive changes or occupation of new ecospaces challenges our understanding of ancestral speciation and the relationships of modern species. At the molecular level, rapid radiation with successive speciations over short time periods-too short to fix polymorphic alleles-is described as incomplete lineage sorting. Incomplete lineage sorting leads to random fixation of genetic markers and hence, random signals of relationships in phylogenetic reconstructions. The situation is further complicated when you consider that the genome is a mosaic of ancestral and modern incompletely sorted sequence blocks that leads to reconstructed affiliations to one or the other relative, depending on the fixation of their shared ancestral polymorphic alleles. The laurasiatherian relationships among Chiroptera, Perissodactyla, Cetartiodactyla, and Carnivora present a prime example for such enigmatic affiliations. We performed whole-genome screenings for phylogenetically diagnostic retrotransposon insertions involving the representatives bat (Chiroptera), horse (Perissodactyla), cow (Cetartiodactyla), and dog (Carnivora), and extracted among 162,000 preselected cases 102 virtually homoplasy-free, phylogenetically informative retroelements to draw a complete picture of the highly complex evolutionary relations within Laurasiatheria. All possible evolutionary scenarios received considerable retrotransposon support, leaving us with a network of affiliations. However, the Cetartiodactyla-Carnivora relationship as well as the basal position of Chiroptera and an ancestral laurasiatherian hybridization process did exhibit some very clear, distinct signals. The significant accordance of retrotransposon presence/absence patterns and flanking nucleotide changes suggest an important influence of mosaic genome structures in the reconstruction of species histories.


Asunto(s)
Quirópteros/genética , Especiación Genética , Genoma , Caballos/genética , Filogenia , Retroelementos , Animales , Bovinos , Quirópteros/clasificación , Mapeo Cromosómico , Perros , Marcadores Genéticos , Caballos/clasificación , Hibridación Genética , Mutagénesis Insercional , Análisis de Secuencia de ADN , Programas Informáticos
2.
BMC Bioinformatics ; 19(1): 341, 2018 Sep 26.
Artículo en Inglés | MEDLINE | ID: mdl-30257653

RESUMEN

BACKGROUND: We describe a prototype implementation of a platform that could underlie a Precision Oncology Rapid Learning system. RESULTS: We describe the prototype platform, and examine some important issues and details. In the Appendix we provide a complete walk-through of the prototype platform. CONCLUSIONS: The design choices made in this implementation rest upon ten constitutive hypotheses, which, taken together, define a particular view of how a rapid learning medical platform might be defined, organized, and implemented.


Asunto(s)
Oncología Médica , Medicina de Precisión , Programas Informáticos , Algoritmos , Educación Médica , Humanos , Publicaciones
3.
Proc Natl Acad Sci U S A ; 112(47): E6544-52, 2015 Nov 24.
Artículo en Inglés | MEDLINE | ID: mdl-26460041

RESUMEN

Evidence from numerous cancers suggests that increased aggressiveness is accompanied by up-regulation of signaling pathways and acquisition of properties common to stem cells. It is unclear if different subtypes of late-stage cancer vary in stemness properties and whether or not these subtypes are transcriptionally similar to normal tissue stem cells. We report a gene signature specific for human prostate basal cells that is differentially enriched in various phenotypes of late-stage metastatic prostate cancer. We FACS-purified and transcriptionally profiled basal and luminal epithelial populations from the benign and cancerous regions of primary human prostates. High-throughput RNA sequencing showed the basal population to be defined by genes associated with stem cell signaling programs and invasiveness. Application of a 91-gene basal signature to gene expression datasets from patients with organ-confined or hormone-refractory metastatic prostate cancer revealed that metastatic small cell neuroendocrine carcinoma was molecularly more stem-like than either metastatic adenocarcinoma or organ-confined adenocarcinoma. Bioinformatic analysis of the basal cell and two human small cell gene signatures identified a set of E2F target genes common between prostate small cell neuroendocrine carcinoma and primary prostate basal cells. Taken together, our data suggest that aggressive prostate cancer shares a conserved transcriptional program with normal adult prostate basal stem cells.


Asunto(s)
Perfilación de la Expresión Génica , Neoplasias de la Próstata/genética , Neoplasias de la Próstata/patología , Células Madre/metabolismo , Antígenos CD/metabolismo , Células Epiteliales/metabolismo , Femenino , Regulación Neoplásica de la Expresión Génica , Redes Reguladoras de Genes , Humanos , Masculino , Glándulas Mamarias Humanas/citología , Metástasis de la Neoplasia , Tumores Neuroendocrinos/genética , Tumores Neuroendocrinos/patología , Fenotipo , Proteínas Proto-Oncogénicas c-myc/metabolismo , Análisis de Secuencia de ARN , Transducción de Señal/genética , Factores de Transcripción/metabolismo
4.
PLoS Comput Biol ; 12(3): e1004790, 2016 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-26960204

RESUMEN

We present a novel regularization scheme called The Generalized Elastic Net (GELnet) that incorporates gene pathway information into feature selection. The proposed formulation is applicable to a wide variety of problems in which the interpretation of predictive features using known molecular interactions is desired. The method naturally steers solutions toward sets of mechanistically interlinked genes. Using experiments on synthetic data, we demonstrate that pathway-guided results maintain, and often improve, the accuracy of predictors even in cases where the full gene network is unknown. We apply the method to predict the drug response of breast cancer cell lines. GELnet is able to reveal genetic determinants of sensitivity and resistance for several compounds. In particular, for an EGFR/HER2 inhibitor, it finds a possible trans-differentiation resistance mechanism missed by the corresponding pathway agnostic approach.


Asunto(s)
Mapeo Cromosómico/métodos , Modelos Genéticos , Reconocimiento de Normas Patrones Automatizadas/métodos , Mapeo de Interacción de Proteínas/métodos , Proteoma/genética , Transducción de Señal/genética , Animales , Simulación por Computador , Humanos
5.
Mol Biol Evol ; 32(12): 3194-204, 2015 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26337548

RESUMEN

Freed from the competition of large raptors, Paleocene carnivores could expand their newly acquired habitats in search of prey. Such changing conditions might have led to their successful distribution and rapid radiation. Today, molecular evolutionary biologists are faced, however, with the consequences of such accelerated adaptive radiations, because they led to sequential speciation more rapidly than phylogenetic markers could be fixed. The repercussions being that current genealogies based on such markers are incongruent with species trees.Our aim was to explore such conflicting phylogenetic zones of evolution during the early arctoid radiation, especially to distinguish diagnostic from misleading phylogenetic signals, and to examine other carnivore-related speciation events. We applied a combination of high-throughput computational strategies to screen carnivore and related genomes in silico for randomly inserted retroposed elements that we then used to identify inconsistent phylogenetic patterns in the Arctoidea group, which is well known for phylogenetic discordances.Our combined retrophylogenomic and in vitro wet lab approach detected hundreds of carnivore-specific insertions, many of them confirming well-established splits or identifying and solving conflicting species distributions. Our systematic genome-wide screens for Long INterspersed Elements detected homoplasy-free markers with insertion-specific truncation points that we used to distinguish phylogenetically informative markers from conflicting signals. The results were independently confirmed by phylogenetic diagnostic Short INterspersed Elements. As statistical analysis ruled out ancestral hybridization, these doubly verified but still conflicting patterns were statistically determined to be genomic remnants from a time of ancestral incomplete lineage sorting that especially accompanied large parts of Arctoidea evolution.


Asunto(s)
Carnívoros/genética , Animales , Evolución Biológica , Evolución Molecular , Especiación Genética , Genómica , Hibridación Genética , Elementos de Nucleótido Esparcido Largo , Datos de Secuencia Molecular , Filogenia , Elementos de Nucleótido Esparcido Corto
6.
Nucleic Acids Res ; 42(Database issue): D865-72, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24217909

RESUMEN

The Consensus Coding Sequence (CCDS) project (http://www.ncbi.nlm.nih.gov/CCDS/) is a collaborative effort to maintain a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assemblies by the National Center for Biotechnology Information (NCBI) and Ensembl genome annotation pipelines. Identical annotations that pass quality assurance tests are tracked with a stable identifier (CCDS ID). Members of the collaboration, who are from NCBI, the Wellcome Trust Sanger Institute and the University of California Santa Cruz, provide coordinated and continuous review of the dataset to ensure high-quality CCDS representations. We describe here the current status and recent growth in the CCDS dataset, as well as recent changes to the CCDS web and FTP sites. These changes include more explicit reporting about the NCBI and Ensembl annotation releases being compared, new search and display options, the addition of biologically descriptive information and our approach to representing genes for which support evidence is incomplete. We also present a summary of recent and future curation targets.


Asunto(s)
Bases de Datos Genéticas , Proteínas/genética , Animales , Exones , Genómica , Humanos , Internet , Ratones , Anotación de Secuencia Molecular , Análisis de Secuencia
7.
Genome Res ; 19(12): 2324-33, 2009 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-19767417

RESUMEN

Since its start, the Mammalian Gene Collection (MGC) has sought to provide at least one full-protein-coding sequence cDNA clone for every human and mouse gene with a RefSeq transcript, and at least 6200 rat genes. The MGC cloning effort initially relied on random expressed sequence tag screening of cDNA libraries. Here, we summarize our recent progress using directed RT-PCR cloning and DNA synthesis. The MGC now contains clones with the entire protein-coding sequence for 92% of human and 89% of mouse genes with curated RefSeq (NM-accession) transcripts, and for 97% of human and 96% of mouse genes with curated RefSeq transcripts that have one or more PubMed publications, in addition to clones for more than 6300 rat genes. These high-quality MGC clones and their sequences are accessible without restriction to researchers worldwide.


Asunto(s)
Clonación Molecular/métodos , Biología Computacional/métodos , ADN Complementario/genética , Biblioteca de Genes , Genes/genética , Mamíferos/genética , Animales , ADN/biosíntesis , Humanos , Ratones , National Institutes of Health (U.S.) , Ratas , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Estados Unidos
8.
Bioinformatics ; 24(5): 637-44, 2008 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-18218656

RESUMEN

MOTIVATION: Computational annotation of protein coding genes in genomic DNA is a widely used and essential tool for analyzing newly sequenced genomes. However, current methods suffer from inaccuracy and do poorly with certain types of genes. Including additional sources of evidence of the existence and structure of genes can improve the quality of gene predictions. For many eukaryotic genomes, expressed sequence tags (ESTs) are available as evidence for genes. Related genomes that have been sequenced, annotated, and aligned to the target genome provide evidence of existence and structure of genes. RESULTS: We incorporate several different evidence sources into the gene finder AUGUSTUS. The sources of evidence are gene and transcript annotations from related species syntenically mapped to the target genome using TransMap, evolutionary conservation of DNA, mRNA and ESTs of the target species, and retroposed genes. The predictions include alternative splice variants where evidence supports it. Using only ESTs we were able to correctly predict at least one splice form exactly correct in 57% of human genes. Also using evidence from other species and human mRNAs, this number rises to 77%. Syntenic mapping is well-suited to annotate genomes closely related to genomes that are already annotated or for which extensive transcript evidence is available. Native cDNA evidence is most helpful when the alignments are used as compound information rather than independent positionwise information. AVAILABILITY: AUGUSTUS is open source and available at http://augustus.gobics.de. The gene predictions for human can be browsed and downloaded at the UCSC Genome Browser (http://genome.ucsc.edu).


Asunto(s)
ADN Complementario/genética , Alineación de Secuencia , Empalme Alternativo , Animales , Etiquetas de Secuencia Expresada , Humanos
9.
Nucleic Acids Res ; 35(Web Server issue): W152-8, 2007 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-17545196

RESUMEN

GeneHub-GEPIS is a web application that performs digital expression analysis in human and mouse tissues based on an integrated gene database. Using aggregated expressed sequence tag (EST) library information and EST counts, the application calculates the normalized gene expression levels across a large panel of normal and tumor tissues, thus providing rapid expression profiling for a given gene. The backend GeneHub component of the application contains pre-defined gene structures derived from mRNA transcript sequences from major databases and includes extensive cross references for commonly used gene identifiers. ESTs are then linked to genes based on their precise genomic locations as determined by GMAP. This genome-based approach reduces incorrect matches between ESTs and genes, thus minimizing the noise seen with previous tools. In addition, the gene-centric design makes it possible to add several important features, including text searching capabilities, the ability to accept diverse input values, expression analysis for microRNAs, basic gene annotation, batch analysis and linking between mouse and human genes. GeneHub-GEPIS is available at http://www.cgl.ucsf.edu/Research/genentech/genehub-gepis/ or http://www.gepis.org/.


Asunto(s)
Algoritmos , Mapeo Cromosómico/métodos , Perfilación de la Expresión Génica/métodos , Neoplasias/genética , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Interfaz Usuario-Computador , Biomarcadores de Tumor/genética , Etiquetas de Secuencia Expresada , Pruebas Genéticas/métodos , Humanos , Internet , Neoplasias/diagnóstico , Sistemas en Línea , Alineación de Secuencia/métodos
10.
PLoS Genet ; 2(10): e168, 2006 Oct 13.
Artículo en Inglés | MEDLINE | ID: mdl-17040131

RESUMEN

Comparative genomics allow us to search the human genome for segments that were extensively changed in the last approximately 5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202 genomic elements that are highly conserved in vertebrates but show evidence of significantly accelerated substitution rates in human. These are mostly in non-coding DNA, often near genes associated with transcription and DNA binding. Resequencing confirmed that the five most accelerated elements are dramatically changed in human but not in other primates, with seven times more substitutions in human than in chimp. The accelerated elements, and in particular the top five, show a strong bias for adenine and thymine to guanine and cytosine nucleotide changes and are disproportionately located in high recombination and high guanine and cytosine content environments near telomeres, suggesting either biased gene conversion or isochore selection. In addition, there is some evidence of directional selection in the regions containing the two most accelerated regions. A combination of evolutionary forces has contributed to accelerated evolution of the fastest evolving elements in the human genome.


Asunto(s)
Evolución Molecular , Genoma Humano/genética , Selección Genética , Animales , Emparejamiento Base , Secuencia de Bases , Secuencia Conservada , Humanos , Datos de Secuencia Molecular , Recombinación Genética , Elementos Reguladores de la Transcripción/genética , Análisis de Secuencia de ADN , Especificidad de la Especie
11.
Prostate Cancer Prostatic Dis ; 22(4): 531-538, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-30804427

RESUMEN

BACKGROUND: Metastatic castration resistant prostate cancer (mCRPC) is incurable and progression after drugs that target the androgen receptor-signaling axis is inevitable. Thus, there is an urgent need to develop more effective treatments beyond hormonal manipulation. We sought to identify activated kinases in mCRPC as therapeutic targets for existing, approved agents, with the goal of identifying candidate drugs for rapid translation into proof of concept Phase II trials in mCRPC. METHODS: To identify evidence of activation of druggable kinases in these patients, we compared mRNA expression from metastatic biopsies of patients with mCRPC (n = 101) to mRNA expression in localized prostate from TCGA and used this analysis to infer differential kinase activity. In addition, we assessed the differential phosphorylation levels for key MAPK pathway kinases between mCRPC and localized prostate cancers. RESULTS: Transcriptomic profiling of 101 patients with mCRPC as compared to patients with localized prostate cancer identified evidence of hyperactive ERK1, and whole genome sequencing revealed frequent amplifications of members of the MAPK pathway in 32% of this cohort. Next, we confirmed elevated levels of phosphorylated ERK1/2 in castration resistant prostate cancer as compared to untreated primary prostate cancer. We observed that the presence of detectable phosphorylated ERK1/2 in the primary tumor is associated with biochemical failure after radical prostatectomy independent of clinicopathologic features. ERK1 is the immediate downstream target of MEK1/2, which is druggable with trametinib, an approved therapeutic for melanoma. Trametinib elicited a profound biochemical and clinical response in a patient who had failed multiple prior treatments for mCRPC. CONCLUSIONS: We conclude that pharmacologic targeting of the MEK/ERK pathway may be a viable treatment strategy for patients with refractory metastatic prostate cancer. An ongoing Phase II trial tests this hypothesis.


Asunto(s)
Sistema de Señalización de MAP Quinasas/efectos de los fármacos , Proteína Quinasa 3 Activada por Mitógenos/antagonistas & inhibidores , Neoplasias de la Próstata Resistentes a la Castración/tratamiento farmacológico , Inhibidores de Proteínas Quinasas/uso terapéutico , Anciano , Antineoplásicos , Biopsia , Supervivencia sin Enfermedad , Amplificación de Genes , Regulación Neoplásica de la Expresión Génica , Humanos , Sistema de Señalización de MAP Quinasas/genética , Masculino , Persona de Mediana Edad , Proteína Quinasa 3 Activada por Mitógenos/genética , Proteína Quinasa 3 Activada por Mitógenos/metabolismo , Terapia Molecular Dirigida/métodos , Fosforilación/efectos de los fármacos , Estudios Prospectivos , Próstata/patología , Neoplasias de la Próstata Resistentes a la Castración/mortalidad , Neoplasias de la Próstata Resistentes a la Castración/patología , Inhibidores de Proteínas Quinasas/farmacología , Piridonas/farmacología , Piridonas/uso terapéutico , Pirimidinonas/farmacología , Pirimidinonas/uso terapéutico , RNA-Seq
12.
BMC Genomics ; 9: 466, 2008 Oct 08.
Artículo en Inglés | MEDLINE | ID: mdl-18842134

RESUMEN

BACKGROUND: Evolution via point mutations is a relatively slow process and is unlikely to completely explain the differences between primates and other mammals. By contrast, 45% of the human genome is composed of retroposed elements, many of which were inserted in the primate lineage. A subset of retroposed mRNAs (retrocopies) shows strong evidence of expression in primates, often yielding functional retrogenes. RESULTS: To identify and analyze the relatively recently evolved retrogenes, we carried out BLASTZ alignments of all human mRNAs against the human genome and scored a set of features indicative of retroposition. Of over 12,000 putative retrocopy-derived genes that arose mainly in the primate lineage, 726 with strong evidence of transcript expression were examined in detail. These mRNA retroposition events fall into three categories: I) 34 retrocopies and antisense retrocopies that added potential protein coding space and UTRs to existing genes; II) 682 complete retrocopy duplications inserted into new loci; and III) an unexpected set of 13 retrocopies that contributed out-of-frame, or antisense sequences in combination with other types of transposed elements (SINEs, LINEs, LTRs), even unannotated sequence to form potentially novel genes with no homologs outside primates. In addition to their presence in human, several of the gene candidates also had potentially viable ORFs in chimpanzee, orangutan, and rhesus macaque, underscoring their potential of function. CONCLUSION: mRNA-derived retrocopies provide raw material for the evolution of genes in a wide variety of ways, duplicating and amending the protein coding region of existing genes as well as generating the potential for new protein coding space, or non-protein coding RNAs, by unexpected contributions out of frame, in reverse orientation, or from previously non-protein coding sequence.


Asunto(s)
Evolución Molecular , Genoma Humano , Retroelementos , Animales , Exones , Duplicación de Gen , Humanos , Primates/genética
13.
PLoS Biol ; 3(1): e5, 2005 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-15630477

RESUMEN

We have solved the three-dimensional crystal structure of the stem-loop II motif (s2m) RNA element of the SARS virus genome to 2.7-A resolution. SARS and related coronaviruses and astroviruses all possess a motif at the 3' end of their RNA genomes, called the s2m, whose pathogenic importance is inferred from its rigorous sequence conservation in an otherwise rapidly mutable RNA genome. We find that this extreme conservation is clearly explained by the requirement to form a highly structured RNA whose unique tertiary structure includes a sharp 90 degrees kink of the helix axis and several novel longer-range tertiary interactions. The tertiary base interactions create a tunnel that runs perpendicular to the main helical axis whose interior is negatively charged and binds two magnesium ions. These unusual features likely form interaction surfaces with conserved host cell components or other reactive sites required for virus function. Based on its conservation in viral pathogen genomes and its absence in the human genome, we suggest that these unusual structural features in the s2m RNA element are attractive targets for the design of anti-viral therapeutic agents. Structural genomics has sought to deduce protein function based on three-dimensional homology. Here we have extended this approach to RNA by proposing potential functions for a rigorously conserved set of RNA tertiary structural interactions that occur within the SARS RNA genome itself. Based on tertiary structural comparisons, we propose the s2m RNA binds one or more proteins possessing an oligomer-binding-like fold, and we suggest a possible mechanism for SARS viral RNA hijacking of host protein synthesis, both based upon observed s2m RNA macromolecular mimicry of a relevant ribosomal RNA fold.


Asunto(s)
Genoma Viral , ARN Viral/química , Coronavirus Relacionado al Síndrome Respiratorio Agudo Severo/genética , Sitios de Unión , Secuencia Conservada , Cristalografía por Rayos X , Humanos , Datos de Secuencia Molecular , Mutación , Conformación de Ácido Nucleico , Purinas/química , ARN Mensajero/metabolismo
14.
Nucleic Acids Res ; 34(Database issue): D407-10, 2006 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-16381898

RESUMEN

As more archaeal genomes are sequenced, effective research and analysis tools are needed to integrate the diverse information available for any given locus. The feature-rich UCSC Genome Browser, created originally to annotate the human genome, can be applied to any sequenced organism. We have created a UCSC Archaeal Genome Browser, available at http://archaea.ucsc.edu/, currently with 26 archaeal genomes. It displays G/C content, gene and operon annotation from multiple sources, sequence motifs (promoters and Shine-Dalgarno), microarray data, multi-genome alignments and protein conservation across phylogenetic and habitat categories. We encourage submission of new experimental and bioinformatic analysis from contributors. The purpose of this tool is to aid biological discovery and facilitate greater collaboration within the archaeal research community.


Asunto(s)
Bases de Datos Genéticas , Genoma Arqueal , Proteínas Arqueales/química , Proteínas Arqueales/genética , Proteínas Arqueales/metabolismo , Composición de Base , Perfilación de la Expresión Génica , Genes Arqueales , Genómica , Internet , Operón , Filogenia , Alineación de Secuencia , Interfaz Usuario-Computador
15.
Cancer Res ; 77(21): e111-e114, 2017 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-29092953

RESUMEN

Vast amounts of molecular data are being collected on tumor samples, which provide unique opportunities for discovering trends within and between cancer subtypes. Such cross-cancer analyses require computational methods that enable intuitive and interactive browsing of thousands of samples based on their molecular similarity. We created a portal called TumorMap to assist in exploration and statistical interrogation of high-dimensional complex "omics" data in an interactive and easily interpretable way. In the TumorMap, samples are arranged on a hexagonal grid based on their similarity to one another in the original genomic space and are rendered with Google's Map technology. While the important feature of this public portal is the ability for the users to build maps from their own data, we pre-built genomic maps from several previously published projects. We demonstrate the utility of this portal by presenting results obtained from The Cancer Genome Atlas project data. Cancer Res; 77(21); e111-4. ©2017 AACR.


Asunto(s)
Biología Computacional/métodos , Genómica/métodos , Neoplasias/genética , Programas Informáticos , Mapeo Cromosómico/métodos , Redes Reguladoras de Genes/genética , Predisposición Genética a la Enfermedad/genética , Genoma Humano/genética , Humanos , Mutación , Neoplasias/patología , Reproducibilidad de los Resultados , Interfaz Usuario-Computador
16.
Nat Commun ; 7: 12997, 2016 10 06.
Artículo en Inglés | MEDLINE | ID: mdl-27708261

RESUMEN

Tarsiers are phylogenetically located between the most basal strepsirrhines and the most derived anthropoid primates. While they share morphological features with both groups, they also possess uncommon primate characteristics, rendering their evolutionary history somewhat obscure. To investigate the molecular basis of such attributes, we present here a new genome assembly of the Philippine tarsier (Tarsius syrichta), and provide extended analyses of the genome and detailed history of transposable element insertion events. We describe the silencing of Alu monomers on the lineage leading to anthropoids, and recognize an unexpected abundance of long terminal repeat-derived and LINE1-mobilized transposed elements (Tarsius interspersed elements; TINEs). For the first time in mammals, we identify a complete mitochondrial genome insertion within the nuclear genome, then reveal tarsier-specific, positive gene selection and posit population size changes over time. The genomic resources and analyses presented here will aid efforts to more fully understand the ancient characteristics of primate genomes.


Asunto(s)
Silenciador del Gen , Genoma Mitocondrial , Genoma , Elementos de Nucleótido Esparcido Largo , Tarsiidae/genética , Animales , Encéfalo/metabolismo , Núcleo Celular/metabolismo , Elementos Transponibles de ADN , Femenino , Cadenas de Markov , MicroARNs/metabolismo , Mitocondrias/metabolismo , Músculos/metabolismo , Filogenia , ARN Nucleolar Pequeño/metabolismo
17.
Cancer Cell ; 29(4): 536-547, 2016 Apr 11.
Artículo en Inglés | MEDLINE | ID: mdl-27050099

RESUMEN

MYCN amplification and overexpression are common in neuroendocrine prostate cancer (NEPC). However, the impact of aberrant N-Myc expression in prostate tumorigenesis and the cellular origin of NEPC have not been established. We define N-Myc and activated AKT1 as oncogenic components sufficient to transform human prostate epithelial cells to prostate adenocarcinoma and NEPC with phenotypic and molecular features of aggressive, late-stage human disease. We directly show that prostate adenocarcinoma and NEPC can arise from a common epithelial clone. Further, N-Myc is required for tumor maintenance, and destabilization of N-Myc through Aurora A kinase inhibition reduces tumor burden. Our findings establish N-Myc as a driver of NEPC and a target for therapeutic intervention.


Asunto(s)
Adenocarcinoma/genética , Transformación Celular Neoplásica/genética , Células Epiteliales/patología , Proteínas de Neoplasias/fisiología , Tumores Neuroendocrinos/genética , Neoplasias de la Próstata/genética , Proteínas Proto-Oncogénicas c-myc/fisiología , Adenocarcinoma/patología , Animales , Antineoplásicos/uso terapéutico , Aurora Quinasa A/antagonistas & inhibidores , Aurora Quinasa A/fisiología , Azepinas/uso terapéutico , Línea Celular Tumoral , Activación Enzimática , Células Epiteliales/metabolismo , Exoma , Regulación Neoplásica de la Expresión Génica , Genes myc , Humanos , Captura por Microdisección con Láser , Masculino , Ratones Endogámicos NOD , Ratones SCID , Terapia Molecular Dirigida , Invasividad Neoplásica , Metástasis de la Neoplasia , Proteínas de Neoplasias/genética , Células Madre Neoplásicas/metabolismo , Células Madre Neoplásicas/patología , Tumores Neuroendocrinos/patología , Orquiectomía , Compuestos de Fenilurea/uso terapéutico , Neoplasias de la Próstata/patología , Inhibidores de Proteínas Quinasas/uso terapéutico , Proteínas Proto-Oncogénicas c-akt/fisiología , Pirimidinas/uso terapéutico , Proteínas Recombinantes de Fusión/metabolismo , Transducción Genética , Ensayos Antitumor por Modelo de Xenoinjerto
18.
Genome Res ; 19(5): 868-75, 2009 May.
Artículo en Inglés | MEDLINE | ID: mdl-19261842

RESUMEN

One and a half centuries after Charles Darwin and Alfred Russel Wallace outlined our current understanding of evolution, a new scientific era is dawning that enables direct observations of genetic variation. However, pure sequence-based molecular attempts to resolve the basal origin of placental mammals have so far resulted only in apparently conflicting hypotheses. By contrast, in the mammalian genomes where they were highly active, the insertion of retroelements and their comparative insertion patterns constitute a neutral, virtually homoplasy-free archive of evolutionary histories. The "presence" of a retroelement at an orthologous genomic position in two species indicates their common ancestry in contrast to its "absence" in more distant species. To resolve the placental origin controversy we extracted approximately 2 million potentially phylogenetically informative, retroposon-containing loci from representatives of the major placental mammalian lineages and found highly significant evidence challenging all current single hypotheses of their basal origin. The Exafroplacentalia hypothesis (Afrotheria as the sister group to all remaining placentals) is significantly supported by five retroposon insertions, the Epitheria hypothesis (Xenarthra as the sister group to all remaining placentals) by nine insertion patterns, and the Atlantogenata hypothesis (a monophyletic clade comprising Xenarthra and Afrotheria as the sister group to Boreotheria comprising all remaining placentals) by eight insertion patterns. These findings provide significant support for a "soft" polytomy of the major mammalian clades. Ancestral successive hybridization events and/or incomplete lineage sorting associated with short speciation intervals are viable explanations for the mosaic retroposon insertion patterns of recent placental mammals and for the futile search for a clear root dichotomy.


Asunto(s)
Mamíferos/genética , Retroelementos/genética , Animales , Secuencia de Bases , Evolución Molecular , Femenino , Genoma , Humanos , Mamíferos/clasificación , Datos de Secuencia Molecular , Mutagénesis Insercional , Filogenia , Placenta/metabolismo , Alineación de Secuencia , Xenarthra/clasificación , Xenarthra/genética
19.
Genome Res ; 19(7): 1316-23, 2009 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-19498102

RESUMEN

Effective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on the NCBI, Ensembl, and UCSC Genome Browsers. Importantly, the project coordinates on manually reviewing inconsistent protein annotations between sites, as well as annotations for which new evidence suggests a revision is needed, to progressively converge on a complete protein-coding set for the human and mouse reference genomes, while maintaining a high standard of reliability and biological accuracy. To date, the project has identified 20,159 human and 17,707 mouse consensus coding regions from 17,052 human and 16,893 mouse genes. Three evaluation methods indicate that the entries in the CCDS set are highly likely to represent real proteins, more so than annotations from contributing groups not included in CCDS. The CCDS database thus centralizes the function of identifying well-supported, identically-annotated, protein-coding regions.


Asunto(s)
Secuencia de Consenso , Genoma , Sistemas de Lectura Abierta/genética , Animales , Humanos , Ratones , Alineación de Secuencia
20.
Genome Res ; 17(6): 839-51, 2007 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-17568002

RESUMEN

Arising from either retrotransposition or genomic duplication of functional genes, pseudogenes are "genomic fossils" valuable for exploring the dynamics and evolution of genes and genomes. Pseudogene identification is an important problem in computational genomics, and is also critical for obtaining an accurate picture of a genome's structure and function. However, no consensus computational scheme for defining and detecting pseudogenes has been developed thus far. As part of the ENCyclopedia Of DNA Elements (ENCODE) project, we have compared several distinct pseudogene annotation strategies and found that different approaches and parameters often resulted in rather distinct sets of pseudogenes. We subsequently developed a consensus approach for annotating pseudogenes (derived from protein coding genes) in the ENCODE regions, resulting in 201 pseudogenes, two-thirds of which originated from retrotransposition. A survey of orthologs for these pseudogenes in 28 vertebrate genomes showed that a significant fraction ( approximately 80%) of the processed pseudogenes are primate-specific sequences, highlighting the increasing retrotransposition activity in primates. Analysis of sequence conservation and variation also demonstrated that most pseudogenes evolve neutrally, and processed pseudogenes appear to have lost their coding potential immediately or soon after their emergence. In order to explore the functional implication of pseudogene prevalence, we have extensively examined the transcriptional activity of the ENCODE pseudogenes. We performed systematic series of pseudogene-specific RACE analyses. These, together with complementary evidence derived from tiling microarrays and high throughput sequencing, demonstrated that at least a fifth of the 201 pseudogenes are transcribed in one or more cell lines or tissues.


Asunto(s)
Evolución Molecular , Duplicación de Gen , Seudogenes , Transcripción Genética , Animales , Línea Celular , Humanos , Primates/genética , Retroelementos , Análisis de Secuencia de ADN , Especificidad de la Especie
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA