Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
2.
J Comput Biol ; 18(3): 469-81, 2011 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-21385048

RESUMEN

We introduce a data structure, analysis, and visualization scheme called a cactus graph for comparing sets of related genomes. In common with multi-break point graphs and A-Bruijn graphs, cactus graphs can represent duplications and general genomic rearrangements, but additionally, they naturally decompose the common substructures in a set of related genomes into a hierarchy of chains that can be visualized as two-dimensional multiple alignments and nets that can be visualized in circular genome plots. Supplementary Material is available at www.liebertonline.com/cmb .


Asunto(s)
Gráficos por Computador , Genoma , Genómica/métodos , Alineación de Secuencia/métodos , Algoritmos , Animales , Secuencia de Bases , ADN/genética , Evolución Molecular , Humanos , Datos de Secuencia Molecular
3.
Nature ; 469(7331): 529-33, 2011 Jan 27.
Artículo en Inglés | MEDLINE | ID: mdl-21270892

RESUMEN

'Orang-utan' is derived from a Malay term meaning 'man of the forest' and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (N(e)) expanded exponentially relative to the ancestral N(e) after the split, while Bornean N(e) declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.


Asunto(s)
Variación Genética , Genoma/genética , Pongo abelii/genética , Pongo pygmaeus/genética , Animales , Centrómero/genética , Cerebrósidos/metabolismo , Cromosomas , Evolución Molecular , Femenino , Reordenamiento Génico/genética , Especiación Genética , Genética de Población , Humanos , Masculino , Filogenia , Densidad de Población , Dinámica Poblacional , Especificidad de la Especie
4.
Nucleic Acids Res ; 39(Database issue): D871-5, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21037257

RESUMEN

The ENCODE project is an international consortium with a goal of cataloguing all the functional elements in the human genome. The ENCODE Data Coordination Center (DCC) at the University of California, Santa Cruz serves as the central repository for ENCODE data. In this role, the DCC offers a collection of high-throughput, genome-wide data generated with technologies such as ChIP-Seq, RNA-Seq, DNA digestion and others. This data helps illuminate transcription factor-binding sites, histone marks, chromatin accessibility, DNA methylation, RNA expression, RNA binding and other cell-state indicators. It includes sequences with quality scores, alignments, signals calculated from the alignments, and in most cases, element or peak calls calculated from the signal data. Each data set is available for visualization and download via the UCSC Genome Browser (http://genome.ucsc.edu/). ENCODE data can also be retrieved using a metadata system that captures the experimental parameters of each assay. The ENCODE web portal at UCSC (http://encodeproject.org/) provides information about the ENCODE data and links for access.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano , Regulación de la Expresión Génica , Genómica , Humanos , Internet , Programas Informáticos , Interfaz Usuario-Computador
5.
Proc Natl Acad Sci U S A ; 105(38): 14254-61, 2008 Sep 23.
Artículo en Inglés | MEDLINE | ID: mdl-18787111

RESUMEN

We formalize the problem of recovering the evolutionary history of a set of genomes that are related to an unseen common ancestor genome by operations of speciation, deletion, insertion, duplication, and rearrangement of segments of bases. The problem is examined in the limit as the number of bases in each genome goes to infinity. In this limit, the chromosomes are represented by continuous circles or line segments. For such an infinite-sites model, we present a polynomial-time algorithm to find the most parsimonious evolutionary history of any set of related present-day genomes.


Asunto(s)
Evolución Molecular , Genoma , Modelos Genéticos , Algoritmos , Animales , Simulación por Computador , Humanos , Ratones , Mutación/genética , Cromosoma X
6.
J Comput Biol ; 15(8): 1007-27, 2008 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-18774902

RESUMEN

Accurately reconstructing the large-scale gene order in an ancestral genome is a critical step to better understand genome evolution. In this paper, we propose a heuristic algorithm, called DUPCAR, for reconstructing ancestral genomic orders with duplications. The method starts from the order of genes in modern genomes and predicts predecessor and successor relationships in the ancestor. Then a greedy algorithm is used to reconstruct the ancestral orders by connecting genes into contiguous regions based on predicted adjacencies. Computer simulation was used to validate the algorithm. We also applied the method to reconstruct the ancestral chromosome X of placental mammals and the ancestral genomes of the ciliate Paramecium tetraurelia.


Asunto(s)
Algoritmos , Duplicación de Gen , Genoma , Modelos Genéticos , Animales , Simulación por Computador , Evolución Molecular , Humanos , Paramecium tetraurelia/genética , Filogenia
7.
Genome Res ; 16(12): 1557-65, 2006 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-16983148

RESUMEN

This article analyzes mammalian genome rearrangements at higher resolution than has been published to date. We identify 3171 intervals, covering approximately 92% of the human genome, within which we find no rearrangements larger than 50 kilobases (kb) in the lineages leading to human, mouse, rat, and dog from their most recent common ancestor. Combining intervals that are adjacent in all contemporary species produces 1338 segments that may contain large insertions or deletions but that are free of chromosome fissions or fusions as well as inversions or translocations >50 kb in length. We describe a new method for predicting the ancestral order and orientation of those intervals from their observed adjacencies in modern species. We combine the results from this method with data from chromosome painting experiments to produce a map of an early mammalian genome that accounts for 96.8% of the available human genome sequence data. The precision is further increased by mapping inversions as small as 31 bp. Analysis of the predicted evolutionary breakpoints in the human lineage confirms certain published observations but disagrees with others. Although only a few mammalian genomes are currently sequenced to high precision, our theoretical analyses and computer simulations indicate that our results are reasonably accurate and that they will become highly accurate in the foreseeable future. Our methods were developed as part of a project to reconstruct the genome sequence of the last ancestor of human, dogs, and most other placental mammals.


Asunto(s)
Evolución Molecular , Genoma Humano , Genoma , Algoritmos , Animales , Composición de Base , Emparejamiento Base , Rotura Cromosómica , Inversión Cromosómica , Mapeo Cromosómico , Pintura Cromosómica , Cromosomas , Simulación por Computador , Perros , Eliminación de Gen , Reordenamiento Génico , Humanos , Ratones , Modelos Genéticos , Ratas , Alineación de Secuencia/métodos , Homología de Secuencia de Ácido Nucleico
8.
Science ; 309(5731): 134-7, 2005 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-15994558

RESUMEN

We report the genome sequence of Theileria parva, an apicomplexan pathogen causing economic losses to smallholder farmers in Africa. The parasite chromosomes exhibit limited conservation of gene synteny with Plasmodium falciparum, and its plastid-like genome represents the first example where all apicoplast genes are encoded on one DNA strand. We tentatively identify proteins that facilitate parasite segregation during host cell cytokinesis and contribute to persistent infection of transformed host cells. Several biosynthetic pathways are incomplete or absent, suggesting substantial metabolic dependence on the host cell. One protein family that may generate parasite antigenic diversity is not telomere-associated.


Asunto(s)
Genoma de Protozoos , Linfocitos/parasitología , Proteínas Protozoarias/genética , Theileria parva/genética , Algoritmos , Animales , Antígenos de Protozoos/genética , Bovinos , Proliferación Celular , Cromosomas/genética , Secuencia Conservada , Enzimas/genética , Enzimas/metabolismo , Genes Protozoarios , Linfocitos/citología , Mitocondrias/metabolismo , Datos de Secuencia Molecular , Orgánulos/genética , Orgánulos/fisiología , Plasmodium falciparum/genética , Estructura Terciaria de Proteína , Proteínas Protozoarias/química , Proteínas Protozoarias/metabolismo , Análisis de Secuencia de ADN , Sintenía , Telómero/genética , Theileria parva/crecimiento & desarrollo , Theileria parva/patogenicidad , Theileria parva/fisiología
9.
Nature ; 433(7028): 865-8, 2005 Feb 24.
Artículo en Inglés | MEDLINE | ID: mdl-15729342

RESUMEN

Entamoeba histolytica is an intestinal parasite and the causative agent of amoebiasis, which is a significant source of morbidity and mortality in developing countries. Here we present the genome of E. histolytica, which reveals a variety of metabolic adaptations shared with two other amitochondrial protist pathogens: Giardia lamblia and Trichomonas vaginalis. These adaptations include reduction or elimination of most mitochondrial metabolic pathways and the use of oxidative stress enzymes generally associated with anaerobic prokaryotes. Phylogenomic analysis identifies evidence for lateral gene transfer of bacterial genes into the E. histolytica genome, the effects of which centre on expanding aspects of E. histolytica's metabolic repertoire. The presence of these genes and the potential for novel metabolic pathways in E. histolytica may allow for the development of new chemotherapeutic agents. The genome encodes a large number of novel receptor kinases and contains expansions of a variety of gene families, including those associated with virulence. Additional genome features include an abundance of tandemly repeated transfer-RNA-containing arrays, which may have a structural function in the genome. Analysis of the genome provides new insights into the workings and genome evolution of a major human pathogen.


Asunto(s)
Entamoeba histolytica/genética , Genoma de Protozoos , Parásitos/genética , Animales , Entamoeba histolytica/metabolismo , Entamoeba histolytica/patogenicidad , Evolución Molecular , Fermentación , Transferencia de Gen Horizontal/genética , Glucólisis , Estrés Oxidativo/genética , Parásitos/metabolismo , Parásitos/patogenicidad , Filogenia , Transducción de Señal , Virulencia/genética
10.
Science ; 307(5713): 1321-4, 2005 Feb 25.
Artículo en Inglés | MEDLINE | ID: mdl-15653466

RESUMEN

Cryptococcus neoformans is a basidiomycetous yeast ubiquitous in the environment, a model for fungal pathogenesis, and an opportunistic human pathogen of global importance. We have sequenced its approximately 20-megabase genome, which contains approximately 6500 intron-rich gene structures and encodes a transcriptome abundant in alternatively spliced and antisense messages. The genome is rich in transposons, many of which cluster at candidate centromeric regions. The presence of these transposons may drive karyotype instability and phenotypic variation. C. neoformans encodes unique genes that may contribute to its unusual virulence properties, and comparison of two phenotypically distinct strains reveals variation in gene content in addition to sequence polymorphisms between the genomes.


Asunto(s)
Cryptococcus neoformans/genética , Genoma Fúngico , Empalme Alternativo , Pared Celular/metabolismo , Cromosomas Fúngicos/genética , Biología Computacional , Cryptococcus neoformans/patogenicidad , Cryptococcus neoformans/fisiología , Elementos Transponibles de ADN , Proteínas Fúngicas/metabolismo , Biblioteca de Genes , Genes Fúngicos , Humanos , Intrones , Datos de Secuencia Molecular , Fenotipo , Polimorfismo Genético , Polimorfismo de Nucleótido Simple , Polisacáridos/metabolismo , ARN sin Sentido , Análisis de Secuencia de ADN , Transcripción Genética , Virulencia , Factores de Virulencia/metabolismo
11.
Nucleic Acids Res ; 31(16): 4856-63, 2003 Aug 15.
Artículo en Inglés | MEDLINE | ID: mdl-12907728

RESUMEN

We report here the sequence of chromosome II from Trypanosoma brucei, the causative agent of African sleeping sickness. The 1.2-Mb pairs encode about 470 predicted genes organised in 17 directional clusters on either strand, the largest cluster of which has 92 genes lined up over a 284-kb region. An analysis of the GC skew reveals strand compositional asymmetries that coincide with the distribution of protein-coding genes, suggesting these asymmetries may be the result of transcription-coupled repair on coding versus non-coding strand. A 5-cM genetic map of the chromosome reveals recombinational 'hot' and 'cold' regions, the latter of which is predicted to include the putative centromere. One end of the chromosome consists of a 250-kb region almost exclusively composed of RHS (pseudo)genes that belong to a newly characterised multigene family containing a hot spot of insertion for retroelements. Interspersed with the RHS genes are a few copies of truncated RNA polymerase pseudogenes as well as expression site associated (pseudo)genes (ESAGs) 3 and 4, and 76 bp repeats. These features are reminiscent of a vestigial variant surface glycoprotein (VSG) gene expression site. The other end of the chromosome contains a 30-kb array of VSG genes, the majority of which are pseudogenes, suggesting that this region may be a site for modular de novo construction of VSG gene diversity during transposition/gene conversion events.


Asunto(s)
Cromosomas/genética , ADN Protozoario/genética , Trypanosoma brucei brucei/genética , Animales , Antígenos de Protozoos/genética , Mapeo Cromosómico , ADN Protozoario/química , Duplicación de Gen , Genes Protozoarios/genética , Datos de Secuencia Molecular , Seudogenes/genética , Recombinación Genética , Análisis de Secuencia de ADN
12.
Proc Natl Acad Sci U S A ; 100(14): 8502-7, 2003 Jul 08.
Artículo en Inglés | MEDLINE | ID: mdl-12799466

RESUMEN

The study of genetic variation in malaria parasites has practical significance for developing strategies to control the disease. Vaccines based on highly polymorphic antigens may be confounded by allelic restriction of the host immune response. In response to drug pressure, a highly plastic genome may generate resistant mutants more easily than a monomorphic one. Additionally, the study of the distribution of genomic polymorphisms may provide information leading to the identification of genes associated with traits such as parasite development and drug resistance. Indeed, the age and diversity of the human malaria parasite Plasmodium falciparum has been the subject of recent debate, because an ancient parasite with a complex genome is expected to present greater challenges for drug and vaccine development. The genome diversity of the important human pathogen Plasmodium vivax, however, remains essentially unknown. Here we analyze an approximately 100-kb contiguous chromosome segment from five isolates, revealing 191 single-nucleotide polymorphisms (SNPs) and 44 size polymorphisms. The SNPs are not evenly distributed across the segment with blocks of high and low diversity. Whereas the majority (approximately 63%) of the SNPs are in intergenic regions, introns contain significantly less SNPs than intergenic sequences. Polymorphic tandem repeats are abundant and are more uniformly distributed at a frequency of about one polymorphic tandem repeat per 3 kb. These data show that P. vivax has a highly diverse genome, and provide useful information for further understanding the genome diversity of the parasite.


Asunto(s)
Genes Protozoarios , Genoma de Protozoos , Plasmodium vivax/genética , Polimorfismo de Nucleótido Simple , Animales , Mapeo Cromosómico , ADN Protozoario/genética , Variación Genética , Haplotipos/genética , Intrones/genética , Datos de Secuencia Molecular , Plasmodium falciparum/genética , Reacción en Cadena de la Polimerasa , Proteínas Protozoarias/genética , Alineación de Secuencia , Análisis de Secuencia de ADN , Especificidad de la Especie , Secuencias Repetidas en Tándem
13.
Nucleic Acids Res ; 31(1): 229-33, 2003 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-12519988

RESUMEN

Rice is not only a major food staple for the world's population but it also is a model species for a major group of flowering plants, the monocotyledonous plants. Draft genomic sequence of two subspecies of rice, Oryza sativa spp. japonica and indica ssp. are publicly available. To provide the community with a resource to data-mine the rice genome, we have constructed an annotation resource for rice (http://www.tigr.org/tdb/e2k1/osa1/). In this resource, we have annotated the rice genome for gene content, identified motifs/domains within the predicted genes, constructed a rice repeat database, identified related sequences in other plant species, and identified syntenic sequences between rice and maize. All of the data is available through web-based interfaces, FTP downloads, and a Distributed Annotation System.


Asunto(s)
Bases de Datos Genéticas , Genoma de Planta , Oryza/genética , Cromosomas Artificiales , Cromosomas de las Plantas , Biología Computacional , Proteínas de Plantas/química , Plantas/genética , Secuencias Repetitivas de Ácidos Nucleicos , Alineación de Secuencia , Homología de Secuencia , Sintenía , Zea mays/genética
14.
Nature ; 419(6906): 512-9, 2002 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-12368865

RESUMEN

Species of malaria parasite that infect rodents have long been used as models for malaria disease research. Here we report the whole-genome shotgun sequence of one species, Plasmodium yoelii yoelii, and comparative studies with the genome of the human malaria parasite Plasmodium falciparum clone 3D7. A synteny map of 2,212 P. y. yoelii contiguous DNA sequences (contigs) aligned to 14 P. falciparum chromosomes reveals marked conservation of gene synteny within the body of each chromosome. Of about 5,300 P. falciparum genes, more than 3,300 P. y. yoelii orthologues of predominantly metabolic function were identified. Over 800 copies of a variant antigen gene located in subtelomeric regions were found. This is the first genome sequence of a model eukaryotic parasite, and it provides insight into the use of such systems in the modelling of Plasmodium biology and disease.


Asunto(s)
Genoma de Protozoos , Plasmodium yoelii/genética , Animales , ADN Protozoario , Modelos Animales de Enfermedad , Humanos , Malaria/parasitología , Familia de Multigenes , Plasmodium falciparum/genética , Recombinación Genética , Roedores , Alineación de Secuencia , Análisis de Secuencia de ADN , Especificidad de la Especie , Sintenía , Telómero
15.
Nature ; 419(6906): 531-4, 2002 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-12368868

RESUMEN

The mosquito-borne malaria parasite Plasmodium falciparum kills an estimated 0.7-2.7 million people every year, primarily children in sub-Saharan Africa. Without effective interventions, a variety of factors-including the spread of parasites resistant to antimalarial drugs and the increasing insecticide resistance of mosquitoes-may cause the number of malaria cases to double over the next two decades. To stimulate basic research and facilitate the development of new drugs and vaccines, the genome of Plasmodium falciparum clone 3D7 has been sequenced using a chromosome-by-chromosome shotgun strategy. We report here the nucleotide sequences of chromosomes 10, 11 and 14, and a re-analysis of the chromosome 2 sequence. These chromosomes represent about 35% of the 23-megabase P. falciparum genome.


Asunto(s)
ADN Protozoario , Plasmodium falciparum/genética , Animales , Cromosomas , Genoma de Protozoos , Proteoma , Proteínas Protozoarias/genética , Análisis de Secuencia de ADN
16.
Nature ; 419(6906): 498-511, 2002 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-12368864

RESUMEN

The parasite Plasmodium falciparum is responsible for hundreds of millions of cases of malaria, and kills more than one million African children annually. Here we report an analysis of the genome sequence of P. falciparum clone 3D7. The 23-megabase nuclear genome consists of 14 chromosomes, encodes about 5,300 genes, and is the most (A + T)-rich genome sequenced to date. Genes involved in antigenic variation are concentrated in the subtelomeric regions of the chromosomes. Compared to the genomes of free-living eukaryotic microbes, the genome of this intracellular parasite encodes fewer enzymes and transporters, but a large proportion of genes are devoted to immune evasion and host-parasite interactions. Many nuclear-encoded proteins are targeted to the apicoplast, an organelle involved in fatty-acid and isoprenoid metabolism. The genome sequence provides the foundation for future studies of this organism, and is being exploited in the search for new drugs and vaccines to fight malaria.


Asunto(s)
Genoma de Protozoos , Plasmodium falciparum/genética , Animales , Estructuras Cromosómicas , Reparación del ADN , Replicación del ADN , ADN Protozoario/biosíntesis , ADN Protozoario/genética , Evolución Molecular , Humanos , Vacunas contra la Malaria , Malaria Falciparum/inmunología , Malaria Falciparum/parasitología , Malaria Falciparum/prevención & control , Proteínas de Transporte de Membrana/genética , Proteínas de Transporte de Membrana/metabolismo , Datos de Secuencia Molecular , Plasmodium falciparum/inmunología , Plasmodium falciparum/metabolismo , Plastidios/genética , Proteoma , Proteínas Protozoarias/genética , Proteínas Protozoarias/metabolismo , Proteínas Protozoarias/fisiología , Recombinación Genética , Análisis de Secuencia de ADN/métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...