Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 17 de 17
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
2.
Nat Commun ; 7: 10507, 2016 Feb 09.
Artículo en Inglés | MEDLINE | ID: mdl-26856261

RESUMEN

Ticks transmit more pathogens to humans and animals than any other arthropod. We describe the 2.1 Gbp nuclear genome of the tick, Ixodes scapularis (Say), which vectors pathogens that cause Lyme disease, human granulocytic anaplasmosis, babesiosis and other diseases. The large genome reflects accumulation of repetitive DNA, new lineages of retro-transposons, and gene architecture patterns resembling ancient metazoans rather than pancrustaceans. Annotation of scaffolds representing ∼57% of the genome, reveals 20,486 protein-coding genes and expansions of gene families associated with tick-host interactions. We report insights from genome analyses into parasitic processes unique to ticks, including host 'questing', prolonged feeding, cuticle synthesis, blood meal concentration, novel methods of haemoglobin digestion, haem detoxification, vitellogenesis and prolonged off-host survival. We identify proteins associated with the agent of human granulocytic anaplasmosis, an emerging disease, and the encephalitis-causing Langat virus, and a population structure correlated to life-history traits and transmission of the Lyme disease agent.


Asunto(s)
Anaplasma phagocytophilum , Vectores Arácnidos/genética , Genoma/genética , Ixodes/genética , Canales Iónicos Activados por Ligandos/genética , Animales , Perfilación de la Expresión Génica , Genómica , Enfermedad de Lyme/transmisión , Oocitos , Xenopus laevis
3.
Genome Biol ; 15(3): R59, 2014 Mar 04.
Artículo en Inglés | MEDLINE | ID: mdl-24647006

RESUMEN

BACKGROUND: The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. RESULTS: We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. CONCLUSIONS: In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied.


Asunto(s)
Mapeo Contig/métodos , Genoma de Planta , Pinus taeda/genética , Análisis de Secuencia de ADN/métodos , ADN de Plantas/genética , Haploidia
4.
Genome Biol ; 14(6): r53, 2013 Jun 03.
Artículo en Inglés | MEDLINE | ID: mdl-23731509

RESUMEN

BACKGROUND: Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. RESULTS: We describe the sequencing and assembly of the genome of Theobroma cacao L. cultivar Matina 1-6. The genome of the Matina 1-6 cultivar is 445 Mbp, which is significantly larger than a sequenced Criollo cultivar, and more typical of other cultivars. The chromosome-scale assembly, version 1.1, contains 711 scaffolds covering 346.0 Mbp, with a contig N50 of 84.4 kbp, a scaffold N50 of 34.4 Mbp, and an evidence-based gene set of 29,408 loci. Version 1.1 has 10x the scaffold N50 and 4x the contig N50 as Criollo, and includes 111 Mb more anchored sequence. The version 1.1 assembly has 4.4% gap sequence, while Criollo has 10.9%. Through a combination of haplotype, association mapping and gene expression analyses, we leverage this robust reference genome to identify a promising candidate gene responsible for pod color variation. We demonstrate that green/red pod color in cacao is likely regulated by the R2R3 MYB transcription factor TcMYB113, homologs of which determine pigmentation in Rosaceae, Solanaceae, and Brassicaceae. One SNP within the target site for a highly conserved trans-acting siRNA in dicots, found within TcMYB113, seems to affect transcript levels of this gene and therefore pod color variation. CONCLUSIONS: We report a high-quality sequence and annotation of Theobroma cacao L. and demonstrate its utility in identifying candidate genes regulating traits.


Asunto(s)
Frutas/genética , Regulación de la Expresión Génica de las Plantas , Genes de Plantas , Genoma de Planta , Carácter Cuantitativo Heredable , Cacao/genética , Cacao/metabolismo , Mapeo Cromosómico , Cromosomas de las Plantas , Color , Frutas/metabolismo , Tamaño del Genoma , Secuenciación de Nucleótidos de Alto Rendimiento , Sitios de Carácter Cuantitativo , ARN Interferente Pequeño/genética , ARN Interferente Pequeño/metabolismo , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Transcripción Genética
5.
Bioessays ; 33(7): 555, 2011 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-21633965
6.
BMC Mol Biol ; 11: 50, 2010 Jun 29.
Artículo en Inglés | MEDLINE | ID: mdl-20587017

RESUMEN

BACKGROUND: The planktonic microcrustacean Daphnia pulex is among the best-studied animals in ecological, toxicological and evolutionary research. One aspect that has sustained interest in the study system is the ability of D. pulex to develop inducible defence structures when exposed to predators, such as the phantom midge larvae Chaoborus. The available draft genome sequence for D. pulex is accelerating research to identify genes that confer plastic phenotypes that are regularly cued by environmental stimuli. Yet for quantifying gene expression levels, no experimentally validated set of internal control genes exists for the accurate normalization of qRT-PCR data. RESULTS: In this study, we tested six candidate reference genes for normalizing transcription levels of D. pulex genes; alpha tubulin (aTub), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), TATA box binding protein (Tbp) syntaxin 16 (Stx16), X-box binding protein 1 (Xbp1) and CAPON, a protein associated with the neuronal nitric oxide synthase, were selected on the basis of an earlier study and from microarray studies. One additional gene, a matrix metalloproteinase (MMP), was tested to validate its transcriptional response to Chaoborus, which was earlier observed in a microarray study. The transcription profiles of these seven genes were assessed by qRT-PCR from RNA of juvenile D. pulex that showed induced defences in comparison to untreated control animals. We tested the individual suitability of genes for expression normalization using the programs geNorm, NormFinder and BestKeeper. Intriguingly, Xbp1, Tbp, CAPON and Stx16 were selected as ideal reference genes. Analyses on the relative expression level using the software REST showed that both classical housekeeping candidate genes (aTub and GAPDH) were significantly downregulated, whereas the MMP gene was shown to be significantly upregulated, as predicted. aTub is a particularly ill suited reference gene because five copies are found in the D. pulex genome sequence. When applying aTub for expression normalization Xbp1 and Tbp are falsely reported as significantly upregulated. CONCLUSIONS: Our results suggest that the genes Xbp1, Tbp, CAPON and Stx16 are suitable reference genes for accurate normalization in qRT-PCR studies using Chaoborus-induced D. pulex specimens. Furthermore, our study underscores the importance of verifying the expression stability of putative reference genes for normalization of expression levels.


Asunto(s)
Daphnia , Perfilación de la Expresión Génica/normas , Regulación de la Expresión Génica , Expresión Génica , Reacción en Cadena de la Polimerasa , Conducta Predatoria/fisiología , Animales , Daphnia/genética , Daphnia/fisiología , Dípteros/fisiología , Femenino , Perfilación de la Expresión Génica/métodos , Genes de Insecto , Proteínas de Insectos/genética , Metaloproteinasas de la Matriz/genética , Reacción en Cadena de la Polimerasa/métodos , Reacción en Cadena de la Polimerasa/normas , ARN/genética , Estándares de Referencia , Tubulina (Proteína)/genética
7.
J Phys Chem A ; 113(39): 10474-87, 2009 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-19736950

RESUMEN

A 2,4,6-trinitrotoluene (TNT) ignition model was developed using data from multiple sources. The one-step, first-order, pressure-dependent mechanism was used to predict ignition behavior from small- and large-scale experiments involving significant fluid motion. Bubbles created from decomposition gases were shown to cause vigorous boiling. The forced mixing caused by these bubbles was not modeled adequately using only free liquid convection. Thorough mixing and ample contact of the reactive species indicated that the TNT decomposition products were in equilibrium. The effect of impurities on the reaction rate was the primary uncertainty in the decomposition model.

8.
Bioinformatics ; 25(17): 2171-3, 2009 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-19578171

RESUMEN

MOTIVATION: Methods to improve tiling array expression signals are needed to accurately detect genome features. Royce et al. provide statistical normalizations of tile signal based on probe sequence content that promises improved accuracy, and should be independently verified. RESULTS: Assessment of the sequence content normalization methods identified a problem: confounding of probe sequence content with gene structure (intron/exon) sequence content. Normalization obscured tile signal changes at gene structure boundaries. This and other evidence suggests that simple sequence normalization does not improve detection of genes from tile expression data.


Asunto(s)
Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Análisis de Secuencia de ADN/métodos , Animales , Composición de Base/genética , Secuencia de Bases , Daphnia/genética , Drosophila melanogaster/genética , Exones/genética , Intrones/genética
9.
Bioinformatics ; 24(6): 744-50, 2008 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-18204064

RESUMEN

MOTIVATION: While it is common to refer to 'the genome sequence' as if it were a single, complete and contiguous DNA string, it is in fact an assembly of millions of small, partially overlapping DNA fragments. Sophisticated computer algorithms (assemblers and scaffolders) merge these DNA fragments into contigs, and place these contigs into sequence scaffolds using the paired-end sequences derived from large-insert DNA libraries. Each step in this automated process is susceptible to producing errors; hence, the resulting draft assembly represents (in practice) only a likely assembly that requires further validation. Knowing which parts of the draft assembly are likely free of errors is critical if researchers are to draw reliable conclusions from the assembled sequence data. RESULTS: We develop a machine-learning method to detect assembly errors in sequence assemblies. Several in silico measures for assembly validation have been proposed by various researchers. Using three benchmarking Drosophila draft genomes, we evaluate these techniques along with some new measures that we propose, including the good-minus-bad coverage (GMB), the good-to-bad-ratio (RGB), the average Z-score (AZ) and the average absolute Z-score (ASZ). Our results show that the GMB measure performs better than the others in both its sensitivity and its specificity for assembly error detection. Nevertheless, no single method performs sufficiently well to reliably detect genomic regions requiring attention for further experimental verification. To utilize the advantages of all these measures, we develop a novel machine learning approach that combines these individual measures to achieve a higher prediction accuracy (i.e. greater than 90%). Our combined evidence approach avoids the difficult and often ad hoc selection of many parameters the individual measures require, and significantly improves the overall precisions on the benchmarking data sets.


Asunto(s)
Algoritmos , Inteligencia Artificial , Mapeo Contig/métodos , Drosophila/genética , Reconocimiento de Normas Patrones Automatizadas/métodos , Análisis de Secuencia de ADN/métodos , Animales , Secuencia de Bases , Datos de Secuencia Molecular , Reproducibilidad de los Resultados , Sensibilidad y Especificidad
10.
Brief Bioinform ; 6(2): 194-8, 2005 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-15975228

RESUMEN

This software review looks at the utility of the Biomolecular Interaction Network Database (BIND) as a web database. BIND offers methods common to related biology databases and specialisations for its protein interaction data. Searching and browsing this database is easy and well integrated with the underlying data and the needs of scientists. Interaction networks are visualised with software that offers many useful options. The innovative ontoglyphs are used throughout to provide visual cues to protein functions, localisation and other aspects one needs to know for this data set. One can expect to get useful results that may be well integrated with one's research needs.


Asunto(s)
Sistemas de Administración de Bases de Datos , Bases de Datos de Proteínas , Almacenamiento y Recuperación de la Información/métodos , Mapeo de Interacción de Proteínas/métodos , Proteínas/química , Proteínas/metabolismo , Programas Informáticos , Interfaz Usuario-Computador , Sitios de Unión , Unión Proteica , Proteínas/clasificación , Proteínas/genética
11.
BMC Bioinformatics ; 6: 45, 2005 Mar 07.
Artículo en Inglés | MEDLINE | ID: mdl-15752432

RESUMEN

BACKGROUND: wFleaBase is a database with the necessary infrastructure to curate, archive and share genetic, molecular and functional genomic data and protocols for an emerging model organism, the microcrustacean Daphnia. Commonly known as the water-flea, Daphnia's ecological merit is unequaled among metazoans, largely because of its sentinel role within freshwater ecosystems and over 200 years of biological investigations. By consequence, the Daphnia Genomics Consortium (DGC) has launched an interdisciplinary research program to create the resources needed to study genes that affect ecological and evolutionary success in natural environments. DISCUSSION: These tools include the genome database wFleaBase, which currently contains functions to search and extract information from expressed sequenced tags, genome survey sequences and full genome sequencing projects. This new database is built primarily from core components of the Generic Model Organism Database project, and related bioinformatics tools. SUMMARY: Over the coming year, preliminary genetic maps and the nearly complete genomic sequence of Daphnia pulex will be integrated into wFleaBase, including gene predictions and ortholog assignments based on sequence similarities with eukaryote genes of known function. wFleaBase aims to serve a large ecological and evolutionary research community. Our challenge is to rapidly expand its content and to ultimately integrate genetic and functional genomic information with population-level responses to environmental challenges. URL: http://wfleabase.org/.


Asunto(s)
Biología Computacional/métodos , Daphnia/genética , Bases de Datos Genéticas , Genoma , Animales , Mapeo Cromosómico , Gráficos por Computador , Sistemas de Administración de Bases de Datos , Bases de Datos Factuales , Ecología , Ecosistema , Evolución Molecular , Genética de Población , Genómica , Servicios de Información , Almacenamiento y Recuperación de la Información , Internet , Programas Informáticos
12.
Brief Bioinform ; 5(3): 300-4, 2004 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-15383216

RESUMEN

This review looks at internet archives, repositories and lists for obtaining popular and useful biology and bioinformatics software. Resources include collections of free software, services for the collaborative development of new programs, software news media and catalogues of links to bioinformatics software and web tools. Problems with such resources arise from needs for continued curator effort to collect and update these, combined with less than optimal community support, funding and collaboration. Despite some problems, the available software repositories provide needed public access to many tools that are a foundation for analyses in bioscience research efforts.


Asunto(s)
Algoritmos , Archivos , Biología Computacional/métodos , Bases de Datos Factuales , Internet , Programas Informáticos
13.
Bioinformatics ; 20(17): 3238-40, 2004 Nov 22.
Artículo en Inglés | MEDLINE | ID: mdl-15059839

RESUMEN

Timely worldwide distribution of biosequence and bioinformatics data depends on high performance networking and advances in Internet transport methods. The Bio-Mirror project focuses on providing up-to-date distribution of this rapidly growing and changing data. It offers FTP, Web and Rsync access to many high-volume databanks from several sites around the world. Experiments with data grids and other methods offer future improvements in biology data distribution.


Asunto(s)
Biología Computacional/métodos , Sistemas de Administración de Bases de Datos , Bases de Datos Genéticas , Documentación/métodos , Difusión de la Información/métodos , Almacenamiento y Recuperación de la Información/métodos , Internet , Biología/métodos , Redes de Comunicación de Computadores , Integración de Sistemas
14.
Brief Bioinform ; 4(3): 292-6, 2003 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-14582522

RESUMEN

Life scientists who work with the supermarket of genome data will find the EnsMart database and software package offers a valuable door to a wealth of genes and genome features. Not only available to lab biologists on the web, this popular multi-organism genome database can be installed and used on your own Unix computer with relative ease. It offers a flexible, fast and practical data-mining framework for computer-savvy biologists and bioinformaticians.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma , Programas Informáticos , Animales , Genómica , Humanos , Almacenamiento y Recuperación de la Información/métodos
15.
Brief Bioinform ; 4(2): 192-6, 2003 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-12846399

RESUMEN

For bioscientists studying protein structure and function, the Protein Family Alignment Annotation Tool (Pfaat) is a useful and simple program for annotating collections of proteins. This open-source software includes methods for viewing and aligning protein families, and for annotating sequence structure and residues with known functions. It offers new options to aid the study of proteins, and an extensible annotation tool for bioinformatics developers.


Asunto(s)
Proteínas/clasificación , Análisis de Secuencia de Proteína/métodos , Programas Informáticos , Secuencia de Aminoácidos , Bases de Datos de Proteínas , Datos de Secuencia Molecular , Filogenia , Estructura Terciaria de Proteína , Proteínas/genética , Alineación de Secuencia
16.
Curr Protoc Bioinformatics ; Appendix 1: Appendix 1E, 2003 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-18428689

RESUMEN

One of the major challenges in using bioinformatics software is that there are a wide variety of sequence formats, e.g., GenBank, EMBL, and FASTA. It is often the case that a sequence or a set of sequences is in one format but is needed in another. This unit offers a solution to this problem--Readseq. Readseq is a program that can read and write 18 different formats.


Asunto(s)
Sistemas de Administración de Bases de Datos , Bases de Datos Genéticas , Almacenamiento y Recuperación de la Información/métodos , Lenguajes de Programación , Análisis de Secuencia/métodos , Interfaz Usuario-Computador , Internet
17.
Brief Bioinform ; 3(4): 405-9, 2002 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-12511068

RESUMEN

Pise is interface construction software for bioinformatics applications that run by command-line operations. It creates common, easy-to-use interfaces to these applications for the Web, or other uses. It is adaptable to new bioinformatics tools, and offers program chaining, Unix system batch and other controls, making it an attractive method for building and using your own bioinformatics web services.


Asunto(s)
Biología Computacional , Redes de Comunicación de Computadores , Programas Informáticos , Humanos , Lenguajes de Programación , Diseño de Software , Integración de Sistemas , Interfaz Usuario-Computador
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...