Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Plants (Basel) ; 13(5)2024 Mar 04.
Artigo em Inglês | MEDLINE | ID: mdl-38475562

RESUMO

Microsatellites or SSRs are small tandem repeats that are 1-6 bp long. They are usually highly polymorphic and form important portions of genomes. They have been extensively analyzed in humans, animals and model plants; however, information from non-flowering plants is generally lacking. Here, we examined 29 samples of Ophioglossaceae ferns, mainly from the genera Botrychium and Sceptridium. We analyzed the SSR distribution, density and composition in almost 400 nuclear exons and their flanking regions. We detected 45 SSRs in exons and 1475 SSRs in the flanking regions. In the exons, only di-, tri- and tetranucleotides were found, and all of them were 12 bp long. The annotation of the exons containing SSRs showed that they were related to various processes, such as metabolism, catalysis, transportation or plant growth. The flanking regions contained SSRs from all categories, with the most numerous being dinucleotides, followed by tetranucleotides. More than one-third of all the SSRs in the flanking regions were 12 bp long. The SSR densities in the exons were very low, ranging from 0 to 0.07 SSRs/kb, while those in the flanking regions ranged from 0.24 to 0.81 SSRs/kb; and those in the combined dataset ranged from 0.2 to 0.81 SSRs/kb. The majority of the detected SSRs in the flanking regions were polymorphic and present at the same loci across two or more samples but differing in the number of repeats. The SSRs detected here may serve as a basis for further population genetic, phylogenetic or evolutionary genetic studies, as well as for further studies focusing on SSRs in the genomes and their roles in adaptation, evolution and diseases.

2.
Nucleic Acids Res ; 51(W1): W443-W450, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37194694

RESUMO

PHASTEST (PHAge Search Tool with Enhanced Sequence Translation) is the successor to the PHAST and PHASTER prophage finding web servers. PHASTEST is designed to support the rapid identification, annotation and visualization of prophage sequences within bacterial genomes and plasmids. PHASTEST also supports rapid annotation and interactive visualization of all other genes (protein coding regions, tRNA/tmRNA/rRNA sequences) in bacterial genomes. Given that bacterial genome sequencing has become so routine, the need for fast tools to comprehensively annotate bacterial genomes has become progressively more important. PHASTEST not only offers faster and more accurate prophage annotations than its predecessors, it also provides more complete whole genome annotations and much improved genome visualization capabilities. In standardized tests, we found that PHASTEST is 31% faster and 2-3% more accurate in prophage identification than PHASTER. Specifically, PHASTEST can process a typical bacterial genome in 3.2 min (raw sequence) or in 1.3 min when given a pre-annotated GenBank file. Improvements in PHASTEST's ability to annotate bacterial genomes now make it a particularly powerful tool for whole genome annotation. In addition, PHASTEST now offers a much more modern and responsive visualization interface that allows users to generate, edit, annotate and interactively visualize (via zooming, rotating, dragging, panning, resetting), colourful, publication quality genome maps. PHASTEST continues to offer popular options such as an API for programmatic queries, a Docker image for local installations, support for multiple (metagenomic) queries and the ability to perform automated look-ups against thousands of previously PHAST-annotated bacterial genomes. PHASTEST is available online at https://phastest.ca.


Assuntos
Bases de Dados de Ácidos Nucleicos , Prófagos , Ferramenta de Busca , Software , Genoma Bacteriano , Anotação de Sequência Molecular , Plasmídeos , Prófagos/genética
3.
Nucleic Acids Res ; 51(W1): W484-W492, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37140037

RESUMO

Proksee (https://proksee.ca) provides users with a powerful, easy-to-use, and feature-rich system for assembling, annotating, analysing, and visualizing bacterial genomes. Proksee accepts Illumina sequence reads as compressed FASTQ files or pre-assembled contigs in raw, FASTA, or GenBank format. Alternatively, users can supply a GenBank accession or a previously generated Proksee map in JSON format. Proksee then performs assembly (for raw sequence data), generates a graphical map, and provides an interface for customizing the map and launching further analysis jobs. Notable features of Proksee include unique and informative assembly metrics provided via a custom reference database of assemblies; a deeply integrated high-performance genome browser for viewing and comparing analysis results at individual base resolution (developed specifically for Proksee); an ever-growing list of embedded analysis tools whose results can be seamlessly added to the map or searched and explored in other formats; and the option to export graphical maps, analysis results, and log files for data sharing and research reproducibility. All these features are provided via a carefully designed multi-server cloud-based system that can easily scale to meet user demand and that ensures the web server is robust and responsive.


Assuntos
Genoma Bacteriano , Software , Reprodutibilidade dos Testes , Bases de Dados de Ácidos Nucleicos , Internet
4.
Plants (Basel) ; 12(8)2023 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-37111932

RESUMO

The Pleistocene climatic oscillations (PCO) that provoked several cycles of glacial-interglacial periods are thought to have profoundly affected species distribution, richness and diversity around the world. While the effect of the PCO on population dynamics at temperate latitudes is well known, considerable questions remain about its impact on the biodiversity of neotropical mountains. Here, we use amplified fragment length polymorphism molecular markers (AFLPs) to investigate the phylogeography and genetic structure of 13 plant species belonging to the gentian genus Macrocarpaea (Gentianaceae) in the tropical Andes. These woody herbs, shrubs or small trees show complex and potentially reticulated relationships, including cryptic species. We show that populations of M. xerantifulva in the dry system of the Rio Marañón in northern Peru have lower levels of genetic diversity compared to other sampled species. We suggest that this is due to a recent demographic bottleneck resulting from the contraction of the montane wet forests into refugia because of the expansion of the dry system into the valley during the glacial cycles of the PCO. This may imply that the ecosystems of different valleys of the Andes might have responded differently to the PCO.

5.
Nucleic Acids Res ; 51(W1): W459-W467, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37099365

RESUMO

PlasMapper 3.0 is a web server that allows users to generate, edit, annotate and interactively visualize publication quality plasmid maps. Plasmid maps are used to plan, design, share and publish critical information about gene cloning experiments. PlasMapper 3.0 is the successor to PlasMapper 2.0 and offers many features found only in commercial plasmid mapping/editing packages. PlasMapper 3.0 allows users to paste or upload plasmid sequences as input or to upload existing plasmid maps from its large database of >2000 pre-annotated plasmids (PlasMapDB). This database can be searched by plasmid names, sequence features, restriction sites, preferred host organisms, and sequence length. PlasMapper 3.0 also supports the annotation of new or never-before-seen plasmids using its own feature database that contains common promoters, terminators, regulatory sequences, replication origins, selectable markers and other features found in most cloning plasmids. PlasMapper 3.0 has several interactive sequence editors/viewers that allow users to select and view plasmid regions, insert genes, modify restriction sites or perform codon optimization. The graphics for PlasMapper 3.0 have also been substantially upgraded. It now offers an interactive, full-color plasmid viewer/editor that allows users to zoom, rotate, re-color, linearize, circularize, edit annotated features and modify plasmid images or labels to improve the esthetic qualities of their plasmid map and textual displays. All the plasmid images and textual displays are downloadable in multiple formats. PlasMapper 3.0 is available online at https://plasmapper.ca.


Assuntos
Software , Interface Usuário-Computador , Plasmídeos/genética , Computadores , Sequência de Bases , Internet
6.
Front Plant Sci ; 14: 1294716, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38288414

RESUMO

Previous phylogenies showed conflicting relationships among the subfamilies and genera within the fern family Ophioglossaceae. However, their classification remains unsettled where contrasting classifications recognize four to 15 genera. Since these treatments are mostly based on phylogenetic evidence using limited, plastid-only loci, a phylogenomic understanding is actually necessary to provide conclusive insight into the systematics of the genera. In this study, we have therefore compiled datasets with the broadest sampling of Ophioglossaceae genera to date, including all fifteen currently recognized genera, especially for the first time the South African endemic genus Rhizoglossum. Notably, our comprehensive phylogenomic matrix is based on both plastome and mitogenome genes. Inferred from the coding sequences of 83 plastid and 37 mitochondrial genes, a strongly supported topology for these subfamilies is presented, and is established by analyses using different partitioning approaches and substitution models. At the generic level, most relationships are well resolved except for few within the subfamily Ophioglossoideae. With this new phylogenomic scheme, key morphological and genomic changes were further identified along this backbone. In addition, we confirmed numerous horizontally transferred (HGT) genes in the genera Botrypus, Helminthostachys, Mankyua, Sahashia, and Sceptridium. These HGT genes are most likely located in mitogenomes and are predominately donated from angiosperm Santalales or non-Ophioglossaceae ferns. By our in-depth searches of the organellar genomes, we also provided phylogenetic overviews for the plastid and mitochondrial MORFFO genes found in these Ophioglossaceae ferns.

7.
Biology (Basel) ; 10(9)2021 Aug 25.
Artigo em Inglês | MEDLINE | ID: mdl-34571702

RESUMO

The evolutionary processes responsible for the extraordinary diversity in the middle elevation montane forests of the Tropical Andes (MMF; 1000-3500 m) remain poorly understood. It is not clear whether adaptive divergence, niche conservatism or geographical processes were the main contributors to the radiation of the respective lineages occurring there. We investigated the evolutionary history of plant lineages in the MMF. We used the vascular plant genus Macrocarpaea (Gentianaceae) as a model, as it consists of 118 morphologically diverse species, a majority of which are endemic to the MMF. We used a time-calibrated molecular phylogeny and morphological and climatic data to compare a set of evolutionary scenarios of various levels of complexity in a phylogenetic comparative framework. In this paper, we show that the hypothesis of adaptive radiation for Macrocarpaea in the MMF is unlikely. The genus remained confined to the upper montane forests (UMF > 1800 m) during more than a half of its evolutionary history, possibly due to evolutionary constraints. Later, coinciding with the beginning of the Pleistocene (around 2.58 Ma), a phylogenetically derived (recently branching) clade, here referred to as the M. micrantha clade (25 species), successfully colonized and radiated in the lower montane forests (LMF < 1800 m). This colonization was accompanied by the evolution of a new leaf phenotype that is unique to the species of the M. micrantha clade that likely represents an adaptation to life in this new environment (adaptive zone). Therefore, our results suggest that niche conservatism and geographical processes have dominated most of the diversification history of Macrocarpaea, but that a rare adaptive divergence event allowed a transition into a new adaptive zone and enabled progressive radiation in this zone through geographical processes.

8.
Sci Rep ; 10(1): 8044, 2020 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-32415111

RESUMO

Multiple methods to detect copy number variants (CNV) relying on different types of data have been developed and CNV have been shown to have an impact on phenotypes of numerous traits of economic importance in cattle, such as reproduction and immunity. Further improvements in CNV detection are still needed in regard to the trade-off between high-true and low-false positive variant identification rates. Instead of improving single CNV detection methods, variants can be identified in silico with high confidence when multiple methods and datasets are combined. Here, CNV were identified from whole-genome sequences (WGS) and genotype array (GEN) data on 96 Holstein animals. After CNV detection, two sets of high confidence CNV regions (CNVR) were created that contained variants found in both WGS and GEN data following an animal-based (n = 52) and a population-based (n = 36) pipeline. Furthermore, the change in false positive CNV identification rates using different GEN marker densities was evaluated. The population-based approach characterized CNVR, which were more often shared among animals (average 40% more samples per CNVR) and were more often linked to putative functions (48 vs 56% of CNVR) than CNV identified with the animal-based approach. Moreover, false positive identification rates up to 22% were estimated on GEN information. Further research using larger datasets should use a population-wide approach to identify high confidence CNVR.


Assuntos
Variações do Número de Cópias de DNA , Genoma , Genótipo , Sequenciamento Completo do Genoma , Animais , Cruzamento , Bovinos , Mapeamento Cromossômico , Biologia Computacional/métodos , Marcadores Genéticos , Genômica/métodos
9.
Environ Microbiol Rep ; 12(3): 342-354, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32216046

RESUMO

Arbuscular mycorrhizal fungi (AMF) play central roles in terrestrial ecosystems by interacting with both above and belowground communities as well as by influencing edaphic properties. The AMF communities associated with the roots of the fern Botrychium lunaria (Ophioglossaceae) were sampled in four transects at 2400 m a.s.l. in the Swiss Alps and analyzed using metabarcoding. Members of five Glomeromycota genera were identified across the 71 samples. Our analyses revealed the existence of a core microbiome composed of four abundant Glomus operational taxonomic units (OTUs), as well as a low OTU turnover between samples. The AMF communities were not spatially structured, which contrasts with most studies on AMF associated with angiosperms. pH, microbial connectivity and humus cover significantly shaped AMF beta diversity but only explained a minor fraction of variation in beta diversity. AMF OTUs associations were found to be significant by both cohesion and co-occurrence analyses, suggesting a role for fungus-fungus interactions in AMF community assembly. In particular, OTU co-occurrences were more frequent between different genera than among the same genus, rising the hypothesis of functional complementarity among the AMF associated to B. lunaria. Altogether, our results provide new insights into the ecology of fern symbionts in alpine grasslands.


Assuntos
Gleiquênias/microbiologia , Micobioma/genética , Micorrizas/genética , Genes Fúngicos , Glomeromycota/classificação , Glomeromycota/genética , Glomeromycota/isolamento & purificação , Pradaria , Metagenômica , Interações Microbianas , Microbiota , Filogenia , Raízes de Plantas/microbiologia , Microbiologia do Solo , Suíça
10.
Gigascience ; 8(6)2019 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-31241156

RESUMO

BACKGROUND: Copy number variants (CNVs) contribute to genetic diversity and phenotypic variation. We aimed to discover CNVs in taurine cattle using a large collection of whole-genome sequences and to provide an interactive database of the identified CNV regions (CNVRs) that includes visualizations of sequence read alignments, CNV boundaries, and genome annotations. RESULTS: CNVs were identified in each of 4 whole-genome sequencing datasets, which together represent >500 bulls from 17 breeds, using a popular multi-sample read-depth-based algorithm, cn.MOPS. Quality control and CNVR construction, performed dataset-wise to avoid batch effects, resulted in 26,223 CNVRs covering 107.75 unique Mb (4.05%) of the bovine genome. Hierarchical clustering of samples by CNVR genotypes indicated clear separation by breeds. An interactive HTML database was created that allows data filtering options, provides graphical and tabular data summaries including Hardy-Weinberg equilibrium tests on genotype proportions, and displays genes and quantitative trait loci at each CNVR. Notably, the database provides sequence read alignments at each CNVR genotype and the boundaries of constituent CNVs in individual samples. Besides numerous novel discoveries, we corroborated the genotypes reported for a CNVR at the KIT locus known to be associated with the piebald coat colour phenotype in Hereford and some Simmental cattle. CONCLUSIONS: We present a large comprehensive collection of taurine cattle CNVs in a novel interactive visual database that displays CNV boundaries, read depths, and genome features for individual CNVRs, thus providing users with a powerful means to explore and scrutinize CNVRs of interest more thoroughly.


Assuntos
Variações do Número de Cópias de DNA , Visualização de Dados , Bases de Dados Genéticas , Genética Populacional , Animais , Bovinos , Locos de Características Quantitativas , Sequenciamento Completo do Genoma
11.
Brief Bioinform ; 20(4): 1576-1582, 2019 07 19.
Artigo em Inglês | MEDLINE | ID: mdl-28968859

RESUMO

Graphical genome maps are widely used to assess genome features and sequence characteristics. The CGView (Circular Genome Viewer) software family is a popular collection of tools for generating genome maps for bacteria, organelles and viruses. In this review, we describe the capabilities of the original CGView program along with those of subsequent companion applications, including the CGView Server and the CGView Comparison Tool. We also discuss GView, a graphical user interface-enabled rewrite of CGView, and the GView Server, which offers several integrated analyses for identifying shared or unique genome regions relative to a collection of comparison genomes. We conclude with some remarks about our current development efforts related to CGView aimed at adding new functionality while increasing ease of use.


Assuntos
DNA Circular/genética , Genômica/estatística & dados numéricos , Software , Mapeamento Cromossômico , Biologia Computacional , Gráficos por Computador , Escherichia coli O157/genética , Genoma Bacteriano , Genoma Viral , Listeria monocytogenes/genética , Interface Usuário-Computador
12.
Mol Phylogenet Evol ; 120: 342-353, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29242164

RESUMO

Polyploidy is a major speciation process in vascular plants, and is postulated to be particularly important in shaping the diversity of extant ferns. However, limitations in the availability of bi-parental markers for ferns have greatly limited phylogenetic investigation of polyploidy in this group. With a large number of allopolyploid species, the genus Botrychium is a classic example in ferns where recurrent polyploidy is postulated to have driven frequent speciation events. Here, we use PacBio sequencing and the PURC bioinformatics pipeline to capture all homeologous or allelic copies of four long (∼1 kb) low-copy nuclear regions from a sample of 45 specimens (25 diploids and 20 polyploids) representing 37 Botrychium taxa, and three outgroups. This sample includes most currently recognized Botrychium species in Europe and North America, and the majority of our specimens were genotyped with co-dominant nuclear allozymes to ensure species identification. We analyzed the sequence data using maximum likelihood (ML) and Bayesian inference (BI) concatenated-data ("gene tree") approaches to explore the relationships among Botrychium species. Finally, we estimated divergence times among Botrychium lineages and inferred the multi-labeled polyploid species tree showing the origins of the polyploid taxa, and their relationships to each other and to their diploid progenitors. We found strong support for the monophyly of the major lineages within Botrychium and identified most of the parental donors of the polyploids; these results largely corroborate earlier morphological and allozyme-based investigations. Each polyploid had at least two distinct homeologs, indicating that all sampled polyploids are likely allopolyploids (rather than autopolyploids). Our divergence-time analyses revealed that these allopolyploid lineages originated recently-within the last two million years-and thus that the genus has undergone a recent radiation, correlated with multiple independent allopolyploidizations across the phylogeny. Also, we found strong parental biases in the formation of allopolyploids, with individual diploid species participating multiple times as either the maternal or paternal donor (but not both). Finally, we discuss the role of polyploidy in the evolutionary history of Botrychium and the interspecific reproductive barriers possibly involved in these parental biases.


Assuntos
Gleiquênias/classificação , Teorema de Bayes , Núcleo Celular/genética , Biologia Computacional , Criptocromos/química , Criptocromos/classificação , Criptocromos/genética , DNA de Plantas/química , DNA de Plantas/isolamento & purificação , DNA de Plantas/metabolismo , Gleiquênias/genética , Filogenia , Poliploidia , Análise de Sequência de DNA
13.
Anal Chem ; 90(1): 649-656, 2018 01 02.
Artigo em Inglês | MEDLINE | ID: mdl-29035042

RESUMO

NMR is a widely used analytical technique with a growing number of repositories available. As a result, demands for a vendor-agnostic, open data format for long-term archiving of NMR data have emerged with the aim to ease and encourage sharing, comparison, and reuse of NMR data. Here we present nmrML, an open XML-based exchange and storage format for NMR spectral data. The nmrML format is intended to be fully compatible with existing NMR data for chemical, biochemical, and metabolomics experiments. nmrML can capture raw NMR data, spectral data acquisition parameters, and where available spectral metadata, such as chemical structures associated with spectral assignments. The nmrML format is compatible with pure-compound NMR data for reference spectral libraries as well as NMR data from complex biomixtures, i.e., metabolomics experiments. To facilitate format conversions, we provide nmrML converters for Bruker, JEOL and Agilent/Varian vendor formats. In addition, easy-to-use Web-based spectral viewing, processing, and spectral assignment tools that read and write nmrML have been developed. Software libraries and Web services for data validation are available for tool developers and end-users. The nmrML format has already been adopted for capturing and disseminating NMR data for small molecules by several open source data processing tools and metabolomics reference spectral libraries, e.g., serving as storage format for the MetaboLights data repository. The nmrML open access data standard has been endorsed by the Metabolomics Standards Initiative (MSI), and we here encourage user participation and feedback to increase usability and make it a successful standard.


Assuntos
Bases de Dados de Compostos Químicos/normas , Espectroscopia de Ressonância Magnética/estatística & dados numéricos , Metabolômica/métodos , Software
14.
Nucleic Acids Res ; 46(D1): D1074-D1082, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29126136

RESUMO

DrugBank (www.drugbank.ca) is a web-enabled database containing comprehensive molecular information about drugs, their mechanisms, their interactions and their targets. First described in 2006, DrugBank has continued to evolve over the past 12 years in response to marked improvements to web standards and changing needs for drug research and development. This year's update, DrugBank 5.0, represents the most significant upgrade to the database in more than 10 years. In many cases, existing data content has grown by 100% or more over the last update. For instance, the total number of investigational drugs in the database has grown by almost 300%, the number of drug-drug interactions has grown by nearly 600% and the number of SNP-associated drug effects has grown more than 3000%. Significant improvements have been made to the quantity, quality and consistency of drug indications, drug binding data as well as drug-drug and drug-food interactions. A great deal of brand new data have also been added to DrugBank 5.0. This includes information on the influence of hundreds of drugs on metabolite levels (pharmacometabolomics), gene expression levels (pharmacotranscriptomics) and protein expression levels (pharmacoprotoemics). New data have also been added on the status of hundreds of new drug clinical trials and existing drug repurposing trials. Many other important improvements in the content, interface and performance of the DrugBank website have been made and these should greatly enhance its ease of use, utility and potential applications in many areas of pharmacological research, pharmaceutical science and drug education.


Assuntos
Bases de Dados de Produtos Farmacêuticos , Interações Medicamentosas , Interações Alimento-Droga , Metaboloma/efeitos dos fármacos , Polimorfismo de Nucleotídeo Único , Transcriptoma/efeitos dos fármacos , Interface Usuário-Computador
15.
Sci Rep ; 7: 46203, 2017 04 10.
Artigo em Inglês | MEDLINE | ID: mdl-28393889

RESUMO

It has been shown that inter-individual variation in host response to porcine reproductive and respiratory syndrome (PRRS) has a heritable component, yet little is known about the underlying genetic architecture of gene expression in response to PRRS virus (PRRSV) infection. Here, we integrated genome-wide genotype, gene expression, viremia level, and weight gain data to identify genetic polymorphisms that are associated with variation in inter-individual gene expression and response to PRRSV infection in pigs. RNA-seq analysis of peripheral blood samples collected just prior to experimental challenge (day 0) and at 4, 7, 11 and 14 days post infection from 44 pigs revealed 6,430 differentially expressed genes at one or more time points post infection compared to the day 0 baseline. We mapped genetic polymorphisms that were associated with inter-individual differences in expression at each day and found evidence of cis-acting expression quantitative trait loci (cis-eQTL) for 869 expressed genes (qval < 0.05). Associations between cis-eQTL markers and host response phenotypes using 383 pigs suggest that host genotype-dependent differences in expression of GBP5, GBP6, CCHCR1 and CMPK2 affect viremia levels or weight gain in response to PRRSV infection.


Assuntos
Regulação Viral da Expressão Gênica , Interações Hospedeiro-Patógeno/genética , Síndrome Respiratória e Reprodutiva Suína/genética , Síndrome Respiratória e Reprodutiva Suína/virologia , Vírus da Síndrome Respiratória e Reprodutiva Suína/genética , Animais , Perfilação da Expressão Gênica , Estudo de Associação Genômica Ampla , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Suínos , Fatores de Tempo , Transcrição Gênica , Viremia/genética , Viremia/virologia , Aumento de Peso/genética
16.
Nucleic Acids Res ; 44(W1): W147-53, 2016 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-27190236

RESUMO

Heatmapper is a freely available web server that allows users to interactively visualize their data in the form of heat maps through an easy-to-use graphical interface. Unlike existing non-commercial heat map packages, which either lack graphical interfaces or are specialized for only one or two kinds of heat maps, Heatmapper is a versatile tool that allows users to easily create a wide variety of heat maps for many different data types and applications. More specifically, Heatmapper allows users to generate, cluster and visualize: (i) expression-based heat maps from transcriptomic, proteomic and metabolomic experiments; (ii) pairwise distance maps; (iii) correlation maps; (iv) image overlay heat maps; (v) latitude and longitude heat maps and (vi) geopolitical (choropleth) heat maps. Heatmapper offers a number of simple and intuitive customization options for facile adjustments to each heat map's appearance and plotting parameters. Heatmapper also allows users to interactively explore their numeric data values by hovering their cursor over each heat map cell, or by using a searchable/sortable data table view. Heat map data can be easily uploaded to Heatmapper in text, Excel or tab delimited formatted tables and the resulting heat map images can be easily downloaded in common formats including PNG, JPG and PDF. Heatmapper is designed to appeal to a wide range of users, including molecular biologists, structural biologists, microbiologists, epidemiologists, environmental scientists, agriculture/forestry scientists, fish and wildlife biologists, climatologists, geologists, educators and students. Heatmapper is available at http://www.heatmapper.ca.


Assuntos
Mapeamento Potencial de Superfície Corporal/métodos , Mapeamento Cromossômico/métodos , Mapeamento Geográfico , Mapeamento de Interação de Proteínas/métodos , Termografia/métodos , Interface Usuário-Computador , Animais , Gráficos por Computador , Redes Reguladoras de Genes , Humanos , Armazenamento e Recuperação da Informação , Internet , Metaboloma , Proteoma , Transcriptoma
17.
Nucleic Acids Res ; 44(W1): W16-21, 2016 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-27141966

RESUMO

PHASTER (PHAge Search Tool - Enhanced Release) is a significant upgrade to the popular PHAST web server for the rapid identification and annotation of prophage sequences within bacterial genomes and plasmids. Although the steps in the phage identification pipeline in PHASTER remain largely the same as in the original PHAST, numerous software improvements and significant hardware enhancements have now made PHASTER faster, more efficient, more visually appealing and much more user friendly. In particular, PHASTER is now 4.3× faster than PHAST when analyzing a typical bacterial genome. More specifically, software optimizations have made the backend of PHASTER 2.7X faster than PHAST, while the addition of 80 CPUs to the PHASTER compute cluster are responsible for the remaining speed-up. PHASTER can now process a typical bacterial genome in 3 min from the raw sequence alone, or in 1.5 min when given a pre-annotated GenBank file. A number of other optimizations have also been implemented, including automated algorithms to reduce the size and redundancy of PHASTER's databases, improvements in handling multiple (metagenomic) queries and higher user traffic, along with the ability to perform automated look-ups against 14 000 previously PHAST/PHASTER annotated bacterial genomes (which can lead to complete phage annotations in seconds as opposed to minutes). PHASTER's web interface has also been entirely rewritten. A new graphical genome browser has been added, gene/genome visualization tools have been improved, and the graphical interface is now more modern, robust and user-friendly. PHASTER is available online at www.phaster.ca.


Assuntos
Bactérias/genética , Bacteriófagos/genética , DNA Viral/genética , Genoma Bacteriano , Software , Algoritmos , Bactérias/virologia , Gráficos por Computador , Bases de Dados Genéticas , Ontologia Genética , Anotação de Sequência Molecular , Plasmídeos/química , Plasmídeos/metabolismo , Ferramenta de Busca , Fatores de Tempo
18.
Nucleic Acids Res ; 44(D1): D495-501, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26481353

RESUMO

ECMDB or the Escherichia coli Metabolome Database (http://www.ecmdb.ca) is a comprehensive database containing detailed information about the genome and metabolome of E. coli (K-12). First released in 2012, the ECMDB has undergone substantial expansion and many modifications over the past 4 years. This manuscript describes the most recent version of ECMDB (ECMDB 2.0). In particular, it provides a comprehensive update of the database that was previously described in the 2013 NAR Database Issue and details many of the additions and improvements made to the ECMDB over that time. Some of the most important or significant enhancements include a 13-fold increase in the number of metabolic pathway diagrams (from 125 to 1650), a 3-fold increase in the number of compounds linked to pathways (from 1058 to 3280), the addition of dozens of operon/metabolite signalling pathways, a 44% increase in the number of compounds in the database (from 2610 to 3760), a 7-fold increase in the number of compounds with NMR or MS spectra (from 412 to 3261) and a massive increase in the number of external links to other E. coli or chemical resources. These additions, along with many other enhancements aimed at improving the ease or speed of querying, searching and viewing the data within ECMDB should greatly facilitate the understanding of not only the metabolism of E. coli, but also allow the in-depth exploration of its extensive metabolic networks, its many signalling pathways and its essential biochemistry.


Assuntos
Bases de Dados de Compostos Químicos , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Genoma Bacteriano , Metaboloma , Escherichia coli K12/química , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Redes e Vias Metabólicas
20.
PLoS One ; 10(5): e0124219, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26017271

RESUMO

Many diseases cause significant changes to the concentrations of small molecules (a.k.a. metabolites) that appear in a person's biofluids, which means such diseases can often be readily detected from a person's "metabolic profile"-i.e., the list of concentrations of those metabolites. This information can be extracted from a biofluids Nuclear Magnetic Resonance (NMR) spectrum. However, due to its complexity, NMR spectral profiling has remained manual, resulting in slow, expensive and error-prone procedures that have hindered clinical and industrial adoption of metabolomics via NMR. This paper presents a system, BAYESIL, which can quickly, accurately, and autonomously produce a person's metabolic profile. Given a 1D 1H NMR spectrum of a complex biofluid (specifically serum or cerebrospinal fluid), BAYESIL can automatically determine the metabolic profile. This requires first performing several spectral processing steps, then matching the resulting spectrum against a reference compound library, which contains the "signatures" of each relevant metabolite. BAYESIL views spectral matching as an inference problem within a probabilistic graphical model that rapidly approximates the most probable metabolic profile. Our extensive studies on a diverse set of complex mixtures including real biological samples (serum and CSF), defined mixtures and realistic computer generated spectra; involving > 50 compounds, show that BAYESIL can autonomously find the concentration of NMR-detectable metabolites accurately (~ 90% correct identification and ~ 10% quantification error), in less than 5 minutes on a single CPU. These results demonstrate that BAYESIL is the first fully-automatic publicly-accessible system that provides quantitative NMR spectral profiling effectively-with an accuracy on these biofluids that meets or exceeds the performance of trained experts. We anticipate this tool will usher in high-throughput metabolomics and enable a wealth of new applications of NMR in clinical settings. BAYESIL is accessible at http://www.bayesil.ca.


Assuntos
Imageamento por Ressonância Magnética , Metabolômica/métodos , Algoritmos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA