Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 51(W1): W443-W450, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37194694

RESUMO

PHASTEST (PHAge Search Tool with Enhanced Sequence Translation) is the successor to the PHAST and PHASTER prophage finding web servers. PHASTEST is designed to support the rapid identification, annotation and visualization of prophage sequences within bacterial genomes and plasmids. PHASTEST also supports rapid annotation and interactive visualization of all other genes (protein coding regions, tRNA/tmRNA/rRNA sequences) in bacterial genomes. Given that bacterial genome sequencing has become so routine, the need for fast tools to comprehensively annotate bacterial genomes has become progressively more important. PHASTEST not only offers faster and more accurate prophage annotations than its predecessors, it also provides more complete whole genome annotations and much improved genome visualization capabilities. In standardized tests, we found that PHASTEST is 31% faster and 2-3% more accurate in prophage identification than PHASTER. Specifically, PHASTEST can process a typical bacterial genome in 3.2 min (raw sequence) or in 1.3 min when given a pre-annotated GenBank file. Improvements in PHASTEST's ability to annotate bacterial genomes now make it a particularly powerful tool for whole genome annotation. In addition, PHASTEST now offers a much more modern and responsive visualization interface that allows users to generate, edit, annotate and interactively visualize (via zooming, rotating, dragging, panning, resetting), colourful, publication quality genome maps. PHASTEST continues to offer popular options such as an API for programmatic queries, a Docker image for local installations, support for multiple (metagenomic) queries and the ability to perform automated look-ups against thousands of previously PHAST-annotated bacterial genomes. PHASTEST is available online at https://phastest.ca.


Assuntos
Bases de Dados de Ácidos Nucleicos , Prófagos , Ferramenta de Busca , Software , Genoma Bacteriano , Anotação de Sequência Molecular , Plasmídeos , Prófagos/genética
2.
Nucleic Acids Res ; 51(W1): W459-W467, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37099365

RESUMO

PlasMapper 3.0 is a web server that allows users to generate, edit, annotate and interactively visualize publication quality plasmid maps. Plasmid maps are used to plan, design, share and publish critical information about gene cloning experiments. PlasMapper 3.0 is the successor to PlasMapper 2.0 and offers many features found only in commercial plasmid mapping/editing packages. PlasMapper 3.0 allows users to paste or upload plasmid sequences as input or to upload existing plasmid maps from its large database of >2000 pre-annotated plasmids (PlasMapDB). This database can be searched by plasmid names, sequence features, restriction sites, preferred host organisms, and sequence length. PlasMapper 3.0 also supports the annotation of new or never-before-seen plasmids using its own feature database that contains common promoters, terminators, regulatory sequences, replication origins, selectable markers and other features found in most cloning plasmids. PlasMapper 3.0 has several interactive sequence editors/viewers that allow users to select and view plasmid regions, insert genes, modify restriction sites or perform codon optimization. The graphics for PlasMapper 3.0 have also been substantially upgraded. It now offers an interactive, full-color plasmid viewer/editor that allows users to zoom, rotate, re-color, linearize, circularize, edit annotated features and modify plasmid images or labels to improve the esthetic qualities of their plasmid map and textual displays. All the plasmid images and textual displays are downloadable in multiple formats. PlasMapper 3.0 is available online at https://plasmapper.ca.


Assuntos
Software , Interface Usuário-Computador , Plasmídeos/genética , Computadores , Sequência de Bases , Internet
3.
Nucleic Acids Res ; 51(W1): W484-W492, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37140037

RESUMO

Proksee (https://proksee.ca) provides users with a powerful, easy-to-use, and feature-rich system for assembling, annotating, analysing, and visualizing bacterial genomes. Proksee accepts Illumina sequence reads as compressed FASTQ files or pre-assembled contigs in raw, FASTA, or GenBank format. Alternatively, users can supply a GenBank accession or a previously generated Proksee map in JSON format. Proksee then performs assembly (for raw sequence data), generates a graphical map, and provides an interface for customizing the map and launching further analysis jobs. Notable features of Proksee include unique and informative assembly metrics provided via a custom reference database of assemblies; a deeply integrated high-performance genome browser for viewing and comparing analysis results at individual base resolution (developed specifically for Proksee); an ever-growing list of embedded analysis tools whose results can be seamlessly added to the map or searched and explored in other formats; and the option to export graphical maps, analysis results, and log files for data sharing and research reproducibility. All these features are provided via a carefully designed multi-server cloud-based system that can easily scale to meet user demand and that ensures the web server is robust and responsive.


Assuntos
Genoma Bacteriano , Software , Reprodutibilidade dos Testes , Bases de Dados de Ácidos Nucleicos , Internet
4.
Brief Bioinform ; 20(4): 1576-1582, 2019 07 19.
Artigo em Inglês | MEDLINE | ID: mdl-28968859

RESUMO

Graphical genome maps are widely used to assess genome features and sequence characteristics. The CGView (Circular Genome Viewer) software family is a popular collection of tools for generating genome maps for bacteria, organelles and viruses. In this review, we describe the capabilities of the original CGView program along with those of subsequent companion applications, including the CGView Server and the CGView Comparison Tool. We also discuss GView, a graphical user interface-enabled rewrite of CGView, and the GView Server, which offers several integrated analyses for identifying shared or unique genome regions relative to a collection of comparison genomes. We conclude with some remarks about our current development efforts related to CGView aimed at adding new functionality while increasing ease of use.


Assuntos
DNA Circular/genética , Genômica/estatística & dados numéricos , Software , Mapeamento Cromossômico , Biologia Computacional , Gráficos por Computador , Escherichia coli O157/genética , Genoma Bacteriano , Genoma Viral , Listeria monocytogenes/genética , Interface Usuário-Computador
5.
Nucleic Acids Res ; 46(D1): D1074-D1082, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29126136

RESUMO

DrugBank (www.drugbank.ca) is a web-enabled database containing comprehensive molecular information about drugs, their mechanisms, their interactions and their targets. First described in 2006, DrugBank has continued to evolve over the past 12 years in response to marked improvements to web standards and changing needs for drug research and development. This year's update, DrugBank 5.0, represents the most significant upgrade to the database in more than 10 years. In many cases, existing data content has grown by 100% or more over the last update. For instance, the total number of investigational drugs in the database has grown by almost 300%, the number of drug-drug interactions has grown by nearly 600% and the number of SNP-associated drug effects has grown more than 3000%. Significant improvements have been made to the quantity, quality and consistency of drug indications, drug binding data as well as drug-drug and drug-food interactions. A great deal of brand new data have also been added to DrugBank 5.0. This includes information on the influence of hundreds of drugs on metabolite levels (pharmacometabolomics), gene expression levels (pharmacotranscriptomics) and protein expression levels (pharmacoprotoemics). New data have also been added on the status of hundreds of new drug clinical trials and existing drug repurposing trials. Many other important improvements in the content, interface and performance of the DrugBank website have been made and these should greatly enhance its ease of use, utility and potential applications in many areas of pharmacological research, pharmaceutical science and drug education.


Assuntos
Bases de Dados de Produtos Farmacêuticos , Interações Medicamentosas , Interações Alimento-Droga , Metaboloma/efeitos dos fármacos , Polimorfismo de Nucleotídeo Único , Transcriptoma/efeitos dos fármacos , Interface Usuário-Computador
6.
Anal Chem ; 90(1): 649-656, 2018 01 02.
Artigo em Inglês | MEDLINE | ID: mdl-29035042

RESUMO

NMR is a widely used analytical technique with a growing number of repositories available. As a result, demands for a vendor-agnostic, open data format for long-term archiving of NMR data have emerged with the aim to ease and encourage sharing, comparison, and reuse of NMR data. Here we present nmrML, an open XML-based exchange and storage format for NMR spectral data. The nmrML format is intended to be fully compatible with existing NMR data for chemical, biochemical, and metabolomics experiments. nmrML can capture raw NMR data, spectral data acquisition parameters, and where available spectral metadata, such as chemical structures associated with spectral assignments. The nmrML format is compatible with pure-compound NMR data for reference spectral libraries as well as NMR data from complex biomixtures, i.e., metabolomics experiments. To facilitate format conversions, we provide nmrML converters for Bruker, JEOL and Agilent/Varian vendor formats. In addition, easy-to-use Web-based spectral viewing, processing, and spectral assignment tools that read and write nmrML have been developed. Software libraries and Web services for data validation are available for tool developers and end-users. The nmrML format has already been adopted for capturing and disseminating NMR data for small molecules by several open source data processing tools and metabolomics reference spectral libraries, e.g., serving as storage format for the MetaboLights data repository. The nmrML open access data standard has been endorsed by the Metabolomics Standards Initiative (MSI), and we here encourage user participation and feedback to increase usability and make it a successful standard.


Assuntos
Bases de Dados de Compostos Químicos/normas , Espectroscopia de Ressonância Magnética/estatística & dados numéricos , Metabolômica/métodos , Software
7.
Mol Phylogenet Evol ; 120: 342-353, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29242164

RESUMO

Polyploidy is a major speciation process in vascular plants, and is postulated to be particularly important in shaping the diversity of extant ferns. However, limitations in the availability of bi-parental markers for ferns have greatly limited phylogenetic investigation of polyploidy in this group. With a large number of allopolyploid species, the genus Botrychium is a classic example in ferns where recurrent polyploidy is postulated to have driven frequent speciation events. Here, we use PacBio sequencing and the PURC bioinformatics pipeline to capture all homeologous or allelic copies of four long (∼1 kb) low-copy nuclear regions from a sample of 45 specimens (25 diploids and 20 polyploids) representing 37 Botrychium taxa, and three outgroups. This sample includes most currently recognized Botrychium species in Europe and North America, and the majority of our specimens were genotyped with co-dominant nuclear allozymes to ensure species identification. We analyzed the sequence data using maximum likelihood (ML) and Bayesian inference (BI) concatenated-data ("gene tree") approaches to explore the relationships among Botrychium species. Finally, we estimated divergence times among Botrychium lineages and inferred the multi-labeled polyploid species tree showing the origins of the polyploid taxa, and their relationships to each other and to their diploid progenitors. We found strong support for the monophyly of the major lineages within Botrychium and identified most of the parental donors of the polyploids; these results largely corroborate earlier morphological and allozyme-based investigations. Each polyploid had at least two distinct homeologs, indicating that all sampled polyploids are likely allopolyploids (rather than autopolyploids). Our divergence-time analyses revealed that these allopolyploid lineages originated recently-within the last two million years-and thus that the genus has undergone a recent radiation, correlated with multiple independent allopolyploidizations across the phylogeny. Also, we found strong parental biases in the formation of allopolyploids, with individual diploid species participating multiple times as either the maternal or paternal donor (but not both). Finally, we discuss the role of polyploidy in the evolutionary history of Botrychium and the interspecific reproductive barriers possibly involved in these parental biases.


Assuntos
Gleiquênias/classificação , Teorema de Bayes , Núcleo Celular/genética , Biologia Computacional , Criptocromos/química , Criptocromos/classificação , Criptocromos/genética , DNA de Plantas/química , DNA de Plantas/isolamento & purificação , DNA de Plantas/metabolismo , Gleiquênias/genética , Filogenia , Poliploidia , Análise de Sequência de DNA
8.
Nucleic Acids Res ; 44(W1): W16-21, 2016 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-27141966

RESUMO

PHASTER (PHAge Search Tool - Enhanced Release) is a significant upgrade to the popular PHAST web server for the rapid identification and annotation of prophage sequences within bacterial genomes and plasmids. Although the steps in the phage identification pipeline in PHASTER remain largely the same as in the original PHAST, numerous software improvements and significant hardware enhancements have now made PHASTER faster, more efficient, more visually appealing and much more user friendly. In particular, PHASTER is now 4.3× faster than PHAST when analyzing a typical bacterial genome. More specifically, software optimizations have made the backend of PHASTER 2.7X faster than PHAST, while the addition of 80 CPUs to the PHASTER compute cluster are responsible for the remaining speed-up. PHASTER can now process a typical bacterial genome in 3 min from the raw sequence alone, or in 1.5 min when given a pre-annotated GenBank file. A number of other optimizations have also been implemented, including automated algorithms to reduce the size and redundancy of PHASTER's databases, improvements in handling multiple (metagenomic) queries and higher user traffic, along with the ability to perform automated look-ups against 14 000 previously PHAST/PHASTER annotated bacterial genomes (which can lead to complete phage annotations in seconds as opposed to minutes). PHASTER's web interface has also been entirely rewritten. A new graphical genome browser has been added, gene/genome visualization tools have been improved, and the graphical interface is now more modern, robust and user-friendly. PHASTER is available online at www.phaster.ca.


Assuntos
Bactérias/genética , Bacteriófagos/genética , DNA Viral/genética , Genoma Bacteriano , Software , Algoritmos , Bactérias/virologia , Gráficos por Computador , Bases de Dados Genéticas , Ontologia Genética , Anotação de Sequência Molecular , Plasmídeos/química , Plasmídeos/metabolismo , Ferramenta de Busca , Fatores de Tempo
9.
Nucleic Acids Res ; 44(W1): W147-53, 2016 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-27190236

RESUMO

Heatmapper is a freely available web server that allows users to interactively visualize their data in the form of heat maps through an easy-to-use graphical interface. Unlike existing non-commercial heat map packages, which either lack graphical interfaces or are specialized for only one or two kinds of heat maps, Heatmapper is a versatile tool that allows users to easily create a wide variety of heat maps for many different data types and applications. More specifically, Heatmapper allows users to generate, cluster and visualize: (i) expression-based heat maps from transcriptomic, proteomic and metabolomic experiments; (ii) pairwise distance maps; (iii) correlation maps; (iv) image overlay heat maps; (v) latitude and longitude heat maps and (vi) geopolitical (choropleth) heat maps. Heatmapper offers a number of simple and intuitive customization options for facile adjustments to each heat map's appearance and plotting parameters. Heatmapper also allows users to interactively explore their numeric data values by hovering their cursor over each heat map cell, or by using a searchable/sortable data table view. Heat map data can be easily uploaded to Heatmapper in text, Excel or tab delimited formatted tables and the resulting heat map images can be easily downloaded in common formats including PNG, JPG and PDF. Heatmapper is designed to appeal to a wide range of users, including molecular biologists, structural biologists, microbiologists, epidemiologists, environmental scientists, agriculture/forestry scientists, fish and wildlife biologists, climatologists, geologists, educators and students. Heatmapper is available at http://www.heatmapper.ca.


Assuntos
Mapeamento Potencial de Superfície Corporal/métodos , Mapeamento Cromossômico/métodos , Mapeamento Geográfico , Mapeamento de Interação de Proteínas/métodos , Termografia/métodos , Interface Usuário-Computador , Animais , Gráficos por Computador , Redes Reguladoras de Genes , Humanos , Armazenamento e Recuperação da Informação , Internet , Metaboloma , Proteoma , Transcriptoma
10.
Nucleic Acids Res ; 44(D1): D495-501, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26481353

RESUMO

ECMDB or the Escherichia coli Metabolome Database (http://www.ecmdb.ca) is a comprehensive database containing detailed information about the genome and metabolome of E. coli (K-12). First released in 2012, the ECMDB has undergone substantial expansion and many modifications over the past 4 years. This manuscript describes the most recent version of ECMDB (ECMDB 2.0). In particular, it provides a comprehensive update of the database that was previously described in the 2013 NAR Database Issue and details many of the additions and improvements made to the ECMDB over that time. Some of the most important or significant enhancements include a 13-fold increase in the number of metabolic pathway diagrams (from 125 to 1650), a 3-fold increase in the number of compounds linked to pathways (from 1058 to 3280), the addition of dozens of operon/metabolite signalling pathways, a 44% increase in the number of compounds in the database (from 2610 to 3760), a 7-fold increase in the number of compounds with NMR or MS spectra (from 412 to 3261) and a massive increase in the number of external links to other E. coli or chemical resources. These additions, along with many other enhancements aimed at improving the ease or speed of querying, searching and viewing the data within ECMDB should greatly facilitate the understanding of not only the metabolism of E. coli, but also allow the in-depth exploration of its extensive metabolic networks, its many signalling pathways and its essential biochemistry.


Assuntos
Bases de Dados de Compostos Químicos , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Genoma Bacteriano , Metaboloma , Escherichia coli K12/química , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Redes e Vias Metabólicas
11.
BMC Genomics ; 15: 451, 2014 Jun 09.
Artigo em Inglês | MEDLINE | ID: mdl-24912542

RESUMO

BACKGROUND: Oocytes and early embryos contain minute amounts of DNA, RNA and proteins, making the study of early mammalian development highly challenging. The study of the embryo epigenome, in particular the DNA methylome, has been made accessible thanks to the possibility of amplifying specific sequences according to their initial methylation status. This paper describes a novel platform dedicated to the genome-wide study of bovine DNA methylation, including a complete pipeline for data analysis and visualization. The platform allows processing and integrating of DNA methylome and transcriptome data from the same sample. Procedures were optimized for genome-wide analysis of 10 ng of DNA (10 bovine blastocysts). Bovine sperm and blastocysts were compared as a test of platform capability. RESULTS: The hypermethylation of bovine sperm DNA compared to the embryo genome was confirmed. Differentially methylated regions were distributed across various classes of bovine sperm genomic feature including primarily promoter, intronic and exonic regions, non-CpG-island regions (shore, shelf and open-sea) and CpG islands with low-to-intermediate CpG density. The blastocyst genome bore more methylation marks than sperm DNA only in CpG islands with high CpG density. Long-terminal-repeat retrotransposons (LTR), LINE and SINE were more methylated in sperm DNA, as were low-complexity repetitive elements in blastocysts. CONCLUSIONS: This is the first early embryo compatible genome-wide epigenetics platform for bovine. Such platforms should improve the study of the potential epigenetic risks of assisted reproductive technologies (ART), the establishment sequence of embryonic cell lines and potential deviations in both gene expression and DNA methylation capable of having long-term impact.


Assuntos
Metilação de DNA , Epigênese Genética , Perfilação da Expressão Gênica/métodos , Transcriptoma , Animais , Blastocisto/metabolismo , Bovinos , Ilhas de CpG , Regulação da Expressão Gênica no Desenvolvimento , Impressão Genômica , Genômica/métodos , Masculino , Reprodutibilidade dos Testes , Espermatozoides/metabolismo , Navegador
12.
Am J Bot ; 101(1): 128-40, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24401328

RESUMO

PREMISE OF THE STUDY: Previous phylogenetic studies of moonworts (Botrychium sensu stricto (s.s.)) included few taxa from outside of North America. This low geographical representation limited interpretations of relationships of this group rich in cryptic species. With 18 out of 30 species in the genus being polyploid, understanding their evolutionary history remains a major challenge. METHODS: A new molecular phylogeny was reconstructed using Maximum Likelihood (ML) and Bayesian Inference (BI) analyses based on multiple accessions of the most wide-ranging Arctic taxa of Botrychium in North America and Europe using three noncoding plastid DNA regions (psbA-trnH(GUG), trnL(UAA)-trnF(GAA) intergenic spacer, and rpL16 intron). KEY RESULTS: The new phylogeny confirms the identity of several recently described species and proposed new taxa. Nine subclades are newly identified within the two major clades in Botrychium s.s.: Lanceolatum and Lunaria. Chloroplast DNA was variable enough to separate morphologically cryptic species in the Lunaria clade. On the contrary, much less variation is seen within the morphologically variable Lanceolatum clade despite sampling over the same broad geographic range. The chloroplast region psbA-trnH(GUG) is identified as an efficient DNA barcode for the identification of cryptic taxa in Botrychium s.s. CONCLUSIONS: The combined increase in species representation, samples from throughout the geographic range of each species, and sequencing of multiple plastid DNA regions supports morphologically cryptic species in Botrychium s.s.


Assuntos
Filogenia , Traqueófitas/genética , Sequência de Bases , Teorema de Bayes , DNA de Cloroplastos/genética , Bases de Dados Genéticas , Funções Verossimilhança , Plastídeos/genética , Análise de Sequência de DNA , Especificidade da Espécie
13.
Plants (Basel) ; 13(5)2024 Mar 04.
Artigo em Inglês | MEDLINE | ID: mdl-38475562

RESUMO

Microsatellites or SSRs are small tandem repeats that are 1-6 bp long. They are usually highly polymorphic and form important portions of genomes. They have been extensively analyzed in humans, animals and model plants; however, information from non-flowering plants is generally lacking. Here, we examined 29 samples of Ophioglossaceae ferns, mainly from the genera Botrychium and Sceptridium. We analyzed the SSR distribution, density and composition in almost 400 nuclear exons and their flanking regions. We detected 45 SSRs in exons and 1475 SSRs in the flanking regions. In the exons, only di-, tri- and tetranucleotides were found, and all of them were 12 bp long. The annotation of the exons containing SSRs showed that they were related to various processes, such as metabolism, catalysis, transportation or plant growth. The flanking regions contained SSRs from all categories, with the most numerous being dinucleotides, followed by tetranucleotides. More than one-third of all the SSRs in the flanking regions were 12 bp long. The SSR densities in the exons were very low, ranging from 0 to 0.07 SSRs/kb, while those in the flanking regions ranged from 0.24 to 0.81 SSRs/kb; and those in the combined dataset ranged from 0.2 to 0.81 SSRs/kb. The majority of the detected SSRs in the flanking regions were polymorphic and present at the same loci across two or more samples but differing in the number of repeats. The SSRs detected here may serve as a basis for further population genetic, phylogenetic or evolutionary genetic studies, as well as for further studies focusing on SSRs in the genomes and their roles in adaptation, evolution and diseases.

14.
Plants (Basel) ; 12(8)2023 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-37111932

RESUMO

The Pleistocene climatic oscillations (PCO) that provoked several cycles of glacial-interglacial periods are thought to have profoundly affected species distribution, richness and diversity around the world. While the effect of the PCO on population dynamics at temperate latitudes is well known, considerable questions remain about its impact on the biodiversity of neotropical mountains. Here, we use amplified fragment length polymorphism molecular markers (AFLPs) to investigate the phylogeography and genetic structure of 13 plant species belonging to the gentian genus Macrocarpaea (Gentianaceae) in the tropical Andes. These woody herbs, shrubs or small trees show complex and potentially reticulated relationships, including cryptic species. We show that populations of M. xerantifulva in the dry system of the Rio Marañón in northern Peru have lower levels of genetic diversity compared to other sampled species. We suggest that this is due to a recent demographic bottleneck resulting from the contraction of the montane wet forests into refugia because of the expansion of the dry system into the valley during the glacial cycles of the PCO. This may imply that the ecosystems of different valleys of the Andes might have responded differently to the PCO.

15.
Front Plant Sci ; 14: 1294716, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38288414

RESUMO

Previous phylogenies showed conflicting relationships among the subfamilies and genera within the fern family Ophioglossaceae. However, their classification remains unsettled where contrasting classifications recognize four to 15 genera. Since these treatments are mostly based on phylogenetic evidence using limited, plastid-only loci, a phylogenomic understanding is actually necessary to provide conclusive insight into the systematics of the genera. In this study, we have therefore compiled datasets with the broadest sampling of Ophioglossaceae genera to date, including all fifteen currently recognized genera, especially for the first time the South African endemic genus Rhizoglossum. Notably, our comprehensive phylogenomic matrix is based on both plastome and mitogenome genes. Inferred from the coding sequences of 83 plastid and 37 mitochondrial genes, a strongly supported topology for these subfamilies is presented, and is established by analyses using different partitioning approaches and substitution models. At the generic level, most relationships are well resolved except for few within the subfamily Ophioglossoideae. With this new phylogenomic scheme, key morphological and genomic changes were further identified along this backbone. In addition, we confirmed numerous horizontally transferred (HGT) genes in the genera Botrypus, Helminthostachys, Mankyua, Sahashia, and Sceptridium. These HGT genes are most likely located in mitogenomes and are predominately donated from angiosperm Santalales or non-Ophioglossaceae ferns. By our in-depth searches of the organellar genomes, we also provided phylogenetic overviews for the plastid and mitochondrial MORFFO genes found in these Ophioglossaceae ferns.

16.
BMC Genomics ; 13: 202, 2012 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-22621371

RESUMO

BACKGROUND: Continued sequencing efforts coupled with advances in sequencing technology will lead to the completion of a vast number of small genomes. Whole-genome comparisons represent an important part of the analysis of any new genome sequence, as they can provide a better understanding of the biology and evolution of the source organism. Visualization of the results is important, as it allows information from a variety of sources to be integrated and interpreted. However, existing graphical comparison tools lack features needed for efficiently comparing a new genome to hundreds or thousands of existing sequences. Moreover, existing tools are limited in terms of the types of comparisons that can be performed, the extent to which the output can be customized, and the ease with which the entire process can be automated. RESULTS: The CGView Comparison Tool (CCT) is a package for visually comparing bacterial, plasmid, chloroplast, or mitochondrial sequences of interest to existing genomes or sequence collections. The comparisons are conducted using BLAST, and the BLAST results are presented in the form of graphical maps that can also show sequence features, gene and protein names, COG (Clusters of Orthologous Groups of proteins) category assignments, and sequence composition characteristics. CCT can generate maps in a variety of sizes, including 400 Megapixel maps suitable for posters. Comparisons can be conducted within a particular species or genus, or all available genomes can be used. The entire map creation process, from downloading sequences to redrawing zoomed maps, can be completed easily using scripts included with the CCT. User-defined features or analysis results can be included on maps, and maps can be extensively customized. To simplify program setup, a CCT virtual machine that includes all dependencies preinstalled is available. Detailed tutorials illustrating the use of CCT are included with the CCT documentation. CONCLUSION: CCT can be used to visually compare a reference sequence to thousands of existing genomes or sequence collections (next-generation sequencing reads for example) on a standard desktop computer. It provides analysis and visualization functionality not available in any existing circular genome visualization tool. By visually presenting sequence conservation information along with functional classifications and sequence composition characteristics, CCT can be a useful tool for identifying rapidly evolving or novel sequences, horizontally transferred sequences, or unusual functional properties in newly sequenced genomes. CCT is freely available for download at http://stothard.afns.ualberta.ca/downloads/CCT/.


Assuntos
Genoma Bacteriano/genética , Análise de Sequência de DNA/métodos , Software , Cloroplastos/genética , Mitocôndrias/genética , Plasmídeos/genética
17.
BMC Genomics ; 13: 370, 2012 Aug 03.
Artigo em Inglês | MEDLINE | ID: mdl-22863022

RESUMO

BACKGROUND: The domestic pig is an important livestock species and there is strong interest in the factors that affect the development of viable embryos and offspring in this species. A limited understanding of the molecular mechanisms involved in early embryonic development has inhibited our ability to fully elucidate these factors. Next generation deep sequencing and microarray technologies are powerful tools for delineation of molecular pathways involved in the developing embryo. RESULTS: Here we present the development of a porcine-embryo-specific microarray platform created from a large expressed sequence tag (EST) analysis generated by Roche/454 next-generation sequencing of cDNAs constructed from critical stages of in vivo or in vitro porcine preimplantation embryos. Two cDNA libraries constructed from in vitro and in vivo produced preimplantation porcine embryos were normalized and sequenced using 454 Titanium pyrosequencing technology. Over one million high-quality EST sequences were obtained and used to develop the EMbryogene Porcine Version 1 (EMPV1) microarray composed of 43,795 probes. Based on an initial probe sequence annotation, the EMPV1 features 17,409 protein-coding, 473 pseudogenes, 46 retrotransposed, 2,359 non-coding RNA, 4,121 splice variants in 2,862 genes and a total of 12,324 Novel Transcript Regions (NTR). After re-annotation, the total unique genes increased from 11,961 to 16,281 and 1.9% of them belonged to a large olfactory receptor (OR) gene family. Quality control on the EMPV1 was performed and revealed an even distribution of ten clusters of spiked-in control spots and array to array (dye-swap) correlation was 0.97. CONCLUSIONS: Using next-generation deep sequencing we have produced a large EST dataset to allow for the selection of probe sequences for the development of the EMPV1 microarray platform. The quality of this embryo-specific array was confirmed with a high-level of reproducibility using current Agilent microarray technology. With more than an estimated 20,000 unique genes represented on the EMPV1, this platform will provide the foundation for future research into the in vivo and in vitro factors that affect the viability of porcine embryos, as well as the effects of these factors on the live offspring that result from these embryos.


Assuntos
Embrião de Mamíferos/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Animais , Suínos
18.
Bioinformatics ; 27(16): 2300-1, 2011 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-21697123

RESUMO

SUMMARY: NGS-SNP is a collection of command-line scripts for providing rich annotations for SNPs identified by the sequencing of whole genomes from any organism with reference sequences in Ensembl. Included among the annotations, several of which are not available from any existing SNP annotation tools, are the results of detailed comparisons with orthologous sequences. These comparisons can, for example, identify SNPs that affect conserved residues, or alter residues or genes linked to phenotypes in another species. AVAILABILITY: NGS-SNP is available both as a set of scripts and as a virtual machine. The virtual machine consists of a Linux operating system with all the NGS-SNP dependencies pre-installed. The source code and virtual machine are freely available for download at http://stothard.afns.ualberta.ca/downloads/NGS-SNP/. CONTACT: stothard@ualberta.ca SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Anotação de Sequência Molecular/métodos , Polimorfismo de Nucleotídeo Único , Genômica , Software
19.
Mol Reprod Dev ; 78(9): 651-64, 2011 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-21812063

RESUMO

While most assisted reproductive technologies (ART) are considered routine for the reproduction of species of economical importance, such as the bovine, the impact of these manipulations on the developing embryo remains largely unknown. In an effort to obtain a comprehensive survey of the bovine embryo transcriptome and how it is modified by ART, resources were combined to design an embryo-specific microarray. Close to one million high-quality reads were produced from subtracted bovine embryo libraries using Roche 454 Titanium deep sequencing technology, which enabled the creation of an augmented bovine genome catalog. This catalog was enriched with bovine embryo transcripts, and included newly discovered indel type and 3'UTR variants. Using this augmented bovine genome catalog, the EmbryoGENE Bovine Microarray was designed and is composed of a total of 42,242 probes, including 21,139 known reference genes; 9,322 probes for novel transcribed regions (NTRs); 3,677 alternatively spliced exons; 3,353 3'-tiling probes; and 3,723 controls. A suite of bioinformatics tools was also developed to facilitate microrarray data analysis and database creation; it includes a quality control module, a Laboratory Information Management System (LIMS) and microarray analysis software. Results obtained during this study have already led to the identification of differentially expressed blastocyst targets, NTRs, splice variants of the indel type, and 3'UTR variants. We were able to confirm microarray results by real-time PCR, indicating that the EmbryoGENE bovine microarray has the power to detect physiologically relevant changes in gene expression.


Assuntos
Bovinos/embriologia , Bovinos/genética , Perfilação da Expressão Gênica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Transcriptoma/fisiologia , Animais , Biologia Computacional , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Embrião de Mamíferos , Feminino , Perfilação da Expressão Gênica/normas , Células da Granulosa/metabolismo , Reação em Cadeia da Polimerase em Tempo Real , Reprodutibilidade dos Testes , Técnicas de Reprodução Assistida , Interface Usuário-Computador
20.
Biology (Basel) ; 10(9)2021 Aug 25.
Artigo em Inglês | MEDLINE | ID: mdl-34571702

RESUMO

The evolutionary processes responsible for the extraordinary diversity in the middle elevation montane forests of the Tropical Andes (MMF; 1000-3500 m) remain poorly understood. It is not clear whether adaptive divergence, niche conservatism or geographical processes were the main contributors to the radiation of the respective lineages occurring there. We investigated the evolutionary history of plant lineages in the MMF. We used the vascular plant genus Macrocarpaea (Gentianaceae) as a model, as it consists of 118 morphologically diverse species, a majority of which are endemic to the MMF. We used a time-calibrated molecular phylogeny and morphological and climatic data to compare a set of evolutionary scenarios of various levels of complexity in a phylogenetic comparative framework. In this paper, we show that the hypothesis of adaptive radiation for Macrocarpaea in the MMF is unlikely. The genus remained confined to the upper montane forests (UMF > 1800 m) during more than a half of its evolutionary history, possibly due to evolutionary constraints. Later, coinciding with the beginning of the Pleistocene (around 2.58 Ma), a phylogenetically derived (recently branching) clade, here referred to as the M. micrantha clade (25 species), successfully colonized and radiated in the lower montane forests (LMF < 1800 m). This colonization was accompanied by the evolution of a new leaf phenotype that is unique to the species of the M. micrantha clade that likely represents an adaptation to life in this new environment (adaptive zone). Therefore, our results suggest that niche conservatism and geographical processes have dominated most of the diversification history of Macrocarpaea, but that a rare adaptive divergence event allowed a transition into a new adaptive zone and enabled progressive radiation in this zone through geographical processes.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA