Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Bioinformatics ; 32(15): 2380-1, 2016 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-27153714

RESUMO

UNLABELLED: Nucleotide Similarity Scanner (NSimScan) is specialized for searching massive DNA databases for distant similarities. Its targeted applications include phylogenomics, comparative and functional studies of non-coding sequences, contamination detection, etc. NSimScan outperforms industry standard tools in combined sensitivity, accuracy and speed, operating at sensitivity similar to BLAST, accuracy of ssearch and speed of MegaBLAST. AVAILABILITY AND IMPLEMENTATION: NSimScan is available at https://github.com/abadona/qsimscan as a part of QSimScan package. It is implemented in C ++, distributed under MIT license and supported on Linux, OS X and Windows (with cygwin). CONTACT: dkaznadzey@yahoo.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Bases de Dados de Ácidos Nucleicos , Software , DNA , Genômica , Filogenia
2.
Nucleic Acids Res ; 31(1): 164-71, 2003 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-12519973

RESUMO

The ERGO (http://ergo.integratedgenomics.com/ERGO/) genome analysis and discovery suite is an integration of biological data from genomics, biochemistry, high-throughput expression profiling, genetics and peer-reviewed journals to achieve a comprehensive analysis of genes and genomes. Far beyond any conventional systems that facilitate functional assignments, ERGO combines pattern-based analysis with comparative genomics by visualizing genes within the context of regulation, expression profiling, phylogenetic clusters, fusion events, networked cellular pathways and chromosomal neighborhoods of other functionally related genes. The result of this multifaceted approach is to provide an extensively curated database of the largest available integration of genomes, with a vast collection of reconstructed cellular pathways spanning all domains of life. Although access to ERGO is provided only under subscription, it is already widely used by the academic community. The current version of the system integrates 500 genomes from all domains of life in various levels of completion, 403 of which are available for subscription.


Assuntos
Bases de Dados Genéticas , Genoma , Genômica , Animais , Biologia Computacional , Perfilação da Expressão Gênica , Metabolismo , Proteínas/fisiologia
3.
FEMS Microbiol Lett ; 250(2): 175-84, 2005 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-16099605

RESUMO

Genome features of the Bacillus cereus group genomes (representative strains of Bacillus cereus, Bacillus anthracis and Bacillus thuringiensis sub spp. israelensis) were analyzed and compared with the Bacillus subtilis genome. A core set of 1381 protein families among the four Bacillus genomes, with an additional set of 933 families common to the B. cereus group, was identified. Differences in signal transduction pathways, membrane transporters, cell surface structures, cell wall, and S-layer proteins suggesting differences in their phenotype were identified. The B. cereus group has signal transduction systems including a tyrosine kinase related to two-component system histidine kinases from B. subtilis. A model for regulation of the stress responsive sigma factor sigmaB in the B. cereus group different from the well studied regulation in B. subtilis has been proposed. Despite a high degree of chromosomal synteny among these genomes, significant differences in cell wall and spore coat proteins that contribute to the survival and adaptation in specific hosts has been identified.


Assuntos
Bacillus anthracis/genética , Bacillus cereus/genética , Bacillus subtilis/genética , Bacillus thuringiensis/genética , Genoma Bacteriano , Proteínas de Bactérias/genética , Parede Celular/genética , Genômica , Glicoproteínas de Membrana/genética , Proteínas de Membrana/genética , Proteínas de Membrana Transportadoras/genética , Transdução de Sinais/genética , Sintenia
4.
PLoS One ; 8(3): e58505, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23505522

RESUMO

In the era of metagenomics and diagnostics sequencing, the importance of protein comparison methods of boosted performance cannot be overstated. Here we present PSimScan (Protein Similarity Scanner), a flexible open source protein similarity search tool which provides a significant gain in speed compared to BLASTP at the price of controlled sensitivity loss. The PSimScan algorithm introduces a number of novel performance optimization methods that can be further used by the community to improve the speed and lower hardware requirements of bioinformatics software. The optimization starts at the lookup table construction, then the initial lookup table-based hits are passed through a pipeline of filtering and aggregation routines of increasing computational complexity. The first step in this pipeline is a novel algorithm that builds and selects 'similarity zones' aggregated from neighboring matches on small arrays of adjacent diagonals. PSimScan performs 5 to 100 times faster than the standard NCBI BLASTP, depending on chosen parameters, and runs on commodity hardware. Its sensitivity and selectivity at the slowest settings are comparable to the NCBI BLASTP's and decrease with the increase of speed, yet stay at the levels reasonable for many tasks. PSimScan is most advantageous when used on large collections of query sequences. Comparing the entire proteome of Streptocuccus pneumoniae (2,042 proteins) to the NCBI's non-redundant protein database of 16,971,855 records takes 6.5 hours on a moderately powerful PC, while the same task with the NCBI BLASTP takes over 66 hours. We describe innovations in the PSimScan algorithm in considerable detail to encourage bioinformaticians to improve on the tool and to use the innovations in their own software development.


Assuntos
Proteínas/química , Ferramenta de Busca , Algoritmos , Biologia Computacional/métodos , Bases de Dados de Proteínas , Internet , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Software
5.
Proc Natl Acad Sci U S A ; 103(35): 13126-31, 2006 Aug 29.
Artigo em Inglês | MEDLINE | ID: mdl-16924101

RESUMO

Comparative analysis of 15 complete cyanobacterial genome sequences, including "near minimal" genomes of five strains of Prochlorococcus spp., revealed 1,054 protein families [core cyanobacterial clusters of orthologous groups of proteins (core CyOGs)] encoded in at least 14 of them. The majority of the core CyOGs are involved in central cellular functions that are shared with other bacteria; 50 core CyOGs are specific for cyanobacteria, whereas 84 are exclusively shared by cyanobacteria and plants and/or other plastid-carrying eukaryotes, such as diatoms or apicomplexans. The latter group includes 35 families of uncharacterized proteins, which could also be involved in photosynthesis. Only a few components of cyanobacterial photosynthetic machinery are represented in the genomes of the anoxygenic phototrophic bacteria Chlorobium tepidum, Rhodopseudomonas palustris, Chloroflexus aurantiacus, or Heliobacillus mobilis. These observations, coupled with recent geological data on the properties of the ancient phototrophs, suggest that photosynthesis originated in the cyanobacterial lineage under the selective pressures of UV light and depletion of electron donors. We propose that the first phototrophs were anaerobic ancestors of cyanobacteria ("procyanobacteria") that conducted anoxygenic photosynthesis using a photosystem I-like reaction center, somewhat similar to the heterocysts of modern filamentous cyanobacteria. From procyanobacteria, photosynthesis spread to other phyla by way of lateral gene transfer.


Assuntos
Cianobactérias/genética , Genoma Bacteriano/genética , Fotossíntese/genética , Proteínas de Bactérias/genética , Transferência Genética Horizontal , Genes Bacterianos/genética , Dados de Sequência Molecular , Família Multigênica/genética , Filogenia , Plantas/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA