Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Nucleic Acids Res ; 36(Web Server issue): W465-9, 2008 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-18424797

RESUMO

Phylogenetic analyses are central to many research areas in biology and typically involve the identification of homologous sequences, their multiple alignment, the phylogenetic reconstruction and the graphical representation of the inferred tree. The Phylogeny.fr platform transparently chains programs to automatically perform these tasks. It is primarily designed for biologists with no experience in phylogeny, but can also meet the needs of specialists; the first ones will find up-to-date tools chained in a phylogeny pipeline to analyze their data in a simple and robust way, while the specialists will be able to easily build and run sophisticated analyses. Phylogeny.fr offers three main modes. The 'One Click' mode targets non-specialists and provides a ready-to-use pipeline chaining programs with recognized accuracy and speed: MUSCLE for multiple alignment, PhyML for tree building, and TreeDyn for tree rendering. All parameters are set up to suit most studies, and users only have to provide their input sequences to obtain a ready-to-print tree. The 'Advanced' mode uses the same pipeline but allows the parameters of each program to be customized by users. The 'A la Carte' mode offers more flexibility and sophistication, as users can build their own pipeline by selecting and setting up the required steps from a large choice of tools to suit their specific needs. Prior to phylogenetic analysis, users can also collect neighbors of a query sequence by running BLAST on general or specialized databases. A guide tree then helps to select neighbor sequences to be used as input for the phylogeny pipeline. Phylogeny.fr is available at: http://www.phylogeny.fr/


Assuntos
Filogenia , Software , Internet , Alinhamento de Sequência , Análise de Sequência de DNA , Análise de Sequência de Proteína
2.
Science ; 290(5490): 347-50, 2000 Oct 13.
Artigo em Inglês | MEDLINE | ID: mdl-11030655

RESUMO

Rickettsia conorii, the aetiological agent of Mediterranean spotted fever, is an intracellular bacterium transmitted by ticks. Preliminary analyses of the nearly complete genome sequence of R. conorii have revealed 44 occurrences of a previously undescribed palindromic repeat (150 base pairs long) throughout the genome. Unexpectedly, this repeat was found inserted in-frame within 19 different R. conorii open reading frames likely to encode functional proteins. We found the same repeat in proteins of other Rickettsia species. The finding of a mobile element inserted in many unrelated genes suggests the potential role of selfish DNA in the creation of new protein sequences.


Assuntos
Proteínas de Bactérias/genética , DNA Bacteriano/genética , Sequências Repetitivas Dispersas , Fases de Leitura Aberta/genética , RNA Mensageiro/genética , Rickettsia conorii/genética , Rickettsia/genética , Proteínas de Bactérias/química , Sequência de Bases , Sequência Conservada , Evolução Molecular , Genoma Bacteriano , Dados de Sequência Molecular , Mutagênese Insercional , Conformação de Ácido Nucleico , Biossíntese de Proteínas , Conformação Proteica , Estrutura Secundária de Proteína , RNA Bacteriano/química , RNA Bacteriano/genética , RNA Bacteriano/metabolismo , RNA Mensageiro/química , RNA Mensageiro/metabolismo
3.
Science ; 293(5537): 2093-8, 2001 Sep 14.
Artigo em Inglês | MEDLINE | ID: mdl-11557893

RESUMO

Rickettsia conorii is an obligate intracellular bacterium that causes Mediterranean spotted fever in humans. We determined the 1,268,755-nucleotide complete genome sequence of R. conorii, containing 1374 open reading frames. This genome exhibits 804 of the 834 genes of the previously determined R. prowazekii genome plus 552 supplementary open reading frames and a 10-fold increase in the number of repetitive elements. Despite these differences, the two genomes exhibit a nearly perfect colinearity that allowed the clear identification of different stages of gene alterations with gene remnants and 37 genes split in 105 fragments, of which 59 are transcribed. A 38-kilobase sequence inversion was dated shortly after the divergence of the genus.


Assuntos
Evolução Molecular , Genoma Bacteriano , Rickettsia conorii/genética , Rickettsia prowazekii/genética , Adaptação Fisiológica , Chlamydia/genética , Biologia Computacional , DNA Bacteriano/genética , DNA Intergênico , Dosagem de Genes , Inativação Gênica , Transferência Genética Horizontal , Genes Bacterianos , Fases de Leitura Aberta , Filogenia , Reação em Cadeia da Polimerase , Sequências Repetitivas de Ácido Nucleico , Rickettsia/genética , Rickettsia conorii/fisiologia , Rickettsia prowazekii/fisiologia , Análise de Sequência de DNA , Transcrição Gênica
4.
Nat Biotechnol ; 15(9): 855-8, 1997 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-9306399

RESUMO

The first successful recovery of ancient DNA, from quagga and human mummies inspired significant enough interest to open an entire field of research. Efforts from many research groups, often in a hunt for the oldest sequences, showed that ancient DNA was a poor substrate for the enzymes used in molecular biology; it is present in tiny amounts, hard to purify, and frequently damaged. These obstacles have been partially overcome by the use of drastic laboratory precautions and by the introduction of polymerase chain reaction and phylogenetic studies. Ancient DNA analysis now finds applications in many research domains.


Assuntos
DNA/genética , Paleontologia , Animais , Arqueologia , Doenças Transmissíveis/genética , DNA/química , DNA Bacteriano/química , DNA Bacteriano/genética , DNA de Plantas/química , DNA de Plantas/genética , Doenças Genéticas Inatas/genética , Humanos , Reação em Cadeia da Polimerase , Moldes Genéticos
5.
Ann N Y Acad Sci ; 1063: 26-34, 2005 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-16481487

RESUMO

Rickettsia felis has been recently cultured in XTC2 cells. This allows production of enough bacteria to create a genomic bank and to sequence it. The chromosome of R. felis is longer than that of previously sequenced rickettsiae and it possess 2 plasmids. Microscopically, this bacterium exhibits two forms of pili: one resembles a conjugative pilus and another forms hair-like projections that may play a role in pathogenicity. R. felis also exhibits several copies of ankyrin-repeat genes and tetratricopeptide encoding gene that are specifically linked to pathogenic host-associated bacteria. It also contains toxin-antitoxin system encoding genes that are extremely rare in intracellular bacteria and may be linked to plasmid maintenance.


Assuntos
DNA Bacteriano/análise , Genoma Bacteriano , Rickettsia felis/crescimento & desenvolvimento , Rickettsia felis/genética , Análise de Sequência de DNA , Animais , Técnicas Bacteriológicas , Humanos , Rickettsia felis/patogenicidade , Rickettsia felis/ultraestrutura
6.
Pharmacogenomics ; 2(4): 361-72, 2001 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-11722286

RESUMO

In the post-genomic era, the new discipline of functional genomics is now facing the challenge of associating a function (as well as estimating its relevance to industrial applications) to about 100,000 microbial, plant or animal genes of known sequence but unknown function. Besides the design of databases, computational methods are increasingly becoming intimately linked with the various experimental approaches. Consequently, bioinformatics is rapidly evolving into independent fields addressing the specific problems of interpreting i) genomic sequences, ii) protein sequences and 3D-structures, as well as iii) transcriptome and macromolecular interaction data. It is thus increasingly difficult for the biologist to choose the computational approaches that perform best in these various areas. This paper attempts to review the most useful developments of the last 2 years.


Assuntos
Biologia Computacional/métodos , Genômica/métodos , Biologia Computacional/tendências , Bases de Dados Factuais , Humanos , Internet , Setor Privado , Proteoma , Análise de Sequência de Proteína , Design de Software
8.
Comput Appl Biosci ; 11(5): 489-95, 1995 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-8590171

RESUMO

We describe a new technique for the automatic detection and characterization, in terms of location, size and intensity, of positive hybridization spots on hybridization filter images. We will also discuss how this information can be used to reconstruct the spot deposition pattern on the filter and thus identify the spots. The part of the algorithm dedicated to the detection of spots is derived from a technique used in astronomy for the automatic recognition of galaxies on photographic plates. This technique has been modified so that it becomes able to cope with the strongly varying background typical of hybridization filter images. The technique does not require any human intervention.


Assuntos
Técnicas Genéticas , Processamento de Imagem Assistida por Computador/métodos , Hibridização de Ácido Nucleico , Software , Algoritmos , DNA/genética , DNA/isolamento & purificação , Sondas de DNA
9.
Comput Chem ; 21(4): 223-7, 1997.
Artigo em Inglês | MEDLINE | ID: mdl-9440929

RESUMO

Eukaryotic promoters are among the most important functional domains yet to be characterized in a satisfactory manner in genomic sequences. Most current detection methods rely on the recognition of individual transcription elements using position-weight matrices (PWM) or consensus sequences. Here, we study a simple promoter detection algorithm based on Markov transition matrices built from sequences upward from proven transcription initiation sites. The performances have been evaluated on the training set and on a test set of promoter-containing sequences. The results on the training set are surprisingly good, given that the algorithm does not incorporate any specific knowledge about promoters. Yet, the program exhibits the pathological behaviour typical of all training set-based methods: a significant decline in performance when confronted with previously unseen sequences. Thus, the Markov algorithm, like the others presently available, does not truly capture the essence of eukaryotic promoters. A detection program based on a Markov model is likely to be blind to categories of promoters without close representatives in the training set.


Assuntos
Cadeias de Markov , Regiões Promotoras Genéticas , Animais , Sequência de Bases , Simulação por Computador , Sequência Consenso , Células Eucarióticas , Técnicas Genéticas , Modelos Genéticos , TATA Box , Transcrição Gênica
10.
Comput Appl Biosci ; 12(5): 431-9, 1996 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-8996792

RESUMO

MOTIVATION: To improve the detection of nucleotide sequence signals (e.g. promoter elements) by position-weight matrices (PWM) using the concept of statistically significant matches. RESULTS: The Mksite program was originally developed for analyzing protein sequences. We report NMksite, a new version adapted to the processing of nucleotide sequences. NMksite creates PWM from nucleotide sequence block alignments or occurrence tables using three weight computation schemes. An original feature of NMksite is the numerical computation of the statistical significance of PWM matches. The utility of this concept is demonstrated in the context of the prediction of splice sites and promoter regions.


Assuntos
Computação Matemática , Regiões Promotoras Genéticas , Software , Sequência de Bases , Reações Falso-Positivas , Reconhecimento Automatizado de Padrão , Alinhamento de Sequência , TATA Box , Transcrição Gênica
11.
Proc Natl Acad Sci U S A ; 95(17): 10026-31, 1998 Aug 18.
Artigo em Inglês | MEDLINE | ID: mdl-9707594

RESUMO

A new method for predicting protein-coding regions in microbial genomic DNA sequences is presented. It uses an ab initio iterative Markov modeling procedure to automatically perform the partition of genomic sequences into three subsets shown to correspond to coding, coding on the opposite strand, and noncoding segments. In contrast to current methods, such as GENEMARK [Borodovsky, M. & McIninch, J. D. (1993) Comput. Chem. 17, 123-133], no training set or prior knowledge of the statistical properties of the studied genome are required. This new method tolerates error rates of 1-2% and can process unassembled sequences. It is thus ideal for the analysis of genome survey and/or fragmented sequence data from uncharacterized microorganisms. The method was validated on 10 complete bacterial genomes (from four major phylogenetic lineages). The results show that protein-coding regions can be identified with an accuracy of up to 90% with a totally automated and objective procedure.


Assuntos
Proteínas de Bactérias/genética , Técnicas Genéticas , Genoma Bacteriano , Algoritmos , Bactérias/genética , Composição de Bases , DNA Bacteriano/química , DNA Bacteriano/genética , Cadeias de Markov , Modelos Genéticos , Reprodutibilidade dos Testes
12.
Genome Res ; 7(10): 986-95, 1997 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-9331369

RESUMO

Genes differentially expressed in different tissues, during development, or during specific pathologies are of foremost interest to both basic and pharmaceutical research. "Transcript profiles" or "digital Northerns" are generated routinely by partially sequencing thousands of randomly selected clones from relevant cDNA libraries. Differentially expressed genes can then be detected from variations in the counts of their cognate sequence tags. Here we present the first systematic study on the influence of random fluctuations and sampling size on the reliability of this kind of data. We establish a rigorous significance test and demonstrate its use on publicly available transcript profiles. The theory links the threshold of selection of putatively regulated genes (e.g., the number of pharmaceutical leads) to the fraction of false positive clones one is willing to risk. Our results delineate more precisely and extend the limits within which digital Northern data can be used.


Assuntos
DNA Complementar/genética , Expressão Gênica , Modelos Genéticos , Modelos Estatísticos , Actinas/genética , Teorema de Bayes , Encéfalo , Intervalos de Confiança , Dimetil Sulfóxido/farmacologia , Humanos , Fígado , Pâncreas , Probabilidade , Tamanho da Amostra , Acetato de Tetradecanoilforbol/farmacologia , Células Tumorais Cultivadas/efeitos dos fármacos
13.
Genome Res ; 8(5): 524-30, 1998 May.
Artigo em Inglês | MEDLINE | ID: mdl-9582195

RESUMO

Alternate polyadenylation is an important post-transcriptional regulatory process now open to large-scale analysis by use of cDNA databases. We clustered 164,000 expressed sequence tags (ESTs) into approximately 15,000 groups and aligned each group to a putative mRNA 3' end. By use of stringent criteria to discard artifactual mRNA extremities, clear evidence for alternate polyadenylation was obtained in 189 of the 1000 EST clusters studied. A number of previously unreported polyadenylation sites were identified, together with possible instances of tissue-specific differential polyadenylation. This study demonstrates that, besides quantitative aspects of gene expression, the distribution of alternate mRNA forms can be analyzed through EST sampling.


Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais , Expressão Gênica/genética , Poli A/metabolismo , RNA Mensageiro/metabolismo , Humanos , Biossíntese de Proteínas , RNA Mensageiro/química , Sequências Reguladoras de Ácido Nucleico
14.
Proteins ; 29(2): 252-7, 1997 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-9329089

RESUMO

SAmBA is a new software for the design of minimal experimental protocols using the notion of orthogonal arrays of strength 2. The main application of SAmBA is the search of protein crystallization conditions. Given a user input defining the relevant effectors/variables (e.g., pH, temperature, salts) and states (e.g., pH: 5, 6, 7 and 8), this software proposes an optimal set of experiments in which all tested variables and the pairwise interactions between them are symmetrically sampled. No a priori restrictions on the number and range of experimental variables is imposed. SAmBA consists of two complementary programs, SAm and BA, using a simulated annealing approach and a backtracking algorithm, respectively. The software is freely available as C code or as an interactive JAVA applet at http:/(/)igs-server.cnrs-mrs.fr.


Assuntos
Proteínas/química , Software , Algoritmos , Cristalização , Concentração de Íons de Hidrogênio , Temperatura
15.
Genome Res ; 9(10): 950-9, 1999 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-10523523

RESUMO

Large, publicly available collections of expressed sequence tags (ESTs) have been generated from Arabidopsis thaliana and rice (Oryza sativa). A potential, but relatively unexplored application of this data is in the study of plant gene expression. Other EST data, mainly from human and mouse, have been successfully used to point out genes exhibiting tissue- or disease-specific expression, as well as for identification of alternative transcripts. In this report, we go a step further in showing that computer analyses of plant EST data can be used to generate evidence of correlated expression patterns of genes across various tissues. Furthermore, tissue types and organs can be classified with respect to one another on the basis of their global gene expression patterns. As in previous studies, expression profiles are first estimated from EST counts. By clustering gene expression profiles or whole cDNA library profiles, we show that genes with similar functions, or cDNA libraries expected to share patterns of gene expression, are grouped together. Promising uses of this technique include functional genomics, in which evidence of correlated expression might complement (or substitute for) those of sequence similarity in the annotation of anonymous genes and identification of surrogate markers. The analysis presented here combines the application of a correlation-based clustering method with a graphical color map allowing intuitive visualization of patterns within a large table of expression measurements.


Assuntos
Etiquetas de Sequências Expressas , Regulação da Expressão Gênica de Plantas , Oryza/genética , Mapeamento de Sequências Contíguas , DNA Complementar/análise , Bases de Dados Factuais , Modelos Genéticos , Modelos Estatísticos , Família Multigênica
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa