Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
1.
Mol Biol Evol ; 40(5)2023 05 02.
Artigo em Inglês | MEDLINE | ID: mdl-37139943

RESUMO

The formation of new genes during evolution is an important motor of functional innovation, but the rate at which new genes originate and the likelihood that they persist over longer evolutionary periods are still poorly understood questions. Two important mechanisms by which new genes arise are gene duplication and de novo formation from a previously noncoding sequence. Does the mechanism of formation influence the evolutionary trajectories of the genes? Proteins arisen by gene duplication retain the sequence and structural properties of the parental protein, and thus they may be relatively stable. Instead, de novo originated proteins are often species specific and thought to be more evolutionary labile. Despite these differences, here we show that both types of genes share a number of similarities, including low sequence constraints in their initial evolutionary phases, high turnover rates at the species level, and comparable persistence rates in deeper branchers, in both yeast and flies. In addition, we show that putative de novo proteins have an excess of substitutions between charged amino acids compared with the neutral expectation, which is reflected in the rapid loss of their initial highly basic character. The study supports high evolutionary dynamics of different kinds of new genes at the species level, in sharp contrast with the stability observed at later stages.


Assuntos
Evolução Molecular , Proteínas , Proteínas/genética , Duplicação Gênica , Saccharomyces cerevisiae/genética , Filogenia
2.
Int J Mol Sci ; 24(3)2023 Feb 03.
Artigo em Inglês | MEDLINE | ID: mdl-36769294

RESUMO

A large part of the genome is known to be transcribed as non-coding DNA including some tandem repeats (satellites) such as telomeric/centromeric satellites in different species. However, there has been no detailed study on the eventual transcription of the interspersed satellites found in many species. In the present paper, we studied the transcription of the abundant DNA satellites in the nematode Caenorhabditis elegans using available RNA-Seq results. We found that many of them have been transcribed, but usually in an irregular manner; different regions of a satellite have been transcribed with variable efficiency. Satellites with a similar repeat sequence also have a different transcription pattern depending on their position in the genome. We also describe the peculiar features of satellites associated with Helitron transposons in C. elegans. Our demonstration that some satellite RNAs are transcribed adds a new family of non-coding RNAs, a new element in the world of RNA interference, with new paths for the control of mRNA translation. This is a field that requires further investigation and will provide a deeper understanding of gene expression and control.


Assuntos
Caenorhabditis elegans , DNA Satélite , Animais , DNA Satélite/genética , Caenorhabditis elegans/genética , Sequências Repetitivas de Ácido Nucleico , DNA , Centrômero
3.
Int J Mol Sci ; 22(10)2021 May 20.
Artigo em Inglês | MEDLINE | ID: mdl-34065296

RESUMO

Little is known about DNA tandem repeats across prokaryotes. We have recently described an enigmatic group of tandem repeats in bacterial genomes with a constant repeat size but variable sequence. These findings strongly suggest that tandem repeat size in some bacteria is under strong selective constraints. Here, we extend these studies and describe tandem repeats in a large set of Bacillus. Some species have very few repeats, while other species have a large number. Most tandem repeats have repeats with a constant size (either 52 or 20-21 nt), but a variable sequence. We characterize in detail these intriguing tandem repeats. Individual species have several families of tandem repeats with the same repeat length and different sequence. This result is in strong contrast with eukaryotes, where tandem repeats of many sizes are found in any species. We discuss the possibility that they are transcribed as small RNA molecules. They may also be involved in the stabilization of the nucleoid through interaction with proteins. We also show that the distribution of tandem repeats in different species has a taxonomic significance. The data we present for all tandem repeats and their families in these bacterial species will be useful for further genomic studies.


Assuntos
Bacillus/genética , Sequências de Repetição em Tandem/genética , Bactérias/genética , Eucariotos/genética , Genoma Bacteriano/genética , Genômica/métodos , Células Procarióticas/fisiologia , Especificidade da Espécie
4.
J Bacteriol ; 202(21)2020 10 08.
Artigo em Inglês | MEDLINE | ID: mdl-32839174

RESUMO

DNA tandem repeats, or satellites, are well described in eukaryotic species, but little is known about their prevalence across prokaryotes. Here, we performed the most complete characterization to date of satellites in bacteria. We identified 121,638 satellites from 12,233 fully sequenced and assembled bacterial genomes with a very uneven distribution. We also determined the families of satellites which have a related sequence. There are 85 genomes that are particularly satellite rich and contain several families of satellites of yet unknown function. Interestingly, we only found two main types of noncoding satellites, depending on their repeat sizes, 22/44 or 52 nucleotides (nt). An intriguing feature is the constant size of the repeats in the genomes of different species, whereas their sequences show no conservation. Individual species also have several families of satellites with the same repeat length and different sequences. This result is in marked contrast with previous findings in eukaryotes, where noncoding satellites of many sizes are found in any species investigated. We describe in greater detail these noncoding satellites in the spirochete Leptospira interrogans and in several bacilli. These satellites undoubtedly play a specific role in the species which have acquired them. We discuss the possibility that they represent binding sites for transcription factors not previously described or that they are involved in the stabilization of the nucleoid through interaction with proteins.IMPORTANCE We found an enigmatic group of noncoding satellites in 85 bacterial genomes with a constant repeat size but variable sequence. This pattern of DNA organization is unique and had not been previously described in bacteria. These findings strongly suggest that satellite size in some bacteria is under strong selective constraints and thus that satellites are very likely to play a fundamental role. We also provide a list and properties of all satellites in 12,233 genomes, which may be used for further genomic analysis.


Assuntos
Bactérias/genética , DNA Bacteriano/genética , DNA Satélite/genética , Genoma Bacteriano , Leptospira interrogans/genética , Bases de Dados Genéticas , Genômica
5.
BMC Evol Biol ; 19(1): 181, 2019 09 18.
Artigo em Inglês | MEDLINE | ID: mdl-31533616

RESUMO

BACKGROUND: Satellites or tandem repeats are very abundant in many eukaryotic genomes. Occasionally they have been reported to be present in some prokaryotes, but to our knowledge there is no general comparative study on their occurrence. For this reason we present here an overview of the distribution and properties of satellites in a set of representative species. Our results provide novel insights into the evolutionary relationship between eukaryotes, Archaea and Bacteria. RESULTS: We have searched all possible satellites present in the NCBI reference group of genomes in Archaea (142 species) and in Bacteria (119 species), detecting 2735 satellites in Archaea and 1067 in Bacteria. We have found that the distribution of satellites is very variable in different organisms. The archaeal Methanosarcina class stands out for the large amount of satellites in their genomes. Satellites from a few species have similar characteristics to those in eukaryotes, but most species have very few satellites: only 21 species in Archaea and 18 in Bacteria have more than 4 satellites/Mb. The distribution of satellites in these species is reminiscent of what is found in eukaryotes, but we find two significant differences: most satellites have a short length and many of them correspond to segments of genes coding for amino acid repeats. Transposition of non-coding satellites throughout the genome occurs rarely: only in the bacteria Leptospira interrogans and the archaea Methanocella conradii we have detected satellite families of transposed satellites with long repeats. CONCLUSIONS: Our results demonstrate that the presence of satellites in the genome is not an exclusive feature of eukaryotes. We have described a few prokaryotes which do contain satellites. We present a discussion on their eventual evolutionary significance.


Assuntos
DNA Satélite/genética , Células Procarióticas/metabolismo , Archaea/genética , Bactérias/genética , Repetições de Microssatélites/genética , Filogenia
6.
BMC Evol Biol ; 15: 218, 2015 Oct 05.
Artigo em Inglês | MEDLINE | ID: mdl-26438045

RESUMO

BACKGROUND: The high density of tandem repeat sequences (satellites) in nematode genomes and the availability of genome sequences from several species in the group offer a unique opportunity to better understand the evolutionary dynamics and the functional role of these sequences. We take advantage of the previously developed SATFIND program to study the satellites in four Caenorhabditis species and investigate these questions. METHODS: The identification and comparison of satellites is carried out in three steps. First we find all the satellites present in each species with the SATFIND program. Each satellite is defined by its length, number of repeats, and repeat sequence. Only satellites with at least ten repeats are considered. In the second step we build satellite families with a newly developed alignment program. Satellite families are defined by a consensus sequence and the number of satellites in the family. Finally we compare the consensus sequence of satellite families in different species. RESULTS: We give a catalog of individual satellites in each species. We have also identified satellite families with a related sequence and compare them in different species. We analyze the turnover of satellites: they increased in size through duplications of fragments of 100-300 bases. It appears that in many cases they have undergone an explosive expansion. In C. elegans we have identified a subset of large satellites that have strong affinity for the centromere protein CENP-A. We have also compared our results with those obtained from other species, including one nematode and three mammals. CONCLUSIONS: Most satellite families found in Caenorhabditis are species-specific; in particular those with long repeats. A subset of these satellites may facilitate the formation of kinetochores in mitosis. Other satellite families in C. elegans are either related to Helitron transposons or to meiotic pairing centers.


Assuntos
Caenorhabditis/classificação , Caenorhabditis/genética , DNA de Helmintos/genética , Animais , Autoantígenos/genética , Evolução Biológica , Caenorhabditis elegans/genética , Centrômero , Proteína Centromérica A , Proteínas Cromossômicas não Histona/genética , DNA Satélite/genética , Sequências Repetitivas de Ácido Nucleico , Especificidade da Espécie
7.
Nucleic Acids Res ; 38(4): 1172-81, 2010 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-19966278

RESUMO

The purpose of this work is to determine the most frequent short sequences in non-coding DNA. They may play a role in maintaining the structure and function of eukaryotic chromosomes. We present a simple method for the detection and analysis of such sequences in several genomes, including Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens. We also study two chromosomes of man and mouse with a length similar to the whole genomes of the other species. We provide a list of the most common sequences of 9-14 bases in each genome. As expected, they are present in human Alu sequences. Our programs may also give a graph and a list of their position in the genome. Detection of clusters is also possible. In most cases, these sequences contain few alternating regions. Their intrinsic structure and their influence on nucleosome formation are not known. In particular, we have found new features of short sequences in C. elegans, which are distributed in heterogeneous clusters. They appear as punctuation marks in the chromosomes. Such clusters are not found in either A. thaliana or D. melanogaster. We discuss the possibility that they play a role in centromere function and homolog recognition in meiosis.


Assuntos
DNA Intergênico/química , Elementos Alu , Animais , Arabidopsis/genética , Sequência de Bases , Caenorhabditis elegans/genética , Genômica , Humanos , Camundongos , Repetições de Microssatélites , Elementos Nucleotídeos Curtos e Dispersos
8.
J Theor Biol ; 283(1): 28-34, 2011 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-21635904

RESUMO

There are general features of chromosome dynamics, such as homologue recognition in early meiosis, which are expected to involve related sequence motifs in non-coding DNA, with a similar distribution in different species. A search for such motifs is presented here. It has been carried out with the CONREPP programme. It has been found that short alternating AT sequences (10-20 bases) have a similar distribution in most eukaryotic organisms, with some exceptions related to unique meiotic features. All other microsatellite and repeat sequences vary significantly in different organisms. It is concluded that the unique structural features and uniform distribution of alternating AT sequences indicate that they may facilitate homologous chromosome pairing in the early preleptotene stage of meiosis. They may also play a role in the compaction of DNA in mitotic chromosomes.


Assuntos
Eucariotos/genética , Meiose/genética , Repetições de Microssatélites/genética , Animais , Sequência de Bases , Caenorhabditis elegans/genética , Pareamento Cromossômico/genética , Bases de Dados de Ácidos Nucleicos , Genoma , Humanos , Dados de Sequência Molecular , Nucleossomos/genética , Proteínas de Saccharomyces cerevisiae/genética , Especificidade da Espécie
9.
Genes (Basel) ; 12(11)2021 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-34828257

RESUMO

It has been shown in recent years that many repeated sequences in the genome are expressed as RNA transcripts, although the role of such RNAs is poorly understood. Some isolated and tandem repeats (satellites) have been found to be transcribed, such as mammalian Alu sequences and telomeric/centromeric satellites in different species. However, there is no detailed study on the eventual transcription of the interspersed satellites found in many species. Therefore, we decided to study for the first time the transcription of the abundant DNA satellites in the bacterium Bacillus coagulans and in the nematode Caenorhabditis elegans. We have updated the data for C. elegans satellites using the latest version of the genome. We analyzed the transcription of satellites in both species in available RNA-seq results and found that they are widely transcribed. Our demonstration that satellite RNAs are transcribed adds a new family of non-coding RNAs. This is a field that requires further investigation and will provide a deeper understanding of gene expression and control.


Assuntos
Bacillus coagulans/genética , Caenorhabditis elegans/genética , DNA Satélite/genética , Animais , Bactérias/genética , Eucariotos/genética , Regulação da Expressão Gênica , Genoma/genética , RNA não Traduzido/genética , Sequências Repetitivas de Ácido Nucleico/genética , Análise de Sequência de DNA , Análise de Sequência de RNA , Transcrição Gênica
10.
Nat Commun ; 12(1): 604, 2021 01 27.
Artigo em Inglês | MEDLINE | ID: mdl-33504782

RESUMO

De novo gene origination has been recently established as an important mechanism for the formation of new genes. In organisms with a large genome, intergenic and intronic regions provide plenty of raw material for new transcriptional events to occur, but little is know about how de novo transcripts originate in more densely-packed genomes. Here, we identify 213 de novo originated transcripts in Saccharomyces cerevisiae using deep transcriptomics and genomic synteny information from multiple yeast species grown in two different conditions. We find that about half of the de novo transcripts are expressed from regions which already harbor other genes in the opposite orientation; these transcripts show similar expression changes in response to stress as their overlapping counterparts, and some appear to translate small proteins. Thus, a large fraction of de novo genes in yeast are likely to co-evolve with already existing genes.


Assuntos
Genes Fúngicos , Saccharomyces cerevisiae/genética , Transcriptoma/genética , Sequência Conservada/genética , Regulação Fúngica da Expressão Gênica , Redes Reguladoras de Genes , Fases de Leitura Aberta/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo
11.
Gene ; 408(1-2): 124-32, 2008 Jan 31.
Artigo em Inglês | MEDLINE | ID: mdl-18022767

RESUMO

We present an analysis of tandem repeats of short sequence motifs (microsatellites) in twelve eukaryotes for which a large part of the genome has been sequenced and assembled. The pattern of motif abundance varies significantly in different species, but it is very similar in different chromosomes of the same species. The most abundant repeats can be classified in two main families. The first family has a rigid conformation, with purines in one strand and pyrimidines in the complementary strand, mainly A(n)/T(n) and (AG)(n)/(CT)(n). The second family has alternating, flexible sequences, such as (AT)(n), (AC)(n) and related sequences. In the pluricellular organisms the relative frequency of both families is rather constant. These observations indicate that microsatellites have structural information and may be involved in the organization of chromatin fibers and in chromosome architecture in general. An additional intriguing finding is the absence of microsatellites with sequences which appear to be forbidden, such as (AATT)(n).


Assuntos
Genoma , Repetições de Microssatélites , Animais , Sequência de Bases , Humanos , Especificidade da Espécie
12.
Nucleic Acids Res ; 34(Database issue): D63-7, 2006 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-16381947

RESUMO

Information about the genomic coordinates and the sequence of experimentally identified transcription factor binding sites is found scattered under a variety of diverse formats. The availability of standard collections of such high-quality data is important to design, evaluate and improve novel computational approaches to identify binding motifs on promoter sequences from related genes. ABS (http://genome.imim.es/datasets/abs2005/index.html) is a public database of known binding sites identified in promoters of orthologous vertebrate genes that have been manually curated from bibliography. We have annotated 650 experimental binding sites from 68 transcription factors and 100 orthologous target genes in human, mouse, rat or chicken genome sequences. Computational predictions and promoter alignment information are also provided for each entry. A simple and easy-to-use web interface facilitates data retrieval allowing different views of the information. In addition, the release 1.0 of ABS includes a customizable generator of artificial datasets based on the known sites contained in the collection and an evaluation tool to aid during the training and the assessment of motif-finding programs.


Assuntos
Bases de Dados de Ácidos Nucleicos , Regiões Promotoras Genéticas , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , Galinhas/genética , Genômica , Humanos , Internet , Camundongos , Ratos , Interface Usuário-Computador
13.
Genes (Basel) ; 9(10)2018 Oct 16.
Artigo em Inglês | MEDLINE | ID: mdl-30332836

RESUMO

Repetitive genome regions have been difficult to sequence, mainly because of the comparatively small size of the fragments used in assembly. Satellites or tandem repeats are very abundant in nematodes and offer an excellent playground to evaluate different assembly methods. Here, we compare the structure of satellites found in three different assemblies of the Caenorhabditis elegans genome: the original sequence obtained by Sanger sequencing, an assembly based on PacBio technology, and an assembly using Nanopore sequencing reads. In general, satellites were found in equivalent genomic regions, but the new long-read methods (PacBio and Nanopore) tended to result in longer assembled satellites. Important differences exist between the assemblies resulting from the two long-read technologies, such as the sizes of long satellites. Our results also suggest that the lengths of some annotated genes with internal repeats which were assembled using Sanger sequencing are likely to be incorrect.

14.
Nat Ecol Evol ; 2(5): 890-896, 2018 05.
Artigo em Inglês | MEDLINE | ID: mdl-29556078

RESUMO

Accumulating evidence indicates that some protein-coding genes have originated de novo from previously non-coding genomic sequences. However, the processes underlying de novo gene birth are still enigmatic. In particular, the appearance of a new functional protein seems highly improbable unless there is already a pool of neutrally evolving peptides that are translated at significant levels and that can at some point acquire new functions. Here, we use deep ribosome-profiling sequencing data, together with proteomics and single nucleotide polymorphism information, to search for these peptides. We find hundreds of open reading frames that are translated and that show no evolutionary conservation or selective constraints. These data suggest that the translation of these neutrally evolving peptides may be facilitated by the chance occurrence of open reading frames with a favourable codon composition. We conclude that the pervasive translation of the transcriptome provides plenty of material for the evolution of new functional proteins.


Assuntos
Evolução Molecular , Peptídeos/química , Polimorfismo de Nucleotídeo Único , Ribossomos/química , Animais , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Camundongos , Camundongos Endogâmicos BALB C , Proteômica
15.
BMC Bioinformatics ; 8: 138, 2007 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-17456238

RESUMO

BACKGROUND: The analysis of the promoter sequence of genes with similar expression patterns is a basic tool to annotate common regulatory elements. Multiple sequence alignments are on the basis of most comparative approaches. The characterization of regulatory regions from co-expressed genes at the sequence level, however, does not yield satisfactory results in many occasions as promoter regions of genes sharing similar expression programs often do not show nucleotide sequence conservation. RESULTS: In a recent approach to circumvent this limitation, we proposed to align the maps of predicted transcription factors (referred as TF-maps) instead of the nucleotide sequence of two related promoters, taking into account the label of the corresponding factor and the position in the primary sequence. We have now extended the basic algorithm to permit multiple promoter comparisons using the progressive alignment paradigm. In addition, non-collinear conservation blocks might now be identified in the resulting alignments. We have optimized the parameters of the algorithm in a small, but well-characterized collection of human-mouse-chicken-zebrafish orthologous gene promoters. CONCLUSION: Results in this dataset indicate that TF-map alignments are able to detect high-level regulatory conservation at the promoter and the 3'UTR gene regions, which cannot be detected by the typical sequence alignments. Three particular examples are introduced here to illustrate the power of the multiple TF-map alignments to characterize conserved regulatory elements in absence of sequence similarity. We consider this kind of approach can be extremely useful in the future to annotate potential transcription factor binding sites on sets of co-regulated genes from high-throughput expression experiments.


Assuntos
Regiões Promotoras Genéticas/genética , Alinhamento de Sequência/métodos , Fatores de Transcrição/genética , Animais , Sequência de Bases/genética , Galinhas , Humanos , Camundongos , Mapeamento de Nucleotídeos/métodos , Mapeamento de Nucleotídeos/estatística & dados numéricos , Alinhamento de Sequência/estatística & dados numéricos , Peixe-Zebra
16.
PLoS Comput Biol ; 2(5): e49, 2006 May.
Artigo em Inglês | MEDLINE | ID: mdl-16733547

RESUMO

We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels--to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human-mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments.


Assuntos
Biologia Computacional/métodos , Regiões Promotoras Genéticas , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Algoritmos , Animais , Mapeamento Cromossômico , Humanos , Camundongos , Modelos Estatísticos , Mapeamento por Restrição/métodos , Software , Especificidade da Espécie
17.
Methods Mol Biol ; 396: 135-52, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-18025691

RESUMO

During the course of evolution, genomes can undergo large-scale mutation events such as rearrangement and lateral transfer. Such mutations can result in significant variations in gene order and gene content among otherwise closely related organisms. The Mauve genome alignment system can successfully identify such rearrangement and lateral transfer events in comparisons of multiple microbial genomes even under high levels of recombination. This chapter outlines the main features of Mauve and provides examples that describe how to use Mauve to conduct a rigorous multiple genome comparison and study evolutionary patterns.


Assuntos
Evolução Biológica , Genoma Bacteriano , Alinhamento de Sequência , Transferência Genética Horizontal , Filogenia , Yersinia pestis/genética
18.
Genes (Basel) ; 8(12)2017 Nov 28.
Artigo em Inglês | MEDLINE | ID: mdl-29182550

RESUMO

The availability of the genome sequence of the unisexual (male-female) Caenorhabditis nigoni offers an opportunity to compare its non-coding features with the related hermaphroditic species Caenorhabditis briggsae; to understand the evolutionary dynamics of their tandem repeat sequences (satellites), as a result of evolution from the unisexual ancestor. We take advantage of the previously developed SATFIND program to build satellite families defined by a consensus sequence. The relative number of satellites (satellites/Mb) in C. nigoni is 24.6% larger than in C. briggsae. Some satellites in C. nigoni have developed from a proto-repeat present in the ancestor species and are conserved as an isolated sequence in C. briggsae. We also identify unique satellites which occur only once and joint satellite families with a related sequence in both species. Some of these families are only found in C. nigoni, which indicates a recent appearance; they contain conserved adjacent 5' and 3' regions, which may favor transposition. Our results show that the number, length and turnover of satellites are restricted in the hermaphrodite C. briggsae when compared with the unisexual C. nigoni. We hypothesize that this results from differences in unequal recombination during meiotic chromosome pairing, which limits satellite turnover in hermaphrodites.

19.
BMC Bioinformatics ; 7: 433, 2006 Oct 05.
Artigo em Inglês | MEDLINE | ID: mdl-17022809

RESUMO

BACKGROUND: Due to recent advances in whole genome shotgun sequencing and assembly technologies, the financial cost of decoding an organism's DNA has been drastically reduced, resulting in a recent explosion of genomic sequencing projects. This increase in related genomic data will allow for in depth studies of evolution in closely related species through multiple whole genome comparisons. RESULTS: To facilitate such comparisons, we present an interactive multiple genome comparison and alignment tool, M-GCAT, that can efficiently construct multiple genome comparison frameworks in closely related species. M-GCAT is able to compare and identify highly conserved regions in up to 20 closely related bacterial species in minutes on a standard computer, and as many as 90 (containing 75 cloned genomes from a set of 15 published enterobacterial genomes) in an hour. M-GCAT also incorporates a novel comparative genomics data visualization interface allowing the user to globally and locally examine and inspect the conserved regions and gene annotations. CONCLUSION: M-GCAT is an interactive comparative genomics tool well suited for quickly generating multiple genome comparisons frameworks and alignments among closely related species. M-GCAT is freely available for download for academic and non-commercial use at: http://alggen.lsi.upc.es/recerca/align/mgcat/intro-mgcat.html.


Assuntos
Algoritmos , Bactérias/genética , Mapeamento Cromossômico/métodos , Genoma Bacteriano/genética , Análise de Sequência de DNA/métodos , Software , Interface Usuário-Computador , Bactérias/classificação , Sequência de Bases , Dados de Sequência Molecular , Alinhamento de Sequência/métodos , Especificidade da Espécie
20.
Nucleic Acids Res ; 31(13): 3651-3, 2003 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-12824386

RESUMO

In this paper we present several web-based tools to identify conserved patterns in sequences. In particular we present details on the functionality of PROMO version 2.0, a program for the prediction of transcription factor binding site in a single sequence or in a group of related sequences and, of MALGEN, a tool to visualize sequence correspondences among long DNA sequences. The web tools and associated documentation can be accessed at http://www.lsi.upc.es/~alggen (RESEARCH link).


Assuntos
Sequências Reguladoras de Ácido Nucleico , Análise de Sequência de DNA/métodos , Software , Animais , Sítios de Ligação , Gráficos por Computador , Sequência Conservada , Genoma , Humanos , Internet , Alinhamento de Sequência , Fatores de Transcrição
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA