Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 75
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
2.
Bioinformatics ; 24(17): 1935-41, 2008 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-18593717

RESUMO

MOTIVATION: Biomedical literature is the principal repository of biomedical knowledge, with PubMed being the most complete database collecting, organizing and analyzing such textual knowledge. There are numerous efforts that attempt to exploit this information by using text mining and machine learning techniques. We developed a novel approach, called PuReD-MCL (Pubmed Related Documents-MCL), which is based on the graph clustering algorithm MCL and relevant resources from PubMed. METHODS: PuReD-MCL avoids using natural language processing (NLP) techniques directly; instead, it takes advantage of existing resources, available from PubMed. PuReD-MCL then clusters documents efficiently using the MCL graph clustering algorithm, which is based on graph flow simulation. This process allows users to analyse the results by highlighting important clues, and finally to visualize the clusters and all relevant information using an interactive graph layout algorithm, for instance BioLayout Express 3D. RESULTS: The methodology was applied to two different datasets, previously used for the validation of the document clustering tool TextQuest. The first dataset involves the organisms Escherichia coli and yeast, whereas the second is related to Drosophila development. PuReD-MCL successfully reproduces the annotated results obtained from TextQuest, while at the same time provides additional insights into the clusters and the corresponding documents. AVAILABILITY: Source code in perl and R are available from http://tartara.csd.auth.gr/~theodos/


Assuntos
Inteligência Artificial , Análise por Conglomerados , Armazenamento e Recuperação da Informação/métodos , Processamento de Linguagem Natural , Reconhecimento Automatizado de Padrão/métodos , PubMed , Software , Algoritmos , Sistemas de Gerenciamento de Base de Dados
3.
J Cell Biol ; 121(4): 847-53, 1993 May.
Artigo em Inglês | MEDLINE | ID: mdl-8491777

RESUMO

The cDNA coding for calf filensin, a membrane-associated protein of the lens fiber cells, has been cloned and sequenced. The predicted 755-amino acid-long open reading frame shows primary and secondary structure similarity to intermediate filament (IF) proteins. Filensin can be divided into an NH2-terminal domain (head) of 38 amino acids, a middle domain (rod) of 279 amino acids, and a COOH-terminal domain (tail) of 438 amino acids. The head domain contains a di-arginine/aromatic amino acid motif which is also found in the head domains of various intermediate filament proteins and includes a potential protein kinase A phosphorylation site. By multiple alignment to all known IF protein sequences, the filensin rod, which is the shortest among IF proteins, can be subdivided into three subdomains (coils 1a, 1b, and 2). A 29 amino acid truncation in the coil 2 region accounts for the smaller size of this domain. The filensin tail contains 6 1/2 tandem repeats which match analogous motifs of mammalian neurofilament M and H proteins. We suggest that filensin is a novel IF protein which does not conform to any of the previously described classes. Purified filensin fails to form regular filaments in vitro (Merdes, A., M. Brunkener, H. Horstmann, and S. D. Georgatos. 1991. J. Cell Biol. 115:397-410), probably due to the missing segment in the coil 2 region. Participation of filensin in a filamentous network in vivo may be facilitated by an assembly partner.


Assuntos
Proteínas do Olho/química , Proteínas de Filamentos Intermediários/química , Estrutura Secundária de Proteína , Sequência de Aminoácidos , Animais , Sequência de Bases , Bovinos , Clonagem Molecular , DNA , Proteínas do Olho/genética , Dados de Sequência Molecular , Especificidade de Órgãos , Testes de Precipitina , RNA Mensageiro/metabolismo , Homologia de Sequência de Aminoácidos
5.
Nucleic Acids Res ; 29(7): 1608-15, 2001 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-11266564

RESUMO

The global amino acid compositions as deduced from the complete genomic sequences of six thermophilic archaea, two thermophilic bacteria, 17 mesophilic bacteria and two eukaryotic species were analysed by hierarchical clustering and principal components analysis. Both methods showed an influence of several factors on amino acid composition. Although GC content has a dominant effect, thermophilic species can be identified by their global amino acid compositions alone. This study presents a careful statistical analysis of factors that affect amino acid composition and also yielded specific features of the average amino acid composition of thermophilic species. Moreover, we introduce the first example of a 'compositional tree' of species that takes into account not only homologous proteins, but also proteins unique to particular species. We expect this simple yet novel approach to be a useful additional tool for the study of phylogeny at the genome level.


Assuntos
Aminoácidos/genética , Genoma , Aminoácidos/química , Animais , Archaea/genética , Bactérias/genética , Caenorhabditis elegans/genética , Bases de Dados Factuais , Genoma Arqueal , Genoma Bacteriano , Genoma Fúngico , Filogenia , Saccharomyces cerevisiae/genética , Especificidade da Espécie , Temperatura
6.
Nucleic Acids Res ; 27(22): 4405-8, 1999 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-10536149

RESUMO

In the process of analysing the four available complete archaeal genomes, we have noted that certain regions characterised as 'non-coding' exhibit significant sequence similarity to other protein sequences from Archaea and other species. Using established technology, we have identified a number of potential protein coding regions in these putative 'non-coding' regions. We have detected 524 such cases, of which 113 regions appear to code for proteins present in archaeal or other species, while the remaining 411 regions are mostly start/stop definition conflicts. Of the 113 protein coding regions, only 21 code for proteins with homologues of known function. The number of novel coding sequences identified herein amounts to 1. 5% of the total genome entries, while the conflicting cases represent an additional 5%. The observed differences between the four complete archaeal genomes seem to reflect disparate approaches to genome annotation. Genome sequence collections should be regularly checked to improve gene prediction by sequence similarity and greater effort is required to make gene definitions consistent across related species.


Assuntos
Proteínas Arqueais/genética , Genoma Arqueal , Fases de Leitura Aberta , Proteínas Arqueais/química , DNA Arqueal/análise , DNA Arqueal/química , Bases de Dados Factuais , Fases de Leitura Aberta/genética , Homologia de Sequência do Ácido Nucleico
7.
Nucleic Acids Res ; 29(21): 4395-404, 2001 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-11691927

RESUMO

Whole-genome clustering of the two available genome sequences of Helicobacter pylori strains 26695 and J99 allows the detection of 110 and 52 strain-specific genes, respectively. This set of strain-specific genes was compared with the sets obtained with other computational approaches of direct genome comparison as well as experimental data from microarray analysis. A considerable number of novel function assignments is possible using database-driven sequence annotation, although the function of the majority of the identified genes remains unknown. Using whole-genome clustering, it is also possible to detect species-specific genes by comparing the two H.pylori strains against the genome sequence of Campylobacter jejuni. It is interesting that the majority of strain-specific genes appear to be species specific. Finally, we introduce a novel approach to gene position analysis by employing measures from directional statistics. We show that although the two strains exhibit differences with respect to strain-specific gene distributions, this is due to the extensive genome rearrangements. If these are taken into account, a common pattern for the genome dynamics of the two Helicobacter strains emerges, suggestive of certain spatial constraints that may act as control mechanisms of gene flux.


Assuntos
Evolução Molecular , Genes Bacterianos/genética , Genoma Bacteriano , Genômica , Helicobacter pylori/classificação , Helicobacter pylori/genética , Sequência de Aminoácidos , Proteínas de Bactérias/química , Proteínas de Bactérias/classificação , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Campylobacter jejuni/genética , Biologia Computacional , Bases de Dados de Proteínas , Ordem dos Genes/genética , Internet , Modelos Genéticos , Dados de Sequência Molecular , Alinhamento de Sequência , Especificidade da Espécie
8.
Nucleic Acids Res ; 30(7): 1575-84, 2002 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-11917018

RESUMO

Detection of protein families in large databases is one of the principal research objectives in structural and functional genomics. Protein family classification can significantly contribute to the delineation of functional diversity of homologous proteins, the prediction of function based on domain architecture or the presence of sequence motifs as well as comparative genomics, providing valuable evolutionary insights. We present a novel approach called TRIBE-MCL for rapid and accurate clustering of protein sequences into families. The method relies on the Markov cluster (MCL) algorithm for the assignment of proteins into families based on precomputed sequence similarity information. This novel approach does not suffer from the problems that normally hinder other protein sequence clustering algorithms, such as the presence of multi-domain proteins, promiscuous domains and fragmented proteins. The method has been rigorously tested and validated on a number of very large databases, including SwissProt, InterPro, SCOP and the draft human genome. Our results indicate that the method is ideally suited to the rapid and accurate detection of protein families on a large scale. The method has been used to detect and categorise protein families within the draft human genome and the resulting families have been used to annotate a large proportion of human proteins.


Assuntos
Algoritmos , Bases de Dados de Proteínas , Proteínas/genética , Sequência de Aminoácidos , Genoma Humano , Humanos , Internet , Dados de Sequência Molecular , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos , Fator de Transcrição TFIIB , Fatores de Transcrição/genética
9.
Nucleic Acids Res ; 28(22): 4573-6, 2000 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-11071948

RESUMO

The proliferation of genome sequence data has led to the development of a number of tools and strategies that facilitate computational analysis. These methods include the identification of motif patterns, membership of the query sequences in family databases, metabolic pathway involvement and gene proximity. We re-examined the completely sequenced genome of Thermotoga maritima by employing the combined use of the above methods. By analyzing all 1877 proteins encoded in this genome, we identified 193 cases of conflicting annotations (10%), of which 164 are new function predictions and 29 are amendments of previously proposed assignments. These results suggest that the combined use of existing computational tools can resolve inconclusive sequence similarities and significantly improve the prediction of protein function from genome sequence.


Assuntos
Genoma Bacteriano , Alinhamento de Sequência/métodos , Thermotoga maritima/genética , Biologia Computacional , Genes Bacterianos/genética , Fases de Leitura Aberta , Análise de Sequência
10.
Biochim Biophys Acta ; 1199(1): 52-8, 1994 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-8280753

RESUMO

Unlike most of the hydrolytic enzymes that participate in endosperm mobilization, beta-glucosidase of barley (Hordeum vulgare) seeds does not increase during germination, even in the presence of exogenously added gibberellic acid. However, the germination process affects the physical properties of beta-glucosidase in terms of charge and apparent molecular weight. Analysis of developing barley grains shows that the enzyme is synthesized two weeks before maturation and is stored in the endosperm of the dry dormant seed. Partial amino acid sequencing of the purified beta-glucosidase demonstrates significant similarity between the barley enzyme and beta-glycosidases that belong to family 1 of glycosyl hydrolases.


Assuntos
Hordeum/enzimologia , Sementes/enzimologia , beta-Glucosidase/genética , Sequência de Aminoácidos , Expressão Gênica , Dados de Sequência Molecular , Alinhamento de Sequência
11.
J Mol Biol ; 232(3): 805-25, 1993 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-8355272

RESUMO

The problem of protein structure prediction is formulated here as that of evaluating how well an amino acid sequence fits a hypothetical structure. The simplest and most complicated approaches, secondary structure prediction and all-atom free energy calculations, can be viewed as sequence-structure fitness problems. Here, an approach of intermediate complexity is described, which involves; (1) description of a protein structure in terms of contact interface vectors, with both intra-protein and protein-solvent contacts counted, (2) derivation of sequence preferences for 2 up to 29 contact interface types, (3) generation of numerous hypothetical model structures by placing the input sequence into a large set of known three-dimensional structures in all possible alignments, (4) evaluation of these models by summing the sequence preferences over all structural positions and (5) choice of predicted three-dimensional structure as that with the best sequence-structure fitness. Evolutionary information is incorporated by using position-dependent core weights derived from multiple sequence alignments. A number of tests of the method are performed: (1) evaluation of cyclic shifts of a sequence in its native structure; (2) alignment of a sequence in its native structure, allowing gaps; (3) alignment search with a sequence or sequence fragment in a database of structures; and (4) alignment search with a structure in a database of sequences. The main results are: (1) a native sequence can very well find its native structure among a large number of alternatives, in correct alignment; (2) substructures, such as (beta alpha)n units, can be detected in spite of very low sequence similarity; (3) remote homologous can be detected, with some dependence on the set of parameters used; (4) contact interface parameters are clearly superior to classical secondary structure parameters; (5) a simple interface description in terms of just two states, protein-protein and protein-water contacts, performs surprisingly well; (6) the use of core weights considerably improves accuracy in detection of remote homologues; (7) based on a sequence database search with a myoglobin contact profile, the C-terminal domain of a viral origin of replication binding protein is predicted to have an all-helical fold. The sequence-structure fitness concept is sufficiently general to accommodate a large variety of protein structure prediction methods, including new models of intermediate complexity currently being developed.


Assuntos
Conformação Proteica , Proteínas/química , Análise de Sequência/métodos , Sequência de Aminoácidos , Bases de Dados Factuais , Modelos Químicos , Dados de Sequência Molecular , Dobramento de Proteína , Alinhamento de Sequência , Relação Estrutura-Atividade
12.
Trends Biotechnol ; 14(8): 280-5, 1996 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-8987458

RESUMO

Complete genomes from model organisms provide new challenges for computational molecular biology. Novel questions emerge from the genome data obtained from the functional prediction of thousands of gene products. In this review, we present some approaches to the computational comparison of genomes, based on sequence and text analysis, and comparisons of genome composition and gene order.


Assuntos
Biologia Computacional/tendências , Genoma , Modelos Genéticos , Biotecnologia/tendências , Escherichia coli/genética , Genoma Bacteriano , Haemophilus influenzae/genética , Família Multigênica , Mycoplasma/genética , Especificidade da Espécie
13.
Protein Sci ; 4(11): 2424-8, 1995 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-8563640

RESUMO

The analysis of the 269 open reading frames of yeast chromosome VIII by computational methods has yielded 24 new significant sequence similarities to proteins of known function. The resulting predicted functions include three particularly interesting cases of translation-associated proteins: peptidyl-tRNA hydrolase, a ribosome recycling factor homologue, and a protein similar to cytochrome b translational activator CBS2. The methodological limits of the meaningful transfer of functional information between distant homologues are discussed.


Assuntos
Cromossomos Fúngicos , Proteínas Fúngicas/genética , Proteínas Fúngicas/fisiologia , Proteínas Mitocondriais , Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae/química , Oxirredutases do Álcool , Sequência de Aminoácidos , Hidrolases de Éster Carboxílico/química , Hidrolases de Éster Carboxílico/genética , Hidrolases de Éster Carboxílico/fisiologia , Proteínas Fúngicas/química , Dados de Sequência Molecular , Fases de Leitura Aberta , Saccharomyces cerevisiae/genética , Homologia de Sequência
14.
Protein Sci ; 1(12): 1691-8, 1992 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-1304898

RESUMO

The availability of fast and robust algorithms for protein structure comparison provides an opportunity to produce a database of three-dimensional comparisons, called families of structurally similar proteins (FSSP). The database currently contains an extended structural family for each of 154 representative (below 30% sequence identity) protein chains. Each data set contains: the search structure; all its relatives with 70-30% sequence identity, aligned structurally; and all other proteins from the representative set that contain substructures significantly similar to the search structure. Very close relatives (above 70% sequence identity) rarely have significant structural differences and are excluded. The alignments of remote relatives are the result of pairwise all-against-all structural comparisons in the set of 154 representative protein chains. The comparisons were carried out with each of three novel automatic algorithms that cover different aspects of protein structure similarity. The user of the database has the choice between strict rigid-body comparisons and comparisons that take into account interdomain motion or geometrical distortions; and, between comparisons that require strictly sequential ordering of segments and comparisons, which allow altered topology of loop connections or chain reversals. The data sets report the structurally equivalent residues in the form of a multiple alignment and as a list of matching fragments to facilitate inspection by three-dimensional graphics. If substructures are ignored, the result is a database of structure alignments of full-length proteins, including those in the twilight zone of sequence similarity.(ABSTRACT TRUNCATED AT 250 WORDS)


Assuntos
Sequência de Aminoácidos , Bases de Dados Factuais , Conformação Proteica , Dobramento de Proteína , Estrutura Secundária de Proteína , Proteínas/química , Algoritmos , Enzimas/química , Dados de Sequência Molecular , Homologia de Sequência de Aminoácidos , Difração de Raios X
15.
Protein Sci ; 1(12): 1677-90, 1992 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-1304897

RESUMO

With the completion of the first phase of the European yeast genome sequencing project, the complete DNA sequence of chromosome III of Saccharomyces cerevisiae has become available (Oliver, S. G., et al., 1992, Nature 357, 38-46). We have tested the predictive power of computer sequence analysis of the 176 probable protein products of this chromosome, after exclusion of six problem cases. When the results of database similarity searches are pooled with prior knowledge, a likely function can be assigned to 42% of the proteins, and a predicted three-dimensional structure to a third of these (14% of the total). The function of the remaining 58% remains to be determined. Of these, about one-third have one or more probable transmembrane segments. Among the most interesting proteins with predicted functions are a new member of the type X polymerase family, a transcription factor with an N-terminal DNA-binding domain related to GAL4, a "fork head" DNA-binding domain previously known only in Drosophila and in mammals, and a putative methyltransferase. Our analysis increased the number of known significant sequence similarities on chromosome III by 13, to now 67. Although the near 40% success rate of identifying unknown protein function by sequence analysis is surprisingly high, the information gap between known protein sequences and unknown function is expected to widen and become a major bottleneck of genome projects in the near future. Based on the experience gained in this test study, we suggest that the development of an automated computer workbench for protein sequence analysis must be an important item in genome projects.


Assuntos
Cromossomos Fúngicos , DNA Fúngico/genética , Enzimas/genética , Proteínas Fúngicas/genética , Fases de Leitura Aberta , Saccharomyces cerevisiae/genética , Acetolactato Sintase/genética , Sequência de Aminoácidos , Animais , Bovinos , Mapeamento Cromossômico , DNA Polimerase Dirigida por DNA/genética , Humanos , Dados de Sequência Molecular , Ratos , Homologia de Sequência de Aminoácidos
16.
FEBS Lett ; 279(1): 73-8, 1991 Feb 11.
Artigo em Inglês | MEDLINE | ID: mdl-1995346

RESUMO

A structure-based approach to the definition of sequence patterns characteristic of protein domains is presented by example. The approach requires a multiple sequence alignment of a family (or set of related families) as well as at least one three-dimensional structure. The pattern derived does not merely summarize the information in the known sequences but attempts to generalize the pattern specifications based on structural insight. In this example, the pattern-driven database search identified correctly most of the known type I copper-binding domains and detected the presence of a homologous domain in a previously unknown case (CopA protein). The significance of these results is discussed.


Assuntos
Cobre/metabolismo , Sequência de Aminoácidos , Ascorbato Oxidase/química , Ascorbato Oxidase/genética , Azurina/análogos & derivados , Azurina/química , Azurina/genética , Sítios de Ligação , Dados de Sequência Molecular , Plastocianina/química , Plastocianina/genética , Conformação Proteica , Pseudomonas/metabolismo , Alinhamento de Sequência , Software
17.
FEBS Lett ; 390(2): 119-23, 1996 Jul 22.
Artigo em Inglês | MEDLINE | ID: mdl-8706840

RESUMO

The phylogenetic distribution of divergently related protein families into the three domains of life (archaea, bacteria and eukaryotes) can signify the presence or absence of entire cellular processes in these domains and their ancestors. We can thus study the emergence of the major transitions during cellular evolution, and resolve some of the controversies surrounding the evolutionary status of archaea and the origins of the eukaryotic cell. In view of the ongoing projects that sequence the complete genomes of several Archaea, this work forms a testable prediction when the genome sequences become available. Using the presence of the protein families as taxonomic traits, and linking them to biochemical pathways, we are able to reason about the presence of the corresponding cellular processes in the last universal ancestor of contemporary cells. The analysis shows that metabolism was already a complex network of reactions which included amino acid, nucleotide, fatty acid, sugar and coenzyme metabolism. In addition, genetic processes such as translation are conserved and close to the original form. However, other processes such as DNA replication and repair or transcription are exceptional and seem to be associated with the structural changes that drove eukaryotes and bacteria away from their common ancestor. There are two major hypotheses in the present work: first, that archaea are probably closer to the last universal ancestor than any other extant life form, and second, that the major cellular processes were in place before the major splitting. The last universal ancestor had metabolism and translation very similar to the contemporary ones, while having an operonic genome organization and archaean-like transcription. Evidently, all cells today contain remnants of the primordial genome of the last universal ancestor.


Assuntos
Evolução Molecular , Modelos Genéticos , Archaea/genética , Archaea/metabolismo , Bactérias/genética , Bactérias/metabolismo , Células Eucarióticas , Mosaicismo , Filogenia , Proteínas/genética , Transcrição Gênica
18.
FEBS Lett ; 322(2): 159-64, 1993 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-8482384

RESUMO

Iterative profile sequence analysis reveals a remote homology of peroxisomal serine-pyruvate aminotransferases from mammals to the small subunit of soluble hydrogenases from cyanobacteria, an isopenicillin N epimerase, the NifS gene products from bacteria and yeast, and the phosphoserine aminotransferase family. All members of this new class whose function is known are pyridoxal phosphate-dependent enzymes, yet they have distinct catalytic activities. Upon alignment, a lysine around position 200 remains invariant and is predicted to be the pyridoxal phosphate-binding residue. Based on the detected homology, it is predicted that NifS has also a pyridoxal phosphate-dependent serine (or related) aminotransferase function associated with nitrogen economy and/or protection during nitrogen fixation.


Assuntos
Proteínas de Bactérias/química , Fosfato de Piridoxal/fisiologia , Transaminases/classificação , Sequência de Aminoácidos , Animais , Bases de Dados Factuais , Humanos , Dados de Sequência Molecular , Fixação de Nitrogênio/genética , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos , Transaminases/química
19.
FEBS Lett ; 480(1): 42-8, 2000 Aug 25.
Artigo em Inglês | MEDLINE | ID: mdl-10967327

RESUMO

Computational genomics is a subfield of computational biology that deals with the analysis of entire genome sequences. Transcending the boundaries of classical sequence analysis, computational genomics exploits the inherent properties of entire genomes by modelling them as systems. We review recent developments in the field, discuss in some detail a number of novel approaches that take into account the genomic context and argue that progress will be made by novel knowledge representation and simulation technologies.


Assuntos
Biologia Computacional/métodos , Biologia Computacional/tendências , Genes , Genoma , Animais , Simulação por Computador , Bases de Dados como Assunto , Genes/genética , Genes/fisiologia , Humanos , Família Multigênica/genética , Proteínas Recombinantes de Fusão/genética , Alinhamento de Sequência
20.
FEBS Lett ; 389(1): 96-101, 1996 Jun 24.
Artigo em Inglês | MEDLINE | ID: mdl-8682215

RESUMO

The functional composition of organisms can be analysed for the first time with the appearance of complete or sizeable parts of various genomes. We have reduced the problem of protein function classification to a simple scheme with three classes of protein function: energy-, information- and communication-associated proteins. Finer classification schemes can be easily mapped to the above three classes. To deal with the vast amount of information, a system for automatic function classification using database annotations has been developed. The system is able to classify correctly about 80% of the query sequences with annotations. Using this system, we can analyse samples from the genomes of the most represented species in sequence databases and compare their genomic composition. The similarities and differences for different taxonomic groups are strikingly intuitive. Viruses have the highest proportion of proteins involved in the control and expression of genetic information. Bacteria have the highest proportion of their genes dedicated to the production of proteins associated with small molecule transformations and transport. Animals have a very large proportion of proteins associated with intra- and intercellular communication and other regulatory processes. In general, the proportion of communication-related proteins increases during evolution, indicating trends that led to the emergence of the eukaryotic cell and later the transition from unicellular to multicellular organisms.


Assuntos
Genoma , Proteínas/fisiologia , Animais , Evolução Molecular , Genes de Plantas , Genoma Bacteriano , Genoma Viral , Humanos , Proteínas/classificação , Proteínas/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA