RESUMO
Sperm storage organs are common and broadly distributed among animal taxa. However, little is known about how these organs function at the molecular level. Additionally, there is a paucity of knowledge about the evolution of genes expressed in these organs. This investigation is an evolutionary expressed sequence tag (EST) study of genes expressed in the seminal receptacle, one of the sperm storage organs in Drosophila. The incidence of positive selection is higher for the seminal receptacle genes than Drosophila reproductive genes as a whole, but lower than genes associated with the spermatheca, a second type of Drosophila sperm storage organ. By identifying overrepresented classes of proteins and classes for which sperm storage function is suggested by the nature of the proteins, candidate genes were discovered. These candidates belong to protein classes such as muscle contraction, odorant binding and odorant receptor, protease inhibitor and immunity.
Assuntos
Drosophila melanogaster/genética , Etiquetas de Sequências Expressas , Genes de Insetos/genética , Genitália Feminina/metabolismo , Animais , Sequência de Bases , DNA Complementar/genética , Drosophila melanogaster/metabolismo , Evolução Molecular , Feminino , Funções Verossimilhança , Modelos Genéticos , Dados de Sequência Molecular , Alinhamento de Sequência , Análise de Sequência de DNARESUMO
We compared the rates of synonymous (silent) substitution among various genes in a number of species of Drosophila. First, we found that even for a particular gene, the rate of synonymous substitution varied considerably with Drosophila lineages. Second, we showed a large variation in synonymous substitution rates among nuclear genes in Drosophila. These rates of synonymous substitution were correlated negatively with C content and positively with A content at the third codon positions. Nucleotide sequences were also compared between pseudogenes and their functional homologs. The C content of the pseudogenes was lower than that of the functional genes and the A content of the former was higher than that of the latter. Because the synonymous substitution for functional genes and the nucleotide substitution for pseudogenes are exempted from any selective constraint at the protein level, these observations could be explained by a biased pattern of mutation in the Drosophila nuclear genome. Such a bias in the mutation pattern may affect the molecular clock (local clock) of each nuclear gene of each species. Finally, we obtained the average rates of synonymous substitution for three gene groups in Drosophila; 11.0 x 10(-9), 17.5 x 10(-9) and 27.1 x 10(-9)/site/year.
Assuntos
Composição de Bases/genética , Drosophila/genética , Genes , Mutação/genética , Animais , Evolução Biológica , Variação Genética/genética , Nucleotídeos , Especificidade da EspécieRESUMO
The nuclear genes of Drosophila evolve at various rates. This variation seems to correlate with codon-usage bias. In order to elucidate the determining factors of the various evolutionary rates and codon-usage bias in the Drosophila nuclear genome, we compared patterns of codon-usage bias with base compositions of exons and introns. Our results clearly show the existence of selective constraints at the translational level for synonymous (silent) sites and, on the other hand, the neutrality or near neutrality of long stretches of nucleotide sequence within noncoding regions. These features were found for comparisons among nuclear genes in a particular species (Drosophila melanogaster, Drosophila pseudoobscura and Drosophila virilis) as well as in a particular gene (alcohol dehydrogenase) among different species in the genus Drosophila. The patterns of evolution of synonymous sites in Drosophila are more similar to those in the prokaryotes than they are to those in mammals. If a difference in the level of expression of each gene is a main reason for the difference in the degree of selective constraint, the evolution of synonymous sites of Drosophila genes would be sensitive to the level of expression among genes and would change as the level of expression becomes altered in different species. Our analysis verifies these predictions and also identifies additional selective constraints at the translational level in Drosophila.
Assuntos
Composição de Bases , Evolução Biológica , Códon , Drosophila/genética , Animais , Núcleo Celular , Biblioteca Genômica , ÍntronsRESUMO
The patterns of nonrandom usage of synonymous codons (codon bias) in enteric bacteria were analyzed. Poisson random field (PRF) theory was used to derive the expected distribution of frequencies of nucleotides differing from the ancestral state at aligned sites in a set of DNA sequences. This distribution was applied to synonymous nucleotide polymorphisms and amino acid polymorphisms in the gnd and putP genes of Escherichia coli. For the gnd gene, the average intensity of selection against disfavored synonymous codons was estimated as approximately 7.3 x 10(-9); this value is significantly smaller than the estimated selection intensity against selectively disfavored amino acids in observed polymorphisms (2.0 x 10(-8)), but it is approximately of the same order of magnitude. The selection coefficients for optimal synonymous codons estimated from PRF theory were consistent with independent estimates based on codon usage for threonine and glycine. Across 118 genes in E. coli and Salmonella typhimurium, the distribution of estimated selection coefficients, expressed as multiples of the effective population size, has a mean and standard deviation of 0.5 +/- 0.4. No significant differences were found in the degree of codon bias between conserved positions and replacement positions, suggesting that translational misincorporation is not an important selective constraint among synonymous polymorphic codons in enteric bacteria. However, across the first 100 codons of the genes, conserved amino acids with identical codons have significantly greater codon bias than that of either synonymous or nonidentical codons, suggesting that there are unique selective constraints, perhaps including mRNA secondary structures, in this part of the coding region.
Assuntos
Códon/genética , Modelos Genéticos , Seleção Genética , DNA Bacteriano/genética , Escherichia coli/genética , Genes Bacterianos , Distribuição de Poisson , Polimorfismo Genético , Salmonella typhimurium/genéticaRESUMO
We designed a universal primer (UNIPOL) for DNA amplification of AIDS-related viruses. The phylogenetic tree constructed from the presumed sequences amplified with UNIPOL was representative of the tree calculated from whole pol gene sequences so far reported. UNIPOL was able to amplify the sequences of all four major groups of primate lentiviruses and also that of a distinct virus from a Ghanaian patient with an AIDS-related complex, designated GH-2. This strain scarcely hybridizes with known HIV/simian immunodeficiency virus (SIV) DNA probes. Sequence analysis of the only amplified fragment revealed rapidly that GH-2 was quite similar to the recently reported HIV-2ALT(D205) and that these two viruses form a new subgroup distint from known HIV-2 and SIVmac/SIVsm in the large HIV-2 group. This system will be useful for further phylogenetic study of various primate lentiviruses.
Assuntos
Síndrome da Imunodeficiência Adquirida/microbiologia , HIV-1/classificação , HIV-2/classificação , Lentivirus/classificação , Filogenia , Sequência de Bases , Southern Blotting , Genes pol , HIV-1/genética , HIV-2/genética , Humanos , Lentivirus/genética , Dados de Sequência Molecular , Reação em Cadeia da Polimerase , Homologia de Sequência do Ácido Nucleico , Moldes GenéticosRESUMO
Human immunodeficiency virus (HIV) exhibits immunological hypervariability, which has been an obstacle to successful production of effective anti-HIV vaccines. In this study, we estimated patterns of nucleotide and amino acid substitutions in the env gene of HIVs, with the aim of finding characteristics of the mechanism which generates the immunological diversity of the env protein of HIVs. We found that nucleotide changes between A and G are predominant compared to those between other nucleotides. Since this feature is consistent with the pattern of nucleotide substitutions of other retroviral genes but is quite different from those of most eukaryotic genes, a high rate of nucleotide substitution between A and G appears to be specific for retroviruses including HIVs. We discuss the biological relationship between this biased substitution and the mechanism generating hypervariability of epitopes on the env protein of HIVs.
Assuntos
Variação Antigênica , Antígenos HIV/imunologia , HIV/imunologia , Nucleotídeos/imunologia , Aminoácidos/imunologia , HIV/genética , Humanos , MutaçãoRESUMO
The phylogenetic relationships among 18 species of Triatominae were inferred based on mitochondrial DNA (mtDNA) sequences. The species of Triatoma included 11 belonging to the infestans complex [T. infestans (Klug), T. guasayana Wygodzinsky & Abalos, T. sordida (Stål), T. platensis Neiva, T. brasiliensis Neiva, T. rubrovaria (Blanchard), T. vitticeps (Stål), T. delpontei Romaña & Abalos, T. maculata (Erichson), T. patagonica Del Ponte, and T. matogrossensis Leite & Barbosa] and four others of the same genus but of different complexes [T. circummaculata (Stål), T. protracta (Uhler), T. dimidiata (Latreille), and T. mazzottii Usinger]. As possible outgroups we used Mepraia spinolai Mazza, Panstrongylus megistus (Burmeister), and Rhodnius prolixus Stål. We analyzed mtDNA fragments of the 12S and 16S ribosomal RNA genes from each of the 18 species, as well as of the cytochrome oxidase I (COI) gene from nine. The 12S, 16S, and COI gene sequences were analyzed individually and combined. All of the phylogenetic analyses unambiguously supported two clusters: one including T. infestans, T. platensis, and T. delpontei, and the other T. sordida and T. mutagrossensis. Inclusion of T. circummaculata into the infestans complex was confirmed, although this is in disagreement with the morphological classification. On the other hand, our analyses showed that T. dimidiata is closely related to a phylosoma complex species, T. mazzottii. This is consistent with the tentative classification previously made based on morphological characters. The issue of the monophyly of the genus Triatoma remains unresolved.
Assuntos
DNA Mitocondrial/análise , Triatoma/genética , Animais , Composição de Bases , Sequência de Bases , Genes de Insetos , Variação Genética , Dados de Sequência Molecular , Filogenia , RNA Ribossômico/análise , RNA Ribossômico 16S/análise , Análise de Sequência de DNA , Triatoma/classificação , Triatominae/classificação , Triatominae/genéticaRESUMO
Cyclophilins, which possess peptidyl-prolyl isomerase activity, are cellular targets of immunosuppressant drugs and involved in a wide variety of functions. While the Arabidopsis thaliana genome contains the largest number of cyclophilins, the number of plant cyclophilins available in databases is small compared to that of other organisms. It implies that many cyclophilins are yet to be identified in plants. In order to identify cyclophilin candidates from available plant sequence data, we examined alignment-free methods based on Partial Least Squares (PLS). PLS classifier performed better than profile hidden Markov models and PSI-BLAST in identifying cyclophilins from the Arabidopsis and rice genomes.
Assuntos
Proteínas de Arabidopsis/genética , Arabidopsis/genética , Ciclofilinas/classificação , Ciclofilinas/genética , Genoma de Planta , Oryza/genética , Proteínas de Arabidopsis/classificação , Cadeias de MarkovRESUMO
We have summarized and analyzed all available nuclear DNA sequence polymorphism studies for three species of Drosophila, D. melanogaster (24 loci), D. simulans (12 loci), and D. pseudoobscura (5 loci). Our major findings are: (1) The average nucleotide heterozygosity ranges from about 0.4% to 2% depending upon species and function of the region, i.e., coding or noncoding. (2) Compared to D. simulans and D. pseudoobscura (which are about equally variable), D. melanogaster displays a low degree of DNA polymorphism. (3) Noncoding introns and 3' and 5' flanking DNA shows less polymorphism than silent sites within coding DNA. (4) X-linked genes are less variable than autosomal genes. (5) Transition (Ts) and transversion (Tv) polymorphisms are about equally frequent in non-coding DNA and at fourfold degenerate sites in coding DNA while Ts polymorphisms outnumber Tv polymorphisms by about 2:1 in total coding DNA. The increased Ts polymorphism in coding regions is likely due to the structure of the genetic code: silent changes are more often Ts's than are replacement substitutions. (6) The proportion of replacement polymorphisms is significantly higher in D. melanogaster than in D. simulans. (7) The level of variation in coding DNA and the adjacent noncoding DNA is significantly correlated indicating regional effects, most notably recombination. (8) Surprisingly, the level of polymorphism at silent coding sites in D. melanogaster is positively correlated with degree of codon usage bias. (9) Three proposed tests of the neutral theory of DNA polymorphisms have been performed on the data: Tajima's test, the HKA test, and the McDonald-Kreitman test. About half of the loci fail to conform to the expectations of neutral theory by one of the tests. We conclude that many variables are affecting levels of DNA polymorphism in Drosophila, from properties of nucleotides to population history and, perhaps, mating structure. No simple, all encompassing explanation satisfactorily accounts for the data.
Assuntos
Drosophila/genética , Variação Genética , Animais , DNA/genética , Modelos Teóricos , Dados de Sequência Molecular , Polimorfismo Genético , Análise de SequênciaRESUMO
We first review what is known about patterns of codon usage bias in Drosophila and make the following points: (i) Drosophila genes are as biased or more biased than those in microorganisms. (ii) The level of bias of genes and even the particular pattern of codon bias can remain phylogenetically invariant for very long periods of evolution. (iii) However, some genes, even very tightly linked genes, can change very greatly in codon bias across species. (iv) Generally G and especially C are favored at synonymous sites in biased genes. (v) With the exception of aspartic acid, all amino acids contribute significantly and about equally to the codon usage bias of a gene. (vi) While most individual amino acids that can use G or C at synonymous sites display a preference for C, there are exceptions: valine and leucine, which prefer G. (vii) Finally, smaller genes tend to be more biased than longer genes. We then examine possible causes of these patterns and discount mutation bias on three bases: there is little evidence of regional mutation bias in Drosophila, mutation bias is likely toward A+T (the opposite of codon usage bias), and not all amino acids display the preference for the same nucleotide in the wobble position. Two lines of evidence support a selection hypothesis based on tRNA pools: highly biased genes tend to be highly and/or rapidly expressed, and the preferred codons in highly biased genes optimally bind the most abundant isoaccepting tRNAs. Finally, we examine the effect of bias on DNA evolution and confirm that genes with high codon usage bias have lower rates of synonymous substitution between species than do genes with low codon usage bias. Surprisingly, we find that genes with higher codon usage bias display higher levels of intraspecific synonymous polymorphism. This may be due to opposing effects of recombination.
Assuntos
Códon , Drosophila/genética , Evolução Molecular , Animais , Recombinação GenéticaRESUMO
Synonymous substitution rates in mitochondrial and nuclear genes of Drosophila were compared. To make accurate comparisons, we considered the following: (1) relative synonymous rates, which do not require divergence time estimates, should be used; (2) methods estimating divergence should take into account base composition; (3) only very closely related species should be used to avoid effects of saturation; (4) the heterogeneity of rates should be examined. We modified the methods estimating synonymous substitution numbers to account for base composition bias. By using these methods, we found that mitochondrial genes have 1.7-3.4 times higher synonymous substitution rates than the fastest nuclear genes or 4.5-9.0 times higher rates than the average nuclear genes. The average rate of synonymous transversions was 2.7 (estimated from the melanogaster species subgroup) or 2.9 (estimated from the obscura group) times higher in mitochondrial genes than in nuclear genes. Synonymous transversions in mitochondrial genes occurred at an approximately equivalent rate to those in the fastest nuclear genes. This last result is not consistent with the hypothesis that the difference in turnover rates between mitochondrial and nuclear genomes is the major factor determining higher synonymous substitution rates in mtDNA. We conclude that the difference in synonymous substitution rates is due to a combination of two factors: a higher transitional mutation rate in mtDNA and constraints on nuclear genes due to selection for codon usage.
Assuntos
Drosophila/genética , Genes de Insetos , Animais , Núcleo Celular/genética , DNA Mitocondrial/genética , Drosophila melanogaster/genética , Evolução Molecular , Modelos Genéticos , Mutação , Fatores de TempoRESUMO
One of the pupal cuticle protein (PCP) genes has been found within an intron of a Drosophila housekeeping gene (the Gart locus) that encodes three enzymes involved in the purine pathway. This intronic gene has been described as a gene within a gene, and the gene is now called a "nested" gene. Because the intronic PCP gene has sequence similarity with the larval cuticle protein (LCP) gene, it may have been derived from one of the LCP genes or their ancestral gene. We have studied possible phylogenetic relationships among these five genes by comparing nucleotide sequences of four LCP genes with that of the PCP gene. The results obtained suggest that the PCP gene may have originated from an ancestral gene before duplication of the LCP genes occurred. Using the number of synonymous (silent) substitutions, we then estimated the divergence time between the PCP gene and the LCP genes to be about 70 million years (Myr). The divergence time estimated is much larger than that for the sibling species of D. melanogaster (about 2.5 Myr), indicating that the "nested" gene structure can be seen not only in Drosophila melanogaster, but also in other distantly related Drosophila species.
Assuntos
Drosophila melanogaster/genética , Genes , Proteínas de Insetos , Íntrons , Filogenia , Sequência de Aminoácidos , Animais , Sequência de Bases , Drosophila/genética , Dados de Sequência Molecular , Proteínas/genética , Homologia de Sequência do Ácido NucleicoRESUMO
The relationship between gene length and synonymous codon usage bias was investigated in Drosophila melanogaster, Escherichia coli and Saccharomyces cerevisiae. Simulation studies indicate that the correlations observed in the three organisms are unlikely to be due to sampling errors or any potential bias in the methods used to measure codon usage bias. The correlation was significantly positive in E.coli genes, whereas negative correlations were obtained for D. melanogaster and S.cerevisiae genes. When only ribosomal protein genes were used, whose expression levels are assumed to be similar, E.coli and S.cerevisiae showed significantly positive correlations. For the two eukaryotes, the distribution of effective number of codons was different in short genes (300-500 bp) compared with longer genes; this was not observed in E.coli. Both positive and negative correlations can be explained by translational selection. Energetically costly longer genes have higher codon usage bias to maximize translational efficiency. Selection may also be acting to reduce the size of highly expressed proteins, and the effect is particularly pronounced in eukaryotes. The different relationships between codon usage bias and gene length observed in prokaryotes and eukaryotes may be the consequence of these different types of selection.
Assuntos
Códon , Drosophila melanogaster/genética , Escherichia coli/genética , Saccharomyces cerevisiae/genética , AnimaisRESUMO
Codon usage bias of 1,117 Drosophila melanogaster genes, as well as fewer D. pseudoobscura and D. virilis genes, was examined from the perspective of relative abundance of isoaccepting tRNAs and their changes during development. We found that each amino acid contributes about equally and highly significantly to overall codon usage bias, with the exception of Asp which had very low contribution to overall bias. Asp was also the only amino acid that did not show a clear preference for one of its synonymous codons. Synonymous codon usage in Drosophila was consistent with "optimal" codons deduced from the isoaccepting tRNA availability. Interestingly, amino acids whose major isoaccepting tRNAs change during development did not show as strong bias as those with developmentally unchanged tRNA pools. Asp is the only amino acid for which the major isoaccepting tRNAs change between larval and adult stages. We conclude that synonymous codon usage in Drosophila is well explained by tRNA availability and is probably influenced by developmental changes in relative abundance.
Assuntos
Códon , Drosophila/genética , RNA de Transferência/genética , Aminoácidos/genética , Animais , Evolução Biológica , Regulação da Expressão Gênica no Desenvolvimento , Genes de Insetos , Modelos GenéticosRESUMO
Evolution of viral genes is characterized by enormously high speed compared with that of nuclear genes of eukaryotic organisms. In this paper, the evolutionary rates and patterns of base substitutions are examined for retroviral oncogenes, human immunodeficiency viruses (HIV), hepatitis B viruses (HBV), and influenza A viruses. Our results show that the evolutionary process of these viral genes can readily be explained by the neutral theory of molecular evolution. In particular, the neutral theory is supported by our observation that synonymous substitutions always much predominate over nonsynonymous substitutions, even though the substitution rate varies considerably among the viruses. Furthermore, the exact correspondence between the high rates of evolutionary base substitutions and the high rates of production of mutants in RNA viruses fits very nicely to the prediction of the theory. The linear relationship between substitution numbers and time was examined to evaluate the clock-like property of viral evolution. The clock appears to be quite accurate in the influenza A viruses in man.
Assuntos
Evolução Biológica , Genes Virais , Modelos Genéticos , Vírus/genética , DNA Viral/genética , HIV/genética , Vírus da Hepatite B/genética , Vírus da Hepatite B/isolamento & purificação , Humanos , Filogenia , Retroviridae/genéticaRESUMO
Drosophila melanogaster belongs to a closely related group of eight species collectively known as the melanogaster subgroup; all are native to sub-Saharan Africa and islands off the east coast of Africa. The phylogenetic relationships of most species in this subgroup have been well documented; however, the three most closely related species, D. simulans, D. sechellia, and D. mauritiana, have remained problematic from a phylogenetic standpoint as no data set has unambiguously resolved them. We present new DNA sequence data on the nullo and Serendipity-alpha genes and combine them with all available nuclear DNA sequence data; the total data encompass 12 genes and the ITS of rDNA. A methodological problem arose because nine of the genes had information on intraspecific polymorphisms in at least one species. We explored the effect of inclusion/exclusion of polymorphic sites and found that it had very little effect on phylogenetic inferences, due largely to the fact that 82% of polymorphisms are autapomorphies (unique to one species). We have also reanalyzed our previous DNA-DNA hybridization data with a bootstrap procedure. The combined sequence data set and the DNA-DNA hybridization data strongly support the sister status of the two island species, D. sechellia and D. mauritiana. This at least partially resolves what had been a paradox of parallel evolution in these two species.
Assuntos
Drosophila melanogaster/genética , Filogenia , Polimorfismo Genético , Animais , DNA/química , Drosophila/classificação , Drosophila/genética , Drosophila melanogaster/classificação , Genes de Insetos , Hibridização In Situ/métodos , Modelos Biológicos , Dados de Sequência Molecular , Proteínas Nucleares/genética , RNA RibossômicoRESUMO
Horizontal transmission has been well documented as a major mechanism for the dissemination of mariner-like elements (MLEs) among species. Less well understood are mechanisms that limit vertical transmission of MLEs resulting in the "spotty" or discontinuous distribution observed in closely related species. In this article we present evidence that the genome of the common ancestor of the melanogaster species subgroup of Drosophila contained an MLE related to the mellifera (honey bee) subfamily. Horizontal transmission, approximately 3-10 MYA, is strongly suggested by the observation that the sequence of the MLE in Drosophila erecta is 97% identical in nucleotide sequence with that of an MLE in the cat flea, Ctenocephalides felis. The D. erecta MLE has a spotty distribution among species in the melanogaster subgroup. The element has a high copy number in D. erecta and D. orena, a moderate copy number in D. teissieri and D. yakuba, and was apparently lost ("stochastic loss") in the lineage leading to D. melanogaster, D. simulans, D. mauritiana, and D. sechellia. In D. erecta, most copies are concentrated in the heterochromatin. Two copies from D. erecta, denoted De12 and De19, were cloned and sequenced, and they appear to be nonfunctional ("vertical inactivation"). It therefore appears that the predominant mode of MLE evolution is vertical inactivation and stochastic loss balanced against occasional reinvasion of lineages by horizontal transmission.
Assuntos
Evolução Biológica , Elementos de DNA Transponíveis , Drosophila/genética , Insetos/genética , Nucleotidiltransferases/genética , Sequência de Aminoácidos , Animais , Southern Blotting , Gatos , DNA/análise , DNA/genética , Drosophila melanogaster/genética , Variação Genética , Hibridização In Situ , Dados de Sequência Molecular , Nucleotidiltransferases/química , Homologia de Sequência de Aminoácidos , Sifonápteros/genética , Processos Estocásticos , TransposasesRESUMO
Drosophila virilis is a prominent reference species for comparison with Drosophila melanogaster in regard to patterns and mechanisms of molecular and genomic evolution. Sequences were determined for 11 Adh genes from 8 species of the D. virilis species group, including species from both the virilis phylad and the montana subphylad. The genome of D. virilis contains a 6-kb duplication that includes the entire Adh coding region. The pattern of sequence identity within the duplication strongly suggests a recent gene-conversion event bordered by 36-bp indels. As in other Drosophila, the amino-acid coding region of Adh is encoded by three exons interrupted by two short introns. The promoter region includes 16 blocks of sequence that are well conserved in D. virilis, Drosophila hydei, and D. melanogaster. The developmental profile of Adh transcription suggests a distal/proximal promoter switch analogous to that in D. melanogaster. Duplicate Adh genes were also found in Drosophila montana and Drosophila lacicola, which apparently originated independently of that in D. virilis. The Adh genes in all species of the D. virilis group have among the lowest codon bias of any Adh genes so far reported in the genus Drosophila. Taking the low codon bias into account, we estimate the time of divergence between the virilis and montana clades as 9.0 +/- 0.7 Mya and the approximate time of divergence of D. virilis from other members of the virilis phylad as 2.6 +/- 0.4 Mya. The region of the D. virilis genome containing Adh, as well as the chromosome as a whole, gives evidence of extensive rearrangements relative to the genome of D. melanogaster.
Assuntos
Álcool Desidrogenase/genética , Drosophila/genética , Evolução Molecular , Sequência de Aminoácidos , Animais , Sequência de Bases , Drosophila/enzimologia , Genoma , Dados de Sequência Molecular , Filogenia , Alinhamento de SequênciaRESUMO
Human immunodeficiency viruses (HIVs) show extensive genetic variation. This feature is the fundamental cause of pathogenicity of HIVs and thwarts efforts to develop effective vaccines. To understand the mutation mechanism of these viruses, we analysed nucleotide sequences of env and gag genes of the viruses by use of molecular evolutionary methods and estimated the direction and frequency of nucleotide substitutions. Results obtained showed that the frequency of changes between A and G was extremely high and the mutation pattern of HIVs was distinct from those of nuclear genes of their host cells. This distinction may be caused by the characteristics of the reverse transcription of HIVs. The mutation pattern obtained would be helpful to construct effective antiviral drugs.