RESUMO
Analyses of ancient DNA typically involve sequencing the surviving short oligonucleotides and aligning to genome assemblies from related, modern species. Here, we report that skin from a female woolly mammoth (Mammuthus primigenius) that died 52,000 years ago retained its ancient genome architecture. We use PaleoHi-C to map chromatin contacts and assemble its genome, yielding 28 chromosome-length scaffolds. Chromosome territories, compartments, loops, Barr bodies, and inactive X chromosome (Xi) superdomains persist. The active and inactive genome compartments in mammoth skin more closely resemble Asian elephant skin than other elephant tissues. Our analyses uncover new biology. Differences in compartmentalization reveal genes whose transcription was potentially altered in mammoths vs. elephants. Mammoth Xi has a tetradic architecture, not bipartite like human and mouse. We hypothesize that, shortly after this mammoth's death, the sample spontaneously freeze-dried in the Siberian cold, leading to a glass transition that preserved subfossils of ancient chromosomes at nanometer scale.
Assuntos
Genoma , Mamutes , Pele , Animais , Mamutes/genética , Genoma/genética , Feminino , Elefantes/genética , Cromatina/genética , Fósseis , DNA Antigo/análise , Camundongos , Humanos , Cromossomo X/genéticaRESUMO
The illegal poaching of lions for their body parts poses a severe threat to lion populations across Africa. Poaching accounts for 35% of all human-caused lion deaths, with 51% attributed to retaliatory killings following livestock predation. In nearly half of the retaliatory killings, lion body parts are removed, suggesting that high demand for lion body parts may fuel killings attributed to human-lion conflict. Trafficked items are often confiscated in transit or destination countries far from their country of origin. DNA from lion parts may in some cases be the only available means for examining their geographic origins. In this paper, we present the Lion Localizer, a full-stack software tool that houses a comprehensive database of lion mitochondrial DNA (mtDNA) sequences sourced from previously published studies. The database covers 146 localities from across the African continent and India, providing information on the potential provenance of seized lion body parts. Lion mtDNA sequences of 350 or 1,140 bp corresponding to the cytochrome b region can be generated from lion products and queried against the Lion Localizer database. Using the query sequence, the Lion Localizer generates a listing of exact or partial matches, which are displayed on an interactive map of Africa. This allows for the rapid identification of potential regions and localities where lions have been or are presently being targeted by poachers. By examining the potential provenance of lion samples, the Lion Localizer serves as a valuable resource in the fight against lion poaching. The software is available at https://lionlocalizer.org.
Assuntos
DNA Mitocondrial , Leões , Animais , Humanos , DNA Mitocondrial/genética , Leões/genética , África , SoftwareRESUMO
Cape lions (Panthera leo melanochaitus) formerly ranged throughout the grassland plains of the "Cape Flats" in what is today known as the Western Cape Province, South Africa. Cape lions were likely eradicated because of overhunting and habitat loss after European colonization. European naturalists originally described Cape lions as "black-maned lions" and claimed that they were phenotypically distinct. However, other depictions and historical descriptions of lions from the Cape report mixed or light coloration and without black or extensively developed manes. These findings suggest that, rather than forming a distinct population, Cape lions may have had phenotypic and genotypic variation similar to other African lions. Here we investigate Cape lion genome characteristics, population dynamics, and genetic distinctiveness prior to their extinction. We generated genomic data from 2 historic Cape lions to compare to 118 existing high-coverage mitogenomes, and low-coverage nuclear genomes of 53 lions from 13 African countries. We show that, before their eradication, lions from the Cape Flats had diverse mitogenomes and nuclear genomes that clustered with lions from both southern and eastern Africa. Cape lions had high genome-wide heterozygosity and low inbreeding coefficients, indicating that populations in the Cape Flats went extinct so rapidly that genomic effects associated with long-term small population size and isolation were not detectable. Our findings do not support the characterization of Cape lions as phylogeographically distinct, as originally put forth by some European naturalists, and illustrates how alternative knowledge systems, for example, Indigenous perspectives, could potentially further inform interpretations of species histories.
Assuntos
Leões , Animais , Leões/genética , Genômica , África do Sul , Genoma , Dinâmica PopulacionalRESUMO
Suids, both domesticated and wild, are found on all continents except for Antarctica and provide valuable food resources for humans in addition to serving as important models for biomedical research. Continuing advances in genome sequencing have allowed researchers to compare the genomes from diverse populations of suids helping to clarify their evolution and dispersal. Further analysis of these samples may provide clues to improve disease resistance/resilience and productivity in domestic suids as well as better ways of classifying and conserving genetic diversity within wild and captive suids. Collecting samples from diverse populations of suids is resource intensive and may negatively impact endangered populations. Here we catalog extensive tissue and DNA samples from suids in collections in both Europe and North America. We include samples that have previously been used for whole genome sequencing, targeted DNA sequencing, RNA sequencing, and reduced representation bisulfite sequencing (RRBS). This work provides an important centralized resource for researchers who wish to access published databases.
Assuntos
Genoma , Genômica , Humanos , Suínos , Animais , Genoma/genética , Análise de Sequência de DNA , Sequenciamento Completo do Genoma , DNARESUMO
Nuclear mitochondrial pseudogenes (numts) may hinder the reconstruction of mtDNA genomes and affect the reliability of mtDNA datasets for phylogenetic and population genetic comparisons. Here, we present the program Numt Parser, which allows for the identification of DNA sequences that likely originate from numt pseudogene DNA. Sequencing reads are classified as originating from either numt or true cytoplasmic mitochondrial (cymt) DNA by direct comparison against cymt and numt reference sequences. Classified reads can then be parsed into cymt or numt datasets. We tested this program using whole genome shotgun-sequenced data from 2 ancient Cape lions (Panthera leo), because mtDNA is often the marker of choice for ancient DNA studies and the genus Panthera is known to have numt pseudogenes. Numt Parser decreased sequence disagreements that were likely due to numt pseudogene contamination and equalized read coverage across the mitogenome by removing reads that likely originated from numts. We compared the efficacy of Numt Parser to 2 other bioinformatic approaches that can be used to account for numt contamination. We found that Numt Parser outperformed approaches that rely only on read alignment or Basic Local Alignment Search Tool (BLAST) properties, and was effective at identifying sequences that likely originated from numts while having minimal impacts on the recovery of cymt reads. Numt Parser therefore improves the reconstruction of true mitogenomes, allowing for more accurate and robust biological inferences.
Assuntos
Genoma Mitocondrial , Panthera , Animais , Pseudogenes , Panthera/genética , Filogenia , Reprodutibilidade dos Testes , DNA Mitocondrial/genética , Núcleo Celular/genética , Análise de Sequência de DNARESUMO
Cervids are distinguished by the shedding and regrowth of antlers. Furthermore, they provide insights into prion and other diseases. Genomic resources can facilitate studies of the genetic underpinnings of deer phenotypes, behavior, and disease resistance. Widely distributed in North America, the white-tailed deer (Odocoileus virginianus) has recreational, commercial, and food source value for many households. We present a genome generated using DNA from a single Illinois white-tailed sequenced on the PacBio Sequel II platform and assembled using Wtdbg2. Omni-C chromatin conformation capture sequencing was used to scaffold the genome contigs. The final assembly was 2.42 Gb, consisting of 508 scaffolds with a contig N50 of 21.7 Mb, a scaffold N50 of 52.4 Mb, and a BUSCO complete score of 93.1%. Thirty-six chromosome pseudomolecules comprised 93% of the entire sequenced genome length. A total of 20 651 predicted genes using the BRAKER pipeline were validated using InterProScan. Chromosome length assembly sequences were aligned to the genomes of related species to reveal corresponding chromosomes.
Assuntos
Cervos , Animais , Cromossomos/genética , Cervos/genética , Genoma , Anotação de Sequência MolecularRESUMO
Endogenous retroviruses (ERVs) are proviral sequences that result from colonization of the host germ line by exogenous retroviruses. The majority of ERVs represent defective retroviral copies. However, for most ERVs, endogenization occurred millions of years ago, obscuring the stages by which ERVs become defective and the changes in both virus and host important to the process. The koala retrovirus, KoRV, only recently began invading the germ line of the koala (Phascolarctos cinereus), permitting analysis of retroviral endogenization on a prospective basis. Here, we report that recombination with host genomic elements disrupts retroviruses during the earliest stages of germ-line invasion. One type of recombinant, designated recKoRV1, was formed by recombination of KoRV with an older degraded retroelement. Many genomic copies of recKoRV1 were detected across koalas. The prevalence of recKoRV1 was higher in northern than in southern Australian koalas, as is the case for KoRV, with differences in recKoRV1 prevalence, but not KoRV prevalence, between inland and coastal New South Wales. At least 15 additional different recombination events between KoRV and the older endogenous retroelement generated distinct recKoRVs with different geographic distributions. All of the identified recombinant viruses appear to have arisen independently and have highly disrupted ORFs, which suggests that recombination with existing degraded endogenous retroelements may be a means by which replication-competent ERVs that enter the germ line are degraded.
Assuntos
Retrovirus Endógenos/genética , Phascolarctidae/genética , Recombinação Genética , Animais , Feminino , Masculino , New South WalesRESUMO
Elephantids are the world's most iconic megafaunal family, yet there is no comprehensive genomic assessment of their relationships. We report a total of 14 genomes, including 2 from the American mastodon, which is an extinct elephantid relative, and 12 spanning all three extant and three extinct elephantid species including an â¼120,000-y-old straight-tusked elephant, a Columbian mammoth, and woolly mammoths. Earlier genetic studies modeled elephantid evolution via simple bifurcating trees, but here we show that interspecies hybridization has been a recurrent feature of elephantid evolution. We found that the genetic makeup of the straight-tusked elephant, previously placed as a sister group to African forest elephants based on lower coverage data, in fact comprises three major components. Most of the straight-tusked elephant's ancestry derives from a lineage related to the ancestor of African elephants while its remaining ancestry consists of a large contribution from a lineage related to forest elephants and another related to mammoths. Columbian and woolly mammoths also showed evidence of interbreeding, likely following a latitudinal cline across North America. While hybridization events have shaped elephantid history in profound ways, isolation also appears to have played an important role. Our data reveal nearly complete isolation between the ancestors of the African forest and savanna elephants for â¼500,000 y, providing compelling justification for the conservation of forest and savanna elephants as separate species.
Assuntos
Elefantes/genética , Mamutes/genética , Mastodontes/genética , Animais , Elefantes/classificação , Evolução Molecular , Extinção Biológica , Fósseis , Fluxo Gênico , Genoma , Genômica/história , História Antiga , Mamutes/classificação , Mastodontes/classificação , FilogeniaRESUMO
Chronic wasting disease (CWD) is a fatal, highly transmissible spongiform encephalopathy caused by an infectious prion protein. CWD is spreading across North American cervids. Studies of the prion protein gene (PRNP) in white-tailed deer (WTD; Odocoileus virginianus) have identified non-synonymous substitutions associated with reduced CWD frequency. Because CWD is spreading rapidly geographically, it may impact cervids of conservation concern. Here, we examined the genetic vulnerability to CWD of 2 subspecies of WTD: the endangered Florida Key deer (O. v. clavium) and the threatened Columbian WTD (O. v. leucurus). In Key deer (n = 48), we identified 3 haplotypes formed by 5 polymorphisms, of which 2 were non-synonymous. The polymorphism c.574G>A, unique to Key deer (29 of 96 chromosomes), encodes a non-synonymous substitution from valine to isoleucine at codon 192. In 91 of 96 chromosomes, Key deer carried c.286G>A (G96S), previously associated with substantially reduced susceptibility to CWD. Key deer may be less genetically susceptible to CWD than many mainland WTD populations. In Columbian WTD (n = 13), 2 haplotypes separated by one synonymous substitution (c.438C>T) were identified. All of the Columbian WTD carried alleles that in other mainland populations are associated with relatively high susceptibility to CWD. While larger sampling is needed, future management plans should consider that Columbian WTD are likely to be genetically more vulnerable to CWD than many other WTD populations. Finally, we suggest that genetic vulnerability to CWD be assessed by sequencing PRNP across other endangered cervids, both wild and in captive breeding facilities.
Assuntos
Cervos/genética , Polimorfismo Genético , Proteínas Priônicas/genética , Doença de Emaciação Crônica/genética , Alelos , Animais , Espécies em Perigo de Extinção , Florida , Predisposição Genética para Doença , HaplótiposRESUMO
Illegal hunting is a major threat to the elephants of Africa, with more elephants killed by poachers than die from natural causes. DNA from tusks has been used to infer the source populations for confiscated ivory, relying on nuclear genetic markers. However, mitochondrial DNA (mtDNA) sequences can also provide information on the geographic origins of elephants due to female elephant philopatry. Here, we introduce the Loxodonta Localizer (LL; www.loxodontalocalizer.org), an interactive software tool that uses a database of mtDNA sequences compiled from previously published studies to provide information on the potential provenance of confiscated ivory. A 316 bp control region sequence, which can be readily generated from DNA extracted from ivory, is used as a query. The software generates a listing of haplotypes reported among 1917 African elephants in 24 range countries, sorted in order of similarity to the query sequence. The African locations from which haplotype sequences have been previously reported are shown on a map. We demonstrate examples of haplotypes reported from only a single locality or country, examine the utility of the program in identifying elephants from countries with varying degrees of sampling, and analyze batches of confiscated ivory. The LL allows for the source of confiscated ivory to be assessed within days, using widely available molecular methods that do not depend on a particular platform or laboratory. The program enables identification of potential regions or localities from which elephants are being poached, with capacity for rapid identification of populations newly or consistently targeted by poachers.
Assuntos
DNA Mitocondrial , Elefantes/genética , Software , Navegador , África , Animais , Animais Selvagens , Biologia Computacional/métodos , Conservação dos Recursos Naturais , Elefantes/classificação , Genética Forense , Marcadores Genéticos , Haplótipos , Dinâmica PopulacionalRESUMO
The mammalian order Eulipotyphla includes four extant families of insectivorans: Solenodontidae (solenodons); Talpidae (moles); Soricidae (shrews); and Erinaceidae (hedgehogs). Of these, Solenodontidae includes only two extant species, which are endemic to the largest islands of the Greater Antilles: Cuba and Hispaniola. Most molecular studies suggest that eulipotyphlan families diverged from each other across several million years, with the basal split between Solenodontidae and other families occurring in the Late Cretaceous. By contrast, Sato et al. (2016) suggest that eulipotyphlan families diverged from each other in a polytomy â¼58.6â¯million years ago (Mya). This more recent divergence estimate for Solenodontidae versus other extant eulipotyphlans suggests that solenodons must have arrived in the Greater Antilles via overwater dispersal rather than vicariance. Here, we show that the young timetree estimates for eulipotyphlan families and the polytomy are due to an inverted ingroup-outgroup arrangement of the tree, the result of using Tracer rather than TreeAnnotator to compile interfamilial divergence times, and of not enforcing the monophly of well-established clades such as Laurasiatheria and Eulipotyphla. Finally, Sato et al.'s (2016) timetree includes several zombie lineages where estimated divergence times are much younger than minimum ages that are implied by the fossil record. We reanalyzed Sato et al.'s (2016) original data with enforced monophyly for well-established clades and updated fossil calibrations that eliminate the inference of zombie lineages. Our resulting timetrees, which were compiled with TreeAnnotator rather than Tracer, produce dates that are in good agreement with other recent studies and place the basal split between Solenodontidae and other eulipotyphlans in the Late Cretaceous.
Assuntos
Fósseis , Mamíferos/classificação , Filogenia , Animais , Calibragem , Cuba , Evolução Molecular , Fatores de TempoRESUMO
The Sumatran rhinoceros (Dicerorhinus sumatrensis), once widespread across Southeast Asia, now consists of as few as 30 individuals within Sumatra and Borneo. To aid in conservation planning, we sequenced 218 bp of control region mitochondrial (mt) DNA, identifying 17 distinct mitochondrial haplotypes across modern (N = 13) and museum (N = 26) samples. Museum specimens from Laos and Myanmar had divergent mtDNA, consistent with the placement of western mainland rhinos into the distinct subspecies D. s. lasiotis (presumed extinct). Haplotypes from Bornean rhinos were highly diverse, but dissimilar from those of other regions, supporting the distinctiveness of the subspecies D. s. harrissoni. Rhinos from Sumatra and Peninsular Malaysia shared mtDNA haplotypes, consistent with their traditional placement into a single subspecies D. s sumatrensis. Modern samples of D. s. sumatrensis were genotyped at 18 microsatellite loci. Rhinos within Sumatra formed 2 sub-populations, likely separated by the Barisan Mountains, though with only modest genetic differentiation between them. There are so few remaining Sumatran rhinoceros that separate management strategies for subspecies or subpopulations may not be viable, while each surviving rhino pedigree is likely to retain alleles found in no other individuals. Given the low population size and low reproductive potential of Sumatran rhinos, rapid genetic erosion is inevitable, though an under-appreciated concern is the potential for fixation of harmful genetic variants. Both concerns underscore 2 overriding priorities for the species: 1) translocation of wild rhinos to ex situ facilities, and 2) collection and storage of gametes and cell lines from every surviving captive and wild individual.
Assuntos
Variação Genética , Perissodáctilos/genética , Animais , Conservação dos Recursos Naturais , DNA Mitocondrial/genética , Feminino , Haplótipos , Indonésia , Malásia , Masculino , FilogeografiaRESUMO
UNLABELLED: Gibbon ape leukemia viruses (GALVs) are part of a larger group of pathogenic gammaretroviruses present across phylogenetically diverse host species of Australasian mammals. Despite the biomedical utility of GALVs as viral vectors and in cancer gene therapy, full genome sequences have not been determined for all of the five identified GALV strains, nor has a comprehensive evolutionary analysis been performed. We therefore generated complete genomic sequences for each GALV strain using hybridization capture and high-throughput sequencing. The four strains of GALV isolated from gibbons formed a monophyletic clade that was closely related to the woolly monkey virus (WMV), which is a GALV strain that likely originated in a gibbon host. The GALV-WMV clade in turn formed a sister group to the koala retroviruses (KoRVs). Genomic signatures of episodic diversifying selection were detected among the gammaretroviruses with concentration in the env gene across the GALV strains that were particularly oncogenic and KoRV strains that were potentially exogenous, likely reflecting their adaptation to the host immune system. In vitro studies involving vectors chimeric between GALV and KoRV-B established that variable regions A and B of the surface unit of the envelope determine which receptor is used by a viral strain to enter host cells. IMPORTANCE: The gibbon ape leukemia viruses (GALVs) are among the most medically relevant retroviruses due to their use as viral vectors for gene transfer and in cancer gene therapy. Despite their importance, full genome sequences have not been determined for the majority of primate isolates, nor has comprehensive evolutionary analysis been performed, despite evidence that the viruses are facing complex selective pressures associated with cross-species transmission. Using hybridization capture and high-throughput sequencing, we report here the full genome sequences of all the GALV strains and demonstrate that diversifying selection is acting on them, particularly in the envelope gene in functionally important domains, suggesting that host immune pressure is shaping GALV evolution.
Assuntos
Evolução Molecular , Hylobates/virologia , Vírus da Leucemia do Macaco Gibão/genética , Seleção Genética , Animais , Australásia , Análise por Conglomerados , Produtos do Gene env/genética , Vetores Genéticos , Genoma Viral , Sequenciamento de Nucleotídeos em Larga Escala , Dados de Sequência Molecular , Phascolarctidae , Filogenia , RNA Viral/genética , Recombinação Genética , Análise de Sequência de DNA , Homologia de Sequência , Internalização do VírusRESUMO
UNLABELLED: Gibbon ape leukemia virus (GALV) and koala retrovirus (KoRV) most likely originated from a cross-species transmission of an ancestral retrovirus into koalas and gibbons via one or more intermediate as-yet-unknown hosts. A virus highly similar to GALV has been identified in an Australian native rodent (Melomys burtoni) after extensive screening of Australian wildlife. GALV-like viruses have also been discovered in several Southeast Asian species, although screening has not been extensive and viruses discovered to date are only distantly related to GALV. We therefore screened 26 Southeast Asian rodent species for KoRV- and GALV-like sequences, using hybridization capture and high-throughput sequencing, in the attempt to identify potential GALV and KoRV hosts. Only the individuals belonging to a newly discovered subspecies of Melomys burtoni from Indonesia were positive, yielding an endogenous provirus very closely related to a strain of GALV. The sequence of the critical receptor domain for GALV infection in the Indonesian M. burtoni subsp. was consistent with the susceptibility of the species to GALV infection. The second record of a GALV in M. burtoni provides further evidence that M. burtoni, and potentially other lineages within the widespread subfamily Murinae, may play a role in the spread of GALV-like viruses. The discovery of a GALV in the most western part of the Australo-Papuan distribution of M. burtoni, specifically in a transitional zone between Asia and Australia (Wallacea), may be relevant to the cross-species transmission to gibbons in Southeast Asia and broadens the known distribution of GALVs in wild rodents. IMPORTANCE: Gibbon ape leukemia virus (GALV) and the koala retrovirus (KoRV) are very closely related, yet their hosts neither are closely related nor overlap geographically. Direct cross-species infection between koalas and gibbons is unlikely. Therefore, GALV and KoRV may have arisen via a cross-species transfer from an intermediate host whose range overlaps those of both gibbons and koalas. Using hybridization capture and high-throughput sequencing, we have screened a wide range of rodent candidate hosts from Southeast Asia for KoRV- and GALV-like sequences. Only a Melomys burtoni subspecies from Wallacea (Indonesia) was positive for GALV. We report the genome sequence of this newly identified GALV, the critical domain for infection of its potential cellular receptor, and its phylogenetic relationships with the other previously characterized GALVs. We hypothesize that Melomys burtoni, and potentially related lineages with an Australo-Papuan distribution, may have played a key role in cross-species transmission to other taxa.
Assuntos
Vírus da Leucemia do Macaco Gibão/isolamento & purificação , Murinae/virologia , Infecções por Retroviridae/veterinária , Doenças dos Roedores/virologia , Animais , Sequenciamento de Nucleotídeos em Larga Escala , Indonésia , Vírus da Leucemia do Macaco Gibão/genética , Hibridização de Ácido Nucleico , Provírus/genética , Provírus/isolamento & purificação , Infecções por Retroviridae/virologia , Análise de Sequência de DNARESUMO
Toll-like receptor 2 (TLR2) plays an important role in recognition by the innate immune system of Gram-positive bacteria. As Gram-positive bacteria cause mastitis, we examined variations in the region of the TLR2 gene that codes for the extracellular domain. Samples of forty goats from a single dairy herd were collected, half with low SCC (≤200,000 cells/mL), and half with higher SCC. Two synonymous single nucleotide polymorphisms (SNPs) were identified: 840G > A and 1083A > G. One nonsynonymous SNP 739G > A was identified. This coded for valine or isoleucine, which have similar physiochemical properties, and was not in a region coding for a known functional domain. Surprisingly, the least square mean SCC of the heterozygous goats (146,220) was significantly lower than the SCC of homozygous GG goats (537,700; p = 0.004), although these two groups were similar in days in milk (p = 0.984), and there was no significant difference by breed (p = 0.941). Because factors other than mastitis can affect SCC and our sample sizes were limited, additional studies are needed to corroborate an association between TLR2 genotype and SCC or mastitis in goats.
Assuntos
Contagem de Células/veterinária , Cabras/genética , Cabras/metabolismo , Leite/citologia , Leite/fisiologia , Polimorfismo de Nucleotídeo Único/genética , Receptor 2 Toll-Like/genética , Animais , Feminino , Estudos de Associação Genética , Predisposição Genética para Doença/genéticaRESUMO
BACKGROUND: The Toll-like receptor (TLR) signaling pathway constitutes an essential component of the innate immune system. Highly conserved proteins, indicative of their critical roles in host survival, characterize this pathway. Selective constraints could vary depending on the gene's position within the pathway as TLR signaling is a sequential process and that genes downstream of the TLRs may be more selectively constrained to ensure efficient immune responses given the important role of downstream genes in the signaling process. Thus, we investigated whether gene position influenced protein evolution in the TLR signaling pathway of the Suidae. The members of the Suidae examined included the European Sus scrofa (wild boar), Asian Sus scrofa (wild boar), Sus verrucosus, Sus celebensis, Sus scebifrons, Sus barbatus, Babyrousa babyrussa, Potamochoerus larvatus, Potamochoerus porcus and Phacochoerus africanus. RESULTS: A total of 33 TLR signaling pathway genes in the Suidae were retrieved from resequencing data. The evolutionary parameter ω (dn/ds) had an overall mean of 0.1668 across genes, indicating high functional conservation within the TLR signaling pathway. A significant relationship was inferred for the network parameters gene position, number of protein-protein interactions, protein length and the evolutionary parameter dn (nonsynonymous substitutions) such that downstream genes had lower nonsynonymous substitution rates, more interactors and shorter protein length than upstream genes. Gene position was significantly correlated with the number of protein-protein interactions and protein length. Thus, the polarity in the selective constraint along the TLR signaling pathway was due to the number of molecules a protein interacted with and the protein's length. CONCLUSION: Results indicate that the level of selective constraints on genes within the TLR signaling pathway of the Suidae is dependent on the gene's position and network parameters. In particular, downstream genes evolve more slowly as a result of being highly connected and having shorter protein lengths. These findings highlight the critical role of gene network parameters in gene evolution.
Assuntos
Evolução Molecular , Transdução de Sinais/genética , Suínos/genética , Receptores Toll-Like/genética , Sequência de Aminoácidos , Substituição de Aminoácidos , Animais , Sequência Conservada , Redes Reguladoras de Genes , Proteínas/genética , Análise de Sequência de DNA , Receptores Toll-Like/químicaRESUMO
Endogenous retroviruses (ERVs) comprise 8% of the human genome and are common in all vertebrate genomes. The only retrovirus known to be currently transitioning from exogenous to endogenous form is the koala retrovirus (KoRV), making koalas (Phascolarctos cinereus) ideal for examining the early stages of retroviral endogenization. To distinguish endogenous from exogenous KoRV proviruses, we isolated koala genomic regions flanking KoRV integration sites. In three wild southern Australian koalas, there were fewer KoRV loci than in three captive Queensland koalas, consistent with reports that southern Australian koalas carry fewer KoRVs. Of 39 distinct KoRV proviral loci examined in a sire-dam-progeny triad, all proved to be vertically transmitted and endogenous; none was exogenous. Of the 39 endogenous KoRVs (enKoRVs), only one was present in the genomes of both the sire and the dam, suggesting that, at this early stage in the retroviral invasion of a host germ line, very large numbers of ERVs have proliferated at very low frequencies in the koala population. Sequence divergence between the 5'- and 3'-long terminal repeats (LTRs) of a provirus can be used as a molecular clock. Within each of ten enKoRVs, the 5'-LTR sequence was identical to the 3'-LTR sequence, suggesting a maximum age for enKoRV invasion of the koala germ line of approximately 22,200-49,900 years ago, although a much younger age is possible. Across the ten proviruses, seven LTR haplotypes were detected, indicating that at least seven different retroviral sequences had entered the koala germ line.
Assuntos
Retrovirus Endógenos/genética , Células Germinativas/virologia , Phascolarctidae/virologia , Provírus/genética , Integração Viral , Animais , Austrália , Retrovirus Endógenos/fisiologia , Evolução Molecular , Genoma , Genoma Viral , Haplótipos , Humanos , Phascolarctidae/classificação , Phascolarctidae/genética , Filogenia , Provírus/fisiologia , Sequências Repetidas TerminaisRESUMO
The koala (Phascolarctos cinereus) suffered population declines and local extirpation due to hunting in the early 20th century, especially in southern Australia. Koalas were subsequently reintroduced to the Brisbane Ranges (BR) and Stony Rises (SR) by translocating individuals from a population on French Island descended from a small number of founders. To examine genetic diversity and north-south differentiation, we genotyped 13 microsatellite markers in 46 wild koalas from the BR and SR, and 27 Queensland koalas kept at the US zoos. The Queensland koalas displayed much higher heterozygosity (H O = 0.73) than the 2 southern Australian koala populations examined: H O = 0.49 in the BR, whereas H O = 0.41 in the SR. This is consistent with the historical accounts of bottlenecks and founder events affecting the southern populations and contrasts with reports of high genetic diversity in some southern populations. The 2 southern Australian koala populations were genetically similar (F ST = 0.018, P = 0.052). By contrast, northern and southern Australian koalas were highly differentiated (F ST = 0.27, P < 0.001), thereby suggesting that geographic structuring should be considered in the conservation management of koalas. Sequencing of 648bp of the mtDNA control region in Queensland koalas found 8 distinct haplotypes, one of which had not been previously detected among koalas. Queensland koalas displayed high mitochondrial haplotype diversity (H = 0.753) and nucleotide diversity (π = 0.0072), indicating along with the microsatellite data that North American zoos have maintained high levels of genetic diversity among their Queensland koalas.
Assuntos
Variação Genética , Genética Populacional , Phascolarctidae/classificação , Phascolarctidae/genética , Animais , DNA Mitocondrial , Genótipo , Haplótipos , Repetições de Microssatélites , Filogenia , Queensland , Análise de Sequência de DNA , VitóriaRESUMO
BACKGROUND: Endogenous murine leukemia retroviruses (MLVs) are high copy number proviral elements difficult to comprehensively characterize using standard low throughput sequencing approaches. However, high throughput approaches generate data that is challenging to process, interpret and present. RESULTS: Next generation sequencing (NGS) data was generated for MLVs from two wild caught Mus musculus domesticus (from mainland France and Corsica) and for inbred laboratory mouse strains C3H, LP/J and SJL. Sequence reads were grouped using a novel sequence clustering approach as applied to retroviral sequences. A Markov cluster algorithm was employed, and the sequence reads were queried for matches to specific xenotropic (Xmv), polytropic (Pmv) and modified polytropic (Mpmv) viral reference sequences. CONCLUSIONS: Various MLV subtypes were more widespread than expected among the mice, which may be due to the higher coverage of NGS, or to the presence of similar sequence across many different proviral loci. The results did not correlate with variation in the major MLV receptor Xpr1, which can restrict exogenous MLVs, suggesting that endogenous MLV distribution may reflect gene flow more than past resistance to infection.