Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 73
Filtrar
1.
Nucleic Acids Res ; 2024 Jun 08.
Artigo em Inglês | MEDLINE | ID: mdl-38850156

RESUMO

Alu elements are non-autonomous Short INterspersed Elements (SINEs) derived from the 7SL RNA gene that are present at over one million copies in human genomic DNA. Alu mobilizes by a mechanism known as retrotransposition, which requires the Long INterspersed Element-1 (LINE-1) ORF2-encoded protein (ORF2p). Here, we demonstrate that HeLa strains differ in their capacity to support Alu retrotransposition. Human Alu elements retrotranspose efficiently in HeLa-HA and HeLa-CCL2 (Alu-permissive) strains, but not in HeLa-JVM or HeLa-H1 (Alu-nonpermissive) strains. A similar pattern of retrotransposition was observed for other 7SL RNA-derived SINEs and tRNA-derived SINEs. In contrast, mammalian LINE-1s, a zebrafish LINE, a human SINE-VNTR-Alu (SVA) element, and an L1 ORF1-containing mRNA can retrotranspose in all four HeLa strains. Using an in vitro reverse transcriptase-based assay, we show that Alu RNAs associate with ORF2p and are converted into cDNAs in both Alu-permissive and Alu-nonpermissive HeLa strains, suggesting that 7SL- and tRNA-derived SINEs use strategies to 'hijack' L1 ORF2p that are distinct from those used by SVA elements and ORF1-containing mRNAs. These data further suggest ORF2p associates with the Alu RNA poly(A) tract in both Alu-permissive and Alu-nonpermissive HeLa strains, but that Alu retrotransposition is blocked after this critical step in Alu-nonpermissive HeLa strains.

2.
Genome Res ; 2022 Aug 12.
Artigo em Inglês | MEDLINE | ID: mdl-35961775

RESUMO

Gene retrocopies arise from the reverse transcription and insertion into the genome of processed mRNA transcripts. Although many retrocopies have acquired mutations that render them functionally inactive, most mammals retain active LINE-1 sequences capable of producing new retrocopies. New retrocopies, referred to as retro copy number variants (retroCNVs), may not be identified by standard variant calling techniques in high-throughput sequencing data. Although multiple functional FGF4 retroCNVs have been associated with skeletal dysplasias in dogs, the full landscape of canid retroCNVs has not been characterized. Here, retroCNV discovery was performed on a whole-genome sequencing data set of 293 canids from 76 breeds. We identified retroCNV parent genes via the presence of mRNA-specific 30-mers, and then identified retroCNV insertion sites through discordant read analysis. In total, we resolved insertion sites for 1911 retroCNVs from 1179 parent genes, 1236 of which appeared identical to their parent genes. Dogs had on average 54.1 total retroCNVs and 1.4 private retroCNVs. We found evidence of expression in testes for 12% (14/113) of the retroCNVs identified in six Golden Retrievers, including four chimeric transcripts, and 97 retroCNVs also had significantly elevated F ST across dog breeds, possibly indicating selection. We applied our approach to a subset of human genomes and detected an average of 4.2 retroCNVs per sample, highlighting a 13-fold relative increase of retroCNV frequency in dogs. Particularly in canids, retroCNVs are a largely unexplored source of genetic variation that can contribute to genome plasticity and that should be considered when investigating traits and diseases.

3.
Cell ; 141(7): 1159-70, 2010 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-20602998

RESUMO

Highly active (i.e., "hot") long interspersed element-1 (LINE-1 or L1) sequences comprise the bulk of retrotransposition activity in the human genome; however, the abundance of hot L1s in the human population remains largely unexplored. Here, we used a fosmid-based, paired-end DNA sequencing strategy to identify 68 full-length L1s that are differentially present among individuals but are absent from the human genome reference sequence. The majority of these L1s were highly active in a cultured cell retrotransposition assay. Genotyping 26 elements revealed that two L1s are only found in Africa and that two more are absent from the H952 subset of the Human Genome Diversity Panel. Therefore, these results suggest that hot L1s are more abundant in the human population than previously appreciated, and that ongoing L1 retrotransposition continues to be a major source of interindividual genetic variation.


Assuntos
Genoma Humano , Elementos Nucleotídeos Longos e Dispersos , Sequência de Bases , Frequência do Gene , Genética Populacional , Humanos , Dados de Sequência Molecular , Filogenia
4.
Cell ; 143(5): 837-47, 2010 Nov 24.
Artigo em Inglês | MEDLINE | ID: mdl-21111241

RESUMO

Understanding the prevailing mutational mechanisms responsible for human genome structural variation requires uniformity in the discovery of allelic variants and precision in terms of breakpoint delineation. We develop a resource based on capillary end sequencing of 13.8 million fosmid clones from 17 human genomes and characterize the complete sequence of 1054 large structural variants corresponding to 589 deletions, 384 insertions, and 81 inversions. We analyze the 2081 breakpoint junctions and infer potential mechanism of origin. Three mechanisms account for the bulk of germline structural variation: microhomology-mediated processes involving short (2-20 bp) stretches of sequence (28%), nonallelic homologous recombination (22%), and L1 retrotransposition (19%). The high quality and long-range continuity of the sequence reveals more complex mutational mechanisms, including repeat-mediated inversions and gene conversion, that are most often missed by other methods, such as comparative genomic hybridization, single nucleotide polymorphism microarrays, and next-generation sequencing.


Assuntos
Genoma Humano , Variação Estrutural do Genoma , Mutação , Sequência de Bases , Conversão Gênica , Humanos , Dados de Sequência Molecular , Análise de Sequência de DNA
5.
Proc Natl Acad Sci U S A ; 118(11)2021 03 16.
Artigo em Inglês | MEDLINE | ID: mdl-33836575

RESUMO

Technological advances have allowed improvements in genome reference sequence assemblies. Here, we combined long- and short-read sequence resources to assemble the genome of a female Great Dane dog. This assembly has improved continuity compared to the existing Boxer-derived (CanFam3.1) reference genome. Annotation of the Great Dane assembly identified 22,182 protein-coding gene models and 7,049 long noncoding RNAs, including 49 protein-coding genes not present in the CanFam3.1 reference. The Great Dane assembly spans the majority of sequence gaps in the CanFam3.1 reference and illustrates that 2,151 gaps overlap the transcription start site of a predicted protein-coding gene. Moreover, a subset of the resolved gaps, which have an 80.95% median GC content, localize to transcription start sites and recombination hotspots more often than expected by chance, suggesting the stable canine recombinational landscape has shaped genome architecture. Alignment of the Great Dane and CanFam3.1 assemblies identified 16,834 deletions and 15,621 insertions, as well as 2,665 deletions and 3,493 insertions located on secondary contigs. These structural variants are dominated by retrotransposon insertion/deletion polymorphisms and include 16,221 dimorphic canine short interspersed elements (SINECs) and 1,121 dimorphic long interspersed element-1 sequences (LINE-1_Cfs). Analysis of sequences flanking the 3' end of LINE-1_Cfs (i.e., LINE-1_Cf 3'-transductions) suggests multiple retrotransposition-competent LINE-1_Cfs segregate among dog populations. Consistent with this conclusion, we demonstrate that a canine LINE-1_Cf element with intact open reading frames can retrotranspose its own RNA and that of a SINEC_Cf consensus sequence in cultured human cells, implicating ongoing retrotransposon activity as a driver of canine genetic variation.


Assuntos
Cães/genética , Sequência Rica em GC , Genoma , Sequências Repetitivas Dispersas , Animais , Cães/classificação , Elementos Nucleotídeos Longos e Dispersos , Elementos Nucleotídeos Curtos e Dispersos , Especificidade da Espécie
7.
Nucleic Acids Res ; 48(6): e36, 2020 04 06.
Artigo em Inglês | MEDLINE | ID: mdl-32067044

RESUMO

Alu retrotransposons account for more than 10% of the human genome, and insertions of these elements create structural variants segregating in human populations. Such polymorphic Alus are powerful markers to understand population structure, and they represent variants that can greatly impact genome function, including gene expression. Accurate genotyping of Alus and other mobile elements has been challenging. Indeed, we found that Alu genotypes previously called for the 1000 Genomes Project are sometimes erroneous, which poses significant problems for phasing these insertions with other variants that comprise the haplotype. To ameliorate this issue, we introduce a new pipeline - TypeTE - which genotypes Alu insertions from whole-genome sequencing data. Starting from a list of polymorphic Alus, TypeTE identifies the hallmarks (poly-A tail and target site duplication) and orientation of Alu insertions using local re-assembly to reconstruct presence and absence alleles. Genotype likelihoods are then computed after re-mapping sequencing reads to the reconstructed alleles. Using a high-quality set of PCR-based genotyping of >200 loci, we show that TypeTE improves genotype accuracy from 83% to 92% in the 1000 Genomes dataset. TypeTE can be readily adapted to other retrotransposon families and brings a valuable toolbox addition for population genomics.


Assuntos
Sequências Repetitivas Dispersas/genética , Mutagênese Insercional/genética , Software , Sequenciamento Completo do Genoma/métodos , Bases de Dados Genéticas , Frequência do Gene/genética , Loci Gênicos , Genética Populacional , Genoma Humano , Genótipo , Humanos
8.
Nucleic Acids Res ; 48(3): 1146-1163, 2020 02 20.
Artigo em Inglês | MEDLINE | ID: mdl-31853540

RESUMO

Long Interspersed Element-1 (LINE-1) retrotransposition contributes to inter- and intra-individual genetic variation and occasionally can lead to human genetic disorders. Various strategies have been developed to identify human-specific LINE-1 (L1Hs) insertions from short-read whole genome sequencing (WGS) data; however, they have limitations in detecting insertions in complex repetitive genomic regions. Here, we developed a computational tool (PALMER) and used it to identify 203 non-reference L1Hs insertions in the NA12878 benchmark genome. Using PacBio long-read sequencing data, we identified L1Hs insertions that were absent in previous short-read studies (90/203). Approximately 81% (73/90) of the L1Hs insertions reside within endogenous LINE-1 sequences in the reference assembly and the analysis of unique breakpoint junction sequences revealed 63% (57/90) of these L1Hs insertions could be genotyped in 1000 Genomes Project sequences. Moreover, we observed that amplification biases encountered in single-cell WGS experiments led to a wide variation in L1Hs insertion detection rates between four individual NA12878 cells; under-amplification limited detection to 32% (65/203) of insertions, whereas over-amplification increased false positive calls. In sum, these data indicate that L1Hs insertions are often missed using standard short-read sequencing approaches and long-read sequencing approaches can significantly improve the detection of L1Hs insertions present in individual genomes.


Assuntos
Elementos Nucleotídeos Longos e Dispersos , Análise de Sequência de DNA/métodos , Linhagem Celular , Genoma Humano , Humanos , Polimorfismo Genético , Análise de Célula Única , Software , Sequenciamento Completo do Genoma
9.
PLoS Pathog ; 15(10): e1007903, 2019 10.
Artigo em Inglês | MEDLINE | ID: mdl-31584995

RESUMO

HIV-1 gene expression is regulated by host and viral factors that interact with viral motifs and is influenced by proviral integration sites. Here, expression variation among integrants was followed for hundreds of individual proviral clones within polyclonal populations throughout successive rounds of virus and cultured cell replication, with limited findings using CD4+ cells from donor blood consistent with observations in immortalized cells. Tracking clonal behavior by proviral "zip codes" indicated that mutational inactivation during reverse transcription was rare, while clonal expansion and proviral expression states varied widely. By sorting for provirus expression using a GFP reporter in the nef open reading frame, distinct clone-specific variation in on/off proportions were observed that spanned three orders of magnitude. Tracking GFP phenotypes over time revealed that as cells divided, their progeny alternated between HIV transcriptional activity and non-activity. Despite these phenotypic oscillations, the overall GFP+ population within each clone was remarkably stable, with clones maintaining clone-specific equilibrium mixtures of GFP+ and GFP- cells. Integration sites were analyzed for correlations between genomic features and the epigenetic phenomena described here. Integrants inserted in the sense orientation of genes were more frequently found to be GFP negative than those in the antisense orientation, and clones with high GFP+ proportions were more distal to repressive H3K9me3 peaks than low GFP+ clones. Clones with low frequencies of GFP positivity appeared to expand more rapidly than clones for which most cells were GFP+, even though the tested proviruses were Vpr-. Thus, much of the increase in the GFP- population in these polyclonal pools over time reflected differential clonal expansion. Together, these results underscore the temporal and quantitative variability in HIV-1 gene expression among proviral clones that are conferred in the absence of metabolic or cell-type dependent variability, and shed light on cell-intrinsic layers of regulation that affect HIV-1 population dynamics.


Assuntos
Linfócitos T CD4-Positivos/virologia , Infecções por HIV/virologia , HIV-1/fisiologia , Provírus/genética , Integração Viral/genética , Replicação Viral , Linfócitos T CD4-Positivos/metabolismo , Infecções por HIV/genética , Ensaios de Triagem em Larga Escala , Humanos , Células Jurkat , Transdução Genética
10.
PLoS Biol ; 16(3): e2003067, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29505568

RESUMO

Human Long interspersed element-1 (L1) retrotransposons contain an internal RNA polymerase II promoter within their 5' untranslated region (UTR) and encode two proteins, (ORF1p and ORF2p) required for their mobilization (i.e., retrotransposition). The evolutionary success of L1 relies on the continuous retrotransposition of full-length L1 mRNAs. Previous studies identified functional splice donor (SD), splice acceptor (SA), and polyadenylation sequences in L1 mRNA and provided evidence that a small number of spliced L1 mRNAs retrotransposed in the human genome. Here, we demonstrate that the retrotransposition of intra-5'UTR or 5'UTR/ORF1 spliced L1 mRNAs leads to the generation of spliced integrated retrotransposed elements (SpIREs). We identified a new intra-5'UTR SpIRE that is ten times more abundant than previously identified SpIREs. Functional analyses demonstrated that both intra-5'UTR and 5'UTR/ORF1 SpIREs lack Cis-acting transcription factor binding sites and exhibit reduced promoter activity. The 5'UTR/ORF1 SpIREs also produce nonfunctional ORF1p variants. Finally, we demonstrate that sequence changes within the L1 5'UTR over evolutionary time, which permitted L1 to evade the repressive effects of a host protein, can lead to the generation of new L1 splicing events, which, upon retrotransposition, generates a new SpIRE subfamily. We conclude that splicing inhibits L1 retrotransposition, SpIREs generally represent evolutionary "dead-ends" in the L1 retrotransposition process, mutations within the L1 5'UTR alter L1 splicing dynamics, and that retrotransposition of the resultant spliced transcripts can generate interindividual genomic variation.


Assuntos
Evolução Molecular , Genoma Humano , Elementos Nucleotídeos Longos e Dispersos/genética , Retroelementos/genética , Células HeLa , Humanos , Polimorfismo Genético , Regiões Promotoras Genéticas , Splicing de RNA , RNA Mensageiro/metabolismo
11.
Nature ; 526(7571): 75-81, 2015 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-26432246

RESUMO

Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.


Assuntos
Variação Genética/genética , Genoma Humano/genética , Mapeamento Físico do Cromossomo , Sequência de Aminoácidos , Predisposição Genética para Doença , Genética Médica , Genética Populacional , Estudo de Associação Genômica Ampla , Genômica , Genótipo , Haplótipos/genética , Homozigoto , Humanos , Dados de Sequência Molecular , Taxa de Mutação , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Análise de Sequência de DNA , Deleção de Sequência/genética
12.
Trends Genet ; 38(5): 514-515, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-35232612
13.
Retrovirology ; 16(1): 6, 2019 03 07.
Artigo em Inglês | MEDLINE | ID: mdl-30845962

RESUMO

BACKGROUND: Vertebrate genomes contain a record of retroviruses that invaded the germlines of ancestral hosts and are passed to offspring as endogenous retroviruses (ERVs). ERVs can impact host function since they contain the necessary sequences for expression within the host. Dogs are an important system for the study of disease and evolution, yet no substantiated reports of infectious retroviruses in dogs exist. Here, we utilized Illumina whole genome sequence data to assess the origin and evolution of a recently active gammaretroviral lineage in domestic and wild canids. RESULTS: We identified numerous recently integrated loci of a canid-specific ERV-Fc sublineage within Canis, including 58 insertions that were absent from the reference assembly. Insertions were found throughout the dog genome including within and near gene models. By comparison of orthologous occupied sites, we characterized element prevalence across 332 genomes including all nine extant canid species, revealing evolutionary patterns of ERV-Fc segregation among species as well as subpopulations. CONCLUSIONS: Sequence analysis revealed common disruptive mutations, suggesting a predominant form of ERV-Fc spread by trans complementation of defective proviruses. ERV-Fc activity included multiple circulating variants that infected canid ancestors from the last 20 million to within 1.6 million years, with recent bursts of germline invasion in the sublineage leading to wolves and dogs.


Assuntos
Canidae , Retrovirus Endógenos/classificação , Retrovirus Endógenos/genética , Evolução Molecular , Infecções por Retroviridae/veterinária , Animais , Biologia Computacional , Sequenciamento de Nucleotídeos em Larga Escala , Provírus/classificação , Provírus/genética , Infecções por Retroviridae/virologia
14.
Nature ; 499(7459): 471-5, 2013 Jul 25.
Artigo em Inglês | MEDLINE | ID: mdl-23823723

RESUMO

Most great ape genetic variation remains uncharacterized; however, its study is critical for understanding population history, recombination, selection and susceptibility to disease. Here we sequence to high coverage a total of 79 wild- and captive-born individuals representing all six great ape species and seven subspecies and report 88.8 million single nucleotide polymorphisms. Our analysis provides support for genetically distinct populations within each species, signals of gene flow, and the split of common chimpanzees into two distinct groups: Nigeria-Cameroon/western and central/eastern populations. We find extensive inbreeding in almost all wild populations, with eastern gorillas being the most extreme. Inferred effective population sizes have varied radically over time in different lineages and this appears to have a profound effect on the genetic diversity at, or close to, genes in almost all species. We discover and assign 1,982 loss-of-function variants throughout the human and great ape lineages, determining that the rate of gene loss has not been different in the human branch compared to other internal branches in the great ape phylogeny. This comprehensive catalogue of great ape genome diversity provides a framework for understanding evolution and a resource for more effective management of wild and captive great ape populations.


Assuntos
Variação Genética , Hominidae/genética , África , Animais , Animais Selvagens/genética , Animais de Zoológico/genética , Sudeste Asiático , Evolução Molecular , Fluxo Gênico/genética , Genética Populacional , Genoma/genética , Gorilla gorilla/classificação , Gorilla gorilla/genética , Hominidae/classificação , Humanos , Endogamia , Pan paniscus/classificação , Pan paniscus/genética , Pan troglodytes/classificação , Pan troglodytes/genética , Filogenia , Polimorfismo de Nucleotídeo Único/genética , Densidade Demográfica
15.
Proc Natl Acad Sci U S A ; 113(16): E2326-34, 2016 Apr 19.
Artigo em Inglês | MEDLINE | ID: mdl-27001843

RESUMO

Endogenous retroviruses (ERVs) have contributed to more than 8% of the human genome. The majority of these elements lack function due to accumulated mutations or internal recombination resulting in a solitary (solo) LTR, although members of one group of human ERVs (HERVs), HERV-K, were recently active with members that remain nearly intact, a subset of which is present as insertionally polymorphic loci that include approximately full-length (2-LTR) and solo-LTR alleles in addition to the unoccupied site. Several 2-LTR insertions have intact reading frames in some or all genes that are expressed as functional proteins. These properties reflect the activity of HERV-K and suggest the existence of additional unique loci within humans. We sought to determine the extent to which other polymorphic insertions are present in humans, using sequenced genomes from the 1000 Genomes Project and a subset of the Human Genome Diversity Project panel. We report analysis of a total of 36 nonreference polymorphic HERV-K proviruses, including 19 newly reported loci, with insertion frequencies ranging from <0.0005 to >0.75 that varied by population. Targeted screening of individual loci identified three new unfixed 2-LTR proviruses within our set, including an intact provirus present at Xq21.33 in some individuals, with the potential for retained infectivity.


Assuntos
Alelos , Retrovirus Endógenos/genética , Loci Gênicos , Mutagênese Insercional , Polimorfismo Genético , Sequências Repetidas Terminais , Feminino , Humanos , Masculino
16.
Proc Natl Acad Sci U S A ; 113(4): E440-9, 2016 Jan 26.
Artigo em Inglês | MEDLINE | ID: mdl-26712023

RESUMO

The Out-of-Africa (OOA) dispersal ∼ 50,000 y ago is characterized by a series of founder events as modern humans expanded into multiple continents. Population genetics theory predicts an increase of mutational load in populations undergoing serial founder effects during range expansions. To test this hypothesis, we have sequenced full genomes and high-coverage exomes from seven geographically divergent human populations from Namibia, Congo, Algeria, Pakistan, Cambodia, Siberia, and Mexico. We find that individual genomes vary modestly in the overall number of predicted deleterious alleles. We show via spatially explicit simulations that the observed distribution of deleterious allele frequencies is consistent with the OOA dispersal, particularly under a model where deleterious mutations are recessive. We conclude that there is a strong signal of purifying selection at conserved genomic positions within Africa, but that many predicted deleterious mutations have evolved as if they were neutral during the expansion out of Africa. Under a model where selection is inversely related to dominance, we show that OOA populations are likely to have a higher mutation load due to increased allele frequencies of nearly neutral variants that are recessive or partially recessive.


Assuntos
Etnicidade/genética , Genoma Humano , Migração Humana , Mutação , África Subsaariana , Alelos , Animais , Povo Asiático/genética , População Negra/genética , Simulação por Computador , Sequência Conservada , Evolução Molecular , Efeito Fundador , Fluxo Gênico , Doenças Genéticas Inatas/genética , Deriva Genética , Genótipo , Comportamento de Retorno ao Território Vital , Humanos , Indígenas Centro-Americanos/genética , Modelos Genéticos , Seleção Genética
17.
BMC Biol ; 16(1): 64, 2018 06 28.
Artigo em Inglês | MEDLINE | ID: mdl-29950181

RESUMO

BACKGROUND: Domesticated from gray wolves between 10 and 40 kya in Eurasia, dogs display a vast array of phenotypes that differ from their ancestors, yet mirror other domesticated animal species, a phenomenon known as the domestication syndrome. Here, we use signatures persisting in dog genomes to identify genes and pathways possibly altered by the selective pressures of domestication. RESULTS: Whole-genome SNP analyses of 43 globally distributed village dogs and 10 wolves differentiated signatures resulting from domestication rather than breed formation. We identified 246 candidate domestication regions containing 10.8 Mb of genome sequence and 429 genes. The regions share haplotypes with ancient dogs, suggesting that the detected signals are not the result of recent selection. Gene enrichments highlight numerous genes linked to neural crest and central nervous system development as well as neurological function. Read depth analysis suggests that copy number variation played a minor role in dog domestication. CONCLUSIONS: Our results identify genes that act early in embryogenesis and can confer phenotypes distinguishing domesticated dogs from wolves, such as tameness, smaller jaws, floppy ears, and diminished craniofacial development as the targets of selection during domestication. These differences reflect the phenotypes of the domestication syndrome, which can be explained by alterations in the migration or activity of neural crest cells during development. We propose that initial selection during early dog domestication was for behavior, a trait influenced by genes which act in the neural crest, which secondarily gave rise to the phenotypes of modern dogs.


Assuntos
Cães/genética , Domesticação , Crista Neural/fisiologia , Lobos/genética , Animais , Variações do Número de Cópias de DNA , Variação Genética , Genoma , Haplótipos/genética , Fenótipo , Seleção Genética
18.
BMC Genomics ; 19(1): 350, 2018 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-29747566

RESUMO

BACKGROUND: Most genetic analyses of ancient and modern dogs have focused on variation in the autosomes or on the mitochondria. Mitochondrial DNA is more easily obtained from ancient samples than nuclear DNA and mitochondrial analyses have revealed important insights into the evolutionary history of canids. Utilizing a recently published dog Y-chromosome reference, we analyzed Y-chromosome sequence across a diverse collection of canids and determined the Y haplogroup of three ancient European dogs. RESULTS: We identified 1121 biallelic Y-chromosome SNVs using whole-genome sequences from 118 canids and defined variants diagnostic to distinct dog Y haplogroups. Similar to that of the mitochondria and previous more limited studies of Y diversity, we observe several deep splits in the Y-chromosome tree which may be the result of retained Y-chromosome diversity which predates dog domestication or post-domestication admixture with wolves. We find that Y-chromosomes from three ancient European dogs (4700-7000 years old) belong to distinct clades. CONCLUSIONS: We estimate that the time to the most recent comment ancestor of dog Y haplogroups is 68-151 thousand years ago. Analysis of three Y-chromosomes from the Neolithic confirms long stranding population structure among European dogs.


Assuntos
Coiotes/genética , Cães/genética , Evolução Molecular , Haplótipos , Filogenia , Análise de Sequência de DNA/métodos , Lobos/genética , Cromossomo Y , Animais , Coiotes/classificação , DNA Mitocondrial/genética , Cães/classificação , Variação Genética , Genoma , Masculino , Lobos/classificação
19.
Mol Biol Evol ; 33(4): 928-45, 2016 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-26671457

RESUMO

We present three linkage-disequilibrium (LD)-based recombination maps generated using whole-genome sequence data from 10 Nigerian chimpanzees, 13 bonobos, and 15 western gorillas, collected as part of the Great Ape Genome Project (Prado-Martinez J, et al. 2013. Great ape genetic diversity and population history. Nature 499:471-475). We also identified species-specific recombination hotspots in each group using a modified LDhot framework, which greatly improves statistical power to detect hotspots at varying strengths. We show that fewer hotspots are shared among chimpanzee subspecies than within human populations, further narrowing the time scale of complete hotspot turnover. Further, using species-specific PRDM9 sequences to predict potential binding sites (PBS), we show higher predicted PRDM9 binding in recombination hotspots as compared to matched cold spot regions in multiple great ape species, including at least one chimpanzee subspecies. We found that correlations between broad-scale recombination rates decline more rapidly than nucleotide divergence between species. We also compared the skew of recombination rates at centromeres and telomeres between species and show a skew from chromosome means extending as far as 10-15 Mb from chromosome ends. Further, we examined broad-scale recombination rate changes near a translocation in gorillas and found minimal differences as compared to other great ape species perhaps because the coordinates relative to the chromosome ends were unaffected. Finally, on the basis of multiple linear regression analysis, we found that various correlates of recombination rate persist throughout the African great apes including repeats, diversity, and divergence. Our study is the first to analyze within- and between-species genome-wide recombination rate variation in several close relatives.


Assuntos
Evolução Molecular , Hominidae/genética , Desequilíbrio de Ligação/genética , Recombinação Genética , Animais , Mapeamento Cromossômico , Cromossomos/genética , Variação Genética , Gorilla gorilla/genética , Humanos , Pan troglodytes/genética , Papio/genética , Especificidade da Espécie
20.
Nature ; 469(7331): 529-33, 2011 Jan 27.
Artigo em Inglês | MEDLINE | ID: mdl-21270892

RESUMO

'Orang-utan' is derived from a Malay term meaning 'man of the forest' and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (N(e)) expanded exponentially relative to the ancestral N(e) after the split, while Bornean N(e) declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.


Assuntos
Variação Genética , Genoma/genética , Pongo abelii/genética , Pongo pygmaeus/genética , Animais , Centrômero/genética , Cerebrosídeos/metabolismo , Cromossomos , Evolução Molecular , Feminino , Rearranjo Gênico/genética , Especiação Genética , Genética Populacional , Humanos , Masculino , Filogenia , Densidade Demográfica , Dinâmica Populacional , Especificidade da Espécie
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA