RESUMO
Technological advances have allowed improvements in genome reference sequence assemblies. Here, we combined long- and short-read sequence resources to assemble the genome of a female Great Dane dog. This assembly has improved continuity compared to the existing Boxer-derived (CanFam3.1) reference genome. Annotation of the Great Dane assembly identified 22,182 protein-coding gene models and 7,049 long noncoding RNAs, including 49 protein-coding genes not present in the CanFam3.1 reference. The Great Dane assembly spans the majority of sequence gaps in the CanFam3.1 reference and illustrates that 2,151 gaps overlap the transcription start site of a predicted protein-coding gene. Moreover, a subset of the resolved gaps, which have an 80.95% median GC content, localize to transcription start sites and recombination hotspots more often than expected by chance, suggesting the stable canine recombinational landscape has shaped genome architecture. Alignment of the Great Dane and CanFam3.1 assemblies identified 16,834 deletions and 15,621 insertions, as well as 2,665 deletions and 3,493 insertions located on secondary contigs. These structural variants are dominated by retrotransposon insertion/deletion polymorphisms and include 16,221 dimorphic canine short interspersed elements (SINECs) and 1,121 dimorphic long interspersed element-1 sequences (LINE-1_Cfs). Analysis of sequences flanking the 3' end of LINE-1_Cfs (i.e., LINE-1_Cf 3'-transductions) suggests multiple retrotransposition-competent LINE-1_Cfs segregate among dog populations. Consistent with this conclusion, we demonstrate that a canine LINE-1_Cf element with intact open reading frames can retrotranspose its own RNA and that of a SINEC_Cf consensus sequence in cultured human cells, implicating ongoing retrotransposon activity as a driver of canine genetic variation.
Assuntos
Cães/genética , Sequência Rica em GC , Genoma , Sequências Repetitivas Dispersas , Animais , Cães/classificação , Elementos Nucleotídeos Longos e Dispersos , Elementos Nucleotídeos Curtos e Dispersos , Especificidade da EspécieRESUMO
Genomic resources for the domestic dog have improved with the widespread adoption of a 173k SNP array platform and updated reference genome. SNP arrays of this density are sufficient for detecting genetic associations within breeds but are underpowered for finding associations across multiple breeds or in mixed-breed dogs, where linkage disequilibrium rapidly decays between markers, even though such studies would hold particular promise for mapping complex diseases and traits. Here we introduce an imputation reference panel, consisting of 365 diverse, whole-genome sequenced dogs and wolves, which increases the number of markers that can be queried in genome-wide association studies approximately 130-fold. Using previously genotyped dogs, we show the utility of this reference panel in identifying potentially novel associations, including a locus on CFA20 significantly associated with cranial cruciate ligament disease, and fine-mapping for canine body size and blood phenotypes, even when causal loci are not in strong linkage disequilibrium with any single array marker. This reference panel resource will improve future genome-wide association studies for canine complex diseases and other phenotypes.
Assuntos
Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Sequenciamento Completo do Genoma/métodos , Animais , Cruzamento , Mapeamento Cromossômico/métodos , Cães/genética , Genoma/genética , Genótipo , Desequilíbrio de Ligação/genética , Fenótipo , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
Consumer genomics enables genetic discovery on an unprecedented scale by linking very large databases of personal genomic data with phenotype information voluntarily submitted via web-based surveys. These databases are having a transformative effect on human genomics research, yielding insights on increasingly complex traits, behaviors, and disease by including many thousands of individuals in genome-wide association studies (GWAS). The promise of consumer genomic data is not limited to human research, however. Genomic tools for dogs are readily available, with hundreds of causal Mendelian variants already characterized, because selection and breeding have led to dramatic phenotypic diversity underlain by a simple genetic structure. Here, we report the results of the first consumer genomics study ever conducted in a non-human model: a GWAS of blue eyes based on more than 3,000 customer dogs with validation panels including nearly 3,000 more, the largest canine GWAS to date. We discovered a novel association with blue eyes on chromosome 18 (P = 1.3x10-68) and used both sequence coverage and microarray probe intensity data to identify the putative causal variant: a 98.6-kb duplication directly upstream of the Homeobox gene ALX4, which plays an important role in mammalian eye development. This duplication is largely restricted to Siberian Huskies, is strongly associated with the blue-eyed phenotype (chi-square P = 5.2x10-290), and is highly, but not completely, penetrant. These results underscore the power of consumer-data-driven discovery in non-human species, especially dogs, where there is intense owner interest in the personal genomic information of their pets, a high level of engagement with web-based surveys, and an underlying genetic architecture ideal for mapping studies.
Assuntos
Cães/genética , Cor de Olho/genética , Doenças da Íris/genética , Transtornos da Pigmentação/genética , Animais , Duplicação Cromossômica/genética , DNA , Triagem e Testes Direto ao Consumidor , Genoma , Estudo de Associação Genômica Ampla , Genômica/métodos , Genótipo , Fenótipo , Análise de Sequência de DNARESUMO
Inbreeding poses a real or potential threat to nearly every species of conservation concern. Inbreeding leads to loss of diversity at the individual level, which can cause inbreeding depression, and at the population level, which can hinder ability to respond to a changing environment. In closed populations such as endangered species and ex situ breeding programs, some degree of inbreeding is inevitable. It is therefore vital to understand how different patterns of breeding and inbreeding can affect fitness in real animals. Domestic dogs provide an excellent model, showing dramatic variation in degree of inbreeding and in lifespan, an important aspect of fitness that is known to be impacted by inbreeding in other species. There is a strong negative correlation between body size and lifespan in dogs, but it is unknown whether the higher rate of aging in large dogs is due to body size per se or some other factor associated with large size. We used dense genome-wide SNP array data to calculate average inbreeding for over 100 dog breeds based on autozygous segment length and found that large breeds tend to have higher coefficients of inbreeding than small breeds. We then used data from the Veterinary medical Database and other published sources to estimate life expectancies for pure and mixed breed dogs. When controlling for size, variation in inbreeding was not associated with life expectancy across breeds. When comparing mixed versus purebred dogs, however, mixed breed dogs lived about 1.2 years longer on average than size-matched purebred dogs. Furthermore, individual pedigree coefficients of inbreeding and lifespans for over 9000 golden retrievers showed that inbreeding does negatively impact lifespan at the individual level. Registration data from the American Kennel Club suggest that the molecular inbreeding patterns observed in purebred dogs result from specific breeding practices and/or founder effects and not the current population size. Our results suggest that recent inbreeding, as reflected in variation within a breed, is more likely to affect fitness than historic inbreeding, as reflected in variation among breeds. Our results also indicate that occasional outcrosses, as in mixed breed dogs, can have a substantial positive effect on fitness.
RESUMO
Inbreeding depression has been demonstrated to impact vital rates, productivity, and performance in human populations, wild and endangered species, and in recent years, the domestic species. In all cases, standardized, high-quality phenotype data on all individuals are invaluable for longitudinal analyses such as those required to evaluate vital rates of a study cohort. Further, many investigators agree upon the preference for and utility of genomic measures of inbreeding in lieu of pedigree-based estimates of inbreeding. We evaluated the association of measures of reproductive fitness in 93 Golden Retrievers enrolled in the Golden Retriever Lifetime Study with a genomic measurement of inbreeding, FROH. We demonstrate a statistically significant negative correlation between fecundity and FROH. This work sets the stage for larger scale analyses to investigate genomic regions associated with fecundity and other measures of fitness.
Assuntos
Fertilidade/fisiologia , Depressão por Endogamia , Animais , Cães/genética , Cães/fisiologia , Feminino , Fertilidade/genética , Genoma/genética , Genótipo , Homozigoto , Depressão por Endogamia/genética , Fenótipo , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
Controlling for background demographic effects is important for accurately identifying loci that have recently undergone positive selection. To date, the effects of demography have not yet been explicitly considered when identifying loci under selection during dog domestication. To investigate positive selection on the dog lineage early in the domestication, we examined patterns of polymorphism in six canid genomes that were previously used to infer a demographic model of dog domestication. Using an inferred demographic model, we computed false discovery rates (FDR) and identified 349 outlier regions consistent with positive selection at a low FDR. The signals in the top 100 regions were frequently centered on candidate genes related to brain function and behavior, including LHFPL3, CADM2, GRIK3, SH3GL2, MBP, PDE7B, NTAN1, and GLRA1. These regions contained significant enrichments in behavioral ontology categories. The 3rd top hit, CCRN4L, plays a major role in lipid metabolism, that is supported by additional metabolism related candidates revealed in our scan, including SCP2D1 and PDXC1. Comparing our method to an empirical outlier approach that does not directly account for demography, we found only modest overlaps between the two methods, with 60% of empirical outliers having no overlap with our demography-based outlier detection approach. Demography-aware approaches have lower-rates of false discovery. Our top candidates for selection, in addition to expanding the set of neurobehavioral candidate genes, include genes related to lipid metabolism, suggesting a dietary target of selection that was important during the period when proto-dogs hunted and fed alongside hunter-gatherers.
Assuntos
Genética Populacional , Genômica , Metabolismo dos Lipídeos/genética , Seleção Genética , Animais , Demografia , Cães , Genoma , Polimorfismo de Nucleotídeo ÚnicoRESUMO
BACKGROUND: Domesticated from gray wolves between 10 and 40 kya in Eurasia, dogs display a vast array of phenotypes that differ from their ancestors, yet mirror other domesticated animal species, a phenomenon known as the domestication syndrome. Here, we use signatures persisting in dog genomes to identify genes and pathways possibly altered by the selective pressures of domestication. RESULTS: Whole-genome SNP analyses of 43 globally distributed village dogs and 10 wolves differentiated signatures resulting from domestication rather than breed formation. We identified 246 candidate domestication regions containing 10.8 Mb of genome sequence and 429 genes. The regions share haplotypes with ancient dogs, suggesting that the detected signals are not the result of recent selection. Gene enrichments highlight numerous genes linked to neural crest and central nervous system development as well as neurological function. Read depth analysis suggests that copy number variation played a minor role in dog domestication. CONCLUSIONS: Our results identify genes that act early in embryogenesis and can confer phenotypes distinguishing domesticated dogs from wolves, such as tameness, smaller jaws, floppy ears, and diminished craniofacial development as the targets of selection during domestication. These differences reflect the phenotypes of the domestication syndrome, which can be explained by alterations in the migration or activity of neural crest cells during development. We propose that initial selection during early dog domestication was for behavior, a trait influenced by genes which act in the neural crest, which secondarily gave rise to the phenotypes of modern dogs.
Assuntos
Cães/genética , Domesticação , Crista Neural/fisiologia , Lobos/genética , Animais , Variações do Número de Cópias de DNA , Variação Genética , Genoma , Haplótipos/genética , Fenótipo , Seleção GenéticaRESUMO
Dogs were the first domesticated species, originating at least 15,000 y ago from Eurasian gray wolves. Dogs today consist primarily of two specialized groups--a diverse set of nearly 400 pure breeds and a far more populous group of free-ranging animals adapted to a human commensal lifestyle (village dogs). Village dogs are more genetically diverse and geographically widespread than purebred dogs making them vital for unraveling dog population history. Using a semicustom 185,805-marker genotyping array, we conducted a large-scale survey of autosomal, mitochondrial, and Y chromosome diversity in 4,676 purebred dogs from 161 breeds and 549 village dogs from 38 countries. Geographic structure shows both isolation and gene flow have shaped genetic diversity in village dog populations. Some populations (notably those in the Neotropics and the South Pacific) are almost completely derived from European stock, whereas others are clearly admixed between indigenous and European dogs. Importantly, many populations--including those of Vietnam, India, and Egypt-show minimal evidence of European admixture. These populations exhibit a clear gradient of short--range linkage disequilibrium consistent with a Central Asian domestication origin.
Assuntos
Cães/genética , Genética Populacional , Animais , Animais Domésticos , ÁsiaRESUMO
To identify genetic changes underlying dog domestication and reconstruct their early evolutionary history, we generated high-quality genome sequences from three gray wolves, one from each of the three putative centers of dog domestication, two basal dog lineages (Basenji and Dingo) and a golden jackal as an outgroup. Analysis of these sequences supports a demographic model in which dogs and wolves diverged through a dynamic process involving population bottlenecks in both lineages and post-divergence gene flow. In dogs, the domestication bottleneck involved at least a 16-fold reduction in population size, a much more severe bottleneck than estimated previously. A sharp bottleneck in wolves occurred soon after their divergence from dogs, implying that the pool of diversity from which dogs arose was substantially larger than represented by modern wolf populations. We narrow the plausible range for the date of initial dog domestication to an interval spanning 11-16 thousand years ago, predating the rise of agriculture. In light of this finding, we expand upon previous work regarding the increase in copy number of the amylase gene (AMY2B) in dogs, which is believed to have aided digestion of starch in agricultural refuse. We find standing variation for amylase copy number variation in wolves and little or no copy number increase in the Dingo and Husky lineages. In conjunction with the estimated timing of dog origins, these results provide additional support to archaeological finds, suggesting the earliest dogs arose alongside hunter-gathers rather than agriculturists. Regarding the geographic origin of dogs, we find that, surprisingly, none of the extant wolf lineages from putative domestication centers is more closely related to dogs, and, instead, the sampled wolves form a sister monophyletic clade. This result, in combination with dog-wolf admixture during the process of domestication, suggests that a re-evaluation of past hypotheses regarding dog origins is necessary.
Assuntos
Amilases/genética , Animais Domésticos/genética , Variações do Número de Cópias de DNA/genética , Evolução Molecular , Animais , DNA Mitocondrial/genética , Dieta , Cães , Variação Genética , Filogenia , Densidade Demográfica , Lobos/classificação , Lobos/genéticaRESUMO
Selective breeding of dogs has resulted in repeated artificial selection on breed-specific morphological phenotypes. A number of quantitative trait loci associated with these phenotypes have been identified in genetic mapping studies. We analysed the population genomic signatures observed around the causal mutations for 12 of these loci in 25 dog breeds, for which we genotyped 25 individuals in each breed. By measuring the population frequencies of the causal mutations in each breed, we identified those breeds in which specific mutations most likely experienced positive selection. These instances were then used as positive controls for assessing the performance of popular statistics to detect selection from population genomic data. We found that artificial selection during dog domestication has left characteristic signatures in the haplotype and nucleotide polymorphism patterns around selected loci that can be detected in the genotype data from a single population sample. However, the sensitivity and accuracy at which such signatures were detected varied widely between loci, the particular statistic used and the choice of analysis parameters. We observed examples of both hard and soft selective sweeps and detected strong selective events that removed genetic diversity almost entirely over regions >10 Mbp. Our study demonstrates the power and limitations of selection scans in populations with high levels of linkage disequilibrium due to severe founder effects and recent population bottlenecks.
Assuntos
Cruzamento , Cães/genética , Locos de Características Quantitativas , Seleção Genética , Animais , Cães/classificação , Frequência do Gene , Variação Genética , Genética Populacional , Genótipo , Haplótipos , Desequilíbrio de Ligação , Fenótipo , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication. To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data. Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity.
Assuntos
Animais Domésticos/genética , Cães/genética , Genoma/genética , Haplótipos/genética , Polimorfismo de Nucleotídeo Único/genética , Animais , Animais Domésticos/classificação , Animais Selvagens/classificação , Animais Selvagens/genética , Cruzamento , Biologia Computacional , Cães/classificação , Evolução Molecular , Ásia Oriental/etnologia , Oriente Médio/etnologia , Fenótipo , Filogenia , Lobos/classificação , Lobos/genéticaRESUMO
The identification of the H3K4 trimethylase, PRDM9, as the gene responsible for recombination hotspot localization has provided considerable insight into the mechanisms by which recombination is initiated in mammals. However, uniquely amongst mammals, canids appear to lack a functional version of PRDM9 and may therefore provide a model for understanding recombination that occurs in the absence of PRDM9, and thus how PRDM9 functions to shape the recombination landscape. We have constructed a fine-scale genetic map from patterns of linkage disequilibrium assessed using high-throughput sequence data from 51 free-ranging dogs, Canis lupus familiaris. While broad-scale properties of recombination appear similar to other mammalian species, our fine-scale estimates indicate that canine highly elevated recombination rates are observed in the vicinity of CpG rich regions including gene promoter regions, but show little association with H3K4 trimethylation marks identified in spermatocytes. By comparison to genomic data from the Andean fox, Lycalopex culpaeus, we show that biased gene conversion is a plausible mechanism by which the high CpG content of the dog genome could have occurred.
Assuntos
Evolução Molecular , Conversão Gênica , Regiões Promotoras Genéticas , Recombinação Genética , Animais , Mapeamento Cromossômico , Ilhas de CpG , Cães , Estudos de Associação Genética , Genoma , Histona-Lisina N-Metiltransferase/genética , Desequilíbrio de LigaçãoRESUMO
With the assistance of their human companions, dogs have dispersed into new environments during the expansion of human civilization. Tibetan Mastiff (TM), a native of the Tibetan Plateau, was derived from the domesticated Chinese native dog and, like Tibetans, has adapted to the extreme environment of high altitude. Here, we genotyped genome-wide single-nucleotide polymorphisms (SNPs) from 32 TMs and compared them with SNPs from 20 Chinese native dogs and 14 gray wolves (Canis lupus). We identified 16 genes with signals of positive selection in the TM, with 12 of these candidate genes associated with functions that have roles in adaptation to high-altitude adaptation, such as EPAS1, SIRT7, PLXNA4, and MAFG that have roles in responses to hypoxia. This study provides important information on the genetic diversity of the TM and potential mechanisms for adaptation to hypoxia.
Assuntos
Aclimatação/genética , Altitude , Cães/genética , Cães/fisiologia , Aclimatação/fisiologia , Animais , China , Cães/classificação , Evolução Molecular , Variação Genética , Genética Populacional , Genoma , Humanos , Modelos Genéticos , Animais de Estimação/classificação , Animais de Estimação/genética , Animais de Estimação/fisiologia , Filogenia , Polimorfismo de Nucleotídeo Único , Tibet , Lobos/genéticaRESUMO
Since the beginnings of domestication, the craniofacial architecture of the domestic dog has morphed and radiated to human whims. By beginning to define the genetic underpinnings of breed skull shapes, we can elucidate mechanisms of morphological diversification while presenting a framework for understanding human cephalic disorders. Using intrabreed association mapping with museum specimen measurements, we show that skull shape is regulated by at least five quantitative trait loci (QTLs). Our detailed analysis using whole-genome sequencing uncovers a missense mutation in BMP3. Validation studies in zebrafish show that Bmp3 function in cranial development is ancient. Our study reveals the causal variant for a canine QTL contributing to a major morphologic trait.
Assuntos
Proteína Morfogenética Óssea 3/genética , Craniossinostoses/genética , Cães/genética , Variação Genética , Locos de Características Quantitativas , Crânio/metabolismo , Animais , Evolução Biológica , Cruzamento , Mapeamento Cromossômico , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Mutação de Sentido Incorreto , Animais de Estimação , Fenótipo , Crânio/anatomia & histologia , Peixe-Zebra/genéticaRESUMO
BACKGROUND: Equine recurrent laryngeal neuropathy (RLN) is a bilateral mononeuropathy with an unknown pathogenesis that significantly affects performance in Thoroughbreds. A genetic contribution to the pathogenesis of RLN is suggested by the higher prevalence of the condition in offspring of RLN-affected than unaffected stallions. To better understand RLN pathogenesis and its genetic basis, we performed a genome-wide association (GWAS) of 282 RLN-affected and 268 control Thoroughbreds. RESULTS: We found a significant association of RLN with the LCORL/NCAPG locus on ECA3 previously shown to affect body size in horses. Using height at the withers of 505 of these horses, we confirmed the strong association of this locus with body size, and demonstrated a significant phenotypic and genetic correlation between height and RLN grade in this cohort. Secondary genetic associations for RLN on ECA18 and X did not correlate with withers height in our cohort, but did contain candidate genes likely influencing muscle physiology and growth: myostatin (MSTN) and integral membrane protein 2A (ITM2A). CONCLUSIONS: This linkage between body size and RLN suggests that selective breeding to reduce RLN prevalence would likely reduce adult size in this population. However, our results do not preclude the possibility of modifier loci that attenuate RLN risk without reducing size or performance, or that the RLN risk allele is distinct but tightly linked to the body size locus on ECA3. This study is both the largest body size GWAS and the largest RLN GWAS within Thoroughbred horses to date, and suggests that improved understanding of the relationship between genetics, equine growth rate, and RLN prevalence may significantly advance our understanding and management of this disease.
Assuntos
Estudo de Associação Genômica Ampla , Doenças dos Cavalos/genética , Característica Quantitativa Herdável , Alelos , Animais , Tamanho Corporal/genética , Estudos de Casos e Controles , Mapeamento Cromossômico , Feminino , Frequência do Gene , Marcadores Genéticos , Genótipo , Haplótipos , Cavalos , Desequilíbrio de Ligação , Masculino , Fenótipo , Locos de Características QuantitativasRESUMO
Domesticated dogs have many unique behaviors not found in gray wolves that have augmented their interaction and communication with humans. The genetic basis of such unique behaviors in dogs remains poorly understood. We found that genes within regions highly differentiated between outbred Chinese native dogs (CNs) and wolves show high bias for expression localized to brain tissues, particularly the prefrontal cortex, a specific region responsible for complex cognitive behaviors. In contrast, candidate genes showing high population differentiation between CNs and German Shepherd dogs (GSs) did not demonstrate significant expression bias. These observations indicate that these candidate genes highly expressed in the brain have rapidly evolved. This rapid evolution was probably driven by artificial selection during the primary transition from wolves to ancient dogs and was consistent with the evolution of dog-specific characteristics, such as behavior transformation, for thousands of years.
Assuntos
Animais Domésticos , Evolução Biológica , Encéfalo/metabolismo , Transcriptoma , Animais , Biologia Computacional/métodos , Cães , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Seleção Genética , LobosRESUMO
High-throughput genotyping technologies developed for model species can potentially increase the resolution of demographic history and ancestry in wild relatives. We use a SNP genotyping microarray developed for the domestic dog to assay variation in over 48K loci in wolf-like species worldwide. Despite the high mobility of these large carnivores, we find distinct hierarchical population units within gray wolves and coyotes that correspond with geographic and ecologic differences among populations. Further, we test controversial theories about the ancestry of the Great Lakes wolf and red wolf using an analysis of haplotype blocks across all 38 canid autosomes. We find that these enigmatic canids are highly admixed varieties derived from gray wolves and coyotes, respectively. This divergent genomic history suggests that they do not have a shared recent ancestry as proposed by previous researchers. Interspecific hybridization, as well as the process of evolutionary divergence, may be responsible for the observed phenotypic distinction of both forms. Such admixture complicates decisions regarding endangered species restoration and protection.
Assuntos
Evolução Biológica , Canidae/genética , Genoma , Animais , Coiotes/genética , Cães/genética , Evolução Molecular , Genótipo , Haplótipos , Hibridização Genética , Fenótipo , Polimorfismo de Nucleotídeo Único , Lobos/genéticaRESUMO
Quantifying the number of deleterious mutations per diploid human genome is of crucial concern to both evolutionary and medical geneticists. Here we combine genome-wide polymorphism data from PCR-based exon resequencing, comparative genomic data across mammalian species, and protein structure predictions to estimate the number of functionally consequential single-nucleotide polymorphisms (SNPs) carried by each of 15 African American (AA) and 20 European American (EA) individuals. We find that AAs show significantly higher levels of nucleotide heterozygosity than do EAs for all categories of functional SNPs considered, including synonymous, non-synonymous, predicted 'benign', predicted 'possibly damaging' and predicted 'probably damaging' SNPs. This result is wholly consistent with previous work showing higher overall levels of nucleotide variation in African populations than in Europeans. EA individuals, in contrast, have significantly more genotypes homozygous for the derived allele at synonymous and non-synonymous SNPs and for the damaging allele at 'probably damaging' SNPs than AAs do. For SNPs segregating only in one population or the other, the proportion of non-synonymous SNPs is significantly higher in the EA sample (55.4%) than in the AA sample (47.0%; P < 2.3 x 10(-37)). We observe a similar proportional excess of SNPs that are inferred to be 'probably damaging' (15.9% in EA; 12.1% in AA; P < 3.3 x 10(-11)). Using extensive simulations, we show that this excess proportion of segregating damaging alleles in Europeans is probably a consequence of a bottleneck that Europeans experienced at about the time of the migration out of Africa.
Assuntos
Genoma Humano/genética , Polimorfismo de Nucleotídeo Único/genética , África/etnologia , Alelos , Biologia Computacional , Emigração e Imigração , Europa (Continente)/etnologia , Éxons/genética , Heterozigoto , Homozigoto , Humanos , Reação em Cadeia da Polimerase , Estados UnidosRESUMO
Understanding the genetic structure of human populations is of fundamental interest to medical, forensic and anthropological sciences. Advances in high-throughput genotyping technology have markedly improved our understanding of global patterns of human genetic variation and suggest the potential to use large samples to uncover variation among closely spaced populations. Here we characterize genetic variation in a sample of 3,000 European individuals genotyped at over half a million variable DNA sites in the human genome. Despite low average levels of genetic differentiation among Europeans, we find a close correspondence between genetic and geographic distances; indeed, a geographical map of Europe arises naturally as an efficient two-dimensional summary of genetic variation in Europeans. The results emphasize that when mapping the genetic basis of a disease phenotype, spurious associations can arise if genetic structure is not properly accounted for. In addition, the results are relevant to the prospects of genetic ancestry testing; an individual's DNA can be used to infer their geographic origin with surprising accuracy-often to within a few hundred kilometres.
Assuntos
Variação Genética/genética , Genética Populacional , Geografia , Emigração e Imigração , Europa (Continente)/etnologia , Genoma Humano/genética , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Filogenia , Polimorfismo de Nucleotídeo Único , Análise de Componente Principal , Característica Quantitativa Herdável , Tamanho da AmostraRESUMO
Asian rice, Oryza sativa, is one of world's oldest and most important crop species. Rice is believed to have been domesticated â¼9,000 y ago, although debate on its origin remains contentious. A single-origin model suggests that two main subspecies of Asian rice, indica and japonica, were domesticated from the wild rice O. rufipogon. In contrast, the multiple independent domestication model proposes that these two major rice types were domesticated separately and in different parts of the species range of wild rice. This latter view has gained much support from the observation of strong genetic differentiation between indica and japonica as well as several phylogenetic studies of rice domestication. We reexamine the evolutionary history of domesticated rice by resequencing 630 gene fragments on chromosomes 8, 10, and 12 from a diverse set of wild and domesticated rice accessions. Using patterns of SNPs, we identify 20 putative selective sweeps on these chromosomes in cultivated rice. Demographic modeling based on these SNP data and a diffusion-based approach provide the strongest support for a single domestication origin of rice. Bayesian phylogenetic analyses implementing the multispecies coalescent and using previously published phylogenetic sequence datasets also point to a single origin of Asian domesticated rice. Finally, we date the origin of domestication at â¼8,200-13,500 y ago, depending on the molecular clock estimate that is used, which is consistent with known archaeological data that suggests rice was first cultivated at around this time in the Yangtze Valley of China.