RESUMEN
A major initiative to create a global human haplotype map has recently been launched as a tool to improve the efficiency of disease gene mapping. The 'HapMap' project will study common variants in depth in four (and to a lesser degree in up to 12) populations to catalogue haplotypes that are expected to be common to all populations. A hope of the 'HapMap' project is that much of the genome occurs in regions of limited diversity such that only a few of the SNPs in each region will capture the diversity and be relevant around the world. In order to explore the implications of studying only a limited number of populations, we have analyzed linkage disequilibrium (LD) patterns of three 175-320 kb genomic regions in 16 diverse populations with an emphasis on African and European populations. Analyses of these three genomic regions provide empiric demonstration of marked differences in frequencies of the same few haplotypes, resulting in differences in the amount of LD and very different sets of haplotype frequencies. These results highlight the distinction between the statistical concept of LD and the biological reality of haplotypes and their frequencies. The significant quantitative and qualitative variation in LD among populations, even for populations within a geographic region, emphasizes the importance of studying diverse populations in the HapMap project to assure broad applicability of the results.
Asunto(s)
Genoma Humano , Haplotipos , Desequilibrio de Ligamiento , Pueblo Asiatico/genética , Población Negra/genética , Mapeo Cromosómico , Frecuencia de los Genes , Humanos , Polimorfismo de Nucleótido Simple , Población Blanca/genéticaRESUMEN
We have constructed haplotypes based on normal variation at six polymorphic sites-five single nucleotide polymorphisms (SNPs) and one short tandem repeat polymorphism (STRP)-at the RET locus for samples of normal individuals from 32 populations distributed across the major continental regions of the world. The haplotyped system spans 41.6 kilobases and encompasses most of the coding region of the gene. All of the markers are polymorphic in all regions of the world and in most individual populations. Expected heterozygosities for the six-site haplotypes range from 82 to 94% for all populations studied except for two Amerindian groups from the Amazon basin at 61 and 76%. Individual populations had from four to eight haplotypes with frequencies exceeding 5%. In general, African, southwest Asian and European groups have the highest numbers of total and of commonly occurring haplotypes; the lowest numbers are observed in Amerindian populations. Overall linkage disequilibrium (LD) for the five SNP sites was very significant (P=0.001) for all the non-African populations, but significant at that level for only one of the seven African populations. In general, the permutation-based xi coefficient that quantifies overall LD tends to increase the farther the population is from Africa, but variability of this measure of LD is often large within geographic regions. Pairwise LD measures among the SNPs also show considerable variation among populations. Association of STRP alleles with the SNP-defined background haplotypes is generally higher outside of Africa than in Africa, but is highly variable.
Asunto(s)
Haplotipos , Desequilibrio de Ligamiento , Proteínas Proto-Oncogénicas/genética , Proteínas Tirosina Quinasas Receptoras/genética , Alelos , Exones , Frecuencia de los Genes , Variación Genética , Genética de Población , Heterocigoto , Humanos , Modelos Genéticos , Datos de Secuencia Molecular , Mapeo Físico de Cromosoma , Polimorfismo Genético , Polimorfismo de Nucleótido Simple , Proteínas Proto-Oncogénicas c-ret , Secuencias Repetidas en TándemRESUMEN
The ADH1B Arg47His polymorphism has been convincingly associated with alcoholism in numerous studies of several populations in Asia and Europe. In a review of literature from the past 30 years, we have identified studies that report allele frequencies of this polymorphism for 131 population samples from many different parts of the world. The derived ADH1B*47His allele reaches high frequencies only in western and eastern Asia. To pursue this pattern, we report here new frequency data for 37 populations. Most of our data are from South and Southeast Asia and confirm that there is a low frequency of this allele in the region between eastern and western Asia. The distribution suggests that the derived allele increased in frequency independently in western and eastern Asia after humans had spread across Eurasia.
Asunto(s)
Alcohol Deshidrogenasa/genética , Alelos , Asia , Asia Occidental , Pueblo Asiatico/genética , Secuencia de Bases , Cartilla de ADN/genética , Flujo Génico , Frecuencia de los Genes , Flujo Genético , Humanos , Selección GenéticaRESUMEN
Recent studies suggest that haplotypes tend to have block-like structures throughout the human genome. Several methods were proposed for haplotype block partitioning and for tagging single-nucleotide polymorphism (SNP) identification. In population genetics studies, several research groups compared block structures across human populations. However, the measures used to quantify population similarity are either less than satisfactory or nonexistent. In this article, we propose several similarity measures to facilitate the comparisons of haplotype structures, namely block boundaries and tagging SNPs, across populations. With these measures, we can more objectively compare haplotype block structures and tagging SNP sets between different populations. In addition, these measures allow us to compare the results of different methods for block partition and tagging SNP identification. When we applied these measures to a real data set on chromosome 10 in 16 worldwide populations, we found that in this genome region: 1) haplotype block boundaries vary among populations, with European and some African populations showing similar boundaries but other populations showing other patterns; 2) tagging SNP sets are generally similar for populations with similar haplotype block structures but differ if the block structures differ; and 3) all but one of the block finding methods we tested yield consistent results, although variations exist regarding consistency. Our tentative results show that at least in the genome region studied, it is unlikely that a common haplotype pattern exists for all human populations: many populations, even in the same geographical region, may have different haplotype patterns.
Asunto(s)
Cromosomas Humanos Par 10/genética , Predisposición Genética a la Enfermedad/epidemiología , Variación Genética , Genética de Población , Haplotipos , Modelos Genéticos , Polimorfismo de Nucleótido Simple/genética , Mapeo Cromosómico/métodos , Genoma Humano , Humanos , Modelos EstadísticosRESUMEN
We report a comprehensive statistical analysis of data on 58 DNA markers (mitochondrial [mt], Y-chromosomal, and autosomal) and sequence data of the mtHVS1 from a large number of ethnically diverse populations of India. Our results provide genomic evidence that (1) there is an underlying unity of female lineages in India, indicating that the initial number of female settlers may have been small; (2) the tribal and the caste populations are highly differentiated; (3) the Austro-Asiatic tribals are the earliest settlers in India, providing support to one anthropological hypothesis while refuting some others; (4) a major wave of humans entered India through the northeast; (5) the Tibeto-Burman tribals share considerable genetic commonalities with the Austro-Asiatic tribals, supporting the hypothesis that they may have shared a common habitat in southern China, but the two groups of tribals can be differentiated on the basis of Y-chromosomal haplotypes; (6) the Dravidian tribals were possibly widespread throughout India before the arrival of the Indo-European-speaking nomads, but retreated to southern India to avoid dominance; (7) formation of populations by fission that resulted in founder and drift effects have left their imprints on the genetic structures of contemporary populations; (8) the upper castes show closer genetic affinities with Central Asian populations, although those of southern India are more distant than those of northern India; (9) historical gene flow into India has contributed to a considerable obliteration of genetic histories of contemporary populations so that there is at present no clear congruence of genetic and geographical or sociocultural affinities.