RESUMO
Four single nucleotide polymorphism (SNP)-based human leukocyte antigen (HLA) imputation methods (e-HLA, HIBAG, HLA*IMP:02 and MAGPrediction) were trained using 1000 Genomes SNP and HLA genotypes and assessed for their ability to accurately impute molecular HLA-A, -B, -C and -DRB1 genotypes in the Human Genome Diversity Project cell panel. Imputation concordance was high (>89%) across all methods for both HLA-A and HLA-C, but HLA-B and HLA-DRB1 proved generally difficult to impute. Overall, <27.8% of subjects were correctly imputed for all HLA loci by any method. Concordance across all loci was not enhanced via the application of confidence thresholds; reliance on confidence scores across methods only led to noticeable improvement (+3.2%) for HLA-DRB1. As the HLA complex is highly relevant to the study of human health and disease, a standardized assessment of SNP-based HLA imputation methods is crucial for advancing genomic research. Considerable room remains for the improvement of HLA-B and especially HLA-DRB1 imputation methods, and no imputation method is as accurate as molecular genotyping. The application of large, ancestrally diverse HLA and SNP reference data sets and multiple imputation methods has the potential to make SNP-based HLA imputation methods a tractable option for determining HLA genotypes.
Assuntos
Genoma Humano/genética , Antígenos HLA/genética , Haplótipos , Polimorfismo de Nucleotídeo Único/genética , Alelos , Variação Genética , Estudo de Associação Genômica Ampla , Genótipo , Antígenos HLA/classificação , Antígenos HLA-A/genética , Antígenos HLA-B/genética , Antígenos HLA-C/genética , Cadeias HLA-DRB1/genética , Humanos , População BrancaRESUMO
Knowledge of an individual's human leukocyte antigen (HLA) genotype is essential for modern medical genetics, and is crucial for hematopoietic stem cell and solid-organ transplantation. However, the high levels of polymorphism known for the HLA genes make it difficult to generate an HLA genotype that unambiguously identifies the alleles that are present at a given HLA locus in an individual. For the last 20 years, the histocompatibility and immunogenetics community has recorded this HLA genotyping ambiguity using allele codes developed by the National Marrow Donor Program (NMDP). While these allele codes may have been effective for recording an HLA genotyping result when initially developed, their use today results in increased ambiguity in an HLA genotype, and they are no longer suitable in the era of rapid allele discovery and ultra-high allele polymorphism. Here, we present a text string format capable of fully representing HLA genotyping results. This Genotype List (GL) String format is an extension of a proposed standard for reporting killer-cell immunoglobulin-like receptor (KIR) genotype data that can be applied to any genetic data that use a standard nomenclature for identifying variants. The GL String format uses a hierarchical set of operators to describe the relationships between alleles, lists of possible alleles, phased alleles, genotypes, lists of possible genotypes, and multilocus unphased genotypes, without losing typing information or increasing typing ambiguity. When used in concert with appropriate tools to create, exchange, and parse these strings, we anticipate that GL Strings will replace NMDP allele codes for reporting HLA genotypes.
Assuntos
Algoritmos , Técnicas de Genotipagem/normas , Antígenos HLA/imunologia , Transplante de Células-Tronco Hematopoéticas , Teste de Histocompatibilidade/normas , Transplante de Órgãos , Receptores KIR/imunologia , Alelos , Frequência do Gene , Genótipo , Técnicas de Genotipagem/estatística & dados numéricos , Antígenos HLA/genética , Teste de Histocompatibilidade/estatística & dados numéricos , Humanos , Polimorfismo Genético , Receptores KIR/genética , Análise de Sequência de DNA , Terminologia como Assunto , Doadores não RelacionadosRESUMO
We have updated the catalogue of common and well-documented (CWD) human leukocyte antigen (HLA) alleles to reflect current understanding of the prevalence of specific allele sequences. The original CWD catalogue designated 721 alleles at the HLA-A, -B, -C, -DRB1, -DRB3/4/5, -DQA1, -DQB1, and -DPB1 loci in IMGT (IMmunoGeneTics)/HLA Database release 2.15.0 as being CWD. The updated CWD catalogue designates 1122 alleles at the HLA-A, -B, -C, -DRB1, -DRB3/4/5, -DQA1, -DQB1, -DPA1 and -DPB1 loci as being CWD, and represents 14.3% of the HLA alleles in IMGT/HLA Database release 3.9.0. In particular, we identified 415 of these alleles as being 'common' (having known frequencies) and 707 as being 'well-documented' on the basis of ~140,000 sequence-based typing observations and available HLA haplotype data. Using these allele prevalence data, we have also assigned CWD status to specific G and P designations. We identified 147/151 G groups and 290/415 P groups as being CWD. The CWD catalogue will be updated on a regular basis moving forward, and will incorporate changes to the IMGT/HLA Database as well as empirical data from the histocompatibility and immunogenetics community. This version 2.0.0 of the CWD catalogue is available online at cwd.immunogenomics.org, and will be integrated into the Allele Frequencies Net Database, the IMGT/HLA Database and National Marrow Donor Program's bioinformatics web pages.
Assuntos
Alelos , Antígenos HLA/classificação , Antígenos HLA/imunologia , Histocompatibilidade/imunologia , Bases de Dados Genéticas , Frequência do Gene , Loci Gênicos/imunologia , Genética Populacional , Antígenos HLA/genética , Histocompatibilidade/genética , Teste de Histocompatibilidade , Humanos , Terminologia como AssuntoRESUMO
SUMMARY: The goal of the immunogenomic data analysis working group (IDAWG) is to facilitate the consistent analysis of HLA and KIR data, and the sharing of those data among the immunogenomic and larger genomic communities. However, the data management approaches currently applied by immunogenomic researchers are not widely discussed or reported in the literature, and the effect of different approaches on data analyses is not known. With ASHI's support, the IDAWG developed a 45 question survey on HLA and KIR data generation, data management and data analysis practices. Survey questions detailed the loci genotyped, typing systems used, nomenclature versions reported, computer operating systems and software used to manage and transmit data, the approaches applied to resolve HLA ambiguity and the methods used for basic population-level analyses. Respondents were invited to demonstrate their HLA ambiguity resolution approaches in simulated data sets. By May 2012, 156 respondents from 35 nations had completed the survey. These survey respondents represent a broad sampling of the Immunogenomic community; 52% were European, 30% North American, 10% Asian, 4% South American and 4% from the Pacific. The project will continue in conjunction with the 17th Workshop, with the aim of developing community data sharing standards, ambiguity resolution documentation formats, single-task data Management tools and novel data analysis methods and applications. While additional project details and plans for the 17th IHIW will be forthcoming, we welcome the input and participation in these projects from the histocompatibility and immunogenetics community.
Assuntos
Etnicidade/genética , Genoma Humano/imunologia , Antígenos HLA , Receptores KIR/genética , Coleta de Dados , Genética Populacional , Antígenos HLA/genética , Antígenos HLA/imunologia , Humanos , Inquéritos e QuestionáriosRESUMO
Continuing a project presented at the 15th International HLA and Immunogenetics Workshop (IHIWS) on the rarity of HLA alleles, we sought to expand the number of data sources and bioinformatics tools available in the Allele Frequencies Net Database website (AFND, www.allelefrequencies.net). In this 16th IHIWS Rare Alleles project, HLA alleles described in the latest IMGT/HLA Database (release 3.8.0) were queried against different sources including data from registries (stem cell) and from 74 different laboratories around the world. We demonstrated that approximately 40% of the alleles officially named in the IMGT/HLA Database have been reported only once across all different sources. To facilitate the large-scale analysis of rare alleles, we have produced an online tool called the Rare Allele Detector that simplifies the detection of alleles that are considered to be 'very rare', 'rare' or 'frequent'. Tools and associated data can be accessed via the www.allelefrequencies.net website.
Assuntos
Alelos , Antígenos HLA , Imunogenética , Biologia Computacional , Bases de Dados Factuais , Frequência do Gene , Antígenos HLA/genética , Antígenos HLA/imunologia , Humanos , Internet , Grupos Populacionais/genéticaRESUMO
Modern high-throughput HLA and KIR typing technologies are generating a wealth of immunogenomic data with the potential to revolutionize the fields of histocompatibility and immune-related disease association and population genetic research, much as SNP-based approaches have revolutionized association research. The STrengthening the REporting of Genetic Association studies (STREGA) statement provides community-based data reporting and analysis standards for genomic disease-association studies, identifying specific areas in which adoption of reporting guidelines can improve the consistent interpretation of genetic studies. While aspects of STREGA can be applied to immunogenomic studies, HLA and KIR research requires additional consideration, as the high levels of polymorphism associated with immunogenomic data pose unique methodological and computational challenges to the synthesis of information across datasets. Here, we outline the principle challenges to consistency in immunogenomic studies, and propose that an immunogenomic-specific analog to the STREGA statement, a STrengthening the REporting of Immunogenomic Studies (STREIS) statement, be developed as part of the 16th International HLA and Immunogenetics Workshop. We propose that STREIS extends at least four of the 22 elements of the STREGA statement to specifically address issues pertinent to immunogenomic data: HLA and KIR nomenclature, data-validation, ambiguity resolution, and the analysis of highly polymorphic genetic systems. As with the STREGA guidelines, the intent behind STREIS is not to dictate the design of immunogenomic studies, but to ensure consistent and transparent reporting of research, facilitating the synthesis of HLA and KIR data across studies.
Assuntos
Genética Populacional/métodos , Genômica/métodos , Projetos de Pesquisa/normas , Estudo de Associação Genômica Ampla , Genômica/estatística & dados numéricos , Antígenos HLA/imunologia , HumanosRESUMO
In this brief communication, we describe the Allele Name Translation Tool (antt) and Update NomenCLature (uncl), free programs developed to facilitate the translation of human leukocyte antigen (HLA) allele names recorded using the December 2002 version of the HLA allele nomenclature (e.g. A*01010101) to those recorded using the colon-delimited version of the HLA allele nomenclature (e.g. A*01:01:01:01) that was adopted in April 2010. In addition, the antt and uncl translate specific HLA allele-name changes (e.g. DPB1*0502 is translated to DPB1*104:01), as well as changes to the locus prefix for HLA-C (i.e. Cw* is translated to C*). The antt and uncl will also translate allele names that have been truncated to two, four, or six digits, as well as ambiguous allele strings. The antt is a locally installed and run application, while uncl is a web-based tool that requires only an Internet connection and a modern browser. The antt accepts a variety of HLA data-presentation and allele-name formats. In addition, the antt can translate using user-defined conversion settings (e.g. the names of alleles that encode identical peptide binding domains can be translated to a common 'P-code'), and can serve as a preliminary data-sanity tool. The antt is available for download, and uncl for use, at www.igdawg.org/software.
Assuntos
Alelos , Biologia Computacional/métodos , Antígenos HLA/química , Automação , Genômica , Humanos , Sistema Imunitário , Software , Terminologia como AssuntoRESUMO
The Type I Diabetes Genetics Consortium (T1DGC) has collected thousands of multiplex and simplex families with type I diabetes (T1D) with the goal of identifying genes involved in T1D susceptibility. These families have all been genotyped for the HLA class I and class II loci and a subset of samples has been typed for an major histocompatibility complex (MHC) single-nucleotide polymorphism (SNP) panel. In addition, the T1DGC has genotyped SNPs in candidate genes to evaluate earlier reported T1D associations. Individual SNPs and SNP haplotypes in IL4R, which encodes the alpha-chain of the IL4 and IL13 receptors, have been associated with T1D in some reports, but not in others. In this study, 38 SNPs in IL4R were genotyped using the Sequenom iPLEX Gold MassARRAY technology in 2042 multiplex families from nine cohorts. Association analyses (transmission-disequilibrium test and parental-disequilibrium test) were performed on individual SNPs and on three-SNP haplotypes. Analyses were also stratified on the high-risk HLA DR3/DR4-DQB1*0302 genotype. A modest T1D association in HBDI families (n=282) was confirmed in this larger collection of HBDI families (n=424). The variant alleles at the non-synonymous SNPs (rs1805011 (E400A), rs1805012 (C431R), and rs1801275 (Q576R)), which are in strong linkage disequilibrium, were negatively associated with T1D risk. These SNPs were more associated with T1D among non-DR3/DR4-DQB1*0302 genotypes than DR3/DR4-DQB1*0302 genotypes. This association was stronger, both in terms of odds ratio and P-values, than the initial report of the smaller collection of HBDI families. However, the IL4R SNPs and the three-SNP haplotype containing the variant alleles were not associated with T1D in the total data. Thus, in the overall families, these results do not show evidence for an association of SNPs in IL4R with T1D.
Assuntos
Diabetes Mellitus Tipo 1/genética , Loci Gênicos , Predisposição Genética para Doença , Subunidade alfa de Receptor de Interleucina-4/análise , Polimorfismo de Nucleotídeo Único , Alelos , Diabetes Mellitus Tipo 1/imunologia , Genótipo , Humanos , Subunidade alfa de Receptor de Interleucina-4/genética , Subunidade alfa de Receptor de Interleucina-4/imunologia , Fatores de RiscoRESUMO
Sequence-based typing was used to identify human leukocyte antigen (HLA)-A, -B, -C, and -DRB1 alleles from 558 consecutively recruited US volunteers with Eastern European ancestry for an unrelated hematopoietic stem cell registry. Four of 31 HLA-A alleles, 29 HLA-C alleles, 59 HLA-B alleles, and 42 HLA-DRB1 alleles identified (A*0325, B*440204, Cw*0332, and *0732N) are novel. The HLA-A*02010101g allele was observed at a frequency of 0.28. Two-, three-, and four-locus haplotypes were estimated using the expectation-maximization algorithm. The highest frequency extended haplotypes (A*010101g-Cw*070101g-B*0801g-DRB1*0301 and A*03010101g-Cw*0702-B*0702-DRB1*1501) were observed at frequencies of 0.04 and 0.03, respectively. Linkage disequilibrium values (Dij') of the constituent two-locus haplotypes were highly significant for both extended haplotypes (P values were less than 8 x 10(-10)) but were consistently higher for the more frequent haplotype. Balancing selection was inferred to be acting on all the four loci, with the strongest evidence of balancing selection observed for the HLA-C locus. Comparisons of the A-C-B haplotypes and DRB1 frequencies in this population with those for African, European, and western Asian populations showed high degrees of identity with Czech, Polish, and Slovenian populations and significant differences from the general European American population.
Assuntos
Frequência do Gene/genética , Antígenos HLA/genética , Haplótipos/genética , População Branca/genética , Alelos , Europa (Continente)/etnologia , Europa Oriental/etnologia , Variação Genética , Antígenos HLA-A/genética , Antígenos HLA-B/genética , Antígenos HLA-C/genética , Antígenos HLA-DR/genética , Cadeias HLA-DRB1 , HumanosRESUMO
The HLA allele frequency distributions have been characterized for the HLA class I and class II loci of the Golla pastoral caste, from Southeast India, subdivided into the subcastes (Puja, Punugu, Kurava, Pokanati, Karnam, and Doddi). Genetic distances, neighbor-joining, correspondence, and haplotype analyses all indicate that the subcastes exhibit a high haplotype variability and that their genetic substratum may be the result of European-Middle East/Asian admixture with the autochthonous populations. The Karnam subcaste seems to be the one that has undergone a higher degree of admixture, when compared with the other subcastes. The Golla speak an old Indian Dravidian language and should theoretically represent the basic Indian substratum that existed before the postulated "Aryan" invasion.
Assuntos
Variação Genética , Antígenos HLA/genética , Classe Social , Adulto , Alelos , Frequência do Gene/genética , Antígenos HLA-A/genética , Antígenos HLA-B/genética , Antígenos HLA-DR , Cadeias HLA-DRB1 , Humanos , Índia , Desequilíbrio de Ligação/genéticaRESUMO
Hereditary muscular disease is well described in racehorses, yet little is known about traumatic muscle disease associated with unaccustomed exercise or training. The objective of the study was to compare sedentary horses, racehorses undergoing training for the first time (unaccustomed exercise), and experienced racehorses during a training season (accustomed exercise) to investigate the effect of exercise and training on serum muscle enzyme activities and other variables. Horses were sampled prior to exercise for serum activities of aspartate amino transferase (AST), creatine kinase and other variables γ glutamyl transferase (GGT) and serum amyloid A (SAA) in a three-part study. Serum activities of AST and GGT were higher in fit racehorses (n=47) compared with sedentary horses (n=57) at a single time point (P<0.05). The monthly serum activity of AST in two-year-old racehorses (n=10) increased from month 1 to 4 of unaccustomed training (P<0.05). The serum activities of AST and GGT in three-year-old racehorses (n=12) previously accustomed to exercise sampled fortnightly to monthly showed a linear increase (P<0.05) with cumulative training days over the seven months of training, but showed minimal increase during the first four months of accustomed training. SAA was weakly correlated to cumulative training days. In conclusion, AST activity was increased by unaccustomed exercise and cumulative training stress in the racehorse. GGT appeared to be correlated to cumulative training load. Mild to moderate elevations in serum AST in racehorses may be associated with cumulative muscle damage from training or trauma associated with unaccustomed exercise.
Assuntos
Aspartato Aminotransferases/sangue , Creatina Quinase/sangue , Condicionamento Físico Animal/fisiologia , Proteína Amiloide A Sérica/metabolismo , gama-Glutamiltransferase/sangue , Animais , Feminino , Doenças dos Cavalos/sangue , Cavalos , Masculino , Doenças Musculares/sangue , Doenças Musculares/veterinária , Condicionamento Físico Animal/estatística & dados numéricos , Aptidão Física/fisiologia , Comportamento Sedentário , EsportesRESUMO
The Biostatistics Component of the 13th International Histocompatibility Workshop (IHWS) developed the PyPop (Python for Population Genomics) software framework for high-throughput analysis and quality control (QC) assessments of highly polymorphic genotype data. Since its initial release, the software has had several new analysis modules added to it. These additions, combined with improved data filtering and QC modules, facilitate analyses of data at different levels (allele, haplotype, amino acid sequence, and nucleotide sequence). Since the 13th IHWS, much of the human leukocyte antigen (HLA) data from the workshop, QCed via PyPop and other methods, have been made publicly available through the Major Histocompatibility Complex database web site at the National Center for Biotechnology Information (http://ncbi.nih.gov/mhc/). The Anthropology/Human Genetic Diversity component (AHGDC) data have been used in a variety of studies. Prugnolle et al. used this data to corroborate a model of pathogen-driven selection as a factor related to high levels of diversity at HLA loci. Using a comparative genomics approach contrasting results for HLA and non-HLA markers, Meyer et al. analyzed a subset of the 13th IHWS AHGDC data and showed that HLA loci show detectable signs of both natural selection and the demographic history of populations.
Assuntos
Bases de Dados como Assunto , Antígenos HLA/genética , Teste de Histocompatibilidade/métodos , Imunogenética , Complexo Principal de Histocompatibilidade/imunologia , Coleta de Dados , Genética Populacional , Antígenos HLA/imunologia , Humanos , InternetRESUMO
The 14th International HLA (human leukocyte antigen) Immunogenetics Workshop (14th-IHIWS) Biostatistics and Anthropology/Human Genetic Diversity project continues the population sampling, genotype data generation, and biostatistic analyses of the 13th International Histocompatibility Workshop Anthropology/Human Genetic Diversity Component, with the overall goal of further characterizing global HLA allele and haplotype diversity and better describing the relationships between major histocompatibility complex diversity, geography, linguistics, and population history. Since the 13th Workshop, new investigators have and continue to be recruited to the project and new high-resolution class I and class II genotype data are being generated for 112 population samples from around the world.
Assuntos
Técnicas Genéticas , Genética Populacional , Antígenos HLA/genética , Haplótipos/genética , Antígenos de Histocompatibilidade Classe II/genética , Antígenos de Histocompatibilidade Classe I/genética , Grupos Populacionais/genética , Genótipo , HumanosRESUMO
Sequence-based typing was used to identify human leukocyte antigen (HLA)-A, -B, -C, and -DRB1 alleles from 564 consecutively recruited African American volunteers for an unrelated hematopoietic stem cell registry. The number of known alleles identified at each locus was 42 for HLA-A, HLA-B 67, HLA-C 33, and HLA-DRB1 44. Six novel alleles (A*260104, A*7411, Cw*0813, Cw*1608, Cw*1704, and DRB1*130502) not observed in the initial sequence-specific oligonucleotide probe testing were characterized. The action of balancing selection, shaping more 'even' than expected allele frequency distributions, was inferred for all four loci and significantly so for the HLA-A and DRB1 loci. Two-, three-, and four-locus haplotypes were estimated using the expectation maximization algorithm. Comparisons with other populations from Africa and Europe suggest that the degree of European admixture in the African American population described here is lower than that in other African American populations previously reported, although HLA-A:B haplotype frequencies similar to those in previous studies of African American individuals were also noted.
Assuntos
Negro ou Afro-Americano/genética , Frequência do Gene , Antígenos HLA-DR/genética , Haplótipos/genética , Antígenos de Histocompatibilidade Classe I/genética , Antígenos HLA-A/genética , Antígenos HLA-B/genética , Antígenos HLA-C/genética , Cadeias HLA-DRB1 , HumanosRESUMO
The alleles at the HLA class II loci HLA-DRB1, DQA1, DQB1 and DPB1 were determined for 49 individuals of the Ticuna, a Native South American population living in Brazil, using PCR/SSO probe hybridization and DNA sequencing. A newly described DRB1*08 variant, DRB1*0807, which has previously been reported only in native Colombians and contemporary Brazilians of African and Caucasian descent, was identified in the Ticuna at a high frequency (f=0.225). Because *0807 has been observed only in South American populations, we propose that it was generated from a parental *0802 allele recently, after the isolation of various Native South American populations, and infer that the DRB1*0807 allele was generated by a C to T change at codon 57 (Asp-->Val, GAT-->GTT) from the ancestral *0802. This inference is supported by the sequence of a complex VNTR in the second intron of the DRB1 gene. The DPB1 alleles *0401, *0402 and *1401 constituted 76% of the observed Ticuna DPB1 alleles (f=0.166, 0.427 and 0.166 respectively). In addition, the DPB1 allele *3501, which has been observed in a few other Native South American groups, was observed at a frequency of 0.053 and may have been generated from the putative ancestral allele *1401 allele in South America. The DRB1 and DPB1 allele frequencies for the Ticuna deviate from expected Hardy-Weinberg equilibrium proportions, while DQA1 and DQB1 allele frequencies do not. When this deviation, which involves an observed excess of DRB1*0807 heterozygotes, is considered with the high frequency of the DRB1*0807 and DPB1*1401 alleles, we infer that native South American populations may have been under selection pressure for increased allele diversity.
Assuntos
Alelos , Evolução Molecular , Antígenos HLA-DR/genética , Indígenas Sul-Americanos/genética , Polimorfismo Genético , África , Brasil/etnologia , Mapeamento Cromossômico , DNA Mitocondrial , Frequência do Gene , Antígenos HLA-DP/genética , Cadeias beta de HLA-DP , Antígenos HLA-DQ/genética , Cadeias alfa de HLA-DQ , Cadeias beta de HLA-DQ , Cadeias HLA-DRB1 , Haplótipos , Homozigoto , Humanos , Desequilíbrio de Ligação , Modelos GenéticosRESUMO
The analysis of HLA allele frequencies in various Amerindian populations may shed light on the history of human migrations in the Americas; the overall reduction in the number of alleles relative to non-Amerindian populations and the observation that the same alleles and allelic lineages are "missing" in all Amerindian groups suggests an "Into America" population bottleneck. The identification of previously unreported (and presumably newly arisen) HLA-DRB1 alleles among isolated Amerindian groups, (DRB*0417 in Argentina, *08042 in Ecuador, DRB1*0807 in Brazil, and *0811 in Canada) suggests that these alleles may have been generated since the colonization of the Americas (about 20-30,000 years ago). These observations are difficult to reconcile with the notion, based on the analysis of exon-2 sequences, that most of the human DRB1 alleles are "ancient", that is, predate the divergence of the hominoids (4-7 myr). Recent analyses of DRB1 intron sequences, however, indicate that, although most of the allelic lineages are ancient, the alleles within a lineage (> 90% of the DRB1 alleles) have arisen relatively recently. For DRB1*0807, presumably generated by an Asp to Val change (GAT to GTT) at codon 57, strong selective pressures appear to be in operation, based on the high frequency (23%), and linkage disequilibrium patterns of this allele. The analysis of a complex microsatellite in the second intron in the Ticuna is consistent with the notion that the new Amerindian DR8 alleles arose from DRB1*0802, the only DR8 allele observed in most Amerindian populations.
Assuntos
Alelos , Evolução Molecular , Efeito Fundador , Genes MHC da Classe II , Antígenos HLA-DR/genética , Indígenas Norte-Americanos/genética , América , Ásia/etnologia , Emigração e Imigração , Frequência do Gene , Genótipo , Cadeias HLA-DRB1 , História Antiga , Humanos , Indígenas Norte-Americanos/história , Indígenas Sul-Americanos , FilogeniaRESUMO
To examine the genetic diversity in west Africa, class I HLA-A and HLA-B alleles of 92 unrelated individuals from two areas in the Cameroon, the capital Yaoundé and the village of Etoa, were identified by direct automated DNA sequencing of exons 2 and 3 of the HLA-B locus alleles and sequence-specific oligonucleotide probe (SSOP) and/or sequencing of the HLA-A locus alleles. HLA-A*2301 (18.7%), A*2902 (10.4%), B*5301 (10.9%), and B*5802 (10.9%) were the most frequently detected alleles, present in at least 10% of the population. A total of 30 HLA-A locus and 33 HLA-B locus alleles, including six novel alleles, were detected. The novel alleles were HLA-A*03012, A*2612, A*3006 and HLA-B*1403, B*4016, and B*4703. HLA-B*4703 contains a novel amino acid sequence that is a combination of the first 5 amino acids of the Bw6 epitope and the last 2 residues of the Bw4 epitope. The addition of 6 alleles to the ever-expanding number of known class I HLA alleles supports our hypothesis that extensive genetic diversity, including previously undescribed alleles, would be observed in this African population. In the Yaoundé population, the allele frequency distribution at the HLA-A locus is consistent with distributions indicative of balancing selection. Extensive HLA-A-B haplotypes were observed in this population suggesting that only a fraction of the Cameroon HLA-A-B haplotype diversity has been observed.
Assuntos
Variação Genética , Antígenos HLA-A/genética , Antígenos HLA-B/genética , Alelos , Sequência de Bases , Camarões , Antígenos HLA-A/imunologia , Antígenos HLA-B/imunologia , Humanos , Dados de Sequência MolecularRESUMO
The distributions of HLA-A alleles in six Pacific/Asian populations, Malay, Papua New Guinea (PNG) Highlands, two Indonesian groups, and two PNG Lowland groups, as well as the distribution of the HLA-B alleles in the PNG Highlands population, were determined using polymerase chain reaction (PCR) immobilized sequence-specific oligonucleotide (SSO) probe typing methods. The allele frequency distributions at the HLA class II loci, DRB1, DQB1 and DPB1 were also determined by PCR/SSO methods in an additional study of the same populations. In most of these populations, the HLA-A*2402 allele was the most frequent, attaining a frequency of 0.78 in the PNG Highlands. A*1101 was the next most frequent allele, followed in frequency by the *3401 allele. The HLA-B*1506, *4001, *5601 and *5602 alleles comprised 73% of the allele diversity at the B-locus in the PNG Highlands. Two previously unreported HLA-A alleles were identified in Indonesians and Malays, based on novel probe reactivity patterns. Cloning and sequencing identified these as A*1104 and *2410. Sequence comparisons show that these new alleles differ at codon 187 from their putative parental alleles (*1101 and *2403) by dinucleotide changes in the first two codon positions. These changes involve a Thr to Arg (CG to AC) and an Arg to Thr substitution (AC to CG) at position 187; residues at this position participate in pocket A of the peptide binding groove. Comparison of the HLA-A allele frequency distributions indicate that Malays are the most diverse (heterozygosity (h)=0.88) and the PNG Highlanders are, by far, the least diverse (h=0.37) of the groups studied. However, the diversity of B-locus alleles in the PNG Highlanders (h=0.91) was greater than that observed at the A-locus of any of the populations reported here. The remarkably high allele frequency of A*2402 in the PNG Highlands could reflect founder effects and population bottlenecks, genetic drift, or positive directional selection. The distribution of the HLA-B locus alleles and class II alleles, as well as mtDNA sequence data in the PNG Highlands indicates a reasonably high level of diversity at other loci, arguing that the high frequency of A*2402 cannot be attributed entirely to founder effects, bottlenecks, or drift and suggests the operation of positive selection for the A*2402 allele in this population.