RESUMEN
The SNP-HLA Reference Consortium (SHLARC), a component of the 18th International HLA and Immunogenetics Workshop, is aimed at collecting diverse and extensive human leukocyte antigen (HLA) data to create custom reference panels and enhance HLA imputation techniques. Genome-wide association studies (GWAS) have significantly contributed to identifying genetic associations with various diseases. The HLA genomic region has emerged as the top locus in GWAS, particularly in immune-related disorders. However, the limited information provided by single nucleotide polymorphisms (SNPs), the hallmark of GWAS, poses challenges, especially in the HLA region, where strong linkage disequilibrium (LD) spans several megabases. HLA imputation techniques have been developed using statistical inference in response to these challenges. These techniques enable the prediction of HLA alleles from genotyped GWAS SNPs. Here we present the SHLARC activities, a collaborative effort to create extensive, and multi-ethnic reference panels to enhance HLA imputation accuracy.
Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Humanos , Inmunogenética , Alelos , Antígenos HLA/genética , GenotipoRESUMEN
The MHC class I region contains crucial genes for the innate and adaptive immune response, playing a key role in susceptibility to many autoimmune and infectious diseases. Genome-wide association studies have identified numerous disease-associated SNPs within this region. However, these associations do not fully capture the immune-biological relevance of specific HLA alleles. HLA imputation techniques may leverage available SNP arrays by predicting allele genotypes based on the linkage disequilibrium between SNPs and specific HLA alleles. Successful imputation requires diverse and large reference panels, especially for admixed populations. This study employed a bioinformatics approach to call SNPs and HLA alleles in multi-ethnic samples from the 1000 genomes (1KG) dataset and admixed individuals from Brazil (SABE), utilising 30X whole-genome sequencing data. Using HIBAG, we created three reference panels: 1KG (n = 2504), SABE (n = 1171), and the full model (n = 3675) encompassing all samples. In extensive cross-validation of these reference panels, the multi-ethnic 1KG reference exhibited overall superior performance than the reference with only Brazilian samples. However, the best results were achieved with the full model. Additionally, we expanded the scope of imputation by developing reference panels for non-classical, MICA, MICB and HLA-H genes, previously unavailable for multi-ethnic populations. Validation in an independent Brazilian dataset showcased the superiority of our reference panels over the Michigan Imputation Server, particularly in predicting HLA-B alleles among Brazilians. Our investigations underscored the need to enhance or adapt reference panels to encompass the target population's genetic diversity, emphasising the significance of multiethnic references for accurate imputation across different populations.
Asunto(s)
Alelos , Etnicidad , Frecuencia de los Genes , Polimorfismo de Nucleótido Simple , Humanos , Brasil , Etnicidad/genética , Antígenos HLA/genética , Desequilibrio de Ligamiento , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Genética de Población/métodos , Antígenos de Histocompatibilidad Clase I/genética , Biología Computacional/métodosRESUMEN
In his 1972 paper 'The apportionment of human diversity', Lewontin showed that, when averaged over loci, genetic diversity is predominantly attributable to differences among individuals within populations. However, selection can alter the apportionment of diversity of specific genes or genomic regions. We examine genetic diversity at the human leucocyte antigen (HLA) loci, located within the major histocompatibility complex (MHC) region. HLA genes code for proteins that are critical to adaptive immunity and are well-documented targets of balancing selection. The single-nucleotide polymorphisms (SNPs) within HLA genes show strong signatures of balancing selection on large timescales and are broadly shared among populations, displaying low FST values. However, when we analyse haplotypes defined by these SNPs (which define 'HLA alleles'), we find marked differences in frequencies between geographic regions. These differences are not reflected in the FST values because of the extreme polymorphism at HLA loci, illustrating challenges in interpreting FST. Differences in the frequency of HLA alleles among geographic regions are relevant to bone-marrow transplantation, which requires genetic identity at HLA loci between patient and donor. We discuss the case of Brazil's bone marrow registry, where a deficit of enrolled volunteers with African ancestry reduces the chance of finding donors for individuals with an MHC region of African ancestry. This article is part of the theme issue 'Celebrating 50 years since Lewontin's apportionment of human diversity'.
Asunto(s)
Complejo Mayor de Histocompatibilidad , Polimorfismo de Nucleótido Simple , Alelos , Frecuencia de los Genes , Haplotipos , Humanos , Complejo Mayor de Histocompatibilidad/genéticaRESUMEN
Background: Although aging correlates with a worse prognosis for Covid-19, super elderly still unvaccinated individuals presenting mild or no symptoms have been reported worldwide. Most of the reported genetic variants responsible for increased disease susceptibility are associated with immune response, involving type I IFN immunity and modulation; HLA cluster genes; inflammasome activation; genes of interleukins; and chemokines receptors. On the other hand, little is known about the resistance mechanisms against SARS-CoV-2 infection. Here, we addressed polymorphisms in the MHC region associated with Covid-19 outcome in super elderly resilient patients as compared to younger patients with a severe outcome. Methods: SARS-CoV-2 infection was confirmed by RT-PCR test. Aiming to identify candidate genes associated with host resistance, we investigated 87 individuals older than 90 years who recovered from Covid-19 with mild symptoms or who remained asymptomatic following positive test for SARS-CoV-2 as compared to 55 individuals younger than 60 years who had a severe disease or died due to Covid-19, as well as to the general elderly population from the same city. Whole-exome sequencing and an in-depth analysis of the MHC region was performed. All samples were collected in early 2020 and before the local vaccination programs started. Results: We found that the resilient super elderly group displayed a higher frequency of some missense variants in the MUC22 gene (a member of the mucins' family) as one of the strongest signals in the MHC region as compared to the severe Covid-19 group and the general elderly control population. For example, the missense variant rs62399430 at MUC22 is two times more frequent among the resilient super elderly (p = 0.00002, OR = 2.24). Conclusion: Since the pro-inflammatory basal state in the elderly may enhance the susceptibility to severe Covid-19, we hypothesized that MUC22 might play an important protective role against severe Covid-19, by reducing overactive immune responses in the senior population.
Asunto(s)
COVID-19 , Anciano , Humanos , Brasil/epidemiología , COVID-19/epidemiología , COVID-19/genética , Genes MHC Clase II , Antígenos HLA-A , SARS-CoV-2/genéticaRESUMEN
HLA-G is a promiscuous immune checkpoint molecule. The HLA-G gene presents substantial nucleotide variability in its regulatory regions. However, it encodes a limited number of proteins compared to classical HLA class I genes. We characterized the HLA-G genetic variability in 4640 individuals from 88 different population samples across the globe by using a state-of-the-art method to characterize polymorphisms and haplotypes from high-coverage next-generation sequencing data. We also provide insights regarding the HLA-G genetic diversity and a resource for future studies evaluating HLA-G polymorphisms in different populations and association studies. Despite the great haplotype variability, we demonstrated that: (1) most of the HLA-G polymorphisms are in introns and regulatory sequences, and these are the sites with evidence of balancing selection, (2) linkage disequilibrium is high throughout the gene, extending up to HLA-A, (3) there are few proteins frequently observed in worldwide populations, with lack of variation in residues associated with major HLA-G biological properties (dimer formation, interaction with leukocyte receptors). These observations corroborate the role of HLA-G as an immune checkpoint molecule rather than as an antigen-presenting molecule. Understanding HLA-G variability across populations is relevant for disease association and functional studies.
Asunto(s)
Antígenos HLA-G/genética , Polimorfismo Genético , Regiones no Traducidas 3' , Alelos , Biología Computacional , Dimerización , Evolución Molecular , Frecuencia de los Genes , Genes MHC Clase I , Variación Genética , Genética de Población , Genotipo , Salud Global , Haplotipos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Proteínas de Punto de Control Inmunitario/genética , Inmunogenética , Intrones , Desequilibrio de Ligamiento , Polimorfismo de Nucleótido SimpleRESUMEN
Despite the high number of individuals infected by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) who develop coronavirus disease 2019 (COVID-19) symptoms worldwide, many exposed individuals remain asymptomatic and/or uninfected and seronegative. This could be explained by a combination of environmental (exposure), immunological (previous infection), epigenetic, and genetic factors. Aiming to identify genetic factors involved in immune response in symptomatic COVID-19 as compared to asymptomatic exposed individuals, we analyzed 83 Brazilian couples where one individual was infected and symptomatic while the partner remained asymptomatic and serum-negative for at least 6 months despite sharing the same bedroom during the infection. We refer to these as "discordant couples". We performed whole-exome sequencing followed by a state-of-the-art method to call genotypes and haplotypes across the highly polymorphic major histocompatibility complex (MHC) region. The discordant partners had comparable ages and genetic ancestry, but women were overrepresented (65%) in the asymptomatic group. In the antigen-presentation pathway, we observed an association between HLA-DRB1 alleles encoding Lys at residue 71 (mostly DRB1*03:01 and DRB1*04:01) and DOB*01:02 with symptomatic infections and HLA-A alleles encoding 144Q/151R with asymptomatic seronegative women. Among the genes related to immune modulation, we detected variants in MICA and MICB associated with symptomatic infections. These variants are related to higher expression of soluble MICA and low expression of MICB. Thus, quantitative differences in these molecules that modulate natural killer (NK) activity could contribute to susceptibility to COVID-19 by downregulating NK cell cytotoxic activity in infected individuals but not in the asymptomatic partners.