RESUMO
Spontaneous clearance of acute hepatitis C virus (HCV) infection is associated with single nucleotide polymorphisms (SNPs) on the MHC class II. We fine-mapped the MHC region in European (n = 1,600; 594 HCV clearance/1,006 HCV persistence) and African (n = 1,869; 340 HCV clearance/1,529 HCV persistence) ancestry individuals and evaluated HCV peptide binding affinity of classical alleles. In both populations, HLA-DQß1Leu26 (p valueMeta = 1.24 × 10-14) located in pocket 4 was negatively associated with HCV spontaneous clearance and HLA-DQß1Pro55 (p valueMeta = 8.23 × 10-11) located in the peptide binding region was positively associated, independently of HLA-DQß1Leu26. These two amino acids are not in linkage disequilibrium (r2 < 0.1) and explain the SNPs and classical allele associations represented by rs2647011, rs9274711, HLA-DQB1∗03:01, and HLA-DRB1∗01:01. Additionally, HCV persistence classical alleles tagged by HLA-DQß1Leu26 had fewer HCV binding epitopes and lower predicted binding affinities compared to clearance alleles (geometric mean of combined IC50 nM of persistence versus clearance; 2,321 nM versus 761.7 nM, p value = 1.35 × 10-38). In summary, MHC class II fine-mapping revealed key amino acids in HLA-DQß1 explaining allelic and SNP associations with HCV outcomes. This mechanistic advance in understanding of natural recovery and immunogenetics of HCV might set the stage for much needed enhancement and design of vaccine to promote spontaneous clearance of HCV infection.
Assuntos
Cadeias beta de HLA-DQ/genética , Hepacivirus/patogenicidade , Hepatite C/genética , Interações Hospedeiro-Patógeno/genética , Polimorfismo de Nucleotídeo Único , Doença Aguda , Alelos , Substituição de Aminoácidos , População Negra , Feminino , Expressão Gênica , Estudo de Associação Genômica Ampla , Genótipo , Cadeias beta de HLA-DQ/imunologia , Hepacivirus/crescimento & desenvolvimento , Hepacivirus/imunologia , Hepatite C/etnologia , Hepatite C/imunologia , Hepatite C/virologia , Interações Hospedeiro-Patógeno/imunologia , Humanos , Leucina/imunologia , Leucina/metabolismo , Masculino , Prolina/imunologia , Prolina/metabolismo , Isoformas de Proteínas/genética , Isoformas de Proteínas/imunologia , Remissão Espontânea , População BrancaRESUMO
BACKGROUND AND PURPOSE: Multiple sclerosis (MS) is a complex autoimmune disease of the central nervous system, with numerous therapeutic options, but a lack of biomarkers to support a mechanistic approach to precision medicine. A computational approach to precision medicine could proceed from clinical decision support systems (CDSSs). They are digital tools aiming to empower physicians through the clinical applications of information technology and massive data. However, the process of their clinical development is still maturing; we aimed to review it in the field of MS. METHODS: For this scoping review, we screened systematically the PubMed database. We identified 24 articles reporting 14 CDSS projects and compared their technical and software development aspects. RESULTS: The projects position themselves in various contexts of usage with various algorithmic approaches: expert systems, CDSSs based on similar patients' data visualization, and model-based CDSSs implementing mathematical predictive models. So far, no project has completed its clinical development up to certification for clinical use with global release. Some CDSSs have been replaced at subsequent project iterations. The most advanced projects did not necessarily report every step of clinical development in a dedicated article (proof of concept, offline validation, refined prototype, live clinical evaluation, comparative prospective evaluation). They seek different software distribution options to integrate into health care: internal usage, "peer-to-peer," and marketing distribution. CONCLUSIONS: This review illustrates the potential of clinical applications of information technology and massive data to support MS management and helps clarify the roadmap for future projects as a multidisciplinary and multistep process.
RESUMO
Severe aplastic anemia (SAA) is a rare disorder characterized by hypoplastic bone marrow and progressive pancytopenia. The etiology of acquired SAA is not understood but is likely related to abnormal immune responses and environmental exposures. We conducted a genome-wide association study of individuals with SAA genetically matched to healthy controls in discovery (359 cases, 1,396 controls) and validation sets (175 cases, 1,059 controls). Combined analyses identified linked SNPs in distinct blocks within the major histocompatibility complex on 6p21. The top SNP encodes p.Met76Val in the P4 binding pocket of the HLA class II gene HLA-DPB1 (rs1042151A>G, odds ratio [OR] 1.75, 95% confidence interval [CI] 1.50-2.03, p = 1.94 × 10-13) and was associated with HLA-DP cell surface expression in healthy individuals (p = 2.04 × 10-6). Phylogenetic analyses indicate that Val76 is not monophyletic and likely occurs in conjunction with different HLA-DP binding groove conformations. Imputation of HLA-DPB1 alleles revealed increased risk of SAA associated with Val76-encoding alleles DPB1∗03:01, (OR 1.66, p = 1.52 × 10-7), DPB1∗10:01 (OR 2.12, p = 0.0003), and DPB1∗01:01 (OR 1.60, p = 0.0008). A second SNP near HLA-B, rs28367832G>A, reached genome-wide significance (OR 1.49, 95% CI 1.22-1.78, p = 7.27 × 10-9) in combined analyses; the association remained significant after excluding cases with clonal copy-neutral loss-of-heterozygosity affecting class I HLA genes (8.6% of cases and 0% of controls). SNPs in the HLA class II gene HLA-DPB1 and possibly class I (HLA-B) are associated with SAA. The replacement of Met76 to Val76 in certain HLA-DPB1 alleles might influence risk of SAA through mechanisms involving DP peptide binding specificity, expression, and/or other factors affecting DP function.
Assuntos
Anemia Aplástica/etiologia , Marcadores Genéticos , Predisposição Genética para Doença , Cadeias beta de HLA-DP/genética , Polimorfismo de Nucleotídeo Único , Adolescente , Adulto , Idoso , Anemia Aplástica/patologia , Estudos de Casos e Controles , Criança , Pré-Escolar , Feminino , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Lactente , Masculino , Pessoa de Meia-Idade , Filogenia , Fatores de Risco , Índice de Gravidade de Doença , Adulto JovemRESUMO
RATIONALE & OBJECTIVE: Focal segmental glomerulosclerosis (FSGS) is a major cause of pediatric nephrotic syndrome, and African Americans exhibit an increased risk for developing FSGS compared with other populations. Predisposing genetic factors have previously been described in adults. Here we performed genomic screening of primary FSGS in a pediatric African American population. STUDY DESIGN: Prospective cohort with case-control genetic association study design. SETTING & PARTICIPANTS: 140 African American children with chronic kidney disease from the Chronic Kidney Disease in Children (CKiD) cohort, including 32 cases with FSGS. PREDICTORS: Over 680,000 common single-nucleotide polymorphisms (SNPs) were tested for association. We also ran a pathway enrichment analysis and a human leucocyte antigen (HLA)-focused association study. OUTCOME: Primary biopsy-proven pediatric FSGS. ANALYTICAL APPROACH: Multivariate logistic regression models. RESULTS: The genome-wide association study revealed 169 SNPs from 14 independent loci significantly associated with FSGS (false discovery rate [FDR]<5%). We observed notable signals for genetic variants within the APOL1 (P=8.6×10-7; OR, 25.8 [95% CI, 7.1-94.0]), ALMS1 (P=1.3×10-7; 13.0% in FSGS cases vs 0% in controls), and FGFR4 (P=4.3×10-6; OR, 24.8 [95% CI, 6.3-97.7]) genes, all of which had previously been associated with adult FSGS, kidney function, or chronic kidney disease. We also highlighted novel, functionally relevant genes, including GRB2 (which encodes a slit diaphragm protein promoting podocyte structure through actin polymerization) and ITGB1 (which is linked to renal injuries). Our results suggest a major role for immune responses and antigen presentation in pediatric FSGS through (1) associations with SNPs in PTPRJ (or CD148, P=3.5×10-7), which plays a role in T-cell receptor signaling, (2) HLA-DRB1∗11:01 association (P=6.1×10-3; OR, 4.5 [95% CI, 1.5-13.0]), and (3) signaling pathway enrichment (P=1.3×10-6). LIMITATIONS: Sample size and no independent replication cohort with genomic data readily available. CONCLUSIONS: Our genetic study has identified functionally relevant risk factors and the importance of immune regulation for pediatric primary FSGS, which contributes to a better description of its molecular pathophysiological mechanisms. PLAIN-LANGUAGE SUMMARY: We assessed the genetic risk factors for primary focal segmental glomerulosclerosis (FSGS) by simultaneously testing over 680,000 genetic markers spread across the genome in 140 children, including 32 with FSGS lesions. Fourteen independent genetic regions were significantly associated with pediatric FSGS, including APOL1 and ALMS1-NAT8, which were previously found to be associated with FSGS and chronic kidney diseases in adults. Novel genes with relevant biological functions were also highlighted, such as GRB2 and FGFR4, which play a role in the kidney filtration barrier and in kidney cell differentiation, respectively. Finally, we revealed the importance of immune regulation in pediatric FSGS through associations involving cell surface proteins presenting antigens to the immune system and interacting with T-cell receptors.
Assuntos
Glomerulosclerose Segmentar e Focal , Insuficiência Renal Crônica , Adulto , Humanos , Criança , Glomerulosclerose Segmentar e Focal/patologia , Apolipoproteína L1/genética , Estudo de Associação Genômica Ampla , Estudos Prospectivos , Fatores de Risco , Insuficiência Renal Crônica/epidemiologia , Insuficiência Renal Crônica/genéticaRESUMO
Human immunoglobulin G (IgG) molecules, IgG1, IgG2 and IgG3, exhibit substantial inter-individual variation in their constant heavy chain regions, as discovered by serological methods. This polymorphism is encoded by the IGHG1, IGHG2, and IGHG3 genes and may influence antibody function. We sequenced the coding fragments of these genes in 95 European Americans, 94 African Americans, and 94 Black South Africans. Striking differences were observed between the population groups, including extremely low amino acid sequence variation in IGHG1 among South Africans, and higher IGHG2 and IGHG3 diversity in individuals of African descent compared to individuals of European descent. Molecular definition of the loci illustrates a greater level of allelic polymorphism than previously described, including the presence of common IGHG2 and IGHG3 variants that were indistinguishable serologically. Comparison of our data with the 1000 Genome Project sequences indicates overall agreement between the datasets, although some inaccuracies in the 1000 Genomes Project are likely. These data represent the most comprehensive analysis of IGHG polymorphisms across major populations, which can now be applied to deciphering their functional impact.
Assuntos
Imunoglobulina G , Cadeias Pesadas de Imunoglobulinas , Alelos , Genes de Imunoglobulinas , Humanos , Imunoglobulina G/genética , Cadeias Pesadas de Imunoglobulinas/genética , Polimorfismo GenéticoRESUMO
Genome-wide associations studies have repeatedly identified the major histocompatibility complex genomic region (6p21.3) as key in immune pathologies. Researchers have also aimed to extend the biological interpretation of associations by focusing directly on human leukocyte antigen (HLA) polymorphisms and their combination as haplotypes. To circumvent the effort and high costs of HLA typing, statistical solutions have been developed to infer HLA alleles from single-nucleotide polymorphism (SNP) genotyping data. Though HLA imputation methods have been developed, no unified effort has yet been undertaken to share large and diverse imputation models, or to improve methods. By training the HIBAG software on SNP + HLA data generated by the Consortium on Asthma among African-ancestry Populations in the Americas (CAAPA) to create reference panels, we highlighted the importance of (a) the number of individuals in reference panels, with a twofold increase in accuracy (from 10 to 100 individuals) and (b) the number of SNPs, with a 1.5-fold increase in accuracy (from 500 to 24,504 SNPs). Results showed improved accuracy with CAAPA compared to the African American models available in HIBAG, highlighting the need for precise population-matching. The SNP-HLA Reference Consortium is an international endeavor to gather data, enhance HLA imputation and broaden access to highly accurate imputation models for the immunogenomics community.
Assuntos
Negro ou Afro-Americano/genética , Genoma Humano/genética , Antígenos HLA/genética , Polimorfismo de Nucleotídeo Único/genética , Alelos , Asma/genética , Frequência do Gene/genética , Genômica , Genótipo , Haplótipos/genética , Humanos , Disseminação de Informação , Modelos Genéticos , População Branca/genéticaRESUMO
MOTIVATION: The HLA system plays a pivotal role in both clinical applications and immunology research. Typing HLA genes in patient and donor is indeed required in hematopoietic stem cell and solid-organ transplantation, and the histocompatibility complex region exhibits countless genetic associations with immune-related pathologies. Since the discovery of HLA antigens, the HLA system nomenclature and typing methods have constantly evolved, which leads to difficulties in using data generated with older methodologies. RESULTS: Here, we present Easy-HLA, a web-based software suite designed to facilitate analysis and gain knowledge from HLA typing, regardless of nomenclature or typing method. Easy-HLA implements a computational and statistical method of HLA haplotypes inference based on published reference populations containing over 600 000 haplotypes to upgrade missing or partial HLA information: 'HLA-Upgrade' tool infers high-resolution HLA typing and 'HLA-2-Haplo' imputes haplotype pairs and provides additional functional annotations (e.g. amino acids and KIR ligands). We validated both tools using two independent cohorts (total n = 2500). For HLA-Upgrade, we reached a prediction accuracy of 92% from low- to high-resolution of European genotypes. We observed a 96% call rate and 76% accuracy with HLA-2-Haplo European haplotype pairs prediction. In conclusion, Easy-HLA tools facilitate large-scale immunogenetic analysis and promotes the multi-faceted HLA expertise beyond allelic associations by providing new functional immunogenomics parameters. AVAILABILITY AND IMPLEMENTATION: Easy-HLA is a web application freely available (free account) at: https://hla.univ-nantes.fr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Antígenos HLA , Alelos , Genótipo , Haplótipos , Teste de Histocompatibilidade , HumanosRESUMO
BACKGROUND: Asthma is a complex chronic inflammatory disease of the airways. Association studies between HLA and asthma were first reported in the 1970s, and yet, the precise role of HLA alleles in asthma is not fully understood. Numerous genome-wide association studies were recently conducted on asthma, but were always limited to simple genetic markers (single nucleotide polymorphisms) and not complex HLA gene polymorphisms (alleles/haplotypes), therefore not capturing the biological relevance of this complex locus for asthma pathogenesis. OBJECTIVE: To run the first HLA-centric association study with asthma and specific asthma-related phenotypes in a large cohort of African-ancestry individuals. METHODS: We collected high-density genomics data for the Consortium on Asthma among African-ancestry Populations in the Americas (N = 4993) participants. Using computer-intensive machine-learning attribute bagging methods to infer HLA alleles, and Easy-HLA to infer HLA 5-gene haplotypes, we conducted a high-throughput HLA-centric association study of asthma susceptibility and total serum IgE (tIgE) levels in subjects with and without asthma. RESULTS: Among the 1607 individuals with asthma, 972 had available tIgE levels, with a mean tIgE level of 198.7 IU/mL. We could not identify any association with asthma susceptibility. However, we showed that HLA-DRB1∗09:01 was associated with increased tIgE levels (P = 8.5 × 10-4; weighted effect size, 0.51 [0.15-0.87]). CONCLUSIONS: We identified for the first time an HLA allele associated with tIgE levels in African-ancestry individuals with asthma. Our report emphasizes that by leveraging powerful computational machine-learning methods, specific/extreme phenotypes, and population diversity, we can explore HLA gene polymorphisms in depth and reveal the full extent of complex disease associations.
Assuntos
Alelos , Negro ou Afro-Americano/genética , Cadeias HLA-DRB1/genética , Imunoglobulina E/imunologia , Polimorfismo de Nucleotídeo Único , Asma , Feminino , Cadeias HLA-DRB1/imunologia , Humanos , MasculinoRESUMO
Differential HLA-C levels influence several human diseases, but the mechanisms responsible are incompletely characterized. Using a validated prediction algorithm, we imputed HLA-C cell surface levels in 228 individuals from the 1000 Genomes dataset. We tested 68,726 SNPs within the MHC for association with HLA-C level. The HLA-C promoter region variant, rs2395471, 800 bp upstream of the transcription start site, gave the most significant association with HLA-C levels (p = 4.2 × 10-66). This imputed expression quantitative trait locus, termed impeQTL, was also shown to associate with HLA-C expression in a genome-wide association study of 273 donors in which HLA-C mRNA expression levels were determined by quantitative PCR (qPCR) (p = 1.8 × 10-20) and in two cohorts where HLA-C cell surface levels were determined directly by flow cytometry (n = 369 combined, p < 10-15). rs2395471 is located in an Oct1 transcription factor consensus binding site motif where the A allele is predicted to have higher affinity for Oct1 than the G allele. Mobility shift electrophoresis demonstrated that Oct1 binds to both alleles in vitro, but decreased HLA-C promoter activity was observed in a luciferase reporter assay for rs2395471_G relative to rs2395471_A on a fixed promoter background. The rs2395471 variant accounts for up to 36% of the explained variation of HLA-C level. These data strengthen our understanding of HLA-C transcriptional regulation and provide a basis for understanding the potential consequences of manipulating HLA-C levels therapeutically.
Assuntos
Antígenos HLA-C/biossíntese , Antígenos HLA-C/genética , Fator 1 de Transcrição de Octâmero/metabolismo , Regiões Promotoras Genéticas/genética , Algoritmos , Alelos , Sítios de Ligação/genética , Conjuntos de Dados como Assunto , Genoma Humano/genética , Estudo de Associação Genômica Ampla , Células HeLa , Humanos , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Sítio de Iniciação de Transcrição , Transcrição GênicaRESUMO
A recent genome-wide association study (GWAS) involving patients with hemophilia A who were exposed to but uninfected with human immunodeficiency virus type 1 (HIV-1) did not reveal genetic variants associated with resistance to HIV-1 infection, beyond homozygosity for CCR5-Δ32. Since variation in HLA class I and KIR genes is not well interrogated by standard GWAS techniques, we tested whether these 2 loci were involved in protection from HIV-1 infection in the same hemophilia cohort, using controls from the general population. Our data indicate that HLA class I alleles, presence or absence of KIR genes, and functionally relevant combinations of the HLA/KIR genotypes are not involved in resistance to parenterally transmitted HIV-1 infection.
Assuntos
Resistência à Doença , Infecções por HIV/imunologia , Hemofilia A/complicações , Antígenos de Histocompatibilidade Classe I/imunologia , Receptores KIR/imunologia , Estudos de Associação Genética , HumanosRESUMO
Leukocyte immunoglobulin-like receptor (LILR)B3 and LILRA6 represent a pair of inhibitory/activating receptors with identical extracellular domains and unknown ligands. LILRB3 can mediate inhibitory signaling via immunoreceptor tyrosine-based inhibition motifs in its cytoplasmic tail whereas LILRA6 can signal through association with an activating adaptor molecule, FcRγ, which bears a cytoplasmic tail with an immunoreceptor tyrosine-based activation motif. The receptors are encoded by two highly polymorphic neighboring genes within the leukocyte receptor complex on human chromosome 19. Here, we report that the two genes display similar levels of single nucleotide polymorphisms with the majority of polymorphic sites being identical. In addition, the LILRA6 gene exhibits copy number variation (CNV) whereas LILRB3 does not. A screen of healthy Caucasians indicated that 32 % of the subjects possessed more than two copies of LILRA6, whereas 4 % have only one copy of the gene per diploid genome. Analysis of mRNA expression in the major fractions of PBMCs showed that LILRA6 is primarily expressed in monocytes, similarly to LILRB3, and its expression level correlates with copy number of the gene. We suggest that the LILRA6 CNV may influence the level of the activating receptor on the cell surface, potentially affecting signaling upon LILRB3/A6 ligation.
Assuntos
Antígenos CD/genética , Variações do Número de Cópias de DNA/genética , Polimorfismo de Nucleotídeo Único/genética , Receptores Imunológicos/genética , Criança , Estudos de Coortes , DNA/análise , DNA/genética , Família , Feminino , Humanos , Masculino , Linhagem , RNA Mensageiro/genéticaRESUMO
BACKGROUND: This study aimed to investigate the factors contributing to the variability of Multiple Sclerosis (MS) among individuals born and residing in France. Geographical variation in MS prevalence was observed in France, but the role of genetic and environmental factors in explaining this heterogeneity has not been yet elucidated. METHODS: We employed a heritability analysis on a cohort of 403 trios with an MS-affected proband in the French population. This sample was retrieved from REFGENSEP register of MS cases collected in 23 French hospital centers from 1992 to 2017. Our objective was to quantify the proportion of MS liability variability explained by genetic variability, sex, shared environment effects, region of birth and year of birth. We further considered gene x environment (GxE) interaction effects between genetic variability and region of birth. We have implemented a Bayesian liability threshold model to obtain posterior distributions for the parameters of interest adjusting for ascertainment bias. RESULTS: Our analysis revealed that GxE interaction effects between genetic variability and region of birth represent the primary significant explanatory factor for MS liability variability in French individuals (29 % [95 %CI: 5 %; 53 %]), suggesting that additive genetic effects are modified by environmental factors associated to the region of birth. The individual contributions of genetic variability and region of birth explained, respectively, ≈15 % and ≈16 % of MS variability, highlighting a significantly higher MS liability in individuals born in the Northern regions compared to the Southern region. Overall, the joint contribution of genetic variability, region of birth, and their interaction was then estimated to explain 65 % [95 %CI: 35 %; 92 %] of MS liability variability. The remaining proportion of MS variability is attributed to environmental exposures associated with the year of birth, shared within the same household, and specific to individuals. CONCLUSION: Overall, our analysis highlighted the interaction between genetic variability and environmental exposures linked to the region of birth as the main factor explaining MS variability within individuals born and residing in France. Among the environmental exposures prevalent in the Northern regions, and potentially interacting with genetic variability, lower vitamin D levels due to reduced sun exposure, higher obesity prevalence and higher pollution levels represent the main risk factors in influencing MS risk. These findings emphasize the importance of accounting for environmental factors linked to geographical location in the investigation of MS risk factors, as well as to further explore the influence of GxE interactions in modifying genetic risk.
Assuntos
Teorema de Bayes , Interação Gene-Ambiente , Esclerose Múltipla , Humanos , França/epidemiologia , Esclerose Múltipla/genética , Esclerose Múltipla/epidemiologia , Feminino , Masculino , Adulto , Predisposição Genética para Doença , Sistema de Registros , Variação GenéticaRESUMO
The SNP-HLA Reference Consortium (SHLARC), a component of the 18th International HLA and Immunogenetics Workshop, is aimed at collecting diverse and extensive human leukocyte antigen (HLA) data to create custom reference panels and enhance HLA imputation techniques. Genome-wide association studies (GWAS) have significantly contributed to identifying genetic associations with various diseases. The HLA genomic region has emerged as the top locus in GWAS, particularly in immune-related disorders. However, the limited information provided by single nucleotide polymorphisms (SNPs), the hallmark of GWAS, poses challenges, especially in the HLA region, where strong linkage disequilibrium (LD) spans several megabases. HLA imputation techniques have been developed using statistical inference in response to these challenges. These techniques enable the prediction of HLA alleles from genotyped GWAS SNPs. Here we present the SHLARC activities, a collaborative effort to create extensive, and multi-ethnic reference panels to enhance HLA imputation accuracy.
Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Humanos , Imunogenética , Alelos , Antígenos HLA/genética , GenótipoRESUMO
Human genomics has quickly evolved, powering genome-wide association studies (GWASs). SNP-based GWASs cannot capture the intense polymorphism of HLA genes, highly associated with disease susceptibility. There are methods to statistically impute HLA genotypes from SNP-genotypes data, but lack of diversity in reference panels hinders their performance. We evaluated the accuracy of the 1000 Genomes data as a reference panel for imputing HLA from admixed individuals of African and European ancestries, focusing on (a) the full dataset, (b) 10 replications from 6 populations, and (c) 19 conditions for the custom reference panels. The full dataset outperformed smaller models, with a good F1-score of 0.66 for HLA-B. However, custom models outperformed the multiethnic or population models of similar size (F1-scores up to 0.53, against up to 0.42). We demonstrated the importance of using genetically specific models for imputing populations, which are currently underrepresented in public datasets, opening the door to HLA imputation for every genetic population.
Assuntos
Genética Populacional , Estudo de Associação Genômica Ampla , Humanos , Alelos , Genótipo , Antígenos HLA-B , Polimorfismo de Nucleotídeo ÚnicoRESUMO
In multiple sclerosis (MS), alterations of the gut microbiota lead to inflammation. However, the role of other microbiomes in the body in MS has not been fully elucidated. In a pilot case-controlled study, we carried out simultaneous characterization of faecal and oral microbiota and conducted an in-depth analysis of bacterial alterations associated with MS. Using 16S rRNA sequencing and metabolic inference tools, we compared the oral/faecal microbiota and bacterial metabolism pathways in French MS patients (n = 14) and healthy volunteers (HV, n = 21). A classification model based on metabolite flux balance was established and validated in an independent German cohort (MS n = 12, HV n = 38). Our analysis revealed decreases in diversity indices and oral/faecal compartmentalization, the depletion of commensal bacteria (Aggregatibacter and Streptococcus in saliva and Coprobacter and Roseburia in faeces) and enrichment of inflammation-associated bacteria in MS patients (Leptotrichia and Fusobacterium in saliva and Enterobacteriaceae and Actinomyces in faeces). Several microbial pathways were also altered (the polyamine pathway and remodelling of bacterial surface antigens and energetic metabolism) while flux balance analysis revealed associated alterations in metabolite production in MS (nitrogen and nucleoside). Based on this analysis, we identified a specific oral metabolite signature in MS patients, that could discriminate MS patients from HV and rheumatoid arthritis patients. This signature allowed us to create and validate a discrimination model on an independent cohort, which reached a specificity of 92%. Overall, the oral and faecal microbiomes were altered in MS patients. This pilot study highlights the need to study the oral microbiota and oral health implications in patients with autoimmune diseases on a larger scale and suggests that knowledge of the salivary microbiome could help guide the identification of new pathogenic mechanisms associated with the microbiota in MS patients.
Assuntos
Microbiota , Esclerose Múltipla , Humanos , Projetos Piloto , RNA Ribossômico 16S/genética , RNA Ribossômico 16S/análise , Microbiota/genética , Bactérias/genética , InflamaçãoRESUMO
The MHC class I region contains crucial genes for the innate and adaptive immune response, playing a key role in susceptibility to many autoimmune and infectious diseases. Genome-wide association studies have identified numerous disease-associated SNPs within this region. However, these associations do not fully capture the immune-biological relevance of specific HLA alleles. HLA imputation techniques may leverage available SNP arrays by predicting allele genotypes based on the linkage disequilibrium between SNPs and specific HLA alleles. Successful imputation requires diverse and large reference panels, especially for admixed populations. This study employed a bioinformatics approach to call SNPs and HLA alleles in multi-ethnic samples from the 1000 genomes (1KG) dataset and admixed individuals from Brazil (SABE), utilising 30X whole-genome sequencing data. Using HIBAG, we created three reference panels: 1KG (n = 2504), SABE (n = 1171), and the full model (n = 3675) encompassing all samples. In extensive cross-validation of these reference panels, the multi-ethnic 1KG reference exhibited overall superior performance than the reference with only Brazilian samples. However, the best results were achieved with the full model. Additionally, we expanded the scope of imputation by developing reference panels for non-classical, MICA, MICB and HLA-H genes, previously unavailable for multi-ethnic populations. Validation in an independent Brazilian dataset showcased the superiority of our reference panels over the Michigan Imputation Server, particularly in predicting HLA-B alleles among Brazilians. Our investigations underscored the need to enhance or adapt reference panels to encompass the target population's genetic diversity, emphasising the significance of multiethnic references for accurate imputation across different populations.
Assuntos
Alelos , Etnicidade , Frequência do Gene , Polimorfismo de Nucleotídeo Único , Humanos , Brasil , Etnicidade/genética , Antígenos HLA/genética , Desequilíbrio de Ligação , Estudo de Associação Genômica Ampla/métodos , Genótipo , Genética Populacional/métodos , Antígenos de Histocompatibilidade Classe I/genética , Biologia Computacional/métodosRESUMO
The main limitation to long-term lung transplant (LT) survival is chronic lung allograft dysfunction (CLAD), which leads to irreversible lung damage and significant mortality. Individual factors can impact CLAD, but no large genetic investigation has been conducted to date. We established the multicentric Genetic COhort in Lung Transplantation (GenCOLT) biobank from a rich and homogeneous sub-part of COLT cohort. GenCOLT collected DNA, high-quality GWAS (genome-wide association study) genotyping and robust HLA data for donors and recipients to supplement COLT clinical data. GenCOLT closely mirrors the global COLT cohort without significant variations in variables like demographics, initial disease and survival rates (P > 0.05). The GenCOLT donors were 45 years-old on average, 44% women, and primarily died of stroke (54%). The recipients were 48 years-old at transplantation on average, 45% women, and the main underlying disease was chronic obstructive pulmonary disease (45%). The mean follow-up time was 67 months and survival at 5 years was 57.3% for the CLAD subgroup and 97.4% for the non-CLAD subgroup. After stringent quality controls, GenCOLT gathered more than 7.3 million SNP and HLA genotypes for 387 LT pairs, including 91% pairs composed of donor and recipient of European ancestry. Overall, GenCOLT is an accurate snapshot of LT clinical practice in France and Belgium between 2009 and 2018. It currently represents one of the largest genetic biobanks dedicated to LT with data available simultaneously for donors and recipients. This unique cohort will empower to run comprehensive GWAS investigations of CLAD and other LT outcomes.
RESUMO
OBJECTIVE: To explore and describe the basis and implications of genetic and environmental susceptibility to multiple sclerosis (MS) using the Canadian population-based data. BACKGROUND: Certain parameters of MS-epidemiology are directly observable (e.g., the recurrence-risk of MS in siblings and twins, the proportion of women among MS patients, the population-prevalence of MS, and the time-dependent changes in the sex-ratio). By contrast, other parameters can only be inferred from the observed parameters (e.g., the proportion of the population that is "genetically susceptible", the proportion of women among susceptible individuals, the probability that a susceptible individual will experience an environment "sufficient" to cause MS, and if they do, the probability that they will develop the disease). DESIGN/METHODS: The "genetically susceptible" subset (G) of the population (Z) is defined to include everyone with any non-zero life-time chance of developing MS under some environmental conditions. The value for each observed and non-observed epidemiological parameter is assigned a "plausible" range. Using both a Cross-sectional Model and a Longitudinal Model, together with established parameter relationships, we explore, iteratively, trillions of potential parameter combinations and determine those combinations (i.e., solutions) that fall within the acceptable range for both the observed and non-observed parameters. RESULTS: Both Models and all analyses intersect and converge to demonstrate that probability of genetic-susceptibitly, P(G), is limited to only a fraction of the population {i.e., P(G) ≤ 0.52)} and an even smaller fraction of women {i.e., P(GâF) < 0.32)}. Consequently, most individuals (particularly women) have no chance whatsoever of developing MS, regardless of their environmental exposure. However, for any susceptible individual to develop MS, requires that they also experience a "sufficient" environment. We use the Canadian data to derive, separately, the exponential response-curves for men and women that relate the increasing likelihood of developing MS to an increasing probability that a susceptible individual experiences an environment "sufficient" to cause MS. As the probability of a "sufficient" exposure increases, we define, separately, the limiting probability of developing MS in men (c) and women (d). These Canadian data strongly suggest that: (c < d ≤ 1). If so, this observation establishes both that there must be a "truly" random factor involved in MS pathogenesis and that it is this difference, rather than any difference in genetic or environmental factors, which primarily accounts for the penetrance difference between women and men. CONCLUSIONS: The development of MS (in an individual) requires both that they have an appropriate genotype (which is uncommon in the population) and that they have an environmental exposure "sufficient" to cause MS given their genotype. Nevertheless, the two principal findings of this study are that: P(G) ≤ 0.52)} and: (c < d ≤ 1). Threfore, even when the necessary genetic and environmental factors, "sufficient" for MS pathogenesis, co-occur for an individual, they still may or may not develop MS. Consequently, disease pathogenesis, even in this circumstance, seems to involve an important element of chance. Moreover, the conclusion that the macroscopic process of disease development for MS includes a "truly" random element, if replicated (either for MS or for other complex diseases), provides empiric evidence that our universe is non-deterministic.
Assuntos
Esclerose Múltipla , Masculino , Humanos , Feminino , Fatores de Risco , Esclerose Múltipla/etiologia , Esclerose Múltipla/genética , Estudos Transversais , Canadá/epidemiologia , Predisposição Genética para DoençaRESUMO
While nearly all computational methods operate on pseudonymized personal data, re-identification remains a risk. With personal health data, this re-identification risk may be considered a double-crossing of patients' trust. Herein, we present a new method to generate synthetic data of individual granularity while holding on to patients' privacy. Developed for sensitive biomedical data, the method is patient-centric as it uses a local model to generate random new synthetic data, called an "avatar data", for each initial sensitive individual. This method, compared with 2 other synthetic data generation techniques (Synthpop, CT-GAN), is applied to real health data with a clinical trial and a cancer observational study to evaluate the protection it provides while retaining the original statistical information. Compared to Synthpop and CT-GAN, the Avatar method shows a similar level of signal maintenance while allowing to compute additional privacy metrics. In the light of distance-based privacy metrics, each individual produces an avatar simulation that is on average indistinguishable from 12 other generated avatar simulations for the clinical trial and 24 for the observational study. Data transformation using the Avatar method both preserves, the evaluation of the treatment's effectiveness with similar hazard ratios for the clinical trial (original HR = 0.49 [95% CI, 0.39-0.63] vs. avatar HR = 0.40 [95% CI, 0.31-0.52]) and the classification properties for the observational study (original AUC = 99.46 (s.e. 0.25) vs. avatar AUC = 99.84 (s.e. 0.12)). Once validated by privacy metrics, anonymous synthetic data enable the creation of value from sensitive pseudonymized data analyses by tackling the risk of a privacy breach.
RESUMO
KiT-GENIE is a monocentric DNA biobank set up to consolidate the very rich and homogeneous DIVAT French cohort of kidney donors and recipients (D/R) in order to explore the molecular factors involved in kidney transplantation outcomes. We collected DNA samples for kidney transplantations performed in Nantes, and we leveraged GWAS genotyping data for securing high-quality genetic data with deep SNP and HLA annotations through imputations and for inferring D/R genetic ancestry. Overall, the biobank included 4217 individuals (n = 1945 D + 2,272 R, including 1969 D/R pairs), 7.4 M SNPs and over 200 clinical variables. KiT-GENIE represents an accurate snapshot of kidney transplantation clinical practice in Nantes between 2002 and 2018, with an enrichment in living kidney donors (17%) and recipients with focal segmental glomerulosclerosis (4%). Recipients were predominantly male (63%), of European ancestry (93%), with a mean age of 51yo and 86% experienced their first graft over the study period. D/R pairs were 93% from European ancestry, and 95% pairs exhibited at least one HLA allelic mismatch. The mean follow-up time was 6.7 years with a hindsight up to 25 years. Recipients experienced biopsy-proven rejection and graft loss for 16.6% and 21.3%, respectively. KiT-GENIE constitutes one of the largest kidney transplantation genetic cohorts worldwide to date. It includes homogeneous high-quality clinical and genetic data for donors and recipients, hence offering a unique opportunity to investigate immunogenetic and genetic factors, as well as donor-recipient interactions and mismatches involved in rejection, graft survival, primary disease recurrence and other comorbidities.