RESUMO
Collapsing glomerulopathy (CG) is most often associated with fast progression to kidney failure with an incidence apparently higher in Brazil than in other countries. However, the reason for this occurrence is unknown. To better understand this, we performed an integrated analysis of clinical, histological, therapeutic, causative genetic and genetic ancestry data in a highly genetically admixed cohort of 70 children and adult patients with idiopathic CG (ICG). The disease onset occurred at 23 (interquartile range: 17-31) years and approximately half of patients progressed to chronic kidney disease requiring kidney replacement therapy (CKD-KRT) 36 months after diagnosis. Causative genetic bases, assessed by targeted-gene panel or whole-exome sequencing, were identified in 58.6% of patients. Among these cases, 80.5% harbored APOL1 high-risk genotypes (HRG) and 19.5% causative Mendelian variants (MV). Self-reported non-White patients more frequently had HRG. MV was an independent risk factor for progression to CKD-KRT by 36 months and the end of follow-up, while remission was an independent protective factor. All patients with HRG manifested CG at 9-44 years of age, whereas in those with APOL1 low-risk genotype, the disease arose throughout life. HRGs were associated with higher proportion of African genetic ancestry. Novel causative MVs were identified in COL4A5, COQ2 and PLCE1 and previously described causative MVs were identified in MYH9, TRPC6, COQ2, COL4A3 and TTC21B. Three patients displayed HRG combined with a variant of uncertain significance (ITGB4, LAMA5 or PTPRO). MVs were associated with worse kidney prognosis. Thus, our data reveal that the genetic status plays a major role in ICG pathogenesis, accounting for more than half of cases in a highly admixed Brazilian population.
Assuntos
Apolipoproteína L1 , Insuficiência Renal Crônica , Adulto , Criança , Humanos , Apolipoproteína L1/genética , Genótipo , Rim/patologia , Insuficiência Renal Crônica/epidemiologia , Insuficiência Renal Crônica/genética , Insuficiência Renal Crônica/patologia , Fatores de Risco , Adolescente , Adulto JovemRESUMO
BACKGROUND: Multisystem Inflammatory Syndrome in Children (MIS-C) is a life-threatening complication of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, which manifests as a hyper inflammatory process with multiorgan involvement in predominantly healthy children in the weeks following mild or asymptomatic coronavirus disease 2019 (COVID-19). However, host monogenic predisposing factors to MIS-C remain elusive. METHODS: Herein, we used whole exome sequencing (WES) on 16 MIS-C Brazilian patients to identify single nucleotide/InDels variants as predisposition factors associated with MIS-C. RESULTS: We identified ten very rare variants in eight genes (FREM1, MPO, POLG, C6, C9, ABCA4, ABCC6, and BSCL2) as the most promising candidates to be related to a higher risk of MIS-C development. These variants may propitiate a less effective immune response to infection or trigger the inflammatory response or yet a delayed hyperimmune response to SARS-CoV-2. Protein-Protein Interactions (PPIs) among the products of the mutated genes revealed an integrated network, enriched for immune and inflammatory response mechanisms with some of the direct partners representing gene products previously associated with MIS-C and Kawasaki disease (KD). In addition, the PPIs direct partners are also enriched for COVID-19-related gene sets. HLA alleles prediction from WES data allowed the identification of at least one risk allele in 100% of the MIS-C patients. CONCLUSIONS: This study is the first to explore host MIS-C-associated variants in a Latin American admixed population. Besides expanding the spectrum of MIS-C-associated variants, our findings highlight the relevance of using WES for characterising the genetic interindividual variability associated with COVID-19 complications and ratify the presence of overlapping/convergent mechanisms among MIS-C, KD and COVID-19, crucial for future therapeutic management.
Assuntos
COVID-19 , SARS-CoV-2 , Criança , Humanos , COVID-19/complicações , COVID-19/genética , Predisposição Genética para Doença , Síndrome de Resposta Inflamatória Sistêmica/genética , Transportadores de Cassetes de Ligação de ATPRESUMO
Alcohol and other substance use disorders (AUD and SUD) are complex diseases that are postulated to have a polygenic inheritance and are often comorbid with other disorders. The comorbidities may arise partially through genetic pleiotropy. Identification of specific gene variants accounting for large parts of the variance in these disorders has yet to be accomplished. We describe a flexible strategy that takes a variant-trait association database and determines if a subset of disease/straits are potentially pleiotropic with the disorder under study. We demonstrate its usage in a study of use disorders in two independent cohorts: alcohol, stimulants, cannabis (CUD), and multi-substance use disorders (MSUD) in American Indians (AI) and AUD and CUD in Mexican Americans (MA). Using a machine learning method with variants in GWAS catalog, we identified 229 to 246 pleiotropic variants for AI and 153 to 160 for MA for each SUD. Inflammation was the most enriched for MSUD and AUD in AIs. Neurological disorder was the most significantly enriched for CUD in both cohorts, and for AUD and stimulants in AIs. Of the select pleiotropic genes shared among substances-cohorts, multiple biological pathways implicated in SUD and other psychiatric disorders were enriched, including neurotrophic factors, immune responses, extracellular matrix, and circadian regulation. Shared pleiotropic genes were significantly up-regulated in brain regions playing important roles in SUD, down-regulated in esophagus mucosa, and differentially regulated in adrenal gland. This study fills a gap for pleiotropy detection in understudied admixed populations and identifies pleiotropic variants that may be potential targets of interest for SUD.
Assuntos
Indígenas Norte-Americanos/genética , Americanos Mexicanos/genética , Transtornos Relacionados ao Uso de Substâncias/genética , Adulto , Alcoolismo/genética , Feminino , Pleiotropia Genética , Estudo de Associação Genômica Ampla , Humanos , Aprendizado de Máquina , MasculinoRESUMO
BACKGROUND: Admixed populations arise when two or more previously isolated populations interbreed. A powerful approach to addressing the genetic complexity in admixed populations is to infer ancestry. Ancestry inference including the proportion of an individual's genome coming from each population and its ancestral origin along the chromosome of an admixed population requires the use of ancestry informative markers (AIMs) from reference ancestral populations. AIMs exhibit substantial differences in allele frequency between ancestral populations. Given the huge amount of human genetic variation data available from diverse populations, a computationally feasible and cost-effective approach is becoming increasingly important to extract or filter AIMs with the maximum information content for ancestry inference, admixture mapping, forensic applications, and detecting genomic regions that have been under recent selection. RESULTS: To address this gap, we present MI-MAAP, an easy-to-use web-based bioinformatics tool designed to prioritize informative markers for multi-ancestry admixed populations by utilizing feature selection methods and multiple genomics resources including 1000 Genomes Project and Human Genome Diversity Project. Specifically, this tool implements a novel allele frequency-based feature selection algorithm, Lancaster Estimator of Independence (LEI), as well as other genotype-based methods such as Principal Component Analysis (PCA), Support Vector Machine (SVM), and Random Forest (RF). We demonstrated that MI-MAAP is a useful tool in prioritizing informative markers and accurately classifying ancestral populations. LEI is an efficient feature selection strategy to retrieve ancestry informative variants with different allele frequency/selection pressure among (or between) ancestries without requiring computationally expensive individual-level genotype data. CONCLUSIONS: MI-MAAP has a user-friendly interface which provides researchers an easy and fast way to filter and identify AIMs. MI-MAAP can be accessed at https://research.cchmc.org/mershalab/MI-MAAP/login/.
Assuntos
Genética Populacional/métodos , Software , Algoritmos , Frequência do Gene , Marcadores Genéticos , Genoma Humano , Genômica , Técnicas de Genotipagem , Humanos , Polimorfismo de Nucleotídeo Único , Análise de Componente PrincipalRESUMO
BACKGROUND: The current Brazilian population is the product of centuries of admixture between intercontinental founding groups. Although previous results have revealed a heterogeneous distribution of mitochondrial lineages in the Northeast region, the most targeted by foreign settlers during the sixteenth century, little is known about the paternal ancestry of this particular population. Considering historical records have documented a series of territorial invasions in the Northeast by various European populations, we aimed to characterize the male lineages found in Brazilian individuals in order to discover to what extent these migrations have influenced the present-day gene pool. Our approach consisted of employing four hierarchical multiplex assays for the investigation of 45 unique event polymorphisms in the non-recombining portion of the Y-chromosome of 280 unrelated men from several Northeast Brazilian states. RESULTS: Primary multiplex results allowed the identification of six major haplogroups, four of which were screened for downstream SNPs and enabled the observation of 19 additional lineages. Results reveal a majority of Western European haplogroups, among which R1b-S116* was the most common (63.9%), corroborating historical records of colonizations by Iberian populations. Nonetheless, FST genetic distances show similarities between Northeast Brazil and several other European populations, indicating multiple origins of settlers. Regarding Native American ancestry, our findings confirm a strong sexual bias against such haplogroups, which represented only 2.5% of individuals, highly contrasting previous results for maternal lineages. Furthermore, we document the presence of several Middle Eastern and African haplogroups, supporting a complex historical formation of this population and highlighting its uniqueness among other Brazilian regions. CONCLUSIONS: We performed a comprehensive analysis of the major Y-chromosome lineages that form the most dynamic migratory region from the Brazilian colonial period. This evidence suggests that the ongoing entry of European, Middle Eastern, and African males in the Brazilian Northeast, since at least 500 years, was significantly responsible for the present-day genetic architecture of this population.
Assuntos
Filogenia , Grupos Raciais , Brasil , Cromossomos Humanos Y/genética , Genética Populacional , Geografia , Haplótipos/genética , Humanos , Masculino , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
BACKGROUND: Fitness epistasis, the interaction effect of genes at different loci on fitness, makes an important contribution to adaptive evolution. Although fitness interaction evidence has been observed in model organisms, it is more difficult to detect and remains poorly understood in human populations as a result of limited statistical power and experimental constraints. Fitness epistasis is inferred from non-independence between unlinked loci. We previously observed ancestral block correlation between chromosomes 4 and 6 in African Americans. The same approach fails when examining ancestral blocks on the same chromosome due to the strong confounding effect observed in a recently admixed population. RESULTS: We developed a novel approach to eliminate the bias caused by admixture linkage disequilibrium when searching for fitness epistasis on the same chromosome. We applied this approach in 16,252 unrelated African Americans and identified significant ancestral correlations in two pairs of genomic regions (P-value< 8.11 × 10- 7) on chromosomes 1 and 10. The ancestral correlations were not explained by population admixture. Historical African-European crossover events are reduced between pairs of epistatic regions. We observed multiple pairs of co-expressed genes shared by the two regions on each chromosome, including ADAR being co-expressed with IFI44 in almost all tissues and DARC being co-expressed with VCAM1, S1PR1 and ELTD1 in multiple tissues in the Genotype-Tissue Expression (GTEx) data. Moreover, the co-expressed gene pairs are associated with the same diseases/traits in the GWAS Catalog, such as white blood cell count, blood pressure, lung function, inflammatory bowel disease and educational attainment. CONCLUSIONS: Our analyses revealed two instances of fitness epistasis on chromosomes 1 and 10, and the findings suggest a potential approach to improving our understanding of adaptive evolution.
Assuntos
Epistasia Genética , Aptidão Genética , Estudo de Associação Genômica Ampla/métodos , Negro ou Afro-Americano/genética , Cromossomos Humanos Par 1/genética , Cromossomos Humanos Par 10/genética , Simulação por Computador , Humanos , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único , Receptores Acoplados a Proteínas G/genéticaRESUMO
Although many genes have been shown to be associated with human pigmentary traits and forensic prediction assays exist (e.g. HIrisPlex-S), the genetic knowledge about skin colour remains incomplete. The highly admixed Brazilian population is an interesting study population for investigation of the complex genotype-phenotype architecture of human skin colour because of its large variation. Here, we compared variants in 22 pigmentary genes with quantitative skin pigmentation levels on the buttock, arm, and forehead areas of 266 genetically admixed Brazilian individuals. The genetic ancestry of each individual was estimated by typing 46 AIM-InDels. The mean proportion of genetic ancestry was 68.8% European, 20.8% Sub-Saharan African, and 10.4% Native American. A high correlation (adjusted R2 = 0.65, p < 0.05) was observed between nine SNPs and quantitative skin pigmentation using multiple linear regression analysis. The correlations were notably smaller between skin pigmentation and biogeographic ancestry (adjusted R2 = 0.45, p < 0.05), or markers in the leading forensic skin colour prediction system, the HIrisPlex-S (adjusted R2 = 0.54, p < 0.05). Four of the nine SNPs, OCA2 rs1448484 (rank 2), APBA2 rs4424881 (rank 4), MFSD12 rs10424065 (rank 8), and TYRP1 1408799 (rank 9) were not investigated as part of the HIrisPlex-S selection process, and therefore not included in the HIrisPlex-S model. Our results indicate that these SNPs account for a substantial part of the skin colour variation in individuals of admixed ancestry. Hence, we suggest that these SNPs are considered when developing future skin colour prediction models.
Assuntos
Variação Genética , Polimorfismo de Nucleotídeo Único , Pigmentação da Pele/genética , População Negra/genética , Brasil/etnologia , DNA/genética , Marcadores Genéticos , Técnicas de Genotipagem/instrumentação , Humanos , Povos Indígenas/genética , População Branca/genéticaRESUMO
TwinsMX is a national twin registry in Mexico recently created with institutional support from the Universidad Nacional Autónoma de México. It aims to serve as a platform to advance epidemiological and genetic research in the country and to disentangle the genetic and environmental contributions to health and disease in the admixed Mexican population. Here, we describe our recruitment and data collection strategies and discuss both the progress to date and future directions. More information about the registry is available on our website: https://twinsmxofficial.unam.mx/ (content in Spanish).
Assuntos
Doenças em Gêmeos/epidemiologia , Doenças em Gêmeos/genética , Interação Gene-Ambiente , Sistema de Registros/estatística & dados numéricos , Gêmeos Dizigóticos/genética , Gêmeos Monozigóticos/genética , Adolescente , Adulto , Feminino , Seguimentos , Humanos , Masculino , México/epidemiologia , Pessoa de Meia-Idade , Seleção de Pacientes , Adulto JovemRESUMO
The role played by epistasis between alleles at unlinked loci in shaping population fitness has been debated for many years and the existing evidence has been mainly accumulated from model organisms. In model organisms, fitness epistasis can be systematically inferred by detecting nonindependence of genotypic values between loci in a population and confirmed through examining the number of offspring produced in two-locus genotype groups. No systematic study has been conducted to detect epistasis of fitness in humans owing to experimental constraints. In this study, we developed a novel method to detect fitness epistasis by testing the correlation between local ancestries on different chromosomes in an admixed population. We inferred local ancestry across the genome in 16,252 unrelated African Americans and systematically examined the pairwise correlations between the genomic regions on different chromosomes. Our analysis revealed a pair of genomic regions on chromosomes 4 and 6 that show significant local ancestry correlation (P-value = 4.01 × 10-8 ) that can be potentially attributed to fitness epistasis. However, we also observed substantial local ancestry correlation that cannot be explained by systemic ancestry inference bias. To our knowledge, this study is the first to systematically examine evidence of fitness epistasis across the human genome.
Assuntos
Negro ou Afro-Americano/genética , Doenças Cardiovasculares/genética , Epistasia Genética/genética , Marcadores Genéticos/genética , Genoma Humano , Estudo de Associação Genômica Ampla , Modelos Genéticos , Doenças Cardiovasculares/etnologia , Estudos de Coortes , Genética Populacional , Genótipo , HumanosRESUMO
In this work, the YfilerPlus kit was used to investigate a sample of 258 males from Rio de Janeiro. In addition, the previous database of 760 Yfiler profiles deposited in the YHRD was updated to 1610. YfilerPlus markers showed a high haplotype diversity (0.99997), with only one haplotype shared by two individuals. When only considering the Yfiler markers, the haplotype diversity was slightly lower (0.99976), with 5 haplotypes shared by two individuals and 1 haplotype shared by three individuals. Low genetic distances were found between the Rio de Janeiro and European populations as well as the European/Hispanic American samples.
Assuntos
Cromossomos Humanos Y/genética , Etnicidade/genética , Variação Genética , Genética Populacional , Repetições de Microssatélites , Impressões Digitais de DNA , Haplótipos , Humanos , Masculino , População BrancaRESUMO
Case-control studies are a powerful strategy to identify candidate genes in complex diseases. In admixed populations, association studies can be affected by population stratification, leading to spurious genetic associations. Ancestry informative markers (AIMs) can be used to minimise this effect. The aim of this work was to select a set of AIMs to estimate population stratification in a Brazilian case-control study performed using a genome-wide array. A total of 345 single nucleotide polymorphism (SNP) AIMs, selected from the Cytoscan HD array and based on previously reported panels, was used to discriminate between European, African, and Amerindian populations. These SNP-AIMs were used to infer ancestry in systemic lupus erythematosus (SLE) patients (n = 23) and in healthy subjects (n = 110). Moderate population substructure was observed between SLE and control groups (Fst = 0.0113). Although patients and controls have shown a major European genomic contribution, significant differences in the European (P = 6.47 × 10-5 ) and African (P = 1.14 × 10-3 ) ancestries were detected between the two groups. We performed a two-step validation of the 345 SNP-AIMs panel estimating the ancestral contributions using a panel of 12 AIMs and approximately 70K SNPs from the array. Evaluation of population substructure in case-control studies, avoiding spurious genetic associations, can be performed using our panel of 345 SNP-AIMs.
Assuntos
Genética Populacional , Lúpus Eritematoso Sistêmico/genética , Polimorfismo de Nucleotídeo Único , População Negra/genética , Brasil , Estudos de Casos e Controles , Feminino , Genoma Humano , Humanos , Indígenas Sul-Americanos/genética , Lúpus Eritematoso Sistêmico/etnologia , Masculino , População Branca/genéticaRESUMO
Population stratification has long been recognized as an issue in genetic association studies because unrecognized population stratification can lead to both false-positive and false-negative findings and can obscure true association signals if not appropriately corrected. This issue can be even worse in rare variant association analyses because rare variants often demonstrate stronger and potentially different patterns of stratification than common variants. To correct for population stratification in genetic association studies, we proposed a novel method to Test the effect of an Optimally Weighted combination of variants in Admixed populations (TOWA) in which the analytically derived optimal weights can be calculated from existing phenotype and genotype data. TOWA up weights rare variants and those variants that have strong associations with the phenotype. Additionally, it can adjust for the direction of the association, and allows for local ancestry difference among study subjects. Extensive simulations show that the type I error rate of TOWA is under control in the presence of population stratification and it is more powerful than existing methods. We have also applied TOWA to a real sequencing data. Our simulation studies as well as real data analysis results indicate that TOWA is a useful tool for rare variant association analyses in admixed populations.
Assuntos
Algoritmos , Estudos de Associação Genética/métodos , Predisposição Genética para Doença , Variação Genética , Genética Populacional , Modelos Genéticos , Grupos Populacionais/genética , Estudos de Casos e Controles , Simulação por Computador , Genótipo , Hematócrito , Humanos , FenótipoRESUMO
BACKGROUND: There are different genetic patterns for cardio-metabolic parameters among different populations. Additionally, it has been found that ancestral genetic components (the proportion of Amerindian, European and African) in admixed Latin American populations influence an individual's susceptibility to cardio-metabolic disorders. The aim of this study was to evaluate the effect of ancestral genetic composition on a series of cardio-metabolic risk factors in a young admixed population from Colombia. RESULTS: In a sample of 853 Colombian youth, 10 to 18 years old, the mean European contribution was 66.6 % (range: 41-82 %), the mean African contribution was 14 % (range: 4-48 %), and the mean Amerindian contribution was 19.4 % (range: 10-35 %) using a panel of 40 autosomal ancestry-informative markers (AIMs). We assessed the degree of association between ancestral African, Amerindian and European genetic components and measures of body mass index, waist circumference, fasting glucose, fasting insulin, insulin resistance, triglycerides, high-density lipoprotein, and systolic and diastolic blood pressure. Two of the nine measures assessed presented a nominal significant association with ancestral components after adjusting for confounding variables: triglyceride levels were associated with the Amerindian component (OR = 1.06, 98.3 % CI = 1.01-1.11, P = 0.002) and systolic blood pressure was associated with the European component (OR = 0.93, 98.3 % CI = 0.87 to 0.99, P = 0.008) and the African component (OR = 1.07, CI = 1.01-1.14 P = 0.008), although it was not significant following a global Bonferroni correction. Additionally, insulin levels and insulin resistance showed associations with the African component. CONCLUSIONS: Our findings support the idea that an Amerindian ancestral component may act as a risk factor for high triglyceride levels. In addition, an African ancestral component confers a risk for high systolic blood pressure, and a European ancestry serves as a protective factor for this condition in a young admixed population from Colombia. However, these results should be confirmed in a larger population.
Assuntos
Predisposição Genética para Doença/genética , Miocárdio/metabolismo , Adolescente , Pressão Sanguínea/genética , Índice de Massa Corporal , Doenças Cardiovasculares/genética , Doenças Cardiovasculares/metabolismo , Criança , Colômbia/etnologia , Meio Ambiente , Feminino , Humanos , Insulina/metabolismo , Masculino , Fenótipo , Triglicerídeos/metabolismo , Circunferência da Cintura/genética , Adulto JovemRESUMO
The Brazil Ministry of Health maintains a Registry of Bone Marrow Donors that corresponds to approximately 12% of the Bone Marrow Donors Worldwide registry. This registry contains information on ethnicity (by self-assessment of color) and HLA-A, -B, and -DRB1 type. The self-assessment of color tool has been extensively used for admixed population characterization. In this context, Brazil represents a highly admixed population, resulting from 5 centuries of colonization and interbreeding, mainly, but not exclusively, among Native Americans, Europeans, and Africans. Here we evaluated self-assessed skin color and HLA genetic information from 71,291 bone marrow donors of southern Brazil to verify how likely is the HLA profiling correspondence within and between self-assessed color groups. We found that HLA itself was a better ancestry indicator than was self-assessed color. Therefore, self-assessment of color in highly admixed populations, such as that of Brazil, is not indicative of higher correspondence in the HLA profiles within skin color groups.
Assuntos
Alelos , Antígenos HLA/classificação , Haplótipos , Teste de Histocompatibilidade , Sistema de Registros , Doadores de Tecidos , População Negra/genética , Medula Óssea/imunologia , Transplante de Medula Óssea/estatística & dados numéricos , Brasil , Expressão Gênica , Frequência do Gene , Antígenos HLA/genética , Antígenos HLA/imunologia , Humanos , Indígenas Sul-Americanos/genética , Autoavaliação (Psicologia) , População Branca/genéticaRESUMO
Blood selenium (Se) concentrations differ substantially by population and could be influenced by genetic variants, increasing Se deficiency-related diseases. We conducted a genome-wide association study (GWAS) to identify single nucleotide polymorphisms (SNPs) associated with serum Se deficiency in 382 adults with admixed ancestry. Genotyping arrays were combined to yield 90,937 SNPs. R packages were applied to quality control and imputation. We also performed the ancestral proportion analysis. The Search Tool for the Retrieval of Interacting Genes was used to interrogate known protein-protein interaction networks (PPIs). Our ancestral proportion analysis estimated 71% of the genome was from Caucasians, 22% was from Africans, and 8% was from East Asians. We identified the SNP rs1561573 in the TraB domain containing 2B (TRABD2B), rs425664 in MAF bZIP transcription factor (MAF), rs10444656 in spermatogenesis-associated 13 (SPATA13), and rs6592284 in heat shock protein nuclear import factor (HIKESHI) genes. The PPI analysis showed functional associations of Se deficiency, thyroid hormone metabolism, NRF2-ARE and the Wnt pathway, and heat stress. Our findings show evidence of a genetic association between Se deficiency and metabolic pathways indirectly linked to Se regulation, reinforcing the complex relationship between Se intake and the endogenous factors affecting the Se requirements for optimal health.
Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Selênio , Adulto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Brasil , Predisposição Genética para Doença , Genótipo , Mapas de Interação de Proteínas/genética , Selênio/sangue , Selênio/deficiência , População Branca/genética , População Africana , População do Leste AsiáticoRESUMO
The MHC class I region contains crucial genes for the innate and adaptive immune response, playing a key role in susceptibility to many autoimmune and infectious diseases. Genome-wide association studies have identified numerous disease-associated SNPs within this region. However, these associations do not fully capture the immune-biological relevance of specific HLA alleles. HLA imputation techniques may leverage available SNP arrays by predicting allele genotypes based on the linkage disequilibrium between SNPs and specific HLA alleles. Successful imputation requires diverse and large reference panels, especially for admixed populations. This study employed a bioinformatics approach to call SNPs and HLA alleles in multi-ethnic samples from the 1000 genomes (1KG) dataset and admixed individuals from Brazil (SABE), utilising 30X whole-genome sequencing data. Using HIBAG, we created three reference panels: 1KG (n = 2504), SABE (n = 1171), and the full model (n = 3675) encompassing all samples. In extensive cross-validation of these reference panels, the multi-ethnic 1KG reference exhibited overall superior performance than the reference with only Brazilian samples. However, the best results were achieved with the full model. Additionally, we expanded the scope of imputation by developing reference panels for non-classical, MICA, MICB and HLA-H genes, previously unavailable for multi-ethnic populations. Validation in an independent Brazilian dataset showcased the superiority of our reference panels over the Michigan Imputation Server, particularly in predicting HLA-B alleles among Brazilians. Our investigations underscored the need to enhance or adapt reference panels to encompass the target population's genetic diversity, emphasising the significance of multiethnic references for accurate imputation across different populations.
Assuntos
Alelos , Etnicidade , Frequência do Gene , Polimorfismo de Nucleotídeo Único , Humanos , Brasil , Etnicidade/genética , Antígenos HLA/genética , Desequilíbrio de Ligação , Estudo de Associação Genômica Ampla/métodos , Genótipo , Genética Populacional/métodos , Antígenos de Histocompatibilidade Classe I/genética , Biologia Computacional/métodosRESUMO
To investigate the potential relationship between HLA alleles and haplotypes and the age at diagnosis of type 1 diabetes (T1DAgeD) in an admixed Brazilian population. This nationwide study was conducted in public clinics across 12 Brazilian cities. We collected demographic and genetic data from 1,600 patients with T1D. DNA samples were utilised to determine genomic ancestry (GA) and perform HLA typings for DRB1, DQA1 and DQB1. We explored allele and haplotype frequencies and GA in patients grouped by T1DAgeD categories (<6 years, ≥6-<11 years, ≥11-<19 years and ≥19 years) through univariate and multivariate analyses and primary component analyses. Additionally, we considered self-reported colour-race and identified a familiar history of T1D in first-degree relatives. The homozygosity index for DRB1~DQA1~DQB1 haplotypes exhibited the highest variation among T1DAgeD groups, and the percentages of Sub-Saharan African and European ancestries showed opposite trends in principal component analysis (PCA) analyses. Regarding the association of alleles and haplotypes with T1DAgeD, risk alleles such as HLA-DQB1*03:02g, -DQA1*03:01g, -02:01g, DRB1*04:05g and -04:02g were more frequently observed in heterozygosity or homozygosity in T1D patients with an early disease onset. Conversely, alleles such as DRB1*07:01g, -13:03g, DQB1*06:02g and DQA1*02:01 were more prevalent in older T1D patients. The combination DR3/DR4.5 was significantly associated with early disease onset. However, gender, GA, familiar history of T1D and self-reported colour-race identity did not exhibit significant associations with the onset of T1D. It is worth noting that the very common risk haplotype DRB1*03:01g~DQA1*05:01g~DQB1*02:01g did not differentiate between T1DAgeD groups. In the admixed Brazilian population, the high-risk haplotype DRB1*04:05~DQA1*03:01~DQB1*03:02 was more prevalent in individuals diagnosed before 6 years of age. In contrast, the protective alleles DQA1*01:02g, DQB1*06:02g, DRB1*07:01g and DRB1*13:03g and haplotypes DRB1*13:03g~DQA1*05:01g~DQB1*03:01g and DRB1*16:02g~DQA1*01:02g~DQB1*05:02g were more frequently observed in patients diagnosed in adulthood. Notably, these associations were independent of factors such as sex, economic status, GA, familiar history of T1D and region of birth in Brazil. These alleles and haplotypes contribute to our understanding of the disease onset heterogeneity and may have implications for early interventions when detected in association with well-known genomic risk or protection factors for T1D.
Assuntos
Alelos , Diabetes Mellitus Tipo 1 , Frequência do Gene , Haplótipos , Humanos , Brasil/epidemiologia , Diabetes Mellitus Tipo 1/genética , Diabetes Mellitus Tipo 1/epidemiologia , Diabetes Mellitus Tipo 1/diagnóstico , Masculino , Feminino , Criança , Adolescente , Adulto , Pré-Escolar , Adulto Jovem , Predisposição Genética para Doença , Cadeias HLA-DRB1/genética , Cadeias alfa de HLA-DQ/genética , Cadeias beta de HLA-DQ/genética , Idade de Início , Lactente , Pessoa de Meia-IdadeRESUMO
BACKGROUND AND AIMS: To investigate associations between Single Nucleotide Polymorphisms (SNPs) in the TAS1R and TAS2R taste receptors and diet quality, intake of alcohol, added sugar, and fat, using linear regression and machine learning techniques in a highly admixed population. METHODS: In the ISA-Capital health survey, 901 individuals were interviewed and had socioeconomic, demographic, health characteristics, along with dietary information obtained through two 24-h recalls. Data on 12 components related to food groups, nutrients, and calories was combined into a diet quality score (BHEI-R). BHEI-R, SoFAAs (calories from added sugar, saturated fat, and alcohol) and Alcohol use were tested for associations with 255 TAS2R SNPs and 73 TAS1R SNPs for 637 individuals with regression analysis and Random Forest. Significant SNPs were combined into Genetic taste scores (GTSs). RESULTS: Among 23 SNPs significantly associated either by stepwise linear/logistic regression or random forest with any possible biological functionality, the missense variants rs149217752 in TAS2R40, for SoFAAs, and rs2233997 in TAS2R4, were associated with both BHEI-R (under 4% increase in Mean Squared Error) and SoFAAs. GTSs increased the variance explanation of quantitative phenotypes and there was a moderately high AUC for alcohol use. CONCLUSIONS: The study provides insights into the genetic basis of human taste perception through the identification of missense variants in the TAS2R gene family. These findings may contribute to future strategies in precision nutrition aimed at improving food quality by reducing added sugar, saturated fat, and alcohol intake.
Assuntos
Polimorfismo de Nucleotídeo Único , Receptores Acoplados a Proteínas G , Paladar , Humanos , Receptores Acoplados a Proteínas G/genética , Feminino , Masculino , Adulto , Paladar/genética , Pessoa de Meia-Idade , Dieta , Ingestão de Alimentos/genética , Mutação de Sentido Incorreto , Adulto Jovem , Consumo de Bebidas Alcoólicas/genéticaRESUMO
Introduction: The Cuban population is genetically diverse, and information on the prevalence of genetic variants is still limited. As complex admixture processes have occurred, we hypothesized that the frequency of pharmacogenetic variants and drug responses may vary within the country. The aims of the study were to describe the frequency distribution of 43 single-nucleotide variants (SNVs) from 25 genes of pharmacogenetic interest within the Cuba population and in relation to other populations, while taking into consideration some descriptive variables such as place of birth and skin color. Materials and Methods: SNVs were analyzed in 357 unrelated healthy Cuban volunteers. Genotype, allele frequencies, and ancestry proportions were determined, and the pairwise fixation index (FST ) was evaluated. Results: Hardy-Weinberg equilibrium (HWE) deviations in six loci (rs11572103, rs2740574, rs776746, rs3025039, rs861539, and rs1762429) were identified. Minor allele frequencies (MAFs) ranged from 0.00 to 0.15 for variants in genes encoding xenobiotic metabolizing enzymes. They also ranged from 0.01 to 0.21 for variants in DNA repair, growth factors, methyltransferase, and methyl-binding proteins, while they ranged from 0.04 to 0.27 for variants in the O-6-methylguanine-DNA methyltransferase enzyme. Moderate genetic divergence was observed upon comparison to Africans (FST = 0.071 and SD 0.079), with 19 markers exhibiting moderate-to-large genetic differentiation. The average European, African, and Amerindian ancestry proportions were 67.8%, 27.2%, and 5.3%, respectively. Ancestry proportions differed by skin color and birthplace for both African and European components, with the exception of the European component, which showed no significant difference between individuals from Western and Eastern regions. Meanwhile, the statistical significance varied in comparisons by skin color and birthplace within the Amerindian component. Low genetic divergence was observed across geographical regions. We identified 12 variants showing moderate-to-large differentiation between White/Black individuals. Conclusion: Altogether, our results may support national strategies for the introduction of pharmacogenetic tools in clinical practice, contributing to the development of precision medicine in Cuba.
RESUMO
Many molecular mechanisms that lead to the host antibody response to COVID-19 vaccines remain largely unknown. In this study, we used serum antibody detection combined with whole blood RNA-based transcriptome analysis to investigate variability in vaccine response in healthy recipients of a booster (third) dose schedule of the mRNA BNT162b2 vaccine against COVID-19. The cohort was divided into two groups: (1) low-stable individuals, with antibody concentration anti-SARS-CoV IgG S1 below 0.4 percentile at 180 days after boosting vaccination; and (2) high-stable individuals, with antibody values greater than 0.6 percentile of the range in the same period (median 9525 [185-80,000] AU/mL). Differential gene expression, expressed single nucleotide variants and insertions/deletions, differential splicing events, and allelic imbalance were explored to broaden our understanding of the immune response sustenance. Our analysis revealed a differential expression of genes with immunological functions in individuals with low antibody titers, compared to those with higher antibody titers, underscoring the fundamental importance of the innate immune response for boosting immunity. Our findings also provide new insights into the determinants of the immune response variability to the SARS-CoV-2 mRNA vaccine booster, highlighting the significance of differential splicing regulatory mechanisms, mainly concerning HLA alleles, in delineating vaccine immunogenicity.