Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 64
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nature ; 586(7831): 741-748, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-33116287

RESUMEN

The African continent is regarded as the cradle of modern humans and African genomes contain more genetic variation than those from any other continent, yet only a fraction of the genetic diversity among African individuals has been surveyed1. Here we performed whole-genome sequencing analyses of 426 individuals-comprising 50 ethnolinguistic groups, including previously unsampled populations-to explore the breadth of genomic diversity across Africa. We uncovered more than 3 million previously undescribed variants, most of which were found among individuals from newly sampled ethnolinguistic groups, as well as 62 previously unreported loci that are under strong selection, which were predominantly found in genes that are involved in viral immunity, DNA repair and metabolism. We observed complex patterns of ancestral admixture and putative-damaging and novel variation, both within and between populations, alongside evidence that Zambia was a likely intermediate site along the routes of expansion of Bantu-speaking populations. Pathogenic variants in genes that are currently characterized as medically relevant were uncommon-but in other genes, variants denoted as 'likely pathogenic' in the ClinVar database were commonly observed. Collectively, these findings refine our current understanding of continental migration, identify gene flow and the response to human disease as strong drivers of genome-level population variation, and underscore the scientific imperative for a broader characterization of the genomic diversity of African individuals to understand human ancestry and improve health.


Asunto(s)
Variación Genética , Genoma Humano/genética , Genómica , Salud , Migración Humana , África/etnología , Reparación del ADN/genética , Conjuntos de Datos como Asunto , Femenino , Flujo Génico , Genética Médica , Genética de Población , Salud/historia , Historia Antigua , Migración Humana/historia , Humanos , Inmunidad/genética , Lenguaje , Masculino , Metabolismo/genética , Selección Genética , Secuenciación Completa del Genoma
2.
BMC Bioinformatics ; 23(1): 498, 2022 Nov 19.
Artículo en Inglés | MEDLINE | ID: mdl-36402955

RESUMEN

BACKGROUND: Genome-wide association studies (GWAS) are a powerful method to detect associations between variants and phenotypes. A GWAS requires several complex computations with large data sets, and many steps may need to be repeated with varying parameters. Manual running of these analyses can be tedious, error-prone and hard to reproduce. RESULTS: The H3AGWAS workflow from the Pan-African Bioinformatics Network for H3Africa is a powerful, scalable and portable workflow implementing pre-association analysis, implementation of various association testing methods and post-association analysis of results. CONCLUSIONS: The workflow is scalable-laptop to cluster to cloud (e.g., SLURM, AWS Batch, Azure). All required software is containerised and can run under Docker or Singularity.


Asunto(s)
Biología Computacional , Estudio de Asociación del Genoma Completo , Flujo de Trabajo , Biología Computacional/métodos , Programas Informáticos , Fenotipo
4.
Pharmacogenomics J ; 21(6): 649-656, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34302047

RESUMEN

Chloroquine/hydroxychloroquine have been proposed as potential treatments for COVID-19. These drugs have warning labels for use in individuals with glucose-6-phosphate dehydrogenase (G6PD) deficiency. Analysis of whole genome sequence data of 458 individuals from sub-Saharan Africa showed significant G6PD variation across the continent. We identified nine variants, of which four are potentially deleterious to G6PD function, and one (rs1050828) that is known to cause G6PD deficiency. We supplemented data for the rs1050828 variant with genotype array data from over 11,000 Africans. Although this variant is common in Africans overall, large allele frequency differences exist between sub-populations. African sub-populations in the same country can show significant differences in allele frequency (e.g. 16.0% in Tsonga vs 0.8% in Xhosa, both in South Africa, p = 2.4 × 10-3). The high prevalence of variants in the G6PD gene found in this analysis suggests that it may be a significant interaction factor in clinical trials of chloroquine and hydroxychloroquine for treatment of COVID-19 in Africans.


Asunto(s)
Tratamiento Farmacológico de COVID-19 , Cloroquina/efectos adversos , Deficiencia de Glucosafosfato Deshidrogenasa/genética , Glucosafosfato Deshidrogenasa/genética , Hidroxicloroquina/efectos adversos , África del Sur del Sahara/epidemiología , COVID-19/epidemiología , COVID-19/genética , Bases de Datos Genéticas , Variación Genética/genética , Deficiencia de Glucosafosfato Deshidrogenasa/tratamiento farmacológico , Deficiencia de Glucosafosfato Deshidrogenasa/epidemiología , Humanos , Mutación Missense/genética , Factores de Riesgo
5.
Int J Mol Sci ; 22(15)2021 Jul 21.
Artículo en Inglés | MEDLINE | ID: mdl-34360551

RESUMEN

Pharmacogenomics aims to reveal variants associated with drug response phenotypes. Genes whose roles involve the absorption, distribution, metabolism, and excretion of drugs, are highly polymorphic between populations. High coverage whole genome sequencing showed that a large proportion of the variants for these genes are rare in African populations. This study investigated the impact of such variants on protein structure to assess their functional importance. We used genetic data of CYP3A5 from 458 individuals from sub-Saharan Africa to conduct a structural bioinformatics analysis. Five missense variants were modeled and microsecond scale molecular dynamics simulations were conducted for each, as well as for the CYP3A5 wildtype and the Y53C variant, which has a known deleterious impact on enzyme activity. The binding of ritonavir and artemether to CYP3A5 variant structures was also evaluated. Our results showed different conformational characteristics between all the variants. No significant structural changes were noticed. However, the genetic variability seemed to act on the plasticity of the protein. The impact on drug binding might be drug dependant. We concluded that rare variants hold relevance in determining the pharmacogenomics properties of populations. This could have a significant impact on precision medicine applications in sub-Saharan Africa.


Asunto(s)
Simulación por Computador , Citocromo P-450 CYP3A/genética , Genética de Población , Genoma Humano , Fenotipo , Polimorfismo de Nucleótido Simple , África del Sur del Sahara , Genotipo , Humanos , Secuenciación Completa del Genoma
6.
Hum Mol Genet ; 27(R2): R209-R218, 2018 08 01.
Artículo en Inglés | MEDLINE | ID: mdl-29741686

RESUMEN

Genetic variation and susceptibility to disease are shaped by human demographic history and adaptation. We can now study the genomes of extant Africans and uncover traces of population migration, admixture, assimilation and selection by applying sophisticated computational algorithms. There are four major ethnolinguistic divisions among present day Africans: Hunter-gatherer populations in southern and central Africa; Nilo-Saharan speakers from north and northeast Africa; Afro-Asiatic speakers from north and east Africa; and Niger-Congo speakers who are the predominant ethnolinguistic group spread across most of sub-Saharan Africa. The enormous ethnolinguistic diversity in sub-Saharan African populations is largely paralleled by extensive genetic diversity and until a decade ago, little was known about detailed origins and divergence of these groups. Results from large-scale population genetic studies, and more recently whole genome sequence data, are unravelling the critical role of events like migration and admixture and environmental factors including diet, infectious diseases and climatic conditions in shaping current population diversity. It is now possible to start providing quantitative estimates of divergence times, population size and dynamic processes that have affected populations and their genetic risk for disease. Finally, the availability of ancient genomes from Africa provides historical insights of unprecedented depth. In this review, we highlight some key interpretations that have emerged from recent African genome studies.


Asunto(s)
Adaptación Biológica/genética , Población Negra/genética , África/etnología , Evolución Biológica , Etnicidad/genética , Evolución Molecular , Técnicas Genéticas , Variación Genética/genética , Genética , Genética de Población/métodos , Genómica/métodos , Haplotipos/genética , Humanos , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del Genoma/métodos
7.
Biochem Biophys Res Commun ; 527(3): 702-708, 2020 06 30.
Artículo en Inglés | MEDLINE | ID: mdl-32410735

RESUMEN

The spread of COVID-19 caused by the SARS-CoV-2 outbreak has been growing since its first identification in December 2019. The publishing of the first SARS-CoV-2 genome made a valuable source of data to study the details about its phylogeny, evolution, and interaction with the host. Protein-protein binding assays have confirmed that Angiotensin-converting enzyme 2 (ACE2) is more likely to be the cell receptor through which the virus invades the host cell. In the present work, we provide an insight into the interaction of the viral spike Receptor Binding Domain (RBD) from different coronavirus isolates with host ACE2 protein. By calculating the binding energy score between RBD and ACE2, we highlighted the putative jump in the affinity from a progenitor form of SARS-CoV-2 to the current virus responsible for COVID-19 outbreak. Our result was consistent with previously reported phylogenetic analysis and corroborates the opinion that the interface segment of the spike protein RBD might be acquired by SARS-CoV-2 via a complex evolutionary process rather than a progressive accumulation of mutations. We also highlighted the relevance of Q493 and P499 amino acid residues of SARS-CoV-2 RBD for binding to human ACE2 and maintaining the stability of the interface. Moreover, we show from the structural analysis that it is unlikely for the interface residues to be the result of genetic engineering. Finally, we studied the impact of eight different variants located at the interaction surface of ACE2, on the complex formation with SARS-CoV-2 RBD. We found that none of them is likely to disrupt the interaction with the viral RBD of SARS-CoV-2.


Asunto(s)
Betacoronavirus/química , Peptidil-Dipeptidasa A/química , Glicoproteína de la Espiga del Coronavirus/química , Secuencia de Aminoácidos , Enzima Convertidora de Angiotensina 2 , Sitios de Unión , COVID-19 , Infecciones por Coronavirus , Humanos , Simulación del Acoplamiento Molecular , Pandemias , Filogenia , Neumonía Viral , Dominios Proteicos , Estructura Terciaria de Proteína , SARS-CoV-2
8.
Hum Genet ; 138(10): 1123-1142, 2019 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-31312899

RESUMEN

The study of runs of homozygosity (ROH) can shed light on population demographic history and cultural practices. We present a fine-scale ROH analysis of 1679 individuals from 28 sub-Saharan African (SSA) populations along with 1384 individuals from 17 worldwide populations. Using high-density SNP coverage, we could accurately identify ROH > 300 kb using PLINK software. The genomic distribution of ROH was analysed through the identification of ROH islands and regions of heterozygosity (RHZ). The analyses showed a heterogeneous distribution of autozygosity across SSA, revealing complex demographic histories. They highlight differences between African groups and can differentiate the impact of consanguineous practices (e.g. among the Somali) from endogamy (e.g. among several Khoe and San groups). Homozygosity cold and hotspots were shown to harbour multiple protein coding genes. Studying ROH therefore not only sheds light on population history, but can also be used to study genetic variation related to adaptation and potentially to the health of extant populations.


Asunto(s)
Población Negra/genética , Genética de Población , Homocigoto , África del Sur del Sahara , Consanguinidad , Cruzamientos Genéticos , Análisis de Datos , Demografía , Variación Genética , Genómica/métodos , Geografía , Humanos
9.
Hum Genet ; 138(10): 1143-1144, 2019 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-31359130

RESUMEN

In the Original article published, the figure number 5: Genomic distribution of ROH is incorrectly published. The correct figure is given below.

10.
Genome Res ; 26(2): 271-7, 2016 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-26627985

RESUMEN

The application of genomics technologies to medicine and biomedical research is increasing in popularity, made possible by new high-throughput genotyping and sequencing technologies and improved data analysis capabilities. Some of the greatest genetic diversity among humans, animals, plants, and microbiota occurs in Africa, yet genomic research outputs from the continent are limited. The Human Heredity and Health in Africa (H3Africa) initiative was established to drive the development of genomic research for human health in Africa, and through recognition of the critical role of bioinformatics in this process, spurred the establishment of H3ABioNet, a pan-African bioinformatics network for H3Africa. The limitations in bioinformatics capacity on the continent have been a major contributory factor to the lack of notable outputs in high-throughput biology research. Although pockets of high-quality bioinformatics teams have existed previously, the majority of research institutions lack experienced faculty who can train and supervise bioinformatics students. H3ABioNet aims to address this dire need, specifically in the area of human genetics and genomics, but knock-on effects are ensuring this extends to other areas of bioinformatics. Here, we describe the emergence of genomics research and the development of bioinformatics in Africa through H3ABioNet.


Asunto(s)
Población Negra/genética , Promoción de la Salud , África , Biología Computacional , Sistemas de Computación , Variación Genética , Genética Médica , Genómica , Humanos
11.
BMC Bioinformatics ; 19(1): 457, 2018 Nov 29.
Artículo en Inglés | MEDLINE | ID: mdl-30486782

RESUMEN

BACKGROUND: The Pan-African bioinformatics network, H3ABioNet, comprises 27 research institutions in 17 African countries. H3ABioNet is part of the Human Health and Heredity in Africa program (H3Africa), an African-led research consortium funded by the US National Institutes of Health and the UK Wellcome Trust, aimed at using genomics to study and improve the health of Africans. A key role of H3ABioNet is to support H3Africa projects by building bioinformatics infrastructure such as portable and reproducible bioinformatics workflows for use on heterogeneous African computing environments. Processing and analysis of genomic data is an example of a big data application requiring complex interdependent data analysis workflows. Such bioinformatics workflows take the primary and secondary input data through several computationally-intensive processing steps using different software packages, where some of the outputs form inputs for other steps. Implementing scalable, reproducible, portable and easy-to-use workflows is particularly challenging. RESULTS: H3ABioNet has built four workflows to support (1) the calling of variants from high-throughput sequencing data; (2) the analysis of microbial populations from 16S rDNA sequence data; (3) genotyping and genome-wide association studies; and (4) single nucleotide polymorphism imputation. A week-long hackathon was organized in August 2016 with participants from six African bioinformatics groups, and US and European collaborators. Two of the workflows are built using the Common Workflow Language framework (CWL) and two using Nextflow. All the workflows are containerized for improved portability and reproducibility using Docker, and are publicly available for use by members of the H3Africa consortium and the international research community. CONCLUSION: The H3ABioNet workflows have been implemented in view of offering ease of use for the end user and high levels of reproducibility and portability, all while following modern state of the art bioinformatics data processing protocols. The H3ABioNet workflows will service the H3Africa consortium projects and are currently in use. All four workflows are also publicly available for research scientists worldwide to use and adapt for their respective needs. The H3ABioNet workflows will help develop bioinformatics capacity and assist genomics research within Africa and serve to increase the scientific output of H3Africa and its Pan-African Bioinformatics Network.


Asunto(s)
Biología Computacional/métodos , Genómica/métodos , África , Humanos , Reproducibilidad de los Resultados
12.
BMC Genomics ; 19(1): 106, 2018 01 30.
Artículo en Inglés | MEDLINE | ID: mdl-29378520

RESUMEN

BACKGROUND: Runs of Homozygosity (ROH) are genomic regions where identical haplotypes are inherited from each parent. Since their first detection due to technological advances in the late 1990s, ROHs have been shedding light on human population history and deciphering the genetic basis of monogenic and complex traits and diseases. ROH studies have predominantly exploited SNP array data, but are gradually moving to whole genome sequence (WGS) data as it becomes available. WGS data, covering more genetic variability, can add value to ROH studies, but require additional considerations during analysis. RESULTS: Using SNP array and low coverage WGS data from 1885 individuals from 20 world populations, our aims were to compare ROH from the two datasets and to establish software conditions to get comparable results, thus providing guidelines for combining disparate datasets in joint ROH analyses. By allowing heterozygous SNPs per window, using the PLINK homozygosity function and non-parametric analysis, we were able to obtain non-significant differences in number ROH, mean ROH size and total sum of ROH between data sets using the different technologies for almost all populations. CONCLUSIONS: By allowing 3 heterozygous SNPs per ROH when dealing with WGS low coverage data, it is possible to establish meaningful comparisons between data using SNP array and WGS low coverage technologies.


Asunto(s)
Genómica , Homocigoto , Análisis de Secuencia por Matrices de Oligonucleótidos , Heterocigoto , Humanos , Polimorfismo de Nucleótido Simple , Programas Informáticos
13.
PLoS Comput Biol ; 13(6): e1005419, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-28570565

RESUMEN

The H3ABioNet pan-African bioinformatics network, which is funded to support the Human Heredity and Health in Africa (H3Africa) program, has developed node-assessment exercises to gauge the ability of its participating research and service groups to analyze typical genome-wide datasets being generated by H3Africa research groups. We describe a framework for the assessment of computational genomics analysis skills, which includes standard operating procedures, training and test datasets, and a process for administering the exercise. We present the experiences of 3 research groups that have taken the exercise and the impact on their ability to manage complex projects. Finally, we discuss the reasons why many H3ABioNet nodes have declined so far to participate and potential strategies to encourage them to do so.


Asunto(s)
Población Negra/genética , Bases de Datos Genéticas , Genómica/métodos , Sistemas de Administración de Bases de Datos , Países en Desarrollo , Humanos , Nigeria , Sudáfrica
14.
PLoS Comput Biol ; 12(2): e1004395, 2016 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-26845152

RESUMEN

Bioinformatics is now a critical skill in many research and commercial environments as biological data are increasing in both size and complexity. South African researchers recognized this need in the mid-1990s and responded by working with the government as well as international bodies to develop initiatives to build bioinformatics capacity in the country. Significant injections of support from these bodies provided a springboard for the establishment of computational biology units at multiple universities throughout the country, which took on teaching, basic research and support roles. Several challenges were encountered, for example with unreliability of funding, lack of skills, and lack of infrastructure. However, the bioinformatics community worked together to overcome these, and South Africa is now arguably the leading country in bioinformatics on the African continent. Here we discuss how the discipline developed in the country, highlighting the challenges, successes, and lessons learnt.


Asunto(s)
Biología Computacional , Biotecnología , Biología Computacional/educación , Biología Computacional/historia , Biología Computacional/organización & administración , Historia del Siglo XX , Historia del Siglo XXI , Humanos , Sudáfrica
15.
Bioinformatics ; 31(18): 3027-34, 2015 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-25979473

RESUMEN

MOTIVATION: Interactions between amino acids are important determinants of the structure, stability and function of proteins. Several tools have been developed for the identification and analysis of such interactions in proteins based on the extensive studies carried out on high-resolution structures from Protein Data Bank (PDB). Although these tools allow users to identify and analyze interactions, analysis can only be performed on one structure at a time. This makes it difficult and time consuming to study the significance of these interactions on a large scale. RESULTS: SpeeDB is a web-based tool for the identification of protein structures based on structural properties. SpeeDB queries are executed on all structures in the PDB at once, quickly enough for interactive use. SpeeDB includes standard queries based on published criteria for identifying various structures: disulphide bonds, catalytic triads and aromatic-aromatic, sulphur-aromatic, cation-π and ionic interactions. Users can also construct custom queries in the user interface without any programming. Results can be downloaded in a Comma Separated Value (CSV) format for further analysis with other tools. Case studies presented in this article demonstrate how SpeeDB can be used to answer various biological questions. Analysis of human proteases revealed that disulphide bonds are the predominant type of interaction and are located close to the active site, where they promote substrate specificity. When comparing the two homologous G protein-coupled receptors and the two protein kinase paralogs analyzed, the differences in the types of interactions responsible for stability accounts for the differences in specificity and functionality of the structures. AVAILABILITY AND IMPLEMENTATION: SpeeDB is available at http://www.parallelcomputing.ca as a web service. CONTACT: d@drobilla.net SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Péptido Hidrolasas/química , Programas Informáticos , Aminoácidos/química , Bases de Datos de Proteínas , Humanos , Enlace de Hidrógeno , Modelos Moleculares , Conformación Proteica , Homología Estructural de Proteína
16.
BMC Bioinformatics ; 16: 255, 2015 Aug 14.
Artículo en Inglés | MEDLINE | ID: mdl-26269100

RESUMEN

BACKGROUND: Selective pressures at the DNA level shape genes into profiles consisting of patterns of rapidly evolving sites and sites withstanding change. These profiles remain detectable even when protein sequences become extensively diverged. A common task in molecular biology is to infer functional, structural or evolutionary relationships by querying a database using an algorithm. However, problems arise when sequence similarity is low. This study presents an algorithm that uses the evolutionary rate at codon sites, the dN/dS (ω) parameter, coupled to a substitution matrix as an alignment metric for detecting distantly related proteins. The algorithm, called BLOSUM-FIRE couples a newer and improved version of the original FIRE (Functional Inference using Rates of Evolution) algorithm with an amino acid substitution matrix in a dynamic scoring function. The enigmatic hepatitis B virus X protein was used as a test case for BLOSUM-FIRE and its associated database EvoDB. RESULTS: The evolutionary rate based approach was coupled with a conventional BLOSUM substitution matrix. The two approaches are combined in a dynamic scoring function, which uses the selective pressure to score aligned residues. The dynamic scoring function is based on a coupled additive approach that scores aligned sites based on the level of conservation inferred from the ω values. Evaluation of the accuracy of this new implementation, BLOSUM-FIRE, using MAFFT alignment as reference alignments has shown that it is more accurate than its predecessor FIRE. Comparison of the alignment quality with widely used algorithms (MUSCLE, T-COFFEE, and CLUSTAL Omega) revealed that the BLOSUM-FIRE algorithm performs as well as conventional algorithms. Its main strength lies in that it provides greater potential for aligning divergent sequences and addresses the problem of low specificity inherent in the original FIRE algorithm. The utility of this algorithm is demonstrated using the Hepatitis B virus X (HBx) protein, a protein of unknown function, as a test case. CONCLUSION: This study describes the utility of an evolutionary rate based approach coupled to the BLOSUM62 amino acid substitution matrix in inferring protein domain function. We demonstrate that such an approach is robust and performs as well as an array of conventional algorithms.


Asunto(s)
Algoritmos , Sustitución de Aminoácidos , Evolución Molecular , Proteínas/química , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Codón/genética , Bases de Datos Factuales , Humanos , Datos de Secuencia Molecular , Estructura Terciaria de Proteína , Homología de Secuencia de Aminoácido
17.
BMC Genomics ; 15: 437, 2014 Jun 06.
Artículo en Inglés | MEDLINE | ID: mdl-24906912

RESUMEN

BACKGROUND: Population differentiation is the result of demographic and evolutionary forces. Whole genome datasets from the 1000 Genomes Project (October 2012) provide an unbiased view of genetic variation across populations from Europe, Asia, Africa and the Americas. Common population-specific SNPs (MAF > 0.05) reflect a deep history and may have important consequences for health and wellbeing. Their interpretation is contextualised by currently available genome data. RESULTS: The identification of common population-specific (CPS) variants (SNPs and SSV) is influenced by admixture and the sample size under investigation. Nine of the populations in the 1000 Genomes Project (2 African, 2 Asian (including a merged Chinese group) and 5 European) revealed that the African populations (LWK and YRI), followed by the Japanese (JPT) have the highest number of CPS SNPs, in concordance with their histories and given the populations studied. Using two methods, sliding 50-SNP and 5-kb windows, the CPS SNPs showed distinct clustering across large genome segments and little overlap of clusters between populations. iHS enrichment score and the population branch statistic (PBS) analyses suggest that selective sweeps are unlikely to account for the clustering and population specificity. Of interest is the association of clusters close to recombination hotspots. Functional analysis of genes associated with the CPS SNPs revealed over-representation of genes in pathways associated with neuronal development, including axonal guidance signalling and CREB signalling in neurones. CONCLUSIONS: Common population-specific SNPs are non-randomly distributed throughout the genome and are significantly associated with recombination hotspots. Since the variant alleles of most CPS SNPs are the derived allele, they likely arose in the specific population after a split from a common ancestor. Their proximity to genes involved in specific pathways, including neuronal development, suggests evolutionary plasticity of selected genomic regions. Contrary to expectation, selective sweeps did not play a large role in the persistence of population-specific variation. This suggests a stochastic process towards population-specific variation which reflects demographic histories and may have some interesting implications for health and susceptibility to disease.


Asunto(s)
Genética de Población , Genoma Humano , Polimorfismo de Nucleótido Simple , Grupos Raciales/genética , Alelos , Biología Computacional , Bases de Datos de Ácidos Nucleicos , Evolución Molecular , Humanos , Recombinación Genética , Selección Genética
18.
Mol Med ; 20: 341-9, 2014 Aug 14.
Artículo en Inglés | MEDLINE | ID: mdl-25014791

RESUMEN

The aim of this study was to identify genetic variants associated with rheumatoid arthritis (RA) risk in black South Africans. Black South African RA patients (n = 263) were compared with healthy controls (n = 374). Genotyping was performed using the Immunochip, and four-digit high-resolution human leukocyte antigen (HLA) typing was performed by DNA sequencing of exon 2. Standard quality control measures were implemented on the data. The strongest associations were in the intergenic region between the HLA-DRB1 and HLA-DQA1 loci. After conditioning on HLA-DRB1 alleles, the effect in the rest of the extended major histocompatibility (MHC) diminished. Non-HLA single nucleotide polymorphisms (SNPs) in the intergenic regions LOC389203|RBPJ, LOC100131131|IL1R1, KIAA1919|REV3L, LOC643749|TRAF3IP2, and SNPs in the intron and untranslated regions (UTR) of IRF1 and the intronic region of ICOS and KIAA1542 showed association with RA (p < 5 × 10(-5)). Of the SNPs previously associated with RA in Caucasians, one SNP, rs874040, locating to the intergenic region LOC389203|RBPJ was replicated in this study. None of the variants in the PTPN22 gene was significantly associated. The seropositive subgroups showed similar results to the overall cohort. The effects observed across the HLA region are most likely due to HLA-DRB1, and secondary effects in the extended MHC cannot be detected. Seven non-HLA loci are associated with RA in black South Africans. Similar to Caucasians, the intergenic region between LOC38920 and RBPJ is associated with RA in this population. The strong association of the R620W variant of the PTPN22 gene with RA in Caucasians was not replicated since this variant was monomorphic in our study, but other SNP variants of the PTPN22 gene were also not associated with RA in black South Africans, suggesting that this locus does not play a major role in RA in this population.


Asunto(s)
Artritis Reumatoide/genética , Población Negra/genética , Predisposición Genética a la Enfermedad , Proteínas Adaptadoras Transductoras de Señales , Proteínas de Unión al ADN/genética , ADN Polimerasa Dirigida por ADN/genética , Femenino , Genotipo , Humanos , Proteína de Unión a la Señal Recombinante J de las Inmunoglobulinas/genética , Proteína Coestimuladora de Linfocitos T Inducibles/genética , Factor 1 Regulador del Interferón/genética , Masculino , Polimorfismo de Nucleótido Simple , Proteína Tirosina Fosfatasa no Receptora Tipo 22/genética , Receptores Tipo I de Interleucina-1/genética , Sudáfrica , Péptidos y Proteínas Asociados a Receptores de Factores de Necrosis Tumoral/genética
19.
Clin Pharmacol Ther ; 115(3): 576-594, 2024 03.
Artículo en Inglés | MEDLINE | ID: mdl-38049200

RESUMEN

Genetic variation in CYP2B6 and CYP2A6 is known to impact interindividual response to antiretrovirals, nicotine, and bupropion, among other drugs. However, the full catalogue of clinically relevant pharmacogenetic variants in these genes is yet to be established, especially across African populations. This study therefore aimed to characterize the star allele (haplotype) distribution in CYP2B6 and CYP2A6 across diverse and understudied sub-Saharan African (SSA) populations. We called star alleles from 961 high-depth full genomes using StellarPGx, Aldy, and PyPGx. In addition, we performed CYP2B6 and CYP2A6 star allele frequency comparisons between SSA and other global biogeographical groups represented in the new 1000 Genomes Project high-coverage dataset (n = 2,000). This study presents frequency information for star alleles in CYP2B6 (e.g., *6 and *18; frequency of 21-47% and 2-19%, respectively) and CYP2A6 (e.g., *4, *9, and *17; frequency of 0-6%, 3-10%, and 6-20%, respectively), and predicted phenotypes (for CYP2B6), across various African populations. In addition, 50 potentially novel African-ancestry star alleles were computationally predicted by StellarPGx in CYP2B6 and CYP2A6 combined. For each of these genes, over 4% of the study participants had predicted novel star alleles. Three novel star alleles in CYP2A6 (*54, *55, and *56) and CYP2B6 apiece, and several suballeles were further validated via targeted Single-Molecule Real-Time resequencing. Our findings are important for informing the design of comprehensive pharmacogenetic testing platforms, and are highly relevant for personalized medicine strategies, especially relating to antiretroviral medication and smoking cessation treatment in Africa and the African diaspora. More broadly, this study highlights the importance of sampling diverse African ethnolinguistic groups for accurate characterization of the pharmacogene variation landscape across the continent.


Asunto(s)
Nicotina , Farmacogenética , Humanos , Citocromo P-450 CYP2B6/genética , Citocromo P-450 CYP2A6/genética , Frecuencia de los Genes , África del Sur del Sahara , Genotipo , Alelos
20.
medRxiv ; 2024 Apr 12.
Artículo en Inglés | MEDLINE | ID: mdl-38293229

RESUMEN

BACKGROUND: Genome-wide association studies (GWAS) have predominantly focused on populations of European and Asian ancestry, limiting our understanding of genetic factors influencing kidney disease in Sub-Saharan African (SSA) populations. This study presents the largest GWAS for urinary albumin-to-creatinine ratio (UACR) in SSA individuals, including 8,970 participants living in different African regions and an additional 9,705 non-resident individuals of African ancestry from the UK Biobank and African American cohorts. METHODS: Urine biomarkers and genotype data were obtained from two SSA cohorts (AWI-Gen and ARK), and two non-resident African-ancestry studies (UK Biobank and CKD-Gen Consortium). Association testing and meta-analyses were conducted, with subsequent fine-mapping, conditional analyses, and replication studies. Polygenic scores (PGS) were assessed for transferability across populations. RESULTS: Two genome-wide significant (P<5x10-8) UACR-associated loci were identified, one in the BMP6 region on chromosome 6, in the meta-analysis of resident African individuals, and another in the HBB region on chromosome 11 in the meta-analysis of non-resident SSA individuals, as well as the combined meta-analysis of all studies. Replication of previous significant results confirmed associations in known UACR-associated regions, including THB53, GATM, and ARL15. PGS estimated using previous studies from European ancestry, African ancestry, and multi-ancestry cohorts exhibited limited transferability of PGS across populations, with less than 1% of observed variance explained. CONCLUSION: This study contributes novel insights into the genetic architecture of kidney disease in SSA populations, emphasizing the need for conducting genetic research in diverse cohorts. The identified loci provide a foundation for future investigations into the genetic susceptibility to chronic kidney disease in underrepresented African populations Additionally, there is a need to develop integrated scores using multi-omics data and risk factors specific to the African context to improve the accuracy of predicting disease outcomes. METHODS: Urine biomarkers and genotype data were obtained from two SSA cohorts (AWI-Gen and ARK), and two non-resident African-ancestry studies (UK Biobank and CKD-Gen Consortium). Association testing and meta-analyses were conducted, with subsequent fine-mapping, conditional analyses, and replication studies. Polygenic scores (PGS) were assessed for transferability across populations. RESULTS: Two genome-wide significant (P<5x10-8) UACR-associated loci were identified, one in the BMP6 region on chromosome 6, in the meta-analysis of resident African individuals, and another in the HBB region on chromosome 11 in the meta-analysis of non-resident SSA individuals, as well as the combined meta-analysis of all studies. Replication of previous significant results confirmed associations in known UACR-associated regions, including THB53, GATM, and ARL15. PGS estimated using previous studies from European ancestry, African ancestry, and multi-ancestry cohorts exhibited limited transferability of PGS across populations, with less than 1% of observed variance explained. CONCLUSION: This study contributes novel insights into the genetic architecture of kidney function in SSA populations, emphasizing the need for conducting genetic research in diverse cohorts. The identified loci provide a foundation for future investigations into the genetic susceptibility to chronic kidney disease in underrepresented African populations.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA