Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
1.
Bioinformatics ; 40(4)2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38490256

RESUMO

SUMMARY: Admixed populations, with their unique and diverse genetic backgrounds, are often underrepresented in genetic studies. This oversight not only limits our understanding but also exacerbates existing health disparities. One major barrier has been the lack of efficient tools tailored for the special challenges of genetic studies of admixed populations. Here, we present admix-kit, an integrated toolkit and pipeline for genetic analyses of admixed populations. Admix-kit implements a suite of methods to facilitate genotype and phenotype simulation, association testing, genetic architecture inference, and polygenic scoring in admixed populations. AVAILABILITY AND IMPLEMENTATION: Admix-kit package is open-source and available at https://github.com/KangchengHou/admix-kit. Additionally, users can use the pipeline designed for admixed genotype simulation available at https://github.com/UW-GAC/admix-kit_workflow.


Assuntos
Software , Genótipo , Fenótipo
2.
Cell Rep Med ; 5(2): 101430, 2024 Feb 20.
Artigo em Inglês | MEDLINE | ID: mdl-38382466

RESUMO

Primary open-angle glaucoma (POAG), a leading cause of irreversible blindness globally, shows disparity in prevalence and manifestations across ancestries. We perform meta-analysis across 15 biobanks (of the Global Biobank Meta-analysis Initiative) (n = 1,487,441: cases = 26,848) and merge with previous multi-ancestry studies, with the combined dataset representing the largest and most diverse POAG study to date (n = 1,478,037: cases = 46,325) and identify 17 novel significant loci, 5 of which were ancestry specific. Gene-enrichment and transcriptome-wide association analyses implicate vascular and cancer genes, a fifth of which are primary ciliary related. We perform an extensive statistical analysis of SIX6 and CDKN2B-AS1 loci in human GTEx data and across large electronic health records showing interaction between SIX6 gene and causal variants in the chr9p21.3 locus, with expression effect on CDKN2A/B. Our results suggest that some POAG risk variants may be ancestry specific, sex specific, or both, and support the contribution of genes involved in programmed cell death in POAG pathogenesis.


Assuntos
Predisposição Genética para Doença , Glaucoma de Ângulo Aberto , Masculino , Feminino , Humanos , Predisposição Genética para Doença/genética , Glaucoma de Ângulo Aberto/genética , Glaucoma de Ângulo Aberto/epidemiologia , Polimorfismo de Nucleotídeo Único , Proliferação de Células , Biologia
3.
Nat Rev Genet ; 25(1): 8-25, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-37620596

RESUMO

Polygenic risk scores (PRSs) summarize the genetic predisposition of a complex human trait or disease and may become a valuable tool for advancing precision medicine. However, PRSs that are developed in populations of predominantly European genetic ancestries can increase health disparities due to poor predictive performance in individuals of diverse and complex genetic ancestries. We describe genetic and modifiable risk factors that limit the transferability of PRSs across populations and review the strengths and weaknesses of existing PRS construction methods for diverse ancestries. Developing PRSs that benefit global populations in research and clinical settings provides an opportunity for innovation and is essential for health equity.


Assuntos
Predisposição Genética para Doença , Humanos , Fatores de Risco , Herança Multifatorial , Medicina de Precisão , Estudo de Associação Genômica Ampla
4.
bioRxiv ; 2023 Oct 02.
Artigo em Inglês | MEDLINE | ID: mdl-37873338

RESUMO

Admixed populations, with their unique and diverse genetic backgrounds, are often underrepresented in genetic studies. This oversight not only limits our understanding but also exacerbates existing health disparities. One major barrier has been the lack of efficient tools tailored for the special challenges of genetic study of admixed populations. Here, we present admix-kit, an integrated toolkit and pipeline for genetic analyses of admixed populations. Admix-kit implements a suite of methods to facilitate genotype and phenotype simulation, association testing, genetic architecture inference, and polygenic scoring in admixed populations.

5.
Cell ; 186(5): 923-939.e14, 2023 03 02.
Artigo em Inglês | MEDLINE | ID: mdl-36868214

RESUMO

We conduct high coverage (>30×) whole-genome sequencing of 180 individuals from 12 indigenous African populations. We identify millions of unreported variants, many predicted to be functionally important. We observe that the ancestors of southern African San and central African rainforest hunter-gatherers (RHG) diverged from other populations >200 kya and maintained a large effective population size. We observe evidence for ancient population structure in Africa and for multiple introgression events from "ghost" populations with highly diverged genetic lineages. Although currently geographically isolated, we observe evidence for gene flow between eastern and southern Khoesan-speaking hunter-gatherer populations lasting until ∼12 kya. We identify signatures of local adaptation for traits related to skin color, immune response, height, and metabolic processes. We identify a positively selected variant in the lightly pigmented San that influences pigmentation in vitro by regulating the enhancer activity and gene expression of PDPK1.


Assuntos
Aclimatação , Pigmentação da Pele , Humanos , Sequenciamento Completo do Genoma , Densidade Demográfica , África , Proteínas Quinases Dependentes de 3-Fosfoinositídeo
6.
Cell Genom ; 3(1): 100241, 2023 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-36777179

RESUMO

Polygenic risk scores (PRSs) have been widely explored in precision medicine. However, few studies have thoroughly investigated their best practices in global populations across different diseases. We here utilized data from Global Biobank Meta-analysis Initiative (GBMI) to explore methodological considerations and PRS performance in 9 different biobanks for 14 disease endpoints. Specifically, we constructed PRSs using pruning and thresholding (P + T) and PRS-continuous shrinkage (CS). For both methods, using a European-based linkage disequilibrium (LD) reference panel resulted in comparable or higher prediction accuracy compared with several other non-European-based panels. PRS-CS overall outperformed the classic P + T method, especially for endpoints with higher SNP-based heritability. Notably, prediction accuracy is heterogeneous across endpoints, biobanks, and ancestries, especially for asthma, which has known variation in disease prevalence across populations. Overall, we provide lessons for PRS construction, evaluation, and interpretation using GBMI resources and highlight the importance of best practices for PRS in the biobank-scale genomics era.

7.
BMC Genomics ; 24(1): 75, 2023 Feb 16.
Artigo em Inglês | MEDLINE | ID: mdl-36797672

RESUMO

BACKGROUND: Exfoliation syndrome (XFS) is an age-related systemic disorder characterized by excessive production and progressive accumulation of abnormal extracellular material, with pathognomonic ocular manifestations. It is the most common cause of secondary glaucoma, resulting in widespread global blindness. The largest global meta-analysis of XFS in 123,457 multi-ethnic individuals from 24 countries identified seven loci with the strongest association signal in chr15q22-25 region near LOXL1. Expression analysis have so far correlated coding and a few non-coding variants in the region with LOXL1 expression levels, but functional effects of these variants is unclear. We hypothesize that analysis of the contribution of the genetically determined component of gene expression to XFS risk can provide a powerful method to elucidate potential roles of additional genes and clarify biology that underlie XFS. RESULTS: Transcriptomic Wide Association Studies (TWAS) using PrediXcan models trained in 48 GTEx tissues leveraging on results from the multi-ethnic and European ancestry GWAS were performed. To eliminate the possibility of false-positive results due to Linkage Disequilibrium (LD) contamination, we i) performed PrediXcan analysis in reduced models removing variants in LD with LOXL1 missense variants associated with XFS, and variants in LOXL1 models in both multiethnic and European ancestry individuals, ii) conducted conditional analysis of the significant signals in European ancestry individuals, and iii) filtered signals based on correlated gene expression, LD and shared eQTLs, iv) conducted expression validation analysis in human iris tissues. We observed twenty-eight genes in chr15q22-25 region that showed statistically significant associations, which were whittled down to ten genes after statistical validations. In experimental analysis, mRNA transcript levels for ARID3B, CD276, LOXL1, NEO1, SCAMP2, and UBL7 were significantly decreased in iris tissues from XFS patients compared to control samples. TWAS genes for XFS were significantly enriched for genes associated with inflammatory conditions. We also observed a higher incidence of XFS comorbidity with inflammatory and connective tissue diseases. CONCLUSION: Our results implicate a role for connective tissues and inflammation pathways in the etiology of XFS. Targeting the inflammatory pathway may be a potential therapeutic option to reduce progression in XFS.


Assuntos
Síndrome de Exfoliação , Humanos , Síndrome de Exfoliação/genética , Síndrome de Exfoliação/complicações , Síndrome de Exfoliação/metabolismo , Aminoácido Oxirredutases/genética , RNA Mensageiro , Mutação de Sentido Incorreto , Expressão Gênica , Polimorfismo de Nucleotídeo Único , Proteínas de Ligação a DNA/genética , Antígenos B7/genética
8.
Genome Biol ; 24(1): 35, 2023 02 24.
Artigo em Inglês | MEDLINE | ID: mdl-36829244

RESUMO

BACKGROUND: Mapping of quantitative trait loci (QTL) associated with molecular phenotypes is a powerful approach for identifying the genes and molecular mechanisms underlying human traits and diseases, though most studies have focused on individuals of European descent. While important progress has been made to study a greater diversity of human populations, many groups remain unstudied, particularly among indigenous populations within Africa. To better understand the genetics of gene regulation in East Africans, we perform expression and splicing QTL mapping in whole blood from a cohort of 162 diverse Africans from Ethiopia and Tanzania. We assess replication of these QTLs in cohorts of predominantly European ancestry and identify candidate genes under selection in human populations. RESULTS: We find the gene regulatory architecture of African and non-African populations is broadly shared, though there is a considerable amount of variation at individual loci across populations. Comparing our analyses to an equivalently sized cohort of European Americans, we find that QTL mapping in Africans improves the detection of expression QTLs and fine-mapping of causal variation. Integrating our QTL scans with signatures of natural selection, we find several genes related to immunity and metabolism that are highly differentiated between Africans and non-Africans, as well as a gene associated with pigmentation. CONCLUSION: Extending QTL mapping studies beyond European ancestry, particularly to diverse indigenous populations, is vital for a complete understanding of the genetic architecture of human traits and can reveal novel functional variation underlying human traits and disease.


Assuntos
População da África Oriental , Locos de Características Quantitativas , Humanos , Mapeamento Cromossômico , Expressão Gênica , Tanzânia , Variação Genética
9.
Cell Genom ; 2(10)2022 Oct 12.
Artigo em Inglês | MEDLINE | ID: mdl-36341024

RESUMO

The Global Biobank Meta-analysis Initiative (GBMI), through its diversity, provides a valuable opportunity to study population-wide and ancestry-specific genetic associations. However, with multiple ascertainment strategies and multi-ancestry study populations across biobanks, GBMI presents unique challenges in implementing statistical genetics methods. Transcriptome-wide association studies (TWASs) boost detection power for and provide biological context to genetic associations by integrating genetic variant-to-trait associations from genome-wide association studies (GWASs) with predictive models of gene expression. TWASs present unique challenges beyond GWASs, especially in a multi-biobank, meta-analytic setting. Here, we present the GBMI TWAS pipeline, outlining practical considerations for ancestry and tissue specificity, meta-analytic strategies, and open challenges at every step of the framework. We advise conducting ancestry-stratified TWASs using ancestry-specific expression models and meta-analyzing results using inverse-variance weighting, showing the least test statistic inflation. Our work provides a foundation for adding transcriptomic context to biobank-linked GWASs, allowing for ancestry-aware discovery to accelerate genomic medicine.

10.
Proc Natl Acad Sci U S A ; 119(21): e2123000119, 2022 05 24.
Artigo em Inglês | MEDLINE | ID: mdl-35580180

RESUMO

Human genomic diversity has been shaped by both ancient and ongoing challenges from viruses. The current coronavirus disease 2019 (COVID-19) pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has had a devastating impact on population health. However, genetic diversity and evolutionary forces impacting host genes related to SARS-CoV-2 infection are not well understood. We investigated global patterns of genetic variation and signatures of natural selection at host genes relevant to SARS-CoV-2 infection (angiotensin converting enzyme 2 [ACE2], transmembrane protease serine 2 [TMPRSS2], dipeptidyl peptidase 4 [DPP4], and lymphocyte antigen 6 complex locus E [LY6E]). We analyzed data from 2,012 ethnically diverse Africans and 15,977 individuals of European and African ancestry with electronic health records and integrated with global data from the 1000 Genomes Project. At ACE2, we identified 41 nonsynonymous variants that were rare in most populations, several of which impact protein function. However, three nonsynonymous variants (rs138390800, rs147311723, and rs145437639) were common among central African hunter-gatherers from Cameroon (minor allele frequency 0.083 to 0.164) and are on haplotypes that exhibit signatures of positive selection. We identify signatures of selection impacting variation at regulatory regions influencing ACE2 expression in multiple African populations. At TMPRSS2, we identified 13 amino acid changes that are adaptive and specific to the human lineage compared with the chimpanzee genome. Genetic variants that are targets of natural selection are associated with clinical phenotypes common in patients with COVID-19. Our study provides insights into global variation at host genes related to SARS-CoV-2 infection, which have been shaped by natural selection in some populations, possibly due to prior viral infections.


Assuntos
COVID-19 , África , Enzima de Conversão de Angiotensina 2/genética , COVID-19/genética , Variação Genética , Humanos , Fenótipo , SARS-CoV-2/genética , Seleção Genética
11.
Cell Genom ; 2(10): 100192, 2022 Oct 12.
Artigo em Inglês | MEDLINE | ID: mdl-36777996

RESUMO

Biobanks facilitate genome-wide association studies (GWASs), which have mapped genomic loci across a range of human diseases and traits. However, most biobanks are primarily composed of individuals of European ancestry. We introduce the Global Biobank Meta-analysis Initiative (GBMI)-a collaborative network of 23 biobanks from 4 continents representing more than 2.2 million consented individuals with genetic data linked to electronic health records. GBMI meta-analyzes summary statistics from GWASs generated using harmonized genotypes and phenotypes from member biobanks for 14 exemplar diseases and endpoints. This strategy validates that GWASs conducted in diverse biobanks can be integrated despite heterogeneity in case definitions, recruitment strategies, and baseline characteristics. This collaborative effort improves GWAS power for diseases, benefits understudied diseases, and improves risk prediction while also enabling the nomination of disease genes and drug candidates by incorporating gene and protein expression data and providing insight into the underlying biology of human diseases and traits.

12.
Cell Genom ; 2(10): 100181, 2022 Oct 12.
Artigo em Inglês | MEDLINE | ID: mdl-36777997

RESUMO

The research of rare and devastating orphan diseases, such as idiopathic pulmonary fibrosis (IPF) has been limited by the rarity of the disease itself. The prognosis is poor-the prevalence of IPF is only approximately four times the incidence, limiting the recruitment of patients to trials and studies of the underlying biology. Global biobanking efforts can dramatically alter the future of IPF research. We describe a large-scale meta-analysis of IPF, with 8,492 patients and 1,355,819 population controls from 13 biobanks around the globe. Finally, we combine this meta-analysis with the largest available meta-analysis of IPF, reaching 11,160 patients and 1,364,410 population controls. We identify seven novel genome-wide significant loci, only one of which would have been identified if the analysis had been limited to European ancestry individuals. We observe notable pleiotropy across IPF susceptibility and severe COVID-19 infection and note an unexplained sex-heterogeneity effect at the strongest IPF locus MUC5B.

13.
Res Sq ; 2021 Jul 27.
Artigo em Inglês | MEDLINE | ID: mdl-34341784

RESUMO

We investigated global patterns of genetic variation and signatures of natural selection at host genes relevant to SARS-CoV-2 infection ( ACE2, TMPRSS2, DPP4 , and LY6E ). We analyzed novel data from 2,012 ethnically diverse Africans and 15,997 individuals of European and African ancestry with electronic health records, and integrated with global data from the 1000GP. At ACE2 , we identified 41 non-synonymous variants that were rare in most populations, several of which impact protein function. However, three non-synonymous variants were common among Central African hunter-gatherers from Cameroon and are on haplotypes that exhibit signatures of positive selection. We identify strong signatures of selection impacting variation at regulatory regions influencing ACE2 expression in multiple African populations. At TMPRSS2 , we identified 13 amino acid changes that are adaptive and specific to the human lineage. Genetic variants that are targets of natural selection are associated with clinical phenotypes common in patients with COVID-19.

14.
medRxiv ; 2021 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-34230933

RESUMO

We investigated global patterns of genetic variation and signatures of natural selection at host genes relevant to SARS-CoV-2 infection (ACE2, TMPRSS2, DPP4, and LY6E). We analyzed novel data from 2,012 ethnically diverse Africans and 15,997 individuals of European and African ancestry with electronic health records, and integrated with global data from the 1000GP. At ACE2, we identified 41 non-synonymous variants that were rare in most populations, several of which impact protein function. However, three non-synonymous variants were common among Central African hunter-gatherers from Cameroon and are on haplotypes that exhibit signatures of positive selection. We identify strong signatures of selection impacting variation at regulatory regions influencing ACE2 expression in multiple African populations. At TMPRSS2, we identified 13 amino acid changes that are adaptive and specific to the human lineage. Genetic variants that are targets of natural selection are associated with clinical phenotypes common in patients with COVID-19.

15.
Genome Biol ; 20(1): 204, 2019 Oct 09.
Artigo em Inglês | MEDLINE | ID: mdl-31597575

RESUMO

Following publication of the original article [1], a typographical error in the formula for calculating di in the "Scans for local adaptation" subsection in the Method section, was identified. The correct formula should be.

16.
Genome Biol ; 20(1): 82, 2019 04 26.
Artigo em Inglês | MEDLINE | ID: mdl-31023338

RESUMO

BACKGROUND: Africa is the origin of modern humans within the past 300 thousand years. To infer the complex demographic history of African populations and adaptation to diverse environments, we sequenced the genomes of 92 individuals from 44 indigenous African populations. RESULTS: Genetic structure analyses indicate that among Africans, genetic ancestry is largely partitioned by geography and language, though we observe mixed ancestry in many individuals, consistent with both short- and long-range migration events followed by admixture. Phylogenetic analysis indicates that the San genetic lineage is basal to all modern human lineages. The San and Niger-Congo, Afroasiatic, and Nilo-Saharan lineages were substantially diverged by 160 kya (thousand years ago). In contrast, the San and Central African rainforest hunter-gatherer (CRHG), Hadza hunter-gatherer, and Sandawe hunter-gatherer lineages were diverged by ~ 120-100 kya. Niger-Congo, Nilo-Saharan, and Afroasiatic lineages diverged more recently by ~ 54-16 kya. Eastern and western CRHG lineages diverged by ~ 50-31 kya, and the western CRHG lineages diverged by ~ 18-12 kya. The San and CRHG populations maintained the largest effective population size compared to other populations prior to 60 kya. Further, we observed signatures of positive selection at genes involved in muscle development, bone synthesis, reproduction, immune function, energy metabolism, and cell signaling, which may contribute to local adaptation of African populations. CONCLUSIONS: We observe high levels of genomic variation between ethnically diverse Africans which is largely correlated with geography and language. Our study indicates ancient population substructure and local adaptation of Africans.


Assuntos
Adaptação Biológica , Evolução Biológica , População Negra/genética , Filogenia , Densidade Demográfica , África , Genoma Humano , Migração Humana , Humanos , Filogeografia
17.
Proc Natl Acad Sci U S A ; 116(10): 4166-4175, 2019 03 05.
Artigo em Inglês | MEDLINE | ID: mdl-30782801

RESUMO

Anatomically modern humans arose in Africa ∼300,000 years ago, but the demographic and adaptive histories of African populations are not well-characterized. Here, we have generated a genome-wide dataset from 840 Africans, residing in western, eastern, southern, and northern Africa, belonging to 50 ethnicities, and speaking languages belonging to four language families. In addition to agriculturalists and pastoralists, our study includes 16 populations that practice, or until recently have practiced, a hunting-gathering (HG) lifestyle. We observe that genetic structure in Africa is broadly correlated not only with geography, but to a lesser extent, with linguistic affiliation and subsistence strategy. Four East African HG (EHG) populations that are geographically distant from each other show evidence of common ancestry: the Hadza and Sandawe in Tanzania, who speak languages with clicks classified as Khoisan; the Dahalo in Kenya, whose language has remnant clicks; and the Sabue in Ethiopia, who speak an unclassified language. Additionally, we observed common ancestry between central African rainforest HGs and southern African San, the latter of whom speak languages with clicks classified as Khoisan. With the exception of the EHG, central African rainforest HGs, and San, other HG groups in Africa appear genetically similar to neighboring agriculturalist or pastoralist populations. We additionally demonstrate that infectious disease, immune response, and diet have played important roles in the adaptive landscape of African history. However, while the broad biological processes involved in recent human adaptation in Africa are often consistent across populations, the specific loci affected by selective pressures more often vary across populations.


Assuntos
População Negra/genética , Etnicidade/genética , Variação Genética , Genoma Humano , Idioma , Filogenia , Feminino , Humanos , Masculino
18.
Hum Mol Genet ; 25(11): 2324-2330, 2016 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-26936823

RESUMO

Leukocyte telomere length (LTL), which reflects telomere length in other somatic tissues, is a complex genetic trait. Eleven SNPs have been shown in genome-wide association studies to be associated with LTL at a genome-wide level of significance within cohorts of European ancestry. It has been observed that LTL is longer in African Americans than in Europeans. The underlying reason for this difference is unknown. Here we show that LTL is significantly longer in sub-Saharan Africans than in both Europeans and African Americans. Based on the 11 LTL-associated alleles and genetic data in phase 3 of the 1000 Genomes Project, we show that the shifts in allele frequency within Europe and between Europe and Africa do not fit the pattern expected by neutral genetic drift. Our findings suggest that differences in LTL within Europeans and between Europeans and Africans is influenced by polygenic adaptation and that differences in LTL between Europeans and Africans might explain, in part, ethnic differences in risks for human diseases that have been linked to LTL.


Assuntos
Leucócitos/citologia , Homeostase do Telômero/genética , Encurtamento do Telômero/genética , Telômero/genética , Adolescente , Adulto , Negro ou Afro-Americano/genética , Idoso , Idoso de 80 Anos ou mais , Alelos , População Negra/genética , Criança , Feminino , Deriva Genética , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único , População Branca/genética
19.
Nucleic Acids Res ; 44(D1): D908-16, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26567549

RESUMO

Mammalian gestation and pregnancy are fast evolving processes that involve the interaction of the fetal, maternal and paternal genomes. Version 1.0 of the GEneSTATION database (http://genestation.org) integrates diverse types of omics data across mammals to advance understanding of the genetic basis of gestation and pregnancy-associated phenotypes and to accelerate the translation of discoveries from model organisms to humans. GEneSTATION is built using tools from the Generic Model Organism Database project, including the biology-aware database CHADO, new tools for rapid data integration, and algorithms that streamline synthesis and user access. GEneSTATION contains curated life history information on pregnancy and reproduction from 23 high-quality mammalian genomes. For every human gene, GEneSTATION contains diverse evolutionary (e.g. gene age, population genetic and molecular evolutionary statistics), organismal (e.g. tissue-specific gene and protein expression, differential gene expression, disease phenotype), and molecular data types (e.g. Gene Ontology Annotation, protein interactions), as well as links to many general (e.g. Entrez, PubMed) and pregnancy disease-specific (e.g. PTBgene, dbPTB) databases. By facilitating the synthesis of diverse functional and evolutionary data in pregnancy-associated tissues and phenotypes and enabling their quick, intuitive, accurate and customized meta-analysis, GEneSTATION provides a novel platform for comprehensive investigation of the function and evolution of mammalian pregnancy.


Assuntos
Bases de Dados Genéticas , Evolução Molecular , Gravidez/genética , Animais , Gatos , Bovinos , Cães , Feminino , Expressão Gênica , Genômica , Cobaias , Humanos , Camundongos , Especificidade de Órgãos , Fenótipo , Gravidez/metabolismo , Complicações na Gravidez/genética , Complicações na Gravidez/metabolismo , Coelhos , Ratos , Reprodução/genética
20.
PLoS One ; 10(12): e0144155, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26641094

RESUMO

Progress in understanding complex genetic diseases has been bolstered by synthetic approaches that overlay diverse data types and analyses to identify functionally important genes. Pre-term birth (PTB), a major complication of pregnancy, is a leading cause of infant mortality worldwide. A major obstacle in addressing PTB is that the mechanisms controlling parturition and birth timing remain poorly understood. Integrative approaches that overlay datasets derived from comparative genomics with function-derived ones have potential to advance our understanding of the genetics of birth timing, and thus provide insights into the genes that may contribute to PTB. We intersected data from fast evolving coding and non-coding gene regions in the human and primate lineage with data from genes expressed in the placenta, from genes that show enriched expression only in the placenta, as well as from genes that are differentially expressed in four distinct PTB clinical subtypes. A large fraction of genes that are expressed in placenta, and differentially expressed in PTB clinical subtypes (23-34%) are fast evolving, and are associated with functions that include adhesion neurodevelopmental and immune processes. Functional categories of genes that express fast evolution in coding regions differ from those linked to fast evolution in non-coding regions. Finally, there is a surprising lack of overlap between fast evolving genes that are differentially expressed in four PTB clinical subtypes. Integrative approaches, especially those that incorporate evolutionary perspectives, can be successful in identifying potential genetic contributions to complex genetic diseases, such as PTB.


Assuntos
Regulação da Expressão Gênica , Doenças Genéticas Inatas/genética , Modelos Genéticos , Nascimento Prematuro/genética , Feminino , Doenças Genéticas Inatas/metabolismo , Humanos , Fenótipo , Placenta/metabolismo , Gravidez , Nascimento Prematuro/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA