Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 7.247
Filtrar
1.
Sci Rep ; 14(1): 18455, 2024 08 08.
Artículo en Inglés | MEDLINE | ID: mdl-39117746

RESUMEN

Although previous studies have reported that pre-mRNA splicing factors (SFs) are involved in the repair of DNA double-strand breaks (DSBs) via homologous recombination (HR), their exact role in promoting HR remains poorly understood. Here, we showed that SART1, an SF upregulated in several types of cancer, promotes DSB end resection, an essential first step of HR. The resection-promoting function of SART1 requires phosphorylation at threonine 430 and 695 by ATM/ATR. SART1 is recruited to DSB sites in a manner dependent on transcription and its RS domain. SART1 is epistatic with BRCA1, a major HR factor, in the promotion of resection, especially transcription-associated resection in the G2 phase. SART1 and BRCA1 accumulate at DSB sites in an interdependent manner, and epistatically counteract the resection blockade posed by 53BP1 and RIF1. Furthermore, chromosome analysis demonstrated that SART1 and BRCA1 epistatically suppressed genomic alterations caused by DSB misrepair in the G2 phase. Collectively, these results indicate that SART1 and BRCA1 cooperatively facilitate resection of DSBs arising in transcriptionally active genomic regions in the G2 phase, thereby promoting faithful repair by HR, and suppressing genome instability.


Asunto(s)
Proteína BRCA1 , Roturas del ADN de Doble Cadena , Reparación del ADN por Recombinación , Proteína BRCA1/metabolismo , Proteína BRCA1/genética , Humanos , Factores de Empalme Serina-Arginina/metabolismo , Factores de Empalme Serina-Arginina/genética , Proteínas de la Ataxia Telangiectasia Mutada/metabolismo , Proteínas de la Ataxia Telangiectasia Mutada/genética , Fosforilación , Proteína 1 de Unión al Supresor Tumoral P53/metabolismo , Proteína 1 de Unión al Supresor Tumoral P53/genética , Línea Celular Tumoral , Proteínas de Unión a Telómeros/metabolismo , Proteínas de Unión a Telómeros/genética , Epistasis Genética , Fase G2/genética
2.
Proc Natl Acad Sci U S A ; 121(34): e2314999121, 2024 Aug 20.
Artículo en Inglés | MEDLINE | ID: mdl-39133844

RESUMEN

Mutations in protein active sites can dramatically improve function. The active site, however, is densely packed and extremely sensitive to mutations. Therefore, some mutations may only be tolerated in combination with others in a phenomenon known as epistasis. Epistasis reduces the likelihood of obtaining improved functional variants and dramatically slows natural and lab evolutionary processes. Research has shed light on the molecular origins of epistasis and its role in shaping evolutionary trajectories and outcomes. In addition, sequence- and AI-based strategies that infer epistatic relationships from mutational patterns in natural or experimental evolution data have been used to design functional protein variants. In recent years, combinations of such approaches and atomistic design calculations have successfully predicted highly functional combinatorial mutations in active sites. These were used to design thousands of functional active-site variants, demonstrating that, while our understanding of epistasis remains incomplete, some of the determinants that are critical for accurate design are now sufficiently understood. We conclude that the space of active-site variants that has been explored by evolution may be expanded dramatically to enhance natural activities or discover new ones. Furthermore, design opens the way to systematically exploring sequence and structure space and mutational impacts on function, deepening our understanding and control over protein activity.


Asunto(s)
Epistasis Genética , Mutación , Evolución Molecular , Proteínas/genética , Proteínas/química , Proteínas/metabolismo , Dominio Catalítico , Ingeniería de Proteínas/métodos
3.
Sci Rep ; 14(1): 18169, 2024 08 06.
Artículo en Inglés | MEDLINE | ID: mdl-39107519

RESUMEN

Epistasis is one of important genetic components for a quantitative trait in plant. Eshed and Zamir found negative epistatic interactions of quantitative trait loci in Tomato first. We detected that positive (negative) QTLs generated mostly negative (positive) epistatic interactions on heading date in rice, and then proposed the hypothese that QTL epistasis plays a role of homeostasis in one of our recent papers. In order to further provide additional evidence, the effects of QTLs and their epistatic effects on two quantitative traits of plant height (ph) and thousand kernel weight (tkw) were analyzed in this study. The same regularity was verified again. We detected that positive ph QTLs and negative tkw QTLs always generated reverse epistatic effects, respectively. Moreover, high-order epistatic effects were estimated on these two traits. The sum of all epistatic effects would partially neutralize the additive of constitutive QTL effects. This feature of epistsis would be the mechanism for bionts to maintain homeostasis while the obstacle for human to achieve the pyramiding breeding objectives. More evidences are still being collected to support our assumption.


Asunto(s)
Epistasis Genética , Sitios de Carácter Cuantitativo , Oryza/genética , Fenotipo , Modelos Genéticos
4.
Eur J Med Res ; 29(1): 404, 2024 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-39095899

RESUMEN

The supervised machine learning method is often used for biomedical relationship extraction. The disadvantage is that it requires much time and money to manually establish an annotated dataset. Based on distant supervision, the knowledge base is combined with the corpus, thus, the training corpus can be automatically annotated. As many biomedical databases provide knowledge bases for study with a limited number of annotated corpora, this method is practical in biomedicine. The clinical significance of each patient's genetic makeup can be understood based on the healthcare provider's genetic database. Unfortunately, the lack of previous biomedical relationship extraction studies focuses on gene-gene interaction. The main purpose of this study is to develop extraction methods for gene-gene interactions that can help explain the heritability of human complex diseases. This study referred to the information on gene-gene interactions in the KEGG PATHWAY database, the abstracts in PubMed were adopted to generate the training sample set, and the graph kernel method was adopted to extract gene-gene interactions. The best assessment result was an F1-score of 0.79. Our developed distant supervision method automatically finds sentences through the corpus without manual labeling for extracting gene-gene interactions, which can effectively reduce the time cost for manual annotation data; moreover, the relationship extraction method based on a graph kernel can be successfully applied to extract gene-gene interactions. In this way, the results of this study are expected to help achieve precision medicine.


Asunto(s)
Minería de Datos , Epistasis Genética , Minería de Datos/métodos , Humanos , Aprendizaje Automático , Bases de Datos Genéticas
5.
Bioinformatics ; 40(8)2024 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-39115383

RESUMEN

SUMMARY: Deep mutational scanning (DMS) experiments provide a powerful method to measure the functional effects of genetic mutations at massive scales. However, the data generated from these experiments can be difficult to analyze, with significant variation between experimental replicates. To overcome this challenge, we developed popDMS, a computational method based on population genetics theory, to infer the functional effects of mutations from DMS data. Through extensive tests, we found that the functional effects of single mutations and epistasis inferred by popDMS are highly consistent across replicates, comparing favorably with existing methods. Our approach is flexible and can be widely applied to DMS data that includes multiple time points, multiple replicates, and different experimental conditions. AVAILABILITY AND IMPLEMENTATION: popDMS is implemented in Python and Julia, and is freely available on GitHub at https://github.com/bartonlab/popDMS.


Asunto(s)
Mutación , Programas Informáticos , Epistasis Genética , Biología Computacional/métodos , Genética de Población/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis Mutacional de ADN/métodos , Humanos , Algoritmos
6.
Genome Res ; 34(7): 1081-1088, 2024 Aug 20.
Artículo en Inglés | MEDLINE | ID: mdl-39134411

RESUMEN

Studies of bacterial adaptation and evolution are hampered by the difficulty of measuring traits such as virulence, drug resistance, and transmissibility in large populations. In contrast, it is now feasible to obtain high-quality complete assemblies of many bacterial genomes thanks to scalable high-accuracy long-read sequencing technologies. To exploit this opportunity, we introduce a phenotype- and alignment-free method for discovering coselected and epistatically interacting genomic variation from genome assemblies covering both core and accessory parts of genomes. Our approach uses a compact colored de Bruijn graph to approximate the intragenome distances between pairs of loci for a collection of bacterial genomes to account for the impacts of linkage disequilibrium (LD). We demonstrate the versatility of our approach to efficiently identify associations between loci linked with drug resistance and adaptation to the hospital niche in the major human bacterial pathogens Streptococcus pneumoniae and Enterococcus faecalis.


Asunto(s)
Enterococcus faecalis , Epistasis Genética , Genoma Bacteriano , Streptococcus pneumoniae , Streptococcus pneumoniae/genética , Enterococcus faecalis/genética , Desequilibrio de Ligamiento , Humanos , Genómica/métodos
7.
Sci Rep ; 14(1): 18677, 2024 08 12.
Artículo en Inglés | MEDLINE | ID: mdl-39134575

RESUMEN

Single nucleotide polymorphism (SNP) interactions are the key to improving polygenic risk scores. Previous studies reported several significant SNP-SNP interaction pairs that shared a common SNP to form a cluster, but some identified pairs might be false positives. This study aims to identify factors associated with the cluster effect of false positivity and develop strategies to enhance the accuracy of SNP-SNP interactions. The results showed the cluster effect is a major cause of false-positive findings of SNP-SNP interactions. This cluster effect is due to high correlations between a causal pair and null pairs in a cluster. The clusters with a hub SNP with a significant main effect and a large minor allele frequency (MAF) tended to have a higher false-positive rate. In addition, peripheral null SNPs in a cluster with a small MAF tended to enhance false positivity. We also demonstrated that using the modified significance criterion based on the 3 p-value rules and the bootstrap approach (3pRule + bootstrap) can reduce false positivity and maintain high true positivity. In addition, our results also showed that a pair without a significant main effect tends to have weak or no interaction. This study identified the cluster effect and suggested using the 3pRule + bootstrap approach to enhance SNP-SNP interaction detection accuracy.


Asunto(s)
Herencia Multifactorial , Polimorfismo de Nucleótido Simple , Humanos , Herencia Multifactorial/genética , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo/métodos , Análisis por Conglomerados , Modelos Genéticos , Epistasis Genética
8.
Proc Natl Acad Sci U S A ; 121(32): e2400439121, 2024 Aug 06.
Artículo en Inglés | MEDLINE | ID: mdl-39074291

RESUMEN

Protein engineering often targets binding pockets or active sites which are enriched in epistasis-nonadditive interactions between amino acid substitutions-and where the combined effects of multiple single substitutions are difficult to predict. Few existing sequence-fitness datasets capture epistasis at large scale, especially for enzyme catalysis, limiting the development and assessment of model-guided enzyme engineering approaches. We present here a combinatorially complete, 160,000-variant fitness landscape across four residues in the active site of an enzyme. Assaying the native reaction of a thermostable ß-subunit of tryptophan synthase (TrpB) in a nonnative environment yielded a landscape characterized by significant epistasis and many local optima. These effects prevent simulated directed evolution approaches from efficiently reaching the global optimum. There is nonetheless wide variability in the effectiveness of different directed evolution approaches, which together provide experimental benchmarks for computational and machine learning workflows. The most-fit TrpB variants contain a substitution that is nearly absent in natural TrpB sequences-a result that conservation-based predictions would not capture. Thus, although fitness prediction using evolutionary data can enrich in more-active variants, these approaches struggle to identify and differentiate among the most-active variants, even for this near-native function. Overall, this work presents a large-scale testing ground for model-guided enzyme engineering and suggests that efficient navigation of epistatic fitness landscapes can be improved by advances in both machine learning and physical modeling.


Asunto(s)
Dominio Catalítico , Epistasis Genética , Triptófano Sintasa , Dominio Catalítico/genética , Triptófano Sintasa/genética , Triptófano Sintasa/metabolismo , Triptófano Sintasa/química , Ingeniería de Proteínas/métodos , Sustitución de Aminoácidos , Modelos Moleculares
9.
Gene ; 928: 148765, 2024 Nov 30.
Artículo en Inglés | MEDLINE | ID: mdl-39019098

RESUMEN

Tobacco is an economically significant industrial crop and model plant for genetic research, yet little is known about its genetic architecture. Quantitative trait loci (QTL) analysis was performed for six agronomic traits on an F_7 population of 341 genotypes, parents, and F1 plants using 1974 SSR markers across two environments. 31 QTLs contributing single-locus additive effects on 13 linkage groups (LGs) and 6 QTL pairs contributing epistatic effects on 6 LGs, were detected by the QTLNetwork 2.0 which was developed for the mixed-linear-model-based composite interval mapping (MCIM). Notably, 5 QTLs and 1 epistatic QTL pair were found to have pleiotropic effects on some genetically related traits. Moreover, the Broad sense heritability of the detected QTLs ranged from 1.05% to 43.33%, while genotype-by-environment interaction heritability spanned from 27.09% to 56.25%. Based on the results of QTL mapping, the potential superior lines for all or specific environments were designed and evaluated. Five major QTLs were finely dissected based on the tobacco reference genome of K326, and 31 candidate genes were predicted. This study offered new insights into the complicated genetic architecture and QTL resources for efficient breeding design for genetic improvement of agronomic traits in tobacco.


Asunto(s)
Mapeo Cromosómico , Genotipo , Nicotiana , Sitios de Carácter Cuantitativo , Nicotiana/genética , Mapeo Cromosómico/métodos , Epistasis Genética , Fitomejoramiento/métodos , Ligamiento Genético , Fenotipo
10.
PLoS Pathog ; 20(7): e1012359, 2024 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-38980912

RESUMEN

A strong genetic predictor of outcome following untreated HIV-1 infection is the carriage of specific alleles of human leukocyte antigens (HLAs) that present viral epitopes to T cells. Residual variation in outcome measures may be attributed, in part, to viral adaptation to HLA-restricted T cell responses. Variants of the endoplasmic reticulum aminopeptidases (ERAPs) influence the repertoire of T cell epitopes presented by HLA alleles as they trim pathogen-derived peptide precursors to optimal lengths for antigen presentation, along with other functions unrelated to antigen presentation. We investigated whether ERAP variants influence HLA-associated HIV-1 adaptation with demonstrable effects on overall HIV-1 disease outcome. Utilizing host and viral data of 249 West Australian individuals with HIV-1 subtype B infection, we identified a novel association between two linked ERAP2 single nucleotide polymorphisms (SNPs; rs2248374 and rs2549782) with plasma HIV RNA concentration (viral load) (P adjusted = 0.0024 for both SNPs). Greater HLA-associated HIV-1 adaptation in the HIV-1 Gag gene correlated significantly with higher viral load, lower CD4+ T cell count and proportion; P = 0.0103, P = 0.0061, P = 0.0061, respectively). When considered together, there was a significant interaction between the two ERAP2 SNPs and HLA-associated HIV-1 adaptation on viral load (P = 0.0111). In a comprehensive multivariate model, addition of ERAP2 haplotypes and HLA associated adaptation as an interaction term to known HLA and CCR5 determinants and demographic factors, increased the explanatory variance of population viral load from 17.67% to 45.1% in this dataset. These effects were not replicated in publicly available datasets with comparably sized cohorts, suggesting that any true global epistasis may be dependent on specific HLA-ERAP allelic combinations. Our data raises the possibility that ERAP2 variants may shape peptide repertoires presented to HLA class I-restricted T cells to modulate the degree of viral adaptation within individuals, in turn contributing to disease variability at the population level. Analyses of other populations and experimental studies, ideally with locally derived ERAP genotyping and HLA-specific viral adaptations are needed to elucidate this further.


Asunto(s)
Aminopeptidasas , Epistasis Genética , Infecciones por VIH , VIH-1 , Polimorfismo de Nucleótido Simple , Humanos , Aminopeptidasas/genética , Infecciones por VIH/inmunología , Infecciones por VIH/genética , Infecciones por VIH/virología , VIH-1/inmunología , VIH-1/genética , Australia , Masculino , Femenino , Antígenos HLA/genética , Carga Viral , Adulto , Persona de Mediana Edad
11.
Elife ; 122024 Jul 30.
Artículo en Inglés | MEDLINE | ID: mdl-39078397

RESUMEN

Many membrane proteins are prone to misfolding, which compromises their functional expression at the plasma membrane. This is particularly true for the mammalian gonadotropin-releasing hormone receptor GPCRs (GnRHR). We recently demonstrated that evolutionary GnRHR modifications appear to have coincided with adaptive changes in cotranslational folding efficiency. Though protein stability is known to shape evolution, it is unclear how cotranslational folding constraints modulate the synergistic, epistatic interactions between mutations. We therefore compared the pairwise interactions formed by mutations that disrupt the membrane topology (V276T) or tertiary structure (W107A) of GnRHR. Using deep mutational scanning, we evaluated how the plasma membrane expression of these variants is modified by hundreds of secondary mutations. An analysis of 251 mutants in three genetic backgrounds reveals that V276T and W107A form distinct epistatic interactions that depend on both the severity and the mechanism of destabilization. V276T forms predominantly negative epistatic interactions with destabilizing mutations in soluble loops. In contrast, W107A forms positive interactions with mutations in both loops and transmembrane domains that reflect the diminishing impacts of the destabilizing mutations in variants that are already unstable. These findings reveal how epistasis is remodeled by conformational defects in membrane proteins and in unstable proteins more generally.


Asunto(s)
Epistasis Genética , Proteínas de la Membrana , Pliegue de Proteína , Receptores LHRH , Receptores LHRH/genética , Receptores LHRH/metabolismo , Receptores LHRH/química , Humanos , Proteínas de la Membrana/genética , Proteínas de la Membrana/metabolismo , Proteínas de la Membrana/química , Mutación , Estabilidad Proteica , Membrana Celular/metabolismo
12.
New Phytol ; 243(5): 1681-1697, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-39014537

RESUMEN

Vascular bundles transport water and photosynthate to all organs, and increased bundle number contributes to crop lodging resistance. However, the regulation of vascular bundle formation is poorly understood in the Arabidopsis stem. We report a novel semi-dominant mutant with high vascular activity, hva-d, showing increased vascular bundle number and enhanced cambium proliferation in the stem. The activation of a C2H2 zinc finger transcription factor, AT5G27880/HVA, is responsible for the hva-d phenotype. Genetic, biochemical, and fluorescent microscopic analyses were used to dissect the functions of HVA. HVA functions as a repressor and interacts with TOPLESS via the conserved Ethylene-responsive element binding factor-associated Amphiphilic Repression motif. In contrast to the HVA activation line, knockout of HVA function with a CRISPR-Cas9 approach or expression of HVA fused with an activation domain VP16 (HVA-VP16) resulted in fewer vascular bundles. Further, HVA directly regulates the expression of the auxin transport efflux facilitator PIN1, as a result affecting auxin accumulation. Genetics analysis demonstrated that PIN1 is epistatic to HVA in controlling bundle number. This research identifies HVA as a positive regulator of vascular initiation through negatively modulating auxin transport and sheds new light on the mechanism of bundle formation in the stem.


Asunto(s)
Proteínas de Arabidopsis , Arabidopsis , Regulación de la Expresión Génica de las Plantas , Ácidos Indolacéticos , Tallos de la Planta , Haz Vascular de Plantas , Arabidopsis/genética , Arabidopsis/metabolismo , Proteínas de Arabidopsis/metabolismo , Proteínas de Arabidopsis/genética , Ácidos Indolacéticos/metabolismo , Transporte Biológico , Haz Vascular de Plantas/metabolismo , Tallos de la Planta/metabolismo , Mutación/genética , Proteínas Represoras/metabolismo , Proteínas Represoras/genética , Fenotipo , Proteínas de Transporte de Membrana/metabolismo , Proteínas de Transporte de Membrana/genética , Factores de Transcripción/metabolismo , Factores de Transcripción/genética , Unión Proteica , Cámbium/metabolismo , Cámbium/genética , Epistasis Genética
13.
Eur Phys J E Soft Matter ; 47(7): 49, 2024 Jul 27.
Artículo en Inglés | MEDLINE | ID: mdl-39066883

RESUMEN

The process by which adaptive evolution preserves a population threatened with extinction due to environmental changes is known as evolutionary rescue. Several factors determine the fate of those populations, including demography and genetic factors, such as standing genetic variation, gene flow, availability of de novo mutations, and so on. Despite the extensive debate about evolutionary rescue in the current literature, a study about the role of epistasis and the topography of the fitness landscape on the fate of dwindling populations is missing. In the current work, we aim to fill this gap and study the influence of epistasis on the probability of extinction of populations. We present simulation results, and analytical approximations are derived. Counterintuitively, we show that the likelihood of extinction is smaller when the degree of epistasis is higher. The reason underneath is twofold: first, higher epistasis can promote mutations of more significant phenotypic effects, but also, the incongruence between the maps genotype-phenotype and phenotype-fitness turns the fitness landscape at low epistasis more rugged, thus curbing some of its advantages.


Asunto(s)
Epistasis Genética , Modelos Genéticos , Mutación , Aptitud Genética/genética , Evolución Biológica , Evolución Molecular , Fenotipo , Extinción Biológica
14.
Science ; 385(6704): eadm8762, 2024 Jul 05.
Artículo en Inglés | MEDLINE | ID: mdl-38963845

RESUMEN

Understanding how numerous quantitative trait loci (QTL) shape phenotypic variation is an important question in genetics. To address this, we established a permanent population of 18,421 (18K) rice lines with reduced population structure. We generated reference-level genome assemblies of the founders and genotyped all 18K-rice lines through whole-genome sequencing. Through high-resolution mapping, 96 high-quality candidate genes contributing to variation in 16 traits were identified, including OsMADS22 and OsFTL1 verified as causal genes for panicle number and heading date, respectively. We identified epistatic QTL pairs and constructed a genetic interaction network with 19 genes serving as hubs. Overall, 170 masking epistasis pairs were characterized, serving as an important factor contributing to genetic background effects across diverse varieties. The work provides a basis to guide grain yield and quality improvements in rice.


Asunto(s)
Epistasis Genética , Genoma de Planta , Oryza , Sitios de Carácter Cuantitativo , Oryza/genética , Secuenciación Completa del Genoma , Mapeo Cromosómico , Genes de Plantas , Genotipo , Redes Reguladoras de Genes , Fenotipo
15.
J Affect Disord ; 361: 97-103, 2024 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-38834091

RESUMEN

BACKGROUND: Multiple genes might interact to determine the age at onset of bipolar disorder. We investigated gene-gene interactions related to age at onset of bipolar disorder in the Korean population, using genome-wide association study (GWAS) data. METHODS: The study population consisted of 303 patients with bipolar disorder. First, the top 1000 significant single-nucleotide polymorphisms (SNPs) associated with age at onset of bipolar disorder were selected through single SNP analysis by simple linear regression. Subsequently, the QMDR method was used to find gene-gene interactions. RESULTS: The best 10 SNPs from simple regression were located in chromosome 1, 2, 3, 10, 11, 14, 19, and 21. Only five SNPs were found in several genes, such as FOXN3, KIAA1217, OPCML, CAMSAP2, and PTPRS. On QMDR analyses, five pairs of SNPs showed significant interactions with a CVC exceeding 1/5 in a two-locus model. The best interaction was found for the pair of rs60830549 and rs12952733 (CVC = 1/5, P < 1E-07). In three-locus models, four combinations of SNPs showed significant associations with age at onset, with a CVC of >1/5. The best three-locus combination was rs60830549, rs12952733, and rs12952733 (CVC = 2/5, P < 1E-6). The SNPs showing significant interactions were located in the KIAA1217, RBFOX3, SDK2, CYP19A1, NTM, SMYD3, and RBFOX1 genes. CONCLUSIONS: Our analysis confirmed genetic interactions influencing the age of onset for bipolar disorder and identified several potential candidate genes. Further exploration of the functions of these promising genes, which may have multiple roles within the neuronal network, is necessary.


Asunto(s)
Edad de Inicio , Trastorno Bipolar , Epistasis Genética , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Adulto , Femenino , Humanos , Masculino , Persona de Mediana Edad , Trastorno Bipolar/genética , Predisposición Genética a la Enfermedad , República de Corea , Factores de Empalme de ARN/genética , Pueblos del Este de Asia/genética
16.
Sci Rep ; 14(1): 14579, 2024 06 25.
Artículo en Inglés | MEDLINE | ID: mdl-38918413

RESUMEN

Understanding the genetic basis of complex diseases is one of the most important challenges in current precision medicine. To this end, Genome-Wide Association Studies aim to correlate Single Nucleotide Polymorphisms (SNPs) to the presence or absence of certain traits. However, these studies do not consider interactions between several SNPs, known as epistasis, which explain most genetic diseases. Analyzing SNP combinations to detect epistasis is a major computational task, due to the enormous search space. A possible solution is to employ deep learning strategies for genomic prediction, but the lack of explainability derived from the black-box nature of neural networks is a challenge yet to be addressed. Herein, a novel, flexible, portable, and scalable framework for network interpretation based on transformers is proposed to tackle any-order epistasis. The results on various epistasis scenarios show that the proposed framework outperforms state-of-the-art methods for explainability, while being scalable to large datasets and portable to various deep learning accelerators. The proposed framework is validated on three WTCCC datasets, identifying SNPs related to genes known in the literature that have direct relationships with the studied diseases.


Asunto(s)
Epistasis Genética , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Humanos , Estudio de Asociación del Genoma Completo/métodos , Aprendizaje Profundo , Redes Neurales de la Computación , Biología Computacional/métodos , Algoritmos
17.
Nat Commun ; 15(1): 5175, 2024 Jun 18.
Artículo en Inglés | MEDLINE | ID: mdl-38890325

RESUMEN

The receptor-binding site of influenza A virus hemagglutinin partially overlaps with major antigenic sites and constantly evolves. In this study, we observe that mutations G186D and D190N in the hemagglutinin receptor-binding site have coevolved in two recent human H3N2 clades. X-ray crystallography results show that these mutations coordinately drive the evolution of the hemagglutinin receptor binding mode. Epistasis between G186D and D190N is further demonstrated by glycan binding and thermostability analyses. Immunization and neutralization experiments using mouse and human samples indicate that the evolution of receptor binding mode is accompanied by a change in antigenicity. Besides, combinatorial mutagenesis reveals that G186D and D190N, along with other natural mutations in recent H3N2 strains, alter the compatibility with a common egg-adaptive mutation in seasonal influenza vaccines. Overall, our findings elucidate the role of epistasis in shaping the recent evolution of human H3N2 hemagglutinin and substantiate the high evolvability of its receptor-binding mode.


Asunto(s)
Epistasis Genética , Evolución Molecular , Glicoproteínas Hemaglutininas del Virus de la Influenza , Subtipo H3N2 del Virus de la Influenza A , Gripe Humana , Humanos , Subtipo H3N2 del Virus de la Influenza A/genética , Subtipo H3N2 del Virus de la Influenza A/metabolismo , Glicoproteínas Hemaglutininas del Virus de la Influenza/genética , Glicoproteínas Hemaglutininas del Virus de la Influenza/química , Glicoproteínas Hemaglutininas del Virus de la Influenza/metabolismo , Animales , Ratones , Sitios de Unión , Gripe Humana/virología , Mutación , Cristalografía por Rayos X , Vacunas contra la Influenza , Unión Proteica , Receptores Virales/metabolismo , Receptores Virales/genética , Receptores Virales/química , Femenino
18.
BMC Genomics ; 25(1): 640, 2024 Jun 27.
Artículo en Inglés | MEDLINE | ID: mdl-38937661

RESUMEN

BACKGROUND: Drought adaptation is critical to many tree species persisting under climate change, however our knowledge of the genetic basis for trees to adapt to drought is limited. This knowledge gap impedes our fundamental understanding of drought response and application to forest production and conservation. To improve our understanding of the genomic determinants, architecture, and trait constraints, we assembled a reference genome and detected ~ 6.5 M variants in 432 phenotyped individuals for the foundational tree Corymbia calophylla. RESULTS: We found 273 genomic variants determining traits with moderate heritability (h2SNP = 0.26-0.64). Significant variants were predominantly in gene regulatory elements distributed among several haplotype blocks across all chromosomes. Furthermore, traits were constrained by frequent epistatic and pleiotropic interactions. CONCLUSIONS: Our results on the genetic basis for drought traits in Corymbia calophylla have several implications for the ability to adapt to climate change: (1) drought related traits are controlled by complex genomic architectures with large haplotypes, epistatic, and pleiotropic interactions; (2) the most significant variants determining drought related traits occurred in regulatory regions; and (3) models incorporating epistatic interactions increase trait predictions. Our findings indicate that despite moderate heritability drought traits are likely constrained by complex genomic architecture potentially limiting trees response to climate change.


Asunto(s)
Sequías , Epistasis Genética , Genómica , Genoma de Planta , Haplotipos , Sitios de Carácter Cuantitativo , Fenotipo , Polimorfismo de Nucleótido Simple
19.
Proc Natl Acad Sci U S A ; 121(26): e2312335121, 2024 Jun 25.
Artículo en Inglés | MEDLINE | ID: mdl-38889151

RESUMEN

Predicting the effects of one or more mutations to the in vivo or in vitro properties of a wild-type protein is a major computational challenge, due to the presence of epistasis, that is, of interactions between amino acids in the sequence. We introduce a computationally efficient procedure to build minimal epistatic models to predict mutational effects by combining evolutionary (homologous sequence) and few mutational-scan data. Mutagenesis measurements guide the selection of links in a sparse graphical model, while the parameters on the nodes and the edges are inferred from sequence data. We show, on 10 mutational scans, that our pipeline exhibits performances comparable to state-of-the-art deep networks trained on many more data, while requiring much less parameters and being hence more interpretable. In particular, the identified interactions adapt to the wild-type protein and to the fitness or biochemical property experimentally measured, mostly focus on key functional sites, and are not necessarily related to structural contacts. Therefore, our method is able to extract information relevant for one mutational experiment from homologous sequence data reflecting the multitude of structural and functional constraints acting on proteins throughout evolution.


Asunto(s)
Mutación , Proteínas , Proteínas/genética , Proteínas/metabolismo , Proteínas/química , Epistasis Genética , Evolución Molecular , Biología Computacional/métodos
20.
Beijing Da Xue Xue Bao Yi Xue Ban ; 56(3): 384-389, 2024 Jun 18.
Artículo en Chino | MEDLINE | ID: mdl-38864121

RESUMEN

OBJECTIVE: To explore the association between polymorphisms of transforming growth factor-ß (TGF-ß) signaling pathway and non-syndromic cleft lip with or without cleft palate (NSCL/P) among Asian populations, while considering gene-gene interaction and gene-environment interaction. METHODS: A total of 1 038 Asian NSCL/P case-parent trios were ascertained from an international consortium, which conducted a genome-wide association study using a case-parent trio design to investigate the genes affec-ting risk to NSCL/P. After stringent quality control measures, 343 single nucleotide polymorphism (SNP) spanning across 10 pivotal genes in the TGF-ß signaling pathway were selected from the original genome-wide association study(GWAS) dataset for further analysis. The transmission disequilibrium test (TDT) was used to test for SNP effects. The conditional Logistic regression models were used to test for gene-gene interaction and gene-environment interaction. Environmental factors collected for the study included smoking during pregnancy, passive smoking during pregnancy, alcohol intake during pregnancy, and vitamin use during pregnancy. Due to the low rates of exposure to smoking during pregnancy and alcohol consumption during pregnancy (<3%), only the interaction between maternal smoking during pregnancy and multivitamin supplementation during pregnancy was analyzed. The threshold for statistical significance was rigorously set at P =1.46×10-4, applying Bonferroni correction to account for multiple testing. RESULTS: A total of 23 SNPs in 4 genes yielded nominal association with NSCL/P (P<0.05), but none of these associations was statistically significant after Bonferroni' s multiple test correction. However, there were 6 pairs of SNPs rs4939874 (SMAD2) and rs1864615 (TGFBR2), rs2796813 (TGFB2) and rs2132298 (TGFBR2), rs4147358 (SMAD3) and rs1346907 (TGFBR2), rs4939874 (SMAD2) and rs1019855 (TGFBR2), rs4939874 (SMAD2) and rs12490466 (TGFBR2), rs2009112 (TGFB2) and rs4075748 (TGFBR2) showed statistically significant SNP-SNP interaction (P<1.46×10-4). In contrast, the analysis of gene-environment interactions did not yield any significant results after being corrected by multiple testing. CONCLUSION: The comprehensive evaluation of SNP associations and interactions within the TGF-ß signaling pathway did not yield any direct associations with NSCL/P risk in Asian populations. However, the significant gene-gene interactions identified suggest that the genetic architecture influencing NSCL/P risk may involve interactions between genes within the TGF-ß signaling pathway. These findings underscore the necessity for further investigations to unravel these results and further explore the underlying biological mechanisms.


Asunto(s)
Labio Leporino , Fisura del Paladar , Interacción Gen-Ambiente , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Transducción de Señal , Factor de Crecimiento Transformador beta , Humanos , Fisura del Paladar/genética , Labio Leporino/genética , Transducción de Señal/genética , Factor de Crecimiento Transformador beta/genética , Factor de Crecimiento Transformador beta/metabolismo , Femenino , Pueblo Asiatico/genética , Embarazo , Masculino , Predisposición Genética a la Enfermedad , Proteína smad3/genética , Factores de Riesgo , Proteína Smad2/genética , Proteína Smad2/metabolismo , Epistasis Genética , Contaminación por Humo de Tabaco/efectos adversos , Consumo de Bebidas Alcohólicas/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...