RESUMO
Previous research suggests that group IIA-secreted phospholipase A2 (sPLA2-IIA) plays a role in and predicts lethal COVID-19 disease. The current study reanalyzed a longitudinal proteomic data set to determine the temporal relationship between levels of several members of a family of sPLA2 isoforms and the severity of COVID-19 in 214 ICU patients. The levels of six secreted PLA2 isoforms, sPLA2-IIA, sPLA2-V, sPLA2-X, sPLA2-IB, sPLA2-IIC, and sPLA2-XVI, increased over the first 7 ICU days in those who succumbed to the disease but attenuated over the same time period in survivors. In contrast, a reversed pattern in sPLA2-IID and sPLA2-XIIB levels over 7 days suggests a protective role of these two isoforms. Furthermore, decision tree models demonstrated that sPLA2-IIA outperformed top-ranked cytokines and chemokines as a predictor of patient outcome. Taken together, proteomic analysis revealed temporal sPLA2 patterns that reflect the critical roles of sPLA2 isoforms in severe COVID-19 disease.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , COVID-19/mortalidade , COVID-19/sangue , Feminino , Masculino , Pessoa de Meia-Idade , Idoso , Fosfolipases A2 Secretórias/sangue , Proteômica/métodos , Índice de Gravidade de Doença , Fosfolipases A2 do Grupo II/sangue , Adulto , Isoformas de Proteínas/sangue , Citocinas/sangueRESUMO
OBJECTIVE: Recent evidence suggests that the fimbriated end of the fallopian tube harbors the precursor cells for many high-grade ovarian cancers, opening the door for development of better screening methods that directly assess the fallopian tube in women at risk for malignancy. Previously we have shown that the karyometric signature is abnormal in the fallopian tube epithelium in women at hereditary risk of ovarian cancer. In this study, we sought to determine whether the karyometric signature in serous tubal intraepithelial carcinoma (STIC) is significantly different from normal, and whether an abnormal karyometric signature can be detected in histologically normal tubal epithelial cells adjacent to STIC lesions. METHODS: The karyometric signature was measured in epithelial cells from the proximal and fimbriated portion of the fallopian tube in fallopian tube specimens removed from women at: 1) average risk for ovarian cancer undergoing surgery for benign gynecologic indications (n = 37), 2) hereditary risk of ovarian cancer (germline BRCA alterations) undergoing risk-reducing surgery (n = 44), and 3) diagnosed with fimbrial STICs (n = 17). RESULTS: The karyometric signature in tubes with fimbrial STICs differed from that of tubes with benign histology. The degree of karyometric alteration increased with increasing proximity to fimbrial STICs, ranging from moderate in the proximal portion of the tube, to greatest in both normal appearing fimbrial cells near STICs as well as in fimbrial STIC lesions. CONCLUSION: These data demonstrate an abnormal karyometric signature in STICs that may extend beyond the STIC, potentially providing an opportunity for early detection of fallopian tube neoplasia.
Assuntos
Carcinoma in Situ , Neoplasias das Tubas Uterinas , Tubas Uterinas , Humanos , Feminino , Neoplasias das Tubas Uterinas/patologia , Neoplasias das Tubas Uterinas/genética , Carcinoma in Situ/patologia , Carcinoma in Situ/genética , Tubas Uterinas/patologia , Cistadenocarcinoma Seroso/patologia , Cistadenocarcinoma Seroso/genética , Pessoa de Meia-Idade , Adulto , Neoplasias Ovarianas/patologia , Neoplasias Ovarianas/genética , CariótipoRESUMO
The reactivation of quiescent cells to proliferate is fundamental to tissue repair and homeostasis in the body. Often referred to as the G0 state, quiescence is, however, not a uniform state but with graded depth. Shallow quiescent cells exhibit a higher tendency to revert to proliferation than deep quiescent cells, while deep quiescent cells are still fully reversible under physiological conditions, distinct from senescent cells. Cellular mechanisms underlying the control of quiescence depth and the connection between quiescence and senescence are poorly characterized, representing a missing link in our understanding of tissue homeostasis and regeneration. Here we measured transcriptome changes as rat embryonic fibroblasts moved from shallow to deep quiescence over time in the absence of growth signals. We found that lysosomal gene expression was significantly up-regulated in deep quiescence, and partially compensated for gradually reduced autophagy flux. Reducing lysosomal function drove cells progressively deeper into quiescence and eventually into a senescence-like irreversibly arrested state; increasing lysosomal function, by lowering oxidative stress, progressively pushed cells into shallower quiescence. That is, lysosomal function modulates graded quiescence depth between proliferation and senescence as a dimmer switch. Finally, we found that a gene-expression signature developed by comparing deep and shallow quiescence in fibroblasts can correctly classify a wide array of senescent and aging cell types in vitro and in vivo, suggesting that while quiescence is generally considered to protect cells from irreversible arrest of senescence, quiescence deepening likely represents a common transition path from cell proliferation to senescence, related to aging.
Assuntos
Proliferação de Células , Senescência Celular , Fibroblastos/citologia , Lisossomos/metabolismo , Animais , Divisão Celular , Fibroblastos/metabolismo , Expressão Gênica , Lisossomos/genética , Estresse Oxidativo , RatosRESUMO
BACKGROUND: In this era of data science-driven bioinformatics, machine learning research has focused on feature selection as users want more interpretation and post-hoc analyses for biomarker detection. However, when there are more features (i.e., transcripts) than samples (i.e., mice or human samples) in a study, it poses major statistical challenges in biomarker detection tasks as traditional statistical techniques are underpowered in high dimension. Second and third order interactions of these features pose a substantial combinatoric dimensional challenge. In computational biology, random forest (RF) classifiers are widely used due to their flexibility, powerful performance, their ability to rank features, and their robustness to the "P > > N" high-dimensional limitation that many matrix regression algorithms face. We propose binomialRF, a feature selection technique in RFs that provides an alternative interpretation for features using a correlated binomial distribution and scales efficiently to analyze multiway interactions. RESULTS: In both simulations and validation studies using datasets from the TCGA and UCI repositories, binomialRF showed computational gains (up to 5 to 300 times faster) while maintaining competitive variable precision and recall in identifying biomarkers' main effects and interactions. In two clinical studies, the binomialRF algorithm prioritizes previously-published relevant pathological molecular mechanisms (features) with high classification precision and recall using features alone, as well as with their statistical interactions alone. CONCLUSION: binomialRF extends upon previous methods for identifying interpretable features in RFs and brings them together under a correlated binomial distribution to create an efficient hypothesis testing algorithm that identifies biomarkers' main effects and interactions. Preliminary results in simulations demonstrate computational gains while retaining competitive model selection and classification accuracies. Future work will extend this framework to incorporate ontologies that provide pathway-level feature selection from gene expression input data.
Assuntos
Algoritmos , Biomarcadores/metabolismo , Biomarcadores Tumorais/metabolismo , Neoplasias da Mama/diagnóstico , Biologia Computacional/métodos , Feminino , Humanos , Neoplasias Renais/diagnósticoRESUMO
An amendment to this paper has been published and can be accessed via the original article.
RESUMO
Meta-analysis plays an important role in summarizing and synthesizing scientific evidence derived from multiple studies. With high-dimensional data, the incorporation of variable selection into meta-analysis improves model interpretation and prediction. Existing variable selection methods require direct access to raw data, which may not be available in practical situations. We propose a new approach, sparse meta-analysis (SMA), in which variable selection for meta-analysis is based solely on summary statistics and the effect sizes of each covariate are allowed to vary among studies. We show that the SMA enjoys the oracle property if the estimated covariance matrix of the parameter estimators from each study is available. We also show that our approach achieves selection consistency and estimation consistency even when summary statistics include only the variance estimators or no variance/covariance information at all. Simulation studies and applications to high-throughput genomics studies demonstrate the usefulness of our approach.
Assuntos
Interpretação Estatística de Dados , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Metanálise como Assunto , Modelos Estatísticos , Simulação por Computador , HumanosRESUMO
MOTIVATION: Understanding dynamic, patient-level transcriptomic response to therapy is an important step forward for precision medicine. However, conventional transcriptome analysis aims to discover cohort-level change, lacking the capacity to unveil patient-specific response to therapy. To address this gap, we previously developed two N-of-1-pathways methods, Wilcoxon and Mahalanobis distance, to detect unidirectionally responsive transcripts within a pathway using a pair of samples from a single subject. Yet, these methods cannot recognize bidirectionally (up and down) responsive pathways. Further, our previous approaches have not been assessed in presence of background noise and are not designed to identify differentially expressed mRNAs between two samples of a patient taken in different contexts (e.g. cancer vs non cancer), which we termed responsive transcripts (RTs). METHODS: We propose a new N-of-1-pathways method, k-Means Enrichment (kMEn), that detects bidirectionally responsive pathways, despite background noise, using a pair of transcriptomes from a single patient. kMEn identifies transcripts responsive to the stimulus through k-means clustering and then tests for an over-representation of the responsive genes within each pathway. The pathways identified by kMEn are mechanistically interpretable pathways significantly responding to a stimulus. RESULTS: In â¼9000 simulations varying six parameters, superior performance of kMEn over previous single-subject methods is evident by: (i) improved precision-recall at various levels of bidirectional response and (ii) lower rates of false positives (1-specificity) when more than 10% of genes in the genome are differentially expressed (background noise). In a clinical proof-of-concept, personal treatment-specific pathways identified by kMEn correlate with therapeutic response (p-value<0.01). CONCLUSION: Through improved single-subject transcriptome dynamics of bidirectionally-regulated signals, kMEn provides a novel approach to identify mechanism-level biomarkers.
Assuntos
Perfilação da Expressão Gênica , Medicina de Precisão , Transcriptoma , Análise por Conglomerados , Interpretação Estatística de Dados , Humanos , RNA MensageiroRESUMO
Time-varying coefficient Cox model has been widely studied and popularly used in survival data analysis due to its flexibility for modeling covariate effects. It is of great practical interest to accurately identify the structure of covariate effects in a time-varying coefficient Cox model, i.e. covariates with null effect, constant effect and truly time-varying effect, and estimate the corresponding regression coefficients. Combining the ideas of local polynomial smoothing and group nonnegative garrote, we develop a new penalization approach to achieve such goals. Our method is able to identify the underlying true model structure with probability tending to one and simultaneously estimate the time-varying coefficients consistently. The asymptotic normalities of the resulting estimators are also established. We demonstrate the performance of our method using simulations and an application to the primary biliary cirrhosis data.
RESUMO
In clinical studies with time-to-event as a primary endpoint, one main interest is to find the best treatment strategy to maximize patients' mean survival time. Due to patient's heterogeneity in response to treatments, great efforts have been devoted to developing optimal treatment regimes by integrating individuals' clinical and genetic information. A main challenge arises in the selection of important variables that can help to build reliable and interpretable optimal treatment regimes as the dimension of predictors may be high. In this paper, we propose a robust loss-based estimation framework that can be easily coupled with shrinkage penalties for both estimation of optimal treatment regimes and variable selection. The asymptotic properties of the proposed estimators are studied. Moreover, a model-free estimator of restricted mean survival time under the derived optimal treatment regime is developed, and its asymptotic property is studied. Simulations are conducted to assess the empirical performance of the proposed method for parameter estimation, variable selection, and optimal treatment decision. An application to an AIDS clinical trial data set is given to illustrate the method.
Assuntos
Bioestatística/métodos , Taxa de Sobrevida , Síndrome da Imunodeficiência Adquirida/tratamento farmacológico , Síndrome da Imunodeficiência Adquirida/mortalidade , Fármacos Anti-HIV/administração & dosagem , Ensaios Clínicos como Assunto/estatística & dados numéricos , Simulação por Computador , Humanos , Estimativa de Kaplan-Meier , Análise dos Mínimos Quadrados , Modelos Estatísticos , Medicina de Precisão/métodos , Medicina de Precisão/estatística & dados numéricos , Análise de Regressão , Estatísticas não ParamétricasRESUMO
The causal and interplay mechanisms of Single Nucleotide Polymorphisms (SNPs) associated with complex diseases (complex disease SNPs) investigated in genome-wide association studies (GWAS) at the transcriptional level (mRNA) are poorly understood despite recent advancements such as discoveries reported in the Encyclopedia of DNA Elements (ENCODE) and Genotype-Tissue Expression (GTex). Protein interaction network analyses have successfully improved our understanding of both single gene diseases (Mendelian diseases) and complex diseases. Whether the mRNAs downstream of complex disease genes are central or peripheral in the genetic information flow relating DNA to mRNA remains unclear and may be disease-specific. Using expression Quantitative Trait Loci (eQTL) that provide DNA to mRNA associations and network centrality metrics, we hypothesize that we can unveil the systems properties of information flow between SNPs and the transcriptomes of complex diseases. We compare different conditions such as naïve SNP assignments and stringent linkage disequilibrium (LD) free assignments for transcripts to remove confounders from LD. Additionally, we compare the results from eQTL networks between lymphoblastoid cell lines and liver tissue. Empirical permutation resampling (p<0.001) and theoretic Mann-Whitney U test (p<10(-30)) statistics indicate that mRNAs corresponding to complex disease SNPs via eQTL associations are likely to be regulated by a larger number of SNPs than expected. We name this novel property mRNA hubness in eQTL networks, and further term mRNAs with high hubness as master integrators. mRNA master integrators receive and coordinate the perturbation signals from large numbers of polymorphisms and respond to the personal genetic architecture integratively. This genetic signal integration contrasts with the mechanism underlying some Mendelian diseases, where a genetic polymorphism affecting a single protein hub produces a divergent signal that affects a large number of downstream proteins. Indeed, we verify that this property is independent of the hubness in protein networks for which these mRNAs are transcribed. Our findings provide novel insights into the pleiotropy of mRNAs targeted by complex disease polymorphisms and the architecture of the information flow between the genetic polymorphisms and transcriptomes of complex diseases.
Assuntos
Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , RNA Mensageiro/genética , HumanosRESUMO
We consider model selection and estimation for partial spline models and propose a new regularization method in the context of smoothing splines. The regularization method has a simple yet elegant form, consisting of roughness penalty on the nonparametric component and shrinkage penalty on the parametric components, which can achieve function smoothing and sparse estimation simultaneously. We establish the convergence rate and oracle properties of the estimator under weak regularity conditions. Remarkably, the estimated parametric components are sparse and efficient, and the nonparametric component can be estimated with the optimal rate. The procedure also has attractive computational properties. Using the representer theory of smoothing splines, we reformulate the objective function as a LASSO-type problem, enabling us to use the LARS algorithm to compute the solution path. We then extend the procedure to situations when the number of predictors increases with the sample size and investigate its asymptotic properties in that context. Finite-sample performance is illustrated by simulations.
RESUMO
In high-dimensional data analysis, it is of primary interest to reduce the data dimensionality without loss of information. Sufficient dimension reduction (SDR) arises in this context, and many successful SDR methods have been developed since the introduction of sliced inverse regression (SIR) [Li (1991) Journal of the American Statistical Association 86, 316-327]. Despite their fast progress, though, most existing methods target on regression problems with a continuous response. For binary classification problems, SIR suffers the limitation of estimating at most one direction since only two slices are available. In this article, we develop a new and flexible probability-enhanced SDR method for binary classification problems by using the weighted support vector machine (WSVM). The key idea is to slice the data based on conditional class probabilities of observations rather than their binary responses. We first show that the central subspace based on the conditional class probability is the same as that based on the binary response. This important result justifies the proposed slicing scheme from a theoretical perspective and assures no information loss. In practice, the true conditional class probability is generally not available, and the problem of probability estimation can be challenging for data with large-dimensional inputs. We observe that, in order to implement the new slicing scheme, one does not need exact probability values and the only required information is the relative order of probability values. Motivated by this fact, our new SDR procedure bypasses the probability estimation step and employs the WSVM to directly estimate the order of probability values, based on which the slicing is performed. The performance of the proposed probability-enhanced SDR scheme is evaluated by both simulated and real data examples.
Assuntos
Algoritmos , Biometria/métodos , Interpretação Estatística de Dados , Modelos Estatísticos , Reconhecimento Automatizado de Padrão/métodos , Simulação por Computador , Análise de RegressãoRESUMO
We leverage machine learning approaches to adapt nanopore sequencing basecallers for nucleotide modification detection. We first apply the incremental learning (IL) technique to improve the basecalling of modification-rich sequences, which are usually of high biological interest. With sequence backbones resolved, we further run anomaly detection (AD) on individual nucleotides to determine their modification status. By this means, our pipeline promises the single-molecule, single-nucleotide, and sequence context-free detection of modifications. We benchmark the pipeline using control oligos, further apply it in the basecalling of densely-modified yeast tRNAs and E.coli genomic DNAs, the cross-species detection of N6-methyladenosine (m6A) in mammalian mRNAs, and the simultaneous detection of N1-methyladenosine (m1A) and m6A in human mRNAs. Our IL-AD workflow is available at: https://github.com/wangziyuan66/IL-AD .
Assuntos
Adenosina , Escherichia coli , Aprendizado de Máquina , Sequenciamento por Nanoporos , RNA Mensageiro , RNA de Transferência , Sequenciamento por Nanoporos/métodos , Humanos , Adenosina/análogos & derivados , Adenosina/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA de Transferência/genética , Escherichia coli/genética , Saccharomyces cerevisiae/genética , AnimaisRESUMO
Accurately basecalling sequence backbones in the presence of nucleotide modifications remains a substantial challenge in nanopore sequencing bioinformatics. It has been extensively demonstrated that state-of-the-art basecallers are less compatible with modification-induced sequencing signals. A precise basecalling, on the other hand, serves as the prerequisite for virtually all the downstream analyses. Here, we report that basecallers exposed to diverse training modifications gain the generalizability to analyze novel modifications. With synthesized oligos as the model system, we precisely basecall various out-of-sample RNA modifications. From the representation learning perspective, we attribute this generalizability to basecaller representation space expanded by diverse training modifications. Taken together, we conclude increasing the training data diversity as a novel paradigm for building modification-tolerant nanopore sequencing basecallers.
RESUMO
The secreted phospholipase A2 (sPLA2) isoform, sPLA2-IIA, has been implicated in a variety of diseases and conditions, including bacteremia, cardiovascular disease, COVID-19, sepsis, adult respiratory distress syndrome, and certain cancers. Given its significant role in these conditions, understanding the regulatory mechanisms impacting its levels is crucial. Genome-wide association studies (GWAS) have identified several single nucleotide polymorphisms (SNPs), including rs11573156, that are associated with circulating levels of sPLA2-IIA. The work in the manuscript leveraged 4 publicly available datasets to investigate the mechanism by which rs11573156 influences sPLA2-IIA levels via bioinformatics and modeling analysis. Through genotype-tissue expression (GTEx), 234 expression quantitative trait loci (eQTLs) were identified for the gene that encodes for sPLA2-IIA, PLA2G2A. SNP2TFBS was used to ascertain the binding affinities between transcription factors (TFs) to both the reference and alternative alleles of identified eQTL SNPs. Subsequently, candidate TF-SNP interactions were cross-referenced with the ChIP-seq results in matched tissues from ENCODE. SP1-rs11573156 emerged as the significant TF-SNP pair in the liver. Further analysis revealed that the upregulation of PLA2G2A transcript levels through the rs11573156 variant was likely affected by tissue SP1 protein levels. Using an ordinary differential equation based on Michaelis-Menten kinetic assumptions, we modeled the dependence of PLA2G2A transcription on SP1 protein levels, incorporating the SNP influence. Collectively, our analysis strongly suggests that the difference in the binding dynamics of SP1 to different rs11573156 alleles may underlie the allele-specific PLA2G2A expression in different tissues, a mechanistic model that awaits future direct experimental validation. This mechanism likely contributes to the variation in circulating sPLA2-IIA protein levels in the human population, with implications for a wide range of human diseases.
RESUMO
Nucleotide modifications deviate nanopore sequencing readouts, therefore generating artifacts during the basecalling of sequence backbones. Here, we present an iterative approach to polish modification-disturbed basecalling results. We show such an approach is able to promote the basecalling accuracy of both artificially-synthesized and real-world molecules. With demonstrated efficacy and reliability, we exploit the approach to precisely basecall therapeutic RNAs consisting of artificial or natural modifications, as the basis for quantifying the purity and integrity of vaccine mRNAs which are transcribed in vitro , and for determining modification hotspots of novel therapeutic RNA interference (RNAi) molecules which are bioengineered (BioRNA) in vivo .
RESUMO
Conflicting clinical trial results on omega-3 highly unsaturated fatty acids (n-3 HUFA) have prompted uncertainty about their cardioprotective effects. While the VITAL trial found no overall cardiovascular benefit from n-3 HUFA supplementation, its substantial African American (AfAm) enrollment provided a unique opportunity to explore racial differences in response to n-3 HUFA supplementation. The current observational study aimed to simulate randomized clinical trial (RCT) conditions by matching 3766 AfAm and 15,553 non-Hispanic White (NHW) individuals from the VITAL trial utilizing propensity score matching to address the limitations related to differences in confounding variables between the two groups. Within matched groups (3766 AfAm and 3766 NHW), n-3 HUFA supplementation's impact on myocardial infarction (MI), stroke, and cardiovascular disease (CVD) mortality was assessed. A weighted decision tree analysis revealed belonging to the n-3 supplementation group as the most significant predictor of MI among AfAm but not NHW. Further logistic regression using the LASSO method and bootstrap estimation of standard errors indicated n-3 supplementation significantly lowered MI risk in AfAm (OR 0.17, 95% CI [0.048, 0.60]), with no such effect in NHW. This study underscores the critical need for future RCT to explore racial disparities in MI risk associated with n-3 HUFA supplementation and highlights potential causal differences between supplementation health outcomes in AfAm versus NHW populations.
Assuntos
Negro ou Afro-Americano , Suplementos Nutricionais , Ácidos Graxos Ômega-3 , Aprendizado de Máquina , Infarto do Miocárdio , Idoso , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Ácidos Graxos Ômega-3/administração & dosagem , Infarto do Miocárdio/prevenção & controle , Infarto do Miocárdio/etnologia , Pontuação de Propensão , Fatores de Risco , BrancosRESUMO
Learning predictive models in new domains with scarce training data is a growing challenge in modern supervised learning scenarios. This incentivizes developing domain adaptation methods that leverage the knowledge in known domains (source) and adapt to new domains (target) with a different probability distribution. This becomes more challenging when the source and target domains are in heterogeneous feature spaces, known as heterogeneous domain adaptation (HDA). While most HDA methods utilize mathematical optimization to map source and target data to a common space, they suffer from low transferability. Neural representations have proven to be more transferable; however, they are mainly designed for homogeneous environments. Drawing on the theory of domain adaptation, we propose a novel framework, Heterogeneous Adversarial Neural Domain Adaptation (HANDA), to effectively maximize the transferability in heterogeneous environments. HANDA conducts feature and distribution alignment in a unified neural network architecture and achieves domain invariance through adversarial kernel learning. Three experiments were conducted to evaluate the performance against the state-of-the-art HDA methods on major image and text e-commerce benchmarks. HANDA shows statistically significant improvement in predictive performance. The practical utility of HANDA was shown in real-world dark web online markets. HANDA is an important step towards successful domain adaptation in e-commerce applications.
RESUMO
We leverage machine learning approaches to adapt nanopore sequencing basecallers for nucleotide modification detection. We first apply the incremental learning technique to improve the basecalling of modification-rich sequences, which are usually of high biological interests. With sequence backbones resolved, we further run anomaly detection on individual nucleotides to determine their modification status. By this means, our pipeline promises the single-molecule, single-nucleotide and sequence context-free detection of modifications. We benchmark the pipeline using control oligos, further apply it in the basecalling of densely-modified yeast tRNAs and E.coli genomic DNAs, the cross-species detection of N6-methyladenosine (m6A) in mammalian mRNAs, and the simultaneous detection of N1-methyladenosine (m1A) and m6A in human mRNAs. Our IL-AD workflow is available at: https://github.com/wangziyuan66/IL-AD.
RESUMO
The secreted phospholipase A 2 (sPLA 2 ) isoform, sPLA 2 -IIA, has been implicated in a variety of diseases and conditions, including bacteremia, cardiovascular disease, COVID-19, sepsis, adult respiratory distress syndrome, and certain cancers. Given its significant role in these conditions, understanding the regulatory mechanisms impacting its levels is crucial. Genome-wide association studies (GWAS) have identified several single nucleotide polymorphisms (SNPs), including rs11573156, that are associated with circulating levels of sPLA 2 -IIA. Through Genotype-Tissue Expression (GTEx), 234 expression quantitative trait loci (eQTLs) were identified for the gene that encodes for sPLA 2 -IIA, PLA2G2A . SNP2TFBS ( https://ccg.epfl.ch/snp2tfbs/ ) was utilized to ascertain the binding affinities between transcription factors (TFs) to both the reference and alternative alleles of identified SNPs. Subsequently, ChIP-seq peaks highlighted the TF combinations that specifically bind to the SNP, rs11573156. SP1 emerged as a significant TF/SNP pair in liver cells, with rs11573156/SP1 interaction being most prominent in liver, prostate, ovary, and adipose tissues. Further analysis revealed that the upregulation of PLA2G2A transcript levels through the rs11573156 variant was affected by tissue SP1 protein levels. By leveraging an ordinary differential equation, structured upon Michaelis-Menten enzyme kinetics assumptions, we modeled the PLA2G2A transcription's dependence on SP1 protein levels, incorporating the SNP's influence. Collectively, these data strongly suggest that the binding affinity differences of SP1 for the different rs11573156 alleles can influence PLA2G2A expression. This, in turn, can modulate sPLA2-IIA levels, impacting a wide range of human diseases.