Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 76
Filtrar
1.
Exp Cell Res ; 434(1): 113870, 2024 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-38049082

RESUMO

Previous studies have revealed that B cell activation is regulated by various microRNAs(miRNAs). However, the role of microRNA-130b regulating B cell activation and apoptosis is still unclear. In the present study, we first found that the expression of miR-130b was the lowest in Pro/Pre-B cells and the highest in immature B cells. Besides, the expression of miR-130b decreased after activation in B cells. Through the immuno-phenotypic analysis of miR-130b transgenic and knockout mice, we found that miR-130b mainly promoted the proliferation of B cells and inhibited B cell apoptosis. Furthermore, we identified that Cyld, a tumor suppressor gene was the target gene of miR-130b in B cells. Besides, the Cyld-mediated NF-κB signaling was increased in miR-130b overexpressed B cells, which further explains the enhanced proliferation of B cells. In conclusion, we propose that miR-130b promotes B cell proliferation via Cyld-mediated NF-κB signaling, which provides a new theoretical basis for the molecular regulation of B cell activation.


Assuntos
MicroRNAs , NF-kappa B , Animais , Camundongos , Apoptose/genética , Linhagem Celular Tumoral , Proliferação de Células/genética , Enzima Desubiquitinante CYLD/genética , Enzima Desubiquitinante CYLD/metabolismo , Regulação Neoplásica da Expressão Gênica , MicroRNAs/genética , MicroRNAs/metabolismo , NF-kappa B/genética , NF-kappa B/metabolismo , Transdução de Sinais/genética
2.
J Virol ; 97(6): e0068723, 2023 06 29.
Artigo em Inglês | MEDLINE | ID: mdl-37255478

RESUMO

Studies already revealed that some E3 ubiquitin ligases participated in the immune response after viral infection by regulating the type I interferon (IFN) pathway. Here, we demonstrated that type I interferon signaling enhanced the translocation of ETS1 to the nucleus and the promoter activity of E3 ubiquitin ligase DTX3L (deltex E3 ubiquitin ligase 3L) after virus infection and thus increased the expression of DTX3L. Further experiments suggested that DTX3L ubiquitinated TBK1 at K30 and K401 sites on K63-linked ubiquitination pathway. DTX3L was also necessary for mediating the phosphorylation of TBK1 through binding with the tyrosine kinase SRC: both together enhanced the activation of TBK1. Therefore, DTX3L, being an important positive-feedback regulator of type I interferon, exerted a key role in antiviral response. IMPORTANCE Our present study evaluated DTX3L as an antiviral molecule by promoting IFN production and establishing an IFN-ß-ETS1-DTX3L-TBK1 positive-feedback loop as a novel immunomodulatory step to enhance interferon signaling and inhibit respiratory syncytial virus (RSV) infection. Our finding enriches and complements the biological function of DTX3L and provides a new strategy to protect against lung diseases such as bronchiolitis and pneumonia that develop with RSV.


Assuntos
Imunidade Inata , Interferon Tipo I , Proteínas Serina-Treonina Quinases , Infecções por Vírus Respiratório Sincicial , Ubiquitina-Proteína Ligases , Interferon Tipo I/metabolismo , Fosforilação , Proteínas Serina-Treonina Quinases/metabolismo , Ubiquitina-Proteína Ligases/metabolismo , Ubiquitinação , Vírus Sinciciais Respiratórios , Infecções por Vírus Respiratório Sincicial/imunologia
3.
Brief Bioinform ; 23(2)2022 03 10.
Artigo em Inglês | MEDLINE | ID: mdl-35039832

RESUMO

Cancer is an omics disease. The development in high-throughput profiling has fundamentally changed cancer research and clinical practice. Compared with clinical, demographic and environmental data, the analysis of omics data-which has higher dimensionality, weaker signals and more complex distributional properties-is much more challenging. Developments in the literature are often 'scattered', with individual studies focused on one or a few closely related methods. The goal of this review is to assist cancer researchers with limited statistical expertise in establishing the 'overall framework' of cancer omics data analysis. To facilitate understanding, we mainly focus on intuition, concepts and key steps, and refer readers to the original publications for mathematical details. This review broadly covers unsupervised and supervised analysis, as well as individual-gene-based, gene-set-based and gene-network-based analysis. We also briefly discuss 'special topics' including interaction analysis, multi-datasets analysis and multi-omics analysis.


Assuntos
Genômica , Neoplasias , Análise de Dados , Genômica/métodos , Humanos , Neoplasias/genética
4.
Biostatistics ; 23(2): 574-590, 2022 04 13.
Artigo em Inglês | MEDLINE | ID: mdl-33040145

RESUMO

In recent biomedical research, genome-wide association studies (GWAS) have demonstrated great success in investigating the genetic architecture of human diseases. For many complex diseases, multiple correlated traits have been collected. However, most of the existing GWAS are still limited because they analyze each trait separately without considering their correlations and suffer from a lack of sufficient information. Moreover, the high dimensionality of single nucleotide polymorphism (SNP) data still poses tremendous challenges to statistical methods, in both theoretical and practical aspects. In this article, we innovatively propose an integrative functional linear model for GWAS with multiple traits. This study is the first to approximate SNPs as functional objects in a joint model of multiple traits with penalization techniques. It effectively accommodates the high dimensionality of SNPs and correlations among multiple traits to facilitate information borrowing. Our extensive simulation studies demonstrate the satisfactory performance of the proposed method in the identification and estimation of disease-associated genetic variants, compared to four alternatives. The analysis of type 2 diabetes data leads to biologically meaningful findings with good prediction accuracy and selection stability.


Assuntos
Diabetes Mellitus Tipo 2 , Estudo de Associação Genômica Ampla , Diabetes Mellitus Tipo 2/genética , Estudo de Associação Genômica Ampla/métodos , Humanos , Modelos Lineares , Fenótipo , Polimorfismo de Nucleotídeo Único
5.
Brief Bioinform ; 22(3)2021 05 20.
Artigo em Inglês | MEDLINE | ID: mdl-32793970

RESUMO

Gene expression data have played an essential role in many biomedical studies. When the number of genes is large and sample size is limited, there is a 'lack of information' problem, leading to low-quality findings. To tackle this problem, both horizontal and vertical data integrations have been developed, where vertical integration methods collectively analyze data on gene expressions as well as their regulators (such as mutations, DNA methylation and miRNAs). In this article, we conduct a selective review of vertical data integration methods for gene expression data. The reviewed methods cover both marginal and joint analysis and supervised and unsupervised analysis. The main goal is to provide a sketch of the vertical data integration paradigm without digging into too many technical details. We also briefly discuss potential pitfalls, directions for future developments and application notes.


Assuntos
Expressão Gênica , Análise por Conglomerados , Análise de Dados , Humanos , Aprendizado de Máquina não Supervisionado
6.
Bioinformatics ; 38(10): 2855-2862, 2022 05 13.
Artigo em Inglês | MEDLINE | ID: mdl-35561185

RESUMO

MOTIVATION: Cancer genetic heterogeneity analysis has critical implications for tumour classification, response to therapy and choice of biomarkers to guide personalized cancer medicine. However, existing heterogeneity analysis based solely on molecular profiling data usually suffers from a lack of information and has limited effectiveness. Many biomedical and life sciences databases have accumulated a substantial volume of meaningful biological information. They can provide additional information beyond molecular profiling data, yet pose challenges arising from potential noise and uncertainty. RESULTS: In this study, we aim to develop a more effective heterogeneity analysis method with the help of prior information. A network-based penalization technique is proposed to innovatively incorporate a multi-view of prior information from multiple databases, which accommodates heterogeneity attributed to both differential genes and gene relationships. To account for the fact that the prior information might not be fully credible, we propose a weighted strategy, where the weight is determined dependent on the data and can ensure that the present model is not excessively disturbed by incorrect information. Simulation and analysis of The Cancer Genome Atlas glioblastoma multiforme data demonstrate the practical applicability of the proposed method. AVAILABILITY AND IMPLEMENTATION: R code implementing the proposed method is available at https://github.com/mengyunwu2020/PECM. The data that support the findings in this paper are openly available in TCGA (The Cancer Genome Atlas) at https://portal.gdc.cancer.gov/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Glioblastoma , Software , Simulação por Computador , Genoma , Glioblastoma/genética , Humanos , Medicina de Precisão
7.
Biometrics ; 79(3): 1761-1774, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-36524727

RESUMO

Genetic interactions play an important role in the progression of complex diseases, providing explanation of variations in disease phenotype missed by main genetic effects. Comparatively, there are fewer studies on survival time, given its challenging characteristics such as censoring. In recent biomedical research, two-level analysis of both genes and their involved pathways has received much attention and been demonstrated as more effective than single-level analysis. However, such analysis is usually limited to main effects. Pathways are not isolated, and their interactions have also been suggested to have important contributions to the prognosis of complex diseases. In this paper, we develop a novel two-level Bayesian interaction analysis approach for survival data. This approach is the first to conduct the analysis of lower-level gene-gene interactions and higher-level pathway-pathway interactions simultaneously. Significantly advancing from the existing Bayesian studies based on the Markov Chain Monte Carlo (MCMC) technique, we propose a variational inference framework based on the accelerated failure time model with effective priors to accommodate two-level selection as well as censoring. Its computational efficiency is much desirable for high-dimensional interaction analysis. We examine performance of the proposed approach using extensive simulation. The application to TCGA melanoma and lung adenocarcinoma data leads to biologically sensible findings with satisfactory prediction accuracy and selection stability.


Assuntos
Teorema de Bayes , Simulação por Computador , Fenótipo , Cadeias de Markov , Método de Monte Carlo
8.
Biometrics ; 79(4): 3359-3373, 2023 12.
Artigo em Inglês | MEDLINE | ID: mdl-37098961

RESUMO

Genome-wide association studies (GWAS) have led to great successes in identifying genotype-phenotype associations for complex human diseases. In such studies, the high dimensionality of single nucleotide polymorphisms (SNPs) often makes analysis difficult. Functional analysis, which interprets SNPs densely distributed in a chromosomal region as a continuous process rather than discrete observations, has emerged as a promising avenue for overcoming the high dimensionality challenges. However, the majority of the existing functional studies continue to be individual SNP based and are unable to sufficiently account for the intricate underpinning structures of SNP data. SNPs are often found in groups (e.g., genes or pathways) and have a natural group structure. Additionally, these SNP groups can be highly correlated with coordinated biological functions and interact in a network. Motivated by these unique characteristics of SNP data, we develop a novel bi-level structured functional analysis method and investigate disease-associated genetic variants at the SNP level and SNP group level simultaneously. The penalization technique is adopted for bi-level selection and also to accommodate the group-level network structure. Both the estimation and selection consistency properties are rigorously established. The superiority of the proposed method over alternatives is shown through extensive simulation studies. A type 2 diabetes SNP data application yields some biologically intriguing results.


Assuntos
Diabetes Mellitus Tipo 2 , Estudo de Associação Genômica Ampla , Humanos , Estudo de Associação Genômica Ampla/métodos , Diabetes Mellitus Tipo 2/genética , Estudos de Associação Genética , Simulação por Computador , Polimorfismo de Nucleotídeo Único
9.
Eur J Neurol ; 30(2): 443-452, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36286605

RESUMO

BACKGROUND AND PURPOSE: The aim was to evaluate the potential of retinal nerve fiber layer thickness (RNFLT) measured with optical coherence tomography in predicting disease progression in relapsing-remitting multiple sclerosis (RRMS). METHODS: Analyses were conducted post hoc of this 24-month, phase III, double-blind study, in which RRMS patients were randomized (1:1:1) to once daily oral fingolimod 0.5 mg, 1.25 mg or placebo. The key outcomes were the association between baseline RNFLT and baseline clinical characteristics and clinical/imaging outcomes up to 24 months. Change of RNFLT with fingolimod versus placebo within 24 months and time to retinal nerve fiber layer (RNFL) thinning were evaluated. RESULTS: Altogether 885 patients were included. At baseline, lower RNFLT was correlated with higher Expanded Disability Status Scale score (r = -1.085, p = 0.018), lower brain volume (r = 0.025, p = 0.006) and deep gray matter volume (r = 0.731, p < 0.0001), worse visual acuity (r = -19.846, p < 0.0001) and longer duration since diagnosis (r = -0.258, p = 0.018). At month 12, low baseline RNFLT (<86 µm) versus high baseline RNFLT (≥99 µm) was associated with a greater brain volume loss (percentage change -0.605% vs. -0.315%, p = 0.035) in patients without optic neuritis history. At month 24, low baseline RNFLT versus high baseline RNFLT was associated with a higher number of new or newly enlarged T2 lesions (mean number 4.0 vs. 2.8, p = 0.014) and a higher risk of subsequent RNFL thinning (hazard ratio 2.55; 95% confidence interval 1.84-3.53; p < 0.001). The atrophy of the RNFL in the inferior quadrant was alleviated with fingolimod 0.5 mg versus placebo at month 24 (Δ(least squares mean) = 1.8, p = 0.047). CONCLUSION: Retinal nerve fiber layer thickness could predict disease progression in RRMS. TRIAL REGISTRATION: Clinicaltrials.gov identifier: NCT00355134, https://clinicaltrials.gov/ct2/show/NCT00355134.


Assuntos
Esclerose Múltipla Recidivante-Remitente , Esclerose Múltipla , Humanos , Esclerose Múltipla/complicações , Cloridrato de Fingolimode/uso terapêutico , Fibras Nervosas/patologia , Retina/diagnóstico por imagem , Retina/patologia , Esclerose Múltipla Recidivante-Remitente/diagnóstico por imagem , Esclerose Múltipla Recidivante-Remitente/tratamento farmacológico , Progressão da Doença , Tomografia de Coerência Óptica/métodos
10.
J Dairy Sci ; 106(6): 3856-3867, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37164860

RESUMO

Melamine (MEL), enrofloxacin (ENR), sulfamethazine (SMZ), tetracycline (TC), and aflatoxin M1 (AFM1) are the main chemical contaminants in milk. It is necessary to detect these miscellaneous chemical contaminants in milk synchronously to ensure the safety of the milk. In this study, a multiple lateral flow immunoassay (LFIA) was developed for the detection of MEL, ENR, SMZ, TC, and AFM1 in milk. Under optimal experimental conditions, the cutoff values were 25 ng/mL for MEL, 1 ng/mL for ENR, 2.5 ng/mL for SMZ, 2.5 ng/mL for TC, and 0.25 ng/mL for AFM1 in milk samples. The limits of detection of LFIA were 0.173 ng/mL for MEL, 0.078 ng/mL for ENR, 0.059 ng/mL for SMZ, 0.082 ng/mL for TC, and 0.0064 ng/mL for AFM1. The recovery rates of LFIA in milk were 83.2-104.4% for MEL, 76.5-127.3% for ENR, 96.8-113.5% for SMZ, 107.1-166.6% for TC, and 93.5-130.3% for AFM1. The coefficients of variation were all less than 15%. As a whole, the developed multiple lateral flow immunoassay showed potential as a highly reliable and excellent tool for the rapid and sensitive screening of MEL, ENR, SMZ, TC, and AFM1 in milk.


Assuntos
Leite , Sulfametazina , Animais , Leite/química , Imunoensaio/veterinária , Sulfametazina/análise , Antibacterianos , Enrofloxacina , Tetraciclina , Aflatoxina M1/análise , Contaminação de Alimentos/análise
11.
Stat Sin ; 33(2): 729-758, 2023 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38037567

RESUMO

This study has been motivated by cancer research, in which heterogeneity analysis plays an important role and can be roughly classified as unsupervised or supervised. In supervised heterogeneity analysis, the finite mixture of regression (FMR) technique is used extensively, under which the covariates affect the response differently in subgroups. High-dimensional molecular and, very recently, histopathological imaging features have been analyzed separately and shown to be effective for heterogeneity analysis. For simpler analysis, they have been shown to contain overlapping, but also independent information. In this article, our goal is to conduct the first and more effective FMR-based cancer heterogeneity analysis by integrating high-dimensional molecular and histopathological imaging features. A penalization approach is developed to regularize estimation, select relevant variables, and, equally importantly, promote the identification of independent information. Consistency properties are rigorously established. An effective computational algorithm is developed. A simulation and an analysis of The Cancer Genome Atlas (TCGA) lung cancer data demonstrate the practical effectiveness of the proposed approach. Overall, this study provides a practical and useful new way of conducting supervised cancer heterogeneity analysis.

12.
Bioinformatics ; 37(20): 3691-3692, 2021 Oct 25.
Artigo em Inglês | MEDLINE | ID: mdl-33961050

RESUMO

SUMMARY: For understanding complex diseases, gene-environment (G-E) interactions have important implications beyond main G and E effects. Most of the existing analysis approaches and software packages cannot accommodate data contamination/long-tailed distribution. We develop GEInter, a comprehensive R package tailored to robust G-E interaction analysis. For both marginal and joint analysis, for data without and with missingness, for continuous and censored survival responses, it comprehensively conducts identification, estimation, visualization and prediction. It can fill an important gap in the existing literature and enjoy broad applicability. AVAILABILITY AND IMPLEMENTATION: TCGA data is analyzed as demonstrating examples. It is well known that such data is publicly available https://cran.r-project.org/web/packages/GEInter/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

13.
Biometrics ; 78(4): 1542-1554, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-34213006

RESUMO

Multiple types of molecular (genetic, genomic, epigenetic, etc.) measurements, environmental risk factors, and their interactions have been found to contribute to the outcomes and phenotypes of complex diseases. In each of the previous studies, only the interactions between one type of molecular measurement and environmental risk factors have been analyzed. In recent biomedical studies, multidimensional profiling, in which data from multiple types of molecular measurements are collected from the same subjects, is becoming popular. A myriad of recent studies have shown that collectively analyzing multiple types of molecular measurements is not only biologically sensible but also leads to improved estimation and prediction. In this study, we conduct an M-E interaction analysis, with M standing for multidimensional molecular measurements and E standing for environmental risk factors. This can accommodate multiple types of molecular measurements and sufficiently account for their overlapping as well as independent information. Extensive simulation shows that it outperforms several closely related alternatives. In the analysis of TCGA (The Cancer Genome Atlas) data on lung adenocarcinoma and cutaneous melanoma, we make some stable biological findings and achieve stable prediction.


Assuntos
Melanoma , Neoplasias Cutâneas , Humanos , Melanoma/genética , Neoplasias Cutâneas/genética , Interação Gene-Ambiente , Genômica , Simulação por Computador
14.
Stat Med ; 41(27): 5448-5462, 2022 11 30.
Artigo em Inglês | MEDLINE | ID: mdl-36117143

RESUMO

Cancer heterogeneity plays an important role in the understanding of tumor etiology, progression, and response to treatment. To accommodate heterogeneity, cancer subgroup analysis has been extensively conducted. However, most of the existing studies share the limitation that they cannot accommodate heavy-tailed or contaminated outcomes and also high dimensional covariates, both of which are not uncommon in biomedical research. In this study, we propose a robust subgroup identification approach based on M-estimators together with concave and pairwise fusion penalties, which advances from existing studies by effectively accommodating high-dimensional data containing some outliers. The penalties are applied on both latent heterogeneity factors and covariates, where the estimation is expected to achieve subgroup identification and variable selection simultaneously, with the number of subgroups being apriori unknown. We innovatively develop an algorithm based on parallel computing strategy, with a significant advantage of capable of processing large-scale data. The convergence property of the proposed algorithm, oracle property of the penalized M-estimators, and selection consistency of the proposed BIC criterion are carefully established. Simulation and analysis of TCGA breast cancer data demonstrate that the proposed approach is promising to efficiently identify underlying subgroups in high-dimensional data.


Assuntos
Algoritmos , Neoplasias , Humanos , Simulação por Computador , Neoplasias/genética
15.
Genet Epidemiol ; 44(2): 159-196, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-31724772

RESUMO

Gene-environment (G-E) interaction analysis has been extensively conducted for complex diseases. In marginal analysis, the common practice is to conduct likelihood-based (and other "standard") estimation with each marginal model, and then select significant G-E interactions and main effects based on p values and multiple comparisons adjustment. One limitation of this approach is that the identification results often do not respect the "main effects, interactions" hierarchy, which has been stressed in recent G-E interaction analyses. There is some recent effort tackling this problem, however, with very complex formulations. Another limitation of the common practice is that it may not perform well when regularization is needed, for example, because of "non-normal" distributions. In this article, we propose a marginal penalization approach which adopts a novel penalty to directly tackle the aforementioned problems. The proposed approach has a framework more coherent with that of the recently developed joint analysis methods and an intuitive formulation, and can be effectively realized. In simulation, it outperforms the popular significance-based analysis and simple penalization-based alternatives. Promising findings are made in the analysis of a single-nucleotide polymorphism and a gene expression data.


Assuntos
Interação Gene-Ambiente , Modelos Genéticos , Simulação por Computador , Diabetes Mellitus/genética , Genoma Humano , Humanos , Melanoma/genética , Polimorfismo de Nucleotídeo Único/genética , Neoplasias Cutâneas/genética
16.
Brief Bioinform ; 20(2): 624-637, 2019 03 25.
Artigo em Inglês | MEDLINE | ID: mdl-29897421

RESUMO

For the risk, progression, and response to treatment of many complex diseases, it has been increasingly recognized that genetic interactions (including gene-gene and gene-environment interactions) play important roles beyond the main genetic and environmental effects. In practical genetic interaction analyses, model mis-specification and outliers/contaminations in response variables and covariates are not uncommon, and demand robust analysis methods. Compared with their nonrobust counterparts, robust genetic interaction analysis methods are significantly less popular but are gaining attention fast. In this article, we provide a comprehensive review of robust genetic interaction analysis methods, on their methodologies and applications, for both marginal and joint analysis, and for addressing model mis-specification as well as outliers/contaminations in response variables and covariates.


Assuntos
Epistasia Genética , Interação Gene-Ambiente , Humanos , Modelos Genéticos
17.
Stat Med ; 40(29): 6619-6633, 2021 12 20.
Artigo em Inglês | MEDLINE | ID: mdl-34542187

RESUMO

Increasing evidence has shown that gene-gene interactions have important effects in biological processes of human diseases. Due to the high dimensionality of genetic measurements, interaction analysis usually suffers from a lack of sufficient information and has unsatisfactory results. Biological network information has been massively accumulated, allowing researchers to identify biomarkers while taking a system perspective, conducting network selection (of functionally related biomarkers), and accommodating network structures. In main-effect-only analysis, network information has been incorporated. However, effort has been limited in interaction analysis. Recently, link networks that describe the relationships between genetic interactions have been demonstrated as effective for revealing multiscale hierarchical organizations in networks and providing interesting findings beyond node networks. In this study, we develop a novel structured Bayesian interaction analysis approach to effectively incorporate network information. This study is among the first to identify gene-gene interactions with the assistance of network selection, while simultaneously accommodating the underlying network structures of both main effects and interactions. It innovatively respects multiple hierarchies among main effects, interactions, and networks. The Bayesian technique is adopted, which may be more informative for estimation and prediction over some other techniques. An efficient variational Bayesian expectation-maximization algorithm is developed to explore the posterior distribution. Extensive simulation studies demonstrate the practical superiority of the proposed approach. The analysis of TCGA data on melanoma and lung cancer leads to biologically sensible findings with satisfactory prediction accuracy and selection stability.


Assuntos
Redes Reguladoras de Genes , Melanoma , Algoritmos , Teorema de Bayes , Simulação por Computador , Interpretação Estatística de Dados , Humanos , Melanoma/genética
18.
Bioinformatics ; 2019 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-31730176

RESUMO

SUMMARY: Multilayer omics profiling has become a major venue for understanding complex diseases. We develop NCutYX, an R package for clustering analysis of multilayer omics data. The package and methods jointly analyze multiple layers of omics measurements and effectively accommodate their regulations. They systematically conduct a series of analysis based on the normalized cut technique, including the clusterings of subjects and omics measurements and biclustering. The package can be valuable for its timely context, novel methods, and comprehensiveness. AVAILABILITY: https://cran.r-project.org/web/packages/NCutYX/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

19.
Biometrics ; 76(1): 23-35, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-31424088

RESUMO

For the etiology, progression, and treatment of complex diseases, gene-environment (G-E) interactions have important implications beyond the main G and E effects. G-E interaction analysis can be more challenging with higher dimensionality and need for accommodating the "main effects, interactions" hierarchy. In recent literature, an array of novel methods, many of which are based on the penalization technique, have been developed. In most of these studies, however, the structures of G measurements, for example, the adjacency structure of single nucleotide polymorphisms (SNPs; attributable to their physical adjacency on the chromosomes) and the network structure of gene expressions (attributable to their coordinated biological functions and correlated measurements) have not been well accommodated. In this study, we develop structured G-E interaction analysis, where such structures are accommodated using penalization for both the main G effects and interactions. Penalization is also applied for regularized estimation and selection. The proposed structured interaction analysis can be effectively realized. It is shown to have consistency properties under high-dimensional settings. Simulations and analysis of GENEVA diabetes data with SNP measurements and TCGA melanoma data with gene expression measurements demonstrate its competitive practical performance.


Assuntos
Biometria/métodos , Interação Gene-Ambiente , Simulação por Computador , Bases de Dados Genéticas/estatística & dados numéricos , Diabetes Mellitus/etiologia , Diabetes Mellitus/genética , Redes Reguladoras de Genes , Humanos , Modelos Lineares , Melanoma/etiologia , Melanoma/genética , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Neoplasias Cutâneas/etiologia , Neoplasias Cutâneas/genética
20.
Genomics ; 111(5): 1115-1123, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-30009922

RESUMO

Gene-environment (G-E) interactions have important implications for the etiology and progression of many complex diseases. Compared to continuous markers and categorical disease status, prognosis has been less investigated, with the additional challenges brought by the unique characteristics of survival outcomes. Most of the existing G-E interaction approaches for prognosis data share the limitation that they cannot accommodate long-tailed or contaminated outcomes. In this study, for prognosis data, we develop a robust G-E interaction identification approach using the censored quantile partial correlation (CQPCorr) technique. The proposed approach is built on the quantile regression technique (and hence has a solid statistical basis), uses weights to easily accommodate censoring, and adopts partial correlation to identify important interactions while properly controlling for the main genetic and environmental effects. In simulation, it outperforms multiple competitors with more accurate identification. In the analysis of TCGA data on lung cancer and melanoma, biologically sensible findings different from using the alternatives are made.


Assuntos
Interação Gene-Ambiente , Neoplasias Pulmonares/genética , Melanoma/genética , Modelos Genéticos , Simulação por Computador , Humanos , Neoplasias Pulmonares/patologia , Melanoma/patologia , Prognóstico
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA