RESUMO
Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57-1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628-0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.
Assuntos
Neoplasias da Mama/classificação , Neoplasias da Mama/genética , Predisposição Genética para Doença , Herança Multifatorial/genética , Adulto , Fatores Etários , Idoso , Idoso de 80 Anos ou mais , Neoplasias da Mama/diagnóstico , Neoplasias da Mama/prevenção & controle , Feminino , Humanos , Anamnese , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único/genética , Receptores de Estrogênio/metabolismo , Reprodutibilidade dos Testes , Medição de RiscoRESUMO
BACKGROUND: Cluster headache is a highly debilitating neurological disorder with considerable inter-ethnic differences. Genome-wide association studies (GWAS) recently identified replicable genomic loci for cluster headache in Europeans, but the genetic underpinnings for cluster headache in Asians remain unclear. The objective of this study is to investigate the genetic architecture and susceptibility loci of cluster headache in Han Chinese resided in Taiwan. METHODS: We conducted a two-stage genome-wide association study in a Taiwanese cohort enrolled from 2007 through 2022 to identify the genetic variants associated with cluster headache. Diagnosis of cluster headache was retrospectively ascertained with the criteria of International Classification of Headache Disorders, third edition. Control subjects were enrolled from the Taiwan Biobank. Genotyping was conducted with the Axiom Genome-Wide Array TWB chip, followed by whole genome imputation. A polygenic risk score was developed to differentiate patients from controls. Downstream analyses including gene-set and tissue enrichment, linkage disequilibrium score regression, and pathway analyses were performed. RESULTS: We enrolled 734 patients with cluster headache and 9,846 population-based controls. We identified three replicable loci, with the lead SNPs being rs1556780 in CAPN2 (odds ratio = 1.59, 95% CI 1.42â1.78, p = 7.61 × 10-16), rs10188640 in MERTK (odds ratio = 1.52, 95% CI 1.33â1.73, p = 8.58 × 10-13), and rs13028839 in STAB2 (odds ratio = 0.63, 95% CI 0.52â0.78, p = 2.81 × 10-8), with the latter two replicating the findings in European populations. Several previously reported genes also showed significant associations with cluster headache in our samples. Polygenic risk score differentiated patients from controls with an area under the receiver operating characteristic curve of 0.77. Downstream analyses implicated circadian regulation and immunological processes in the pathogenesis of cluster headache. CONCLUSIONS: This study revealed the genetic architecture and novel susceptible loci of cluster headache in Han Chinese residing in Taiwan. Our findings support the common genetic contributions of cluster headache across ethnicities and provide novel mechanistic insights into the pathogenesis of cluster headache.
Assuntos
Cefaleia Histamínica , Estudo de Associação Genômica Ampla , Humanos , Cefaleia Histamínica/genética , Predisposição Genética para Doença , Taiwan , Estudos Retrospectivos , Povo Asiático/genética , ChinaRESUMO
As the global burden of mental illness is estimated to become a severe issue in the near future, it demands the development of more effective treatments. Most psychiatric diseases are moderately to highly heritable and believed to involve many genes. Development of new treatment options demands more knowledge on the molecular basis of psychiatric diseases. Toward this end, we propose to develop new statistical methods with improved sensitivity and accuracy to identify disease-related genes specialized for psychiatric diseases. The qualitative psychiatric diagnoses such as case control often suffer from high rates of misdiagnosis and oversimplify the disease phenotypes. Our proposed method utilizes endophenotypes, the quantitative traits hypothesized to underlie disease syndromes, to better characterize the heterogeneous phenotypes of psychiatric diseases. We employ the structural equation modeling using the liability-index model to link multiple genetically regulated expressions from PrediXcan and the manifest variables including endophenotypes and case-control status. The proposed method can be considered as a general method for multivariate regression, which is particularly helpful for psychiatric diseases. We derive penalized retrospective likelihood estimators to deal with the typical small sample size issue. Simulation results demonstrate the advantages of the proposed method and the real data analysis of Alzheimer's disease illustrates the practical utility of the techniques. Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative database.
Assuntos
Doença de Alzheimer , Doença de Alzheimer/genética , Humanos , Análise de Classes Latentes , Neuroimagem , Fenótipo , Estudos RetrospectivosRESUMO
Melanoma heritability is among the highest for cancer and single nucleotide polymorphisms (SNPs) contribute to it. To date, only SNPs that reached statistical significance in genome-wide association studies or few candidate SNPs have been included in melanoma risk prediction models. We compared four approaches for building polygenic risk scores (PRS) using 12 874 melanoma cases and 23 203 controls from Melanoma Meta-Analysis Consortium as a training set, and newly genotyped 3102 cases and 2301 controls from the MelaNostrum consortium for validation. We estimated adjusted odds ratios (ORs) for melanoma risk using traditional melanoma risk factors and the PRS with the largest area under the receiver operator characteristics curve (AUC). We estimated absolute risks combining the PRS and other risk factors, with age- and sex-specific melanoma incidence and competing mortality rates from Italy as an example. The best PRS, including 204 SNPs (AUC = 64.4%; 95% confidence interval (CI) = 63-65.8%), developed using winner's curse estimate corrections, had a per-quintile OR = 1.35 (95% CI = 1.30-1.41), corresponding to a 3.33-fold increase comparing the 5th to the 1st PRS quintile. The AUC improvement by adding the PRS was up to 7%, depending on adjusted factors and country. The 20-year absolute risk estimates based on the PRS, nevus count and pigmentation characteristics for a 60-year-old Italian man ranged from 0.5 to 11.8% (relative risk = 26.34), indicating good separation.
Assuntos
Predisposição Genética para Doença , Melanoma/genética , Nevo/genética , Neoplasias Cutâneas/genética , Adulto , Idoso , Feminino , Estudo de Associação Genômica Ampla , Humanos , Itália , Masculino , Melanoma/epidemiologia , Melanoma/patologia , Pessoa de Meia-Idade , Herança Multifatorial/genética , Nevo/epidemiologia , Nevo/patologia , Polimorfismo de Nucleotídeo Único , Medição de Risco , Fatores de Risco , Neoplasias Cutâneas/epidemiologia , Neoplasias Cutâneas/patologia , Melanoma Maligno CutâneoRESUMO
We propose a model for high dimensional mediation analysis that includes latent variables. We describe our model in the context of an epidemiologic study for incident breast cancer with one exposure and a large number of biomarkers (i.e., potential mediators). We assume that the exposure directly influences a group of latent, or unmeasured, factors which are associated with both the outcome and a subset of the biomarkers. The biomarkers associated with the latent factors linking the exposure to the outcome are considered "mediators." We derive the likelihood for this model and develop an expectation-maximization algorithm to maximize an L1-penalized version of this likelihood to limit the number of factors and associated biomarkers. We show that the resulting estimates are consistent and that the estimates of the nonzero parameters have an asymptotically normal distribution. In simulations, procedures based on this new model can have significantly higher power for detecting the mediating biomarkers compared with the simpler approaches. We apply our method to a study that evaluates the relationship between body mass index, 481 metabolic measurements, and estrogen-receptor positive breast cancer.
Assuntos
Algoritmos , Neoplasias da Mama/epidemiologia , Funções Verossimilhança , Modelos Teóricos , Animais , Biomarcadores/análise , Simulação por Computador , Feminino , HumanosRESUMO
Implementing precision medicine for complex diseases such as chronic obstructive lung disease (COPD) will require extensive use of biomarkers and an in-depth understanding of how genetic, epigenetic, and environmental variations contribute to phenotypic diversity and disease progression. A meta-analysis from two large cohorts of current and former smokers with and without COPD [SPIROMICS (N = 750); COPDGene (N = 590)] was used to identify single nucleotide polymorphisms (SNPs) associated with measurement of 88 blood proteins (protein quantitative trait loci; pQTLs). PQTLs consistently replicated between the two cohorts. Features of pQTLs were compared to previously reported expression QTLs (eQTLs). Inference of causal relations of pQTL genotypes, biomarker measurements, and four clinical COPD phenotypes (airflow obstruction, emphysema, exacerbation history, and chronic bronchitis) were explored using conditional independence tests. We identified 527 highly significant (p < 8 X 10-10) pQTLs in 38 (43%) of blood proteins tested. Most pQTL SNPs were novel with low overlap to eQTL SNPs. The pQTL SNPs explained >10% of measured variation in 13 protein biomarkers, with a single SNP (rs7041; p = 10-392) explaining 71%-75% of the measured variation in vitamin D binding protein (gene = GC). Some of these pQTLs [e.g., pQTLs for VDBP, sRAGE (gene = AGER), surfactant protein D (gene = SFTPD), and TNFRSF10C] have been previously associated with COPD phenotypes. Most pQTLs were local (cis), but distant (trans) pQTL SNPs in the ABO blood group locus were the top pQTL SNPs for five proteins. The inclusion of pQTL SNPs improved the clinical predictive value for the established association of sRAGE and emphysema, and the explanation of variance (R2) for emphysema improved from 0.3 to 0.4 when the pQTL SNP was included in the model along with clinical covariates. Causal modeling provided insight into specific pQTL-disease relationships for airflow obstruction and emphysema. In conclusion, given the frequency of highly significant local pQTLs, the large amount of variance potentially explained by pQTL, and the differences observed between pQTLs and eQTLs SNPs, we recommend that protein biomarker-disease association studies take into account the potential effect of common local SNPs and that pQTLs be integrated along with eQTLs to uncover disease mechanisms. Large-scale blood biomarker studies would also benefit from close attention to the ABO blood group.
Assuntos
Biomarcadores/sangue , Proteínas Sanguíneas/genética , Enfisema/genética , Doença Pulmonar Obstrutiva Crônica/genética , Sistema ABO de Grupos Sanguíneos/genética , Enfisema/sangue , Enfisema/patologia , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Polimorfismo de Nucleotídeo Único , Doença Pulmonar Obstrutiva Crônica/sangue , Doença Pulmonar Obstrutiva Crônica/patologia , Locos de Características Quantitativas/genéticaRESUMO
A large number of cancer drugs have been developed to target particular genes/pathways that are crucial for cancer growth. Drugs that share a molecular target may also have some common predictive omic features, e.g., somatic mutations or gene expression. Therefore, it is desirable to analyze these drugs as a group to identify the associated omic features, which may provide biological insights into the underlying drug response. Furthermore, these omic features may be robust predictors for any drug sharing the same target. The high dimensionality and the strong correlations among the omic features are the main challenges of this task. Motivated by this problem, we develop a new method for high-dimensional bilevel feature selection using a group of response variables that may share a common set of predictors in addition to their individual predictors. Simulation results show that our method has a substantially higher sensitivity and specificity than existing methods. We apply our method to two large-scale drug sensitivity studies in cancer cell lines. Both within-study and between-study validation demonstrate the good efficacy of our method.
Assuntos
Antineoplásicos/farmacologia , Modelos Biológicos , Neoplasias/tratamento farmacológico , Linhagem Celular Tumoral , HumanosRESUMO
OBJECTIVE: The Cancer Genome Atlas (TCGA) identified four integrated clusters for endometrial cancer (EC): POLE, MSI, CNL and CNH. We evaluated differences in gene expression profiles of obese and non-obese women with EC and examined the association of body mass index (BMI) within the clusters identified in TCGA. METHODS: TCGA RNAseq data was used to identify genes related to increasing BMI among ECs. The POLE, MSI and CNL clusters were composed mostly of endometrioid EC. Patient BMI was compared between these three clusters with one-way ANOVA. Association between gene expression and BMI was also assessed while adjusting for confounding effects of potential confounding factors. p-Values testing the association between gene expression and BMI were adjusted for multiple hypothesis testing over the 20,531 genes considered. RESULTS: Mean BMI was statistically different between the ECs in the CNL (35.8) versus POLE (29.8) cluster (p=0.006) and approached significance for the MSI (33.0) versus CNL (35.8) cluster (p=0.05). 181 genes were significantly up- or down-regulated with increasing BMI in endometrioid EC (q-value<0.01), including LPL, IRS-1, IGFBP4, IGFBP7 and the progesterone receptor. DAVID functional annotation analysis revealed significant enrichment in "cell cycle" (adjusted p-value=1.5E-5) and "DNA metabolic processes" (adjusted p-value=1E-3) for the identified genes. CONCLUSIONS: Obesity related genes were found to be upregulated with increasing BMI among endometrioid ECs. Patients with POLE tumors have the lowest median BMI when compared to MSI and CNL. Given the heterogeneity among endometrioid EC, consideration should be given to abandoning the Type I and II classification of EC tumors.
Assuntos
Índice de Massa Corporal , Carcinoma Endometrioide/genética , Neoplasias do Endométrio/genética , Obesidade/genética , Carcinoma Endometrioide/metabolismo , Bases de Dados Genéticas , Neoplasias do Endométrio/metabolismo , Feminino , Regulação Neoplásica da Expressão Gênica , Genômica/métodos , Humanos , Pessoa de Meia-Idade , Obesidade/metabolismo , Transcriptoma , Regulação para CimaRESUMO
The contributing genetic factors of vertigo remain poorly characterized, particularly in individuals of non-European ancestries. Here we show the genetic landscape of vertigo in an Asian population-based cohort. In a two-stage genome-wide association study (Ncase = 6199; Ncontrol = 54,587), we identify vertigo-associated genomic loci in DROSHA and ZNF91/LINC01224, with the latter replicating the findings in European ancestries. Gene-based association testing corroborates these findings. Interestingly, both genes are enriched in cerebellum, a key structure receiving sensory input from the vestibular system. Subjects carrying risk alleles from lead SNPs of DROSHA and ZNF91 incur a 1.74-fold risk of vertigo than those without. Moreover, composite clinical-polygenic risk scores allow differentiation between patients and controls, yielding an area under receiver operating characteristic curve of 0.69. This study identified novel genomic loci for vertigo in an Asian population-based cohort, which may help identifying high risk subjects and provide mechanistic insight in understanding the pathogenesis of vertigo.
Assuntos
Povo Asiático , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Vertigem , Humanos , Masculino , Feminino , Vertigem/genética , Povo Asiático/genética , Pessoa de Meia-Idade , Estudos de Coortes , Adulto , Loci Gênicos , IdosoRESUMO
Major depressive disorder (MDD) is one of the most important causes of disability worldwide. While recent work provides insights into the molecular alterations in the brain of patients with MDD, whether these molecular signatures can be associated with the expression of specific symptom domains remains unclear. Here, we identified sex-specific gene modules associated with the expression of MDD, combining differential gene expression and co-expression network analyses in six cortical and subcortical brain regions. Our results show varying levels of network homology between males and females across brain regions, although the associations between these structures and the expression of MDD remain highly sex specific. We refined these associations to several symptom domains and identified transcriptional signatures associated with distinct functional pathways, including GABAergic and glutamatergic neurotransmission, metabolic processes and intracellular signal transduction, across brain regions associated with distinct symptomatic profiles in a sex-specific fashion. In most cases, these associations were specific to males or to females with MDD, although a subset of gene modules associated with common symptomatic features in both sexes were also identified. Together, our findings suggest that the expression of distinct MDD symptom domains associates with sex-specific transcriptional structures across brain regions.
Assuntos
Transtorno Depressivo Maior , Masculino , Humanos , Feminino , Depressão/genética , Encéfalo/metabolismo , Transmissão Sináptica , Transdução de Sinais , Imageamento por Ressonância MagnéticaRESUMO
Major depressive disorder (MDD) is one of the most important causes of disability worldwide. While recent work provides insights into the molecular alterations in the brain of patients with MDD, whether these molecular signatures can be associated with the expression of specific symptom domains in males and females remains unclear. Here, we identified sex-specific gene modules associated with the expression of MDD, combining differential gene expression and co-expression network analyses in six cortical and subcortical brain regions. Our results show varying levels of network homology between males and females across brain regions, although the association between these structures and the expression of MDD remains highly sex-specific. We refined these associations to several symptom domains and identified transcriptional signatures associated with distinct functional pathways, including GABAergic and glutamatergic neurotransmission, metabolic processes, and intracellular signal transduction, across brain regions associated with distinct symptomatic profiles in a sex-specific fashion. In most cases, these associations were specific to males or to females with MDD, although a subset of gene modules associated with common symptomatic features in both sexes was also identified. Together, our findings suggest that the expression of distinct MDD symptom domains is associated with sex-specific transcriptional structures across brain regions.
RESUMO
To explore the complex genetic architecture of common diseases and traits, we conducted comprehensive PheWAS of ten diseases and 34 quantitative traits in the community-based Taiwan Biobank (TWB). We identified 995 significantly associated loci with 135 novel loci specific to Taiwanese population. Further analyses highlighted the genetic pleiotropy of loci related to complex disease and associated quantitative traits. Extensive analysis on glycaemic phenotypes (T2D, fasting glucose and HbA1c) was performed and identified 115 significant loci with four novel genetic variants (HACL1, RAD21, ASH1L and GAK). Transcriptomics data also strengthen the relevancy of the findings to metabolic disorders, thus contributing to better understanding of pathogenesis. In addition, genetic risk scores are constructed and validated for absolute risks prediction of T2D in Taiwanese population. In conclusion, our data-driven approach without a priori hypothesis is useful for novel gene discovery and validation on top of disease risk prediction for unique non-European population.
Assuntos
Diabetes Mellitus Tipo 2 , Estudo de Associação Genômica Ampla , Humanos , Polimorfismo de Nucleotídeo Único , Bancos de Espécimes Biológicos , Taiwan/epidemiologia , Glicemia/genética , Fatores de Risco , Diabetes Mellitus Tipo 2/genética , Carbono-Carbono Liases/genéticaRESUMO
Large-scale genome-wide association (GWAS) studies provide opportunities for developing genetic risk prediction models that have the potential to improve disease prevention, intervention or treatment. The key step is to develop polygenic risk score (PRS) models with high predictive performance for a given disease, which typically requires a large training data set for selecting truly associated single nucleotide polymorphisms (SNPs) and estimating effect sizes accurately. Here, we develop a comprehensive penalized regression for fitting l 1 regularized regression models to GWAS summary statistics. We propose incorporating Pleiotropy and ANnotation information into PRS (PANPRS) development through suitable formulation of penalty functions and associated tuning parameters. Extensive simulations show that PANPRS performs equally well or better than existing PRS methods when no functional annotation or pleiotropy is incorporated. When functional annotation data and pleiotropy are informative, PANPRS substantially outperforms existing PRS methods in simulations. Finally, we applied our methods to build PRS for type 2 diabetes and melanoma and found that incorporating relevant functional annotations and GWAS of genetically related traits improved prediction of these two complex diseases.
RESUMO
Background: Several studies have linked type 2 diabetes (T2D) to an increased risk of developing Alzheimer's disease (AD). This has led to an interest in using antidiabetic treatments for the prevention of AD. However, the underlying mechanisms explaining the relationship between T2D and AD have not been completely elucidated. Objective: Our objective was to examine cerebral 18F-fluorodeoxyglucose (FDG) uptake during normal aging and in AD patients in regions associated with diabetes genetic risk factor expression to highlight which genes may serve as potential targets for pharmaceutical intervention. Methods: We calculated regional glucose metabolism differences in units of standardized uptake values (SUVR) for 386 cognitively healthy adults and 335 clinically probable AD patients. We then proceeded to extract gene-expression data from the publicly available Allen Human Brain Atlas (HBA) database. We used the nearest genes to 46 AD- and T2D-associated SNPs previously identified in the literature, and mapped their expression to the same 34 cortical regions in which we calculated SUVRs. SNPs with a donor consistency of 0.40 or greater were selected for further analysis. We evaluated the associations between SUVR and gene-expression across the brain. Results: Of the 46 risk-factor genes, 15 were found to be significantly correlated with FDG-PET brain metabolism in healthy adults and probable AD patients after correction for multiple comparisons. Using multiple regression, we found that five genes explained a total of 72.5% of the SUVR variance across the healthy adult group regions, while four genes explained a total of 79.3% of the SUVR variance across the probable AD group regions. There were significant differences in whole-brain SUVR as a function of allele frequencies for two genes. Conclusions: These results highlight the association between risk factor genes for T2D and regional glucose metabolism during both normal aging and in probable AD. Highlighted genes were associated with mitochondrial stability, vascular maintenance, and glucose intolerance. Pharmacological intervention of these pathways has the potential to improve glucose metabolism during normal again as well as in AD patients.
RESUMO
BACKGROUND: Most of our knowledge of the biological basis of major depressive disorder (MDD) is derived from studies of chronic stress models in rodents. While these models capture certain aspects of the behavioral and neuroendocrine features of MDD, the extent to which they reproduce the molecular pathology of the human syndrome remains unknown. METHODS: We systematically compared transcriptional signatures in two brain regions implicated in depression-medial prefrontal cortex and nucleus accumbens-of humans with MDD and of 3 chronic stress models in mice: chronic variable stress, adult social isolation, and chronic social defeat stress. We used differential expression analysis combined with weighted gene coexpression network analysis to create interspecies gene networks and assess the capacity of each stress paradigm to recapitulate the transcriptional organization of gene networks in human MDD. RESULTS: Our results show significant overlap between transcriptional alterations in medial prefrontal cortex and nucleus accumbens in human MDD and the 3 mouse chronic stress models, with each of the chronic stress paradigms capturing distinct aspects of MDD abnormalities. Chronic variable stress and adult social isolation better reproduce differentially expressed genes, while chronic social defeat stress and adult social isolation better reproduce gene networks characteristic of human MDD. We also identified several gene networks and their constituent genes that are most significantly associated with human MDD and mouse stress models. CONCLUSIONS: This study demonstrates the ability of 3 chronic stress models in mice to recapitulate distinct aspects of the broad molecular pathology of human MDD, with no one mouse model apparently better than another.
Assuntos
Transtorno Depressivo Maior , Animais , Encéfalo , Transtorno Depressivo Maior/genética , Modelos Animais de Doenças , Camundongos , Núcleo Accumbens , Córtex Pré-FrontalRESUMO
Background: Childhood cancer survivors treated with chest-directed radiotherapy have substantially elevated risk for developing breast cancer. Although genetic susceptibility to breast cancer in the general population is well studied, large-scale evaluation of breast cancer susceptibility after chest-directed radiotherapy for childhood cancer is lacking. Methods: We conducted a genome-wide association study of breast cancer in female survivors of childhood cancer, pooling two cohorts with detailed treatment data and systematic, long-term follow-up: the Childhood Cancer Survivor Study and St. Jude Lifetime Cohort. The study population comprised 207 survivors who developed breast cancer and 2774 who had not developed any subsequent neoplasm as of last follow-up. Genotyping and subsequent imputation yielded 16 958 466 high-quality variants for analysis. We tested associations in the overall population and in subgroups stratified by receipt of lower than 10 and 10 or higher gray breast radiation exposure. We report P values and pooled per-allele risk estimates from Cox proportional hazards regression models. All statistical tests were two-sided. Results: Among survivors who received 10 or higher gray breast radiation exposure, a locus on 1q41 was associated with subsequent breast cancer risk (rs4342822, nearest gene PROX1 , risk allele frequency in control subjects [RAF controls ] = 0.46, hazard ratio = 1.92, 95% confidence interval = 1.49 to 2.44, P = 7.09 × 10 -9 ). Two rare variants also showed potentially promising associations (breast radiation ≥10 gray: rs74949440, 11q23, TAGLN , RAF controls = 0.02, P = 5.84 × 10 -8 ; <10 gray: rs17020562, 1q32.3, RPS6KC1 , RAF controls = 0.0005, P = 6.68 × 10 -8 ). Associations were restricted to these dose subgroups, with consistent findings in the two survivor cohorts. Conclusions: Our study provides strong evidence that germline genetics outside high-risk syndromes could modify the effect of radiation exposure on breast cancer risk after childhood cancer.
Assuntos
Neoplasias da Mama/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Proteínas de Homeodomínio/genética , Proteínas dos Microfilamentos/genética , Proteínas Musculares/genética , Neoplasias Induzidas por Radiação/genética , Segunda Neoplasia Primária/genética , Proteínas Quinases S6 Ribossômicas/genética , Proteínas Supressoras de Tumor/genética , Adolescente , Adulto , Mama/efeitos da radiação , Criança , Pré-Escolar , Estudos de Coortes , Feminino , Doença de Hodgkin/radioterapia , Humanos , Lactente , Leucemia/radioterapia , Pessoa de Meia-Idade , Modelos de Riscos Proporcionais , Dosagem Radioterapêutica , Estudos Retrospectivos , Sobreviventes , Adulto Jovem , Quinases raf/genéticaRESUMO
Various forms of penalty functions have been developed for regularized estimation and variable selection. Screening approaches are often used to reduce the number of covariate before penalized estimation. However, in certain problems, the number of covariates remains large after screening. For example, in genome-wide association (GWA) studies, the purpose is to identify Single Nucleotide Polymorphisms (SNPs) that are associated with certain traits, and typically there are millions of SNPs and thousands of samples. Because of the strong correlation of nearby SNPs, screening can only reduce the number of SNPs from millions to tens of thousands and the variable selection problem remains very challenging. Several penalty functions have been proposed for such high dimensional data. However, it is unclear which class of penalty functions is the appropriate choice for a particular application. In this paper, we conduct a theoretical analysis to relate the ranges of tuning parameters of various penalty functions with the dimensionality of the problem and the minimum effect size. We exemplify our theoretical results in several penalty functions. The results suggest that a class of penalty functions that bridges L0 and L1 penalties requires less restrictive conditions on dimensionality and minimum effect sizes in order to attain the two fundamental goals of penalized estimation: to penalize all the noise to be zero and to obtain unbiased estimation of the true signals. The penalties such as SICA and Log belong to this class, but they have not been used often in applications. The simulation and real data analysis using GWAS data suggest the promising applicability of such class of penalties.
RESUMO
Alcohol-related problems have traditionally been conceptualized and measured by an effect indicator model. That is, it is generally assumed that observed indicators of alcohol problems are caused by a latent variable. However, there are reasons to think that this construct is more accurately conceptualized as including at least some causal indicators, in which observed indicators cause the latent variable. The present study examined the measurement model of a well-known alcohol consequences questionnaire, the Rutgers Alcohol Problem Index. Participants were 703 students from a large public university in the Northeast mandated to an alcohol intervention. We conducted a zero tetrad test to examine a measurement model consisting solely of effect indicators and a model with both causal and effect indicators. Overall, the results suggested the hybrid model fit the data better than a model with only effect indicators. These findings have implications regarding the theoretical underpinnings of alcohol-related consequences.
Assuntos
Consumo de Bebidas Alcoólicas/psicologia , Interpretação Estatística de Dados , Estudantes/psicologia , Inquéritos e Questionários/normas , Feminino , Humanos , Masculino , Modelos Estatísticos , Universidades , Adulto JovemRESUMO
DNA damage and mutations induced by oxidative stress are associated with various different human pathologies including cancer. The facts that most human tumors are characterized by large genome rearrangements and glutathione depletion in mice results in deletions in DNA suggest that reactive oxygen species (ROS) may cause gene and chromosome mutations through DNA double strand breaks (DSBs). However, the generation of DSBs at low levels of ROS is still controversial. In the present study, we show that H2O2 at biologically-relevant levels causes a marked increase in oxidative clustered DNA lesions (OCDLs) with a significant elevation of replication-independent DSBs. Although it is frequently reported that OCDLs are fingerprint of high-energy IR, our results indicate for the first time that H2O2, even at low levels, can also cause OCDLs leading to DSBs specifically in G1 cells. Furthermore, a reverse genetic approach revealed a significant contribution of the non-homologous end joining (NHEJ) pathway in H2O2-induced DNA repair & mutagenesis. This genomic instability induced by low levels of ROS may be involved in spontaneous mutagenesis and the etiology of a wide variety of human diseases like chronic inflammation-related disorders, carcinogenesis, neuro-degeneration and aging.
Assuntos
Dano ao DNA/fisiologia , Reparo do DNA por Junção de Extremidades/fisiologia , Mutagênese/fisiologia , Estresse Oxidativo/fisiologia , Animais , Linhagem Celular , Galinhas , Peróxido de Hidrogênio/toxicidade , Oxidantes/toxicidadeRESUMO
We assessed gene expression profiles in 2,752 twins, using a classic twin design to quantify expression heritability and quantitative trait loci (eQTLs) in peripheral blood. The most highly heritable genes (â¼777) were grouped into distinct expression clusters, enriched in gene-poor regions, associated with specific gene function or ontology classes, and strongly associated with disease designation. The design enabled a comparison of twin-based heritability to estimates based on dizygotic identity-by-descent sharing and distant genetic relatedness. Consideration of sampling variation suggests that previous heritability estimates have been upwardly biased. Genotyping of 2,494 twins enabled powerful identification of eQTLs, which we further examined in a replication set of 1,895 unrelated subjects. A large number of non-redundant local eQTLs (6,756) met replication criteria, whereas a relatively small number of distant eQTLs (165) met quality control and replication standards. Our results provide a new resource toward understanding the genetic control of transcription.