Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 317
Filter
1.
Article in English | MEDLINE | ID: mdl-38958945

ABSTRACT

BACKGROUND: The stromal microenvironment (SME) is integral to breast cancer (BC) biology, impacting metastatic proclivity and treatment response. Emerging data indicate that host factors may impact the SME, but the relationship between pre-diagnostic host factors and SME phenotype remains poorly characterized, particularly among women of African ancestry. METHODS: We conducted a case-only analysis involving 792 BC patients (17-84 years) from the Ghana Breast Health Study (GBHS). High-accuracy machine-learning algorithms were applied to standard H&E-stained images to characterize SME phenotypes (including percent tumor-associated connective tissue stroma, Ta-CTS (%), and tumor-associated stromal cellular density, Ta-SCD (%)). Associations between pre-diagnostic host factors and SME phenotypes were assessed in multivariable linear regression models. RESULTS: Decreasing Ta-CTS and increasing Ta-SCD were associated with aggressive, mostly high-grade tumors (p-value<0.001). Several pre-diagnostic host factors were associated with Ta-SCD independently of tumor characteristics. Compared with nulliparous women, parous women had higher levels of Ta-SCD [mean (standard deviation, SD) = 31.3% (7.6%) vs. 28.9% (7.1%); p-value=0.01]. Similarly, women with a positive family history of breast cancer had higher levels of Ta-SCD than those without family history [mean (SD) = 33.0% (7.5%)] vs. 30.9% (7.6%); p-value=0.03]. Conversely, increasing body size was associated with decreasing Ta-SCD [mean (SD) = 32.0% (7.4%), 31.3% (7.3%), and 29.0% (8.0%) for slight, average, and large body sizes, respectively, p-value=0.005]. CONCLUSIONS: Epidemiological risk factors were associated with varying degrees of stromal cellularity in tumors, independently of clinicopathological characteristics. IMPACT: The findings raise the possibility that epidemiological risk factors may partly influence tumor biology via the SME.

2.
Cancer Res ; 2024 Jun 04.
Article in English | MEDLINE | ID: mdl-38832928

ABSTRACT

Breast cancer includes several subtypes with distinct characteristic biological, pathological, and clinical features. Elucidating subtype-specific genetic etiology could provide insights into the heterogeneity of breast cancer to facilitate development of improved prevention and treatment approaches. Here, we conducted pairwise case-case comparisons among five breast cancer subtypes by applying a case-case GWAS (CC-GWAS) approach to summary statistics data of the Breast Cancer Association Consortium. The approach identified 13 statistically significant loci and eight suggestive loci, the majority of which were identified from comparisons between triple-negative breast cancer (TNBC) and luminal A breast cancer. Associations of lead variants in 12 loci remained statistically significant after accounting for previously reported breast cancer susceptibility variants, among which two were genome-wide significant. Fine mapping implicated putative functional/causal variants and risk genes at several loci, e.g., 3q26.31/TNFSF10, 8q22.3/NACAP1/GRHL2, and 8q23.3/LINC00536/TRPS1, for TNBC as compared to luminal cancer. Functional investigation further identified rs16867605 at 8q22.3 as a SNP that modulates enhancer activity of GRHL2. Subtype-informative polygenic risk scores (PRS) were derived, and patients with a high subtype-informative PRS had an up to 2-fold increased risk of being diagnosed with TNBC instead of luminal cancers. The CC-GWAS PRS remained statistically significant after adjusting for TNBC PRS derived from traditional case-control GWAS in The Cancer Genome Atlas and the African Ancestry Breast Cancer Genetic Consortium. The CC-GWAS PRS was also associated with overall survival and disease-specific survival among breast cancer patients. Overall, these findings have advanced our understanding of the genetic etiology of breast cancer subtypes, particularly for TNBC.

3.
JAMIA Open ; 7(2): ooae055, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38938691

ABSTRACT

Objectives: Absolute risk models estimate an individual's future disease risk over a specified time interval. Applications utilizing server-side risk tooling, the R-based iCARE (R-iCARE), to build, validate, and apply absolute risk models, face limitations in portability and privacy due to their need for circulating user data in remote servers for operation. We overcome this by porting iCARE to the web platform. Materials and Methods: We refactored R-iCARE into a Python package (Py-iCARE) and then compiled it to WebAssembly (Wasm-iCARE)-a portable web module, which operates within the privacy of the user's device. Results: We showcase the portability and privacy of Wasm-iCARE through 2 applications: for researchers to statistically validate risk models and to deliver them to end-users. Both applications run entirely on the client side, requiring no downloads or installations, and keep user data on-device during risk calculation. Conclusions: Wasm-iCARE fosters accessible and privacy-preserving risk tools, accelerating their validation and delivery.

4.
Nat Commun ; 15(1): 3718, 2024 May 02.
Article in English | MEDLINE | ID: mdl-38697998

ABSTRACT

African-ancestry (AA) participants are underrepresented in genetics research. Here, we conducted a transcriptome-wide association study (TWAS) in AA female participants to identify putative breast cancer susceptibility genes. We built genetic models to predict levels of gene expression, exon junction, and 3' UTR alternative polyadenylation using genomic and transcriptomic data generated in normal breast tissues from 150 AA participants and then used these models to perform association analyses using genomic data from 18,034 cases and 22,104 controls. At Bonferroni-corrected P < 0.05, we identified six genes associated with breast cancer risk, including four genes not previously reported (CTD-3080P12.3, EN1, LINC01956 and NUP210L). Most of these genes showed a stronger association with risk of estrogen-receptor (ER) negative or triple-negative than ER-positive breast cancer. We also replicated the associations with 29 genes reported in previous TWAS at P < 0.05 (one-sided), providing further support for an association of these genes with breast cancer risk. Our study sheds new light on the genetic basis of breast cancer and highlights the value of conducting research in AA populations.


Subject(s)
Breast Neoplasms , Genetic Predisposition to Disease , Transcriptome , Adult , Aged , Female , Humans , Middle Aged , Black People/genetics , Breast Neoplasms/genetics , Case-Control Studies , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Genome-Wide Association Study , Polymorphism, Single Nucleotide , Receptors, Estrogen/genetics , Receptors, Estrogen/metabolism , Black or African American , United States
5.
Nat Genet ; 56(5): 819-826, 2024 May.
Article in English | MEDLINE | ID: mdl-38741014

ABSTRACT

We performed genome-wide association studies of breast cancer including 18,034 cases and 22,104 controls of African ancestry. Genetic variants at 12 loci were associated with breast cancer risk (P < 5 × 10-8), including associations of a low-frequency missense variant rs61751053 in ARHGEF38 with overall breast cancer (odds ratio (OR) = 1.48) and a common variant rs76664032 at chromosome 2q14.2 with triple-negative breast cancer (TNBC) (OR = 1.30). Approximately 15.4% of cases with TNBC carried six risk alleles in three genome-wide association study-identified TNBC risk variants, with an OR of 4.21 (95% confidence interval = 2.66-7.03) compared with those carrying fewer than two risk alleles. A polygenic risk score (PRS) showed an area under the receiver operating characteristic curve of 0.60 for the prediction of breast cancer risk, which outperformed PRS derived using data from females of European ancestry. Our study markedly increases the population diversity in genetic studies for breast cancer and demonstrates the utility of PRS for risk prediction in females of African ancestry.


Subject(s)
Black People , Breast Neoplasms , Genetic Predisposition to Disease , Genome-Wide Association Study , Polymorphism, Single Nucleotide , Humans , Female , Genome-Wide Association Study/methods , Breast Neoplasms/genetics , Black People/genetics , Case-Control Studies , Risk Factors , Triple Negative Breast Neoplasms/genetics , Alleles , Multifactorial Inheritance/genetics , Middle Aged , Genetic Loci , White People/genetics
6.
Breast Cancer Res Treat ; 206(2): 295-305, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38653906

ABSTRACT

PURPOSE: Mammographic density phenotypes, adjusted for age and body mass index (BMI), are strong predictors of breast cancer risk. BMI is associated with mammographic density measures, but the role of circulating sex hormone concentrations is less clear. We investigated the relationship between BMI, circulating sex hormone concentrations, and mammographic density phenotypes using Mendelian randomization (MR). METHODS: We applied two-sample MR approaches to assess the association between genetically predicted circulating concentrations of sex hormones [estradiol, testosterone, sex hormone-binding globulin (SHBG)], BMI, and mammographic density phenotypes (dense and non-dense area). We created instrumental variables from large European ancestry-based genome-wide association studies and applied estimates to mammographic density phenotypes in up to 14,000 women of European ancestry. We performed analyses overall and by menopausal status. RESULTS: Genetically predicted BMI was positively associated with non-dense area (IVW: ß = 1.79; 95% CI = 1.58, 2.00; p = 9.57 × 10-63) and inversely associated with dense area (IVW: ß = - 0.37; 95% CI = - 0.51,- 0.23; p = 4.7 × 10-7). We observed weak evidence for an association of circulating sex hormone concentrations with mammographic density phenotypes, specifically inverse associations between genetically predicted testosterone concentration and dense area (ß = - 0.22; 95% CI = - 0.38, - 0.053; p = 0.009) and between genetically predicted estradiol concentration and non-dense area (ß = - 3.32; 95% CI = - 5.83, - 0.82; p = 0.009), although results were not consistent across a range of MR approaches. CONCLUSION: Our findings support a positive causal association between BMI and mammographic non-dense area and an inverse association between BMI and dense area. Evidence was weaker and inconsistent for a causal effect of circulating sex hormone concentrations on mammographic density phenotypes. Based on our findings, associations between circulating sex hormone concentrations and mammographic density phenotypes are weak at best.


Subject(s)
Body Mass Index , Breast Density , Breast Neoplasms , Genome-Wide Association Study , Gonadal Steroid Hormones , Mendelian Randomization Analysis , Humans , Female , Breast Neoplasms/genetics , Breast Neoplasms/blood , Breast Neoplasms/diagnostic imaging , Gonadal Steroid Hormones/blood , Sex Hormone-Binding Globulin/analysis , Sex Hormone-Binding Globulin/metabolism , Sex Hormone-Binding Globulin/genetics , Middle Aged , Polymorphism, Single Nucleotide , Mammography , Estradiol/blood , Testosterone/blood , Phenotype
7.
Int J Cancer ; 2024 Apr 26.
Article in English | MEDLINE | ID: mdl-38669116

ABSTRACT

The associations of certain factors, such as age and menopausal hormone therapy, with breast cancer risk are known to differ for interval and screen-detected cancers. However, the extent to which associations of other established breast cancer risk factors differ by mode of detection is unclear. We investigated associations of a wide range of risk factors using data from a large UK cohort with linkage to the National Health Service Breast Screening Programme, cancer registration, and other health records. We used Cox regression to estimate adjusted relative risks (RRs) and 95% confidence intervals (CIs) for associations between risk factors and breast cancer risk. A total of 9421 screen-detected and 5166 interval cancers were diagnosed in 517,555 women who were followed for an average of 9.72 years. We observed the following differences in risk factor associations by mode of detection: greater body mass index (BMI) was associated with a smaller increased risk of interval (RR per 5 unit increase 1.07, 95% CI 1.03-1.11) than screen-detected cancer (RR 1.27, 1.23-1.30); having a first-degree family history was associated with a greater increased risk of interval (RR 1.81, 1.68-1.95) than screen-detected cancer (RR 1.52, 1.43-1.61); and having had previous breast surgery was associated with a greater increased risk of interval (RR 1.85, 1.72-1.99) than screen-detected cancer (RR 1.34, 1.26-1.42). As these differences in associations were relatively unchanged after adjustment for tumour grade, and are in line with the effects of these factors on mammographic density, they are likely to reflect the effects of these risk factors on screening sensitivity.

8.
Breast Cancer Res ; 25(1): 150, 2023 12 11.
Article in English | MEDLINE | ID: mdl-38082317

ABSTRACT

Epidemiologic data on insecticide exposures and breast cancer risk are inconclusive and mostly from high-income countries. Using data from 1071 invasive pathologically confirmed breast cancer cases and 2096 controls from the Ghana Breast Health Study conducted from 2013 to 2015, we investigated associations with mosquito control products to reduce the spread of mosquito-borne diseases, such as malaria. These mosquito control products were insecticide-treated nets, mosquito coils, repellent room sprays, and skin creams for personal protection against mosquitos. Multivariable and polytomous logistic regression models were used to estimate odds ratios (ORadj) and 95% confidence intervals (CI) with breast cancer risk-adjusted for potential confounders and known risk factors. Among controls, the reported use of mosquito control products were mosquito coils (65%), followed by insecticide-treated nets (56%), repellent room sprays (53%), and repellent skin creams (15%). Compared to a referent group of participants unexposed to mosquito control products, there was no significant association between breast cancer risk and mosquito coils. There was an association in breast cancer risk with reported use of insecticide-treated nets; however, that association was weak and not statistically significant. Participants who reported using repellent sprays were at elevated risks compared to women who did not use any mosquito control products, even after adjustment for all other mosquito control products (OR = 1.42, 95% CI=1.15-1.75). We had limited power to detect an association with repellent skin creams. Although only a few participants reported using repellent room sprays weekly/daily or < month-monthly, no trends were evident with increased frequency of use of repellent sprays, and there was no statistical evidence of heterogeneity by estrogen receptor (ER) status (p-het > 0.25). Our analysis was limited when determining if an association existed with repellent skin creams; therefore, we cannot conclude an association. We found limited evidence of risk associations with widely used mosquito coils and insecticide-treated nets, which are reassuring given their importance for malaria prevention. Our findings regarding specific breast cancer risk associations, specifically those observed between repellent sprays, require further study.


Subject(s)
Breast Neoplasms , Insect Repellents , Insecticides , Malaria , Animals , Humans , Female , Mosquito Control , Insecticides/adverse effects , Ghana/epidemiology , Breast Neoplasms/epidemiology , Breast Neoplasms/etiology , Breast Neoplasms/prevention & control , Malaria/prevention & control , Insect Repellents/adverse effects
9.
Bioinform Adv ; 3(1): vbad145, 2023.
Article in English | MEDLINE | ID: mdl-37868335

ABSTRACT

Motivation: Currently, the Polygenic Score (PGS) Catalog curates over 400 publications on over 500 traits corresponding to over 3000 polygenic risk scores (PRSs). To assess the feasibility of privately calculating the underlying multivariate relative risk for individuals with consumer genomics data, we developed an in-browserPRS calculator for genomic data that does not circulate any data or engage in any computation outside of the user's personal device. Results: A prototype personal risk score calculator, created for research purposes, was developed to demonstrate how the PGS Catalog can be privately and readily applied to readily available direct-to-consumer genetic testing services, such as 23andMe. No software download, installation, or configuration is needed. The PRS web calculator matches individual PGS catalog entries with an individual's 23andMe genome data composed of 600k to 1.4 M single-nucleotide polymorphisms (SNPs). Beta coefficients provide researchers with a convenient assessment of risk associated with matched SNPs. This in-browser application was tested in a variety of personal devices, including smartphones, establishing the feasibility of privately calculating personal risk scores with up to a few thousand reference genetic variations and from the full 23andMe SNP data file (compressed or not). Availability and implementation: The PRScalc web application is developed in JavaScript, HTML, and CSS and is available at GitHub repository (https://episphere.github.io/prs) under an MIT license. The datasets were derived from sources in the public domain: [PGS Catalog, Personal Genome Project].

10.
ArXiv ; 2023 Oct 13.
Article in English | MEDLINE | ID: mdl-37873020

ABSTRACT

Objective: Absolute risk models estimate an individual's future disease risk over a specified time interval. Applications utilizing server-side risk tooling, such as the R-based iCARE (R-iCARE), to build, validate, and apply absolute risk models, face serious limitations in portability and privacy due to their need for circulating user data in remote servers for operation. Our objective was to overcome these limitations. Materials and Methods: We refactored R-iCARE into a Python package (Py-iCARE) then compiled it to WebAssembly (Wasm-iCARE): a portable web module, which operates entirely within the privacy of the user's device. Results: We showcase the portability and privacy of Wasm-iCARE through two applications: for researchers to statistically validate risk models, and to deliver them to end-users. Both applications run entirely on the client-side, requiring no downloads or installations, and keeps user data on-device during risk calculation. Conclusions: Wasm-iCARE fosters accessible and privacy-preserving risk tools, accelerating their validation and delivery.

11.
BMC Med Inform Decis Mak ; 23(1): 238, 2023 10 25.
Article in English | MEDLINE | ID: mdl-37880712

ABSTRACT

BACKGROUND: Online questionnaires are commonly used to collect information from participants in epidemiological studies. This requires building questionnaires using machine-readable formats that can be delivered to study participants using web-based technologies such as progressive web applications. However, the paucity of open-source markup standards with support for complex logic make collaborative development of web-based questionnaire modules difficult. This often prevents interoperability and reusability of questionnaire modules across epidemiological studies. RESULTS: We developed an open-source markup language for presentation of questionnaire content and logic, Quest, within a real-time renderer that enables the user to test logic (e.g., skip patterns) and view the structure of data collection. We provide the Quest markup language, an in-browser markup rendering tool, questionnaire development tool and an example web application that embeds the renderer, developed for The Connect for Cancer Prevention Study. CONCLUSION: A markup language can specify both the content and logic of a questionnaire as plain text. Questionnaire markup, such as Quest, can become a standard format for storing questionnaires or sharing questionnaires across the web. Quest is a step towards generation of FAIR data in epidemiological studies by facilitating reusability of questionnaires and data interoperability using open-source tools.


Subject(s)
Software , Humans , Surveys and Questionnaires , Epidemiologic Studies
12.
Eur J Epidemiol ; 38(10): 1053-1068, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37789226

ABSTRACT

Light-at-night triggers the decline of pineal gland melatonin biosynthesis and secretion and is an IARC-classified probable breast-cancer risk factor. We applied a large-scale molecular epidemiology approach to shed light on the putative role of melatonin in breast cancer. We investigated associations between breast-cancer risk and polymorphisms at genes of melatonin biosynthesis/signaling using a study population of 44,405 women from the Breast Cancer Association Consortium (22,992 cases, 21,413 population-based controls). Genotype data of 97 candidate single nucleotide polymorphisms (SNPs) at 18 defined gene regions were investigated for breast-cancer risk effects. We calculated adjusted odds ratios (ORs) and 95% confidence intervals (CI) by logistic regression for the main-effect analysis as well as stratified analyses by estrogen- and progesterone-receptor (ER, PR) status. SNP-SNP interactions were analyzed via a two-step procedure based on logic regression. The Bayesian false-discovery probability (BFDP) was used for all analyses to account for multiple testing. Noteworthy associations (BFDP < 0.8) included 10 linked SNPs in tryptophan hydroxylase 2 (TPH2) (e.g. rs1386492: OR = 1.07, 95% CI 1.02-1.12), and a SNP in the mitogen-activated protein kinase 8 (MAPK8) (rs10857561: OR = 1.11, 95% CI 1.04-1.18). The SNP-SNP interaction analysis revealed noteworthy interaction terms with TPH2- and MAPK-related SNPs (e.g. rs1386483R ∧ rs1473473D ∧ rs3729931D: OR = 1.20, 95% CI 1.09-1.32). In line with the light-at-night hypothesis that links shift work with elevated breast-cancer risks our results point to SNPs in TPH2 and MAPK-genes that may impact the intricate network of circadian regulation.


Subject(s)
Breast Neoplasms , Melatonin , Humans , Female , Breast Neoplasms/genetics , Breast Neoplasms/epidemiology , Melatonin/genetics , Melatonin/metabolism , Bayes Theorem , Polymorphism, Single Nucleotide , Logistic Models , Case-Control Studies , Genetic Predisposition to Disease
13.
Nat Genet ; 55(10): 1757-1768, 2023 10.
Article in English | MEDLINE | ID: mdl-37749244

ABSTRACT

Polygenic risk scores (PRSs) increasingly predict complex traits; however, suboptimal performance in non-European populations raise concerns about clinical applications and health inequities. We developed CT-SLEB, a powerful and scalable method to calculate PRSs, using ancestry-specific genome-wide association study summary statistics from multiancestry training samples, integrating clumping and thresholding, empirical Bayes and superlearning. We evaluated CT-SLEB and nine alternative methods with large-scale simulated genome-wide association studies (~19 million common variants) and datasets from 23andMe, Inc., the Global Lipids Genetics Consortium, All of Us and UK Biobank, involving 5.1 million individuals of diverse ancestry, with 1.18 million individuals from four non-European populations across 13 complex traits. Results demonstrated that CT-SLEB significantly improves PRS performance in non-European populations compared with simple alternatives, with comparable or superior performance to a recent, computationally intensive method. Moreover, our simulation studies offered insights into sample size requirements and SNP density effects on multiancestry risk prediction.


Subject(s)
Multifactorial Inheritance , Population Health , Humans , Multifactorial Inheritance/genetics , Genome-Wide Association Study , Bayes Theorem , Polymorphism, Single Nucleotide/genetics , Risk Factors , Genetic Predisposition to Disease
14.
Cancers (Basel) ; 15(13)2023 Jun 23.
Article in English | MEDLINE | ID: mdl-37444426

ABSTRACT

FANCM germline protein truncating variants (PTVs) are moderate-risk factors for ER-negative breast cancer. We previously described the spectrum of FANCM PTVs in 114 European breast cancer cases. In the present, larger cohort, we report the spectrum and frequency of four common and 62 rare FANCM PTVs found in 274 carriers detected among 44,803 breast cancer cases. We confirmed that p.Gln1701* was the most common PTV in Northern Europe with lower frequencies in Southern Europe. In contrast, p.Gly1906Alafs*12 was the most common PTV in Southern Europe with decreasing frequencies in Central and Northern Europe. We verified that p.Arg658* was prevalent in Central Europe and had highest frequencies in Eastern Europe. We also confirmed that the fourth most common PTV, p.Gln498Thrfs*7, might be a founder variant from Lithuania. Based on the frequency distribution of the carriers of rare PTVs, we showed that the FANCM PTVs spectra in Southwestern and Central Europe were much more heterogeneous than those from Northeastern Europe. These findings will inform the development of more efficient FANCM genetic testing strategies for breast cancer cases from specific European populations.

15.
Cancer Med ; 12(15): 16142-16162, 2023 08.
Article in English | MEDLINE | ID: mdl-37401034

ABSTRACT

BACKGROUND: Breast cancer (BC) patients with a germline CHEK2 c.1100delC variant have an increased risk of contralateral BC (CBC) and worse BC-specific survival (BCSS) compared to non-carriers. AIM: To assessed the associations of CHEK2 c.1100delC, radiotherapy, and systemic treatment with CBC risk and BCSS. METHODS: Analyses were based on 82,701 women diagnosed with a first primary invasive BC including 963 CHEK2 c.1100delC carriers; median follow-up was 9.1 years. Differential associations with treatment by CHEK2 c.1100delC status were tested by including interaction terms in a multivariable Cox regression model. A multi-state model was used for further insight into the relation between CHEK2 c.1100delC status, treatment, CBC risk and death. RESULTS: There was no evidence for differential associations of therapy with CBC risk by CHEK2 c.1100delC status. The strongest association with reduced CBC risk was observed for the combination of chemotherapy and endocrine therapy [HR (95% CI): 0.66 (0.55-0.78)]. No association was observed with radiotherapy. Results from the multi-state model showed shorter BCSS for CHEK2 c.1100delC carriers versus non-carriers also after accounting for CBC occurrence [HR (95% CI): 1.30 (1.09-1.56)]. CONCLUSION: Systemic therapy was associated with reduced CBC risk irrespective of CHEK2 c.1100delC status. Moreover, CHEK2 c.1100delC carriers had shorter BCSS, which appears not to be fully explained by their CBC risk.


Subject(s)
Breast Neoplasms , Female , Humans , Breast Neoplasms/genetics , Breast Neoplasms/radiotherapy , Checkpoint Kinase 2/genetics , Genetic Predisposition to Disease , Germ-Line Mutation , Heterozygote , Proportional Hazards Models
16.
Cancer Prev Res (Phila) ; 16(10): 561-570, 2023 10 02.
Article in English | MEDLINE | ID: mdl-37477495

ABSTRACT

FGFR3 and PIK3CA are among the most frequently mutated genes in bladder tumors. We hypothesized that recurrent mutations in these genes might be caused by common carcinogenic exposures such as smoking and other factors. We analyzed 2,816 bladder tumors with available data on FGFR3 and/or PIK3CA mutations, focusing on the most recurrent mutations detected in ≥10% of tumors. Compared to tumors with other FGFR3/PIK3CA mutations, FGFR3-Y375C was more common in tumors from smokers than never-smokers (P = 0.009), while several APOBEC-type driver mutations were enriched in never-smokers: FGFR3-S249C (P = 0.013) and PIK3CA-E542K/PIK3CA-E545K (P = 0.009). To explore possible causes of these APOBEC-type mutations, we analyzed RNA sequencing (RNA-seq) data from 798 bladder tumors and detected several viruses, with BK polyomavirus (BKPyV) being the most common. We then performed IHC staining for polyomavirus (PyV) Large T-antigen (LTAg) in an independent set of 211 bladder tumors. Overall, by RNA-seq or IHC-LTAg, we detected PyV in 26 out of 1,010 bladder tumors with significantly higher detection (P = 4.4 × 10-5), 25 of 554 (4.5%) in non-muscle-invasive bladder cancers (NMIBC) versus 1 of 456 (0.2%) of muscle-invasive bladder cancers (MIBC). In the NMIBC subset, the FGFR3/PIK3CA APOBEC-type driver mutations were detected in 94.7% (18/19) of PyV-positive versus 68.3% (259/379) of PyV-negative tumors (P = 0.011). BKPyV tumor positivity in the NMIBC subset with FGFR3- or PIK3CA-mutated tumors was also associated with a higher risk of progression to MIBC (P = 0.019). In conclusion, our results support smoking and BKPyV infection as risk factors contributing to bladder tumorigenesis in the general patient population through distinct molecular mechanisms. PREVENTION RELEVANCE: Tobacco smoking likely causes one of the most common mutations in bladder tumors (FGFR3-Y375C), while viral infections might contribute to three others (FGFR3-S249C, PIK3CA-E542K, and PIK3CA-E545K). Understanding the causes of these mutations may lead to new prevention and treatment strategies, such as viral screening and vaccination.


Subject(s)
Urinary Bladder Neoplasms , Virus Diseases , Humans , Urinary Bladder Neoplasms/genetics , Urinary Bladder Neoplasms/pathology , Mutation , Urinary Bladder/pathology , Class I Phosphatidylinositol 3-Kinases/genetics
17.
BMC Med Res Methodol ; 23(1): 153, 2023 06 29.
Article in English | MEDLINE | ID: mdl-37386403

ABSTRACT

BACKGROUND: The rule of thumb that there is little gain in statistical power by obtaining more than 4 controls per case, is based on type-1 error α = 0.05. However, association studies that evaluate thousands or millions of associations use smaller α and may have access to plentiful controls. We investigate power gains, and reductions in p-values, when increasing well beyond 4 controls per case, for small α. METHODS: We calculate the power, the median expected p-value, and the minimum detectable odds-ratio (OR), as a function of the number of controls/case, as α decreases. RESULTS: As α decreases, at each ratio of controls per case, the increase in power is larger than for α = 0.05. For α between 10-6 and 10-9 (typical for thousands or millions of associations), increasing from 4 controls per case to 10-50 controls per case increases power. For example, a study with power = 0.2 (α = 5 × 10-8) with 1 control/case has power = 0.65 with 4 controls/case, but with 10 controls/case has power = 0.78, and with 50 controls/case has power = 0.84. For situations where obtaining more than 4 controls per case provides small increases in power beyond 0.9 (at small α), the expected p-value can decrease by orders-of-magnitude below α. Increasing from 1 to 4 controls/case reduces the minimum detectable OR toward the null by 20.9%, and from 4 to 50 controls/case reduces by an additional 9.7%, a result which applies regardless of α and hence also applies to "regular" α = 0.05 epidemiology. CONCLUSIONS: At small α, versus 4 controls/case, recruiting 10 or more controls/cases can increase power, reduce the expected p-value by 1-2 orders of magnitude, and meaningfully reduce the minimum detectable OR. These benefits of increasing the controls/case ratio increase as the number of cases increases, although the amount of benefit depends on exposure frequencies and true OR. Provided that controls are comparable to cases, our findings suggest greater sharing of comparable controls in large-scale association studies.


Subject(s)
Control Groups , Odds Ratio , Research Design , Humans
18.
Microbiol Spectr ; 11(4): e0157223, 2023 08 17.
Article in English | MEDLINE | ID: mdl-37341612

ABSTRACT

The human fecal and oral microbiome may play a role in the etiology of breast cancer through modulation of endogenous estrogen metabolism. This study aimed to investigate associations of circulating estrogens and estrogen metabolites with the fecal and oral microbiome in postmenopausal African women. A total of 117 women with fecal (N = 110) and oral (N = 114) microbiome data measured by 16S rRNA gene sequencing, and estrogens and estrogen metabolites data measured by liquid chromatography tandem mass spectrometry were included. The outcomes were measures of the microbiome and the independent variables were the estrogens and estrogen metabolites. Estrogens and estrogen metabolites were associated with the fecal microbial Shannon index (global P < 0.01). In particular, higher levels of estrone (ß = 0.36, P = 0.03), 2-hydroxyestradiol (ß = 0.30, P = 0.02), 4-methoxyestrone (ß = 0.51, P = 0.01), and estriol (ß = 0.36, P = 0.04) were associated with higher levels of the Shannon index, while 16alpha-hydroxyestrone (ß = -0.57, P < 0.01) was inversely associated with the Shannon index as indicated by linear regression. Conjugated 2-methoxyestrone was associated with oral microbial unweighted UniFrac as indicated by MiRKAT (P < 0.01) and PERMANOVA, where conjugated 2-methoxyestrone explained 2.67% of the oral microbial variability, but no other estrogens or estrogen metabolites were associated with any other beta diversity measures. The presence and abundance of multiple fecal and oral genera, such as fecal genera from families Lachnospiraceae and Ruminococcaceae, were associated with several estrogens and estrogen metabolites as indicated by zero-inflated negative binomial regression. Overall, we found several associations of specific estrogens and estrogen metabolites and the fecal and oral microbiome. IMPORTANCE Several epidemiologic studies have found associations of urinary estrogens and estrogen metabolites with the fecal microbiome. However, urinary estrogen concentrations are not strongly correlated with serum estrogens, a known risk factor for breast cancer. To better understand whether the human fecal and oral microbiome were associated with breast cancer risk via the regulation of estrogen metabolism, we conducted this study to investigate the associations of circulating estrogens and estrogen metabolites with the fecal and oral microbiome in postmenopausal African women. We found several associations of parent estrogens and several estrogen metabolites with the microbial communities, and multiple individual associations of estrogens and estrogen metabolites with the presence and abundance of multiple fecal and oral genera, such as fecal genera from families Lachnospiraceae and Ruminococcaceae, which have estrogen metabolizing properties. Future large, longitudinal studies to investigate the dynamic changes of the fecal and oral microbiome and estrogen relationship are needed.


Subject(s)
Breast Neoplasms , Lactobacillales , Microbiota , Female , Humans , Estrogens/urine , Postmenopause/physiology , RNA, Ribosomal, 16S/genetics , Ghana/epidemiology , Breast Neoplasms/epidemiology , Breast Neoplasms/urine , Lactobacillales/metabolism
19.
Article in English | MEDLINE | ID: mdl-37350888

ABSTRACT

Motivation: Epidemiological studies face two important challenges: the need to ingest ever more complex data types, and mounting concerns about participant privacy and data governance. These two challenges are compounded by the expectation that data infrastructure will eventually need to facilitate cross-registration of participants by multiple epidemiological studies. Implementation: The portable web-service epiDonate was developed using the serverless model known as FaaS (Function-as-a-Service). The reference implementation uses nodejs. The implementation relies on a simple tokenization scheme, mediated by a public API, that a) distinguishes admin from participant roles, with b) extensible permission configuration operating a read/write structure. General Features: The critical design feature of epiDonate is the absence of business logic on the server-side (the web service). The simplicity removes the need to customize virtual machines and enables ecosystems of multiple web Applications backed by one or more data donation deployments. Availability: https://episphere.github.io/donate.

20.
NPJ Breast Cancer ; 9(1): 37, 2023 May 12.
Article in English | MEDLINE | ID: mdl-37173335

ABSTRACT

We assessed the PREDICT v 2.2 for prognosis of breast cancer patients with pathogenic germline BRCA1 and BRCA2 variants, using follow-up data from 5453 BRCA1/2 carriers from the Consortium of Investigators of Modifiers of BRCA1/2 (CIMBA) and the Breast Cancer Association Consortium (BCAC). PREDICT for estrogen receptor (ER)-negative breast cancer had modest discrimination for BRCA1 carrier patients overall (Gönen & Heller unbiased concordance 0.65 in CIMBA, 0.64 in BCAC), but it distinguished clearly the high-mortality group from lower risk categories. In an analysis of low to high risk categories by PREDICT score percentiles, the observed mortality was consistently lower than the expected mortality, but the confidence intervals always included the calibration slope. Altogether, our results encourage the use of the PREDICT ER-negative model in management of breast cancer patients with germline BRCA1 variants. For the PREDICT ER-positive model, the discrimination was slightly lower in BRCA2 variant carriers (concordance 0.60 in CIMBA, 0.65 in BCAC). Especially, inclusion of the tumor grade distorted the prognostic estimates. The breast cancer mortality of BRCA2 carriers was underestimated at the low end of the PREDICT score distribution, whereas at the high end, the mortality was overestimated. These data suggest that BRCA2 status should also be taken into consideration with tumor characteristics, when estimating the prognosis of ER-positive breast cancer patients.

SELECTION OF CITATIONS
SEARCH DETAIL
...