RESUMO
Limited estimates exist on risk factors for epithelial ovarian cancer (EOC) in Asian, Hispanic, and Native Hawaiian/Pacific Islander women. Participants in this study included 1734 Asian (n = 785 case and 949 control participants), 266 Native Hawaiian/Pacific Islander (n = 99 case and 167 control participants), 1149 Hispanic (n = 505 case and 644 control participants), and 24 189 White (n = 9981 case and 14 208 control participants) from 11 studies in the Ovarian Cancer Association Consortium. Logistic regression models estimated odds ratios (ORs) and 95% CIs for risk associations by race and ethnicity. Heterogeneity in EOC risk associations by race and ethnicity (P ≤ .02) was observed for oral contraceptive (OC) use, parity, tubal ligation, and smoking. We observed inverse associations with EOC risk for OC use and parity across all groups; associations were strongest in Native Hawaiian/Pacific Islander and Asian women. The inverse association for tubal ligation with risk was most pronounced for Native Hawaiian/Pacific Islander participants (odds ratio (OR) = 0.25; 95% CI, 0.13-0.48) compared with Asian and White participants (OR = 0.68 [95% CI, 0.51-0.90] and OR = 0.78 [95% CI, 0.73-0.85], respectively). Differences in EOC risk factor associations were observed across racial and ethnic groups, which could be due, in part, to varying prevalence of EOC histotypes. Inclusion of greater diversity in future studies is essential to inform prevention strategies. This article is part of a Special Collection on Gynecological Cancers.
Assuntos
Carcinoma Epitelial do Ovário , Neoplasias Ovarianas , Adulto , Idoso , Feminino , Humanos , Pessoa de Meia-Idade , Asiático , Carcinoma Epitelial do Ovário/etnologia , Carcinoma Epitelial do Ovário/epidemiologia , Estudos de Casos e Controles , Anticoncepcionais Orais/efeitos adversos , Etnicidade , Hispânico ou Latino , Modelos Logísticos , Havaiano Nativo ou Outro Ilhéu do Pacífico , Razão de Chances , Neoplasias Ovarianas/etnologia , Neoplasias Ovarianas/epidemiologia , Paridade , Fatores de Risco , Fumar/etnologia , Fumar/epidemiologia , Esterilização Tubária/estatística & dados numéricos , Estados Unidos/epidemiologia , BrancosRESUMO
BACKGROUND: Breast density is strongly associated with breast cancer risk. Fully automated quantitative density assessment methods have recently been developed that could facilitate large-scale studies, although data on associations with long-term breast cancer risk are limited. We examined LIBRA assessments and breast cancer risk and compared results to prior assessments using Cumulus, an established computer-assisted method requiring manual thresholding. METHODS: We conducted a cohort study among 21,150 non-Hispanic white female participants of the Research Program in Genes, Environment and Health of Kaiser Permanente Northern California who were 40-74 years at enrollment, followed for up to 10 years, and had archived processed screening mammograms acquired on Hologic or General Electric full-field digital mammography (FFDM) machines and prior Cumulus density assessments available for analysis. Dense area (DA), non-dense area (NDA), and percent density (PD) were assessed using LIBRA software. Cox regression was used to estimate hazard ratios (HRs) for breast cancer associated with DA, NDA and PD modeled continuously in standard deviation (SD) increments, adjusting for age, mammogram year, body mass index, parity, first-degree family history of breast cancer, and menopausal hormone use. We also examined differences by machine type and breast view. RESULTS: The adjusted HRs for breast cancer associated with each SD increment of DA, NDA and PD were 1.36 (95% confidence interval, 1.18-1.57), 0.85 (0.77-0.93) and 1.44 (1.26-1.66) for LIBRA and 1.44 (1.33-1.55), 0.81 (0.74-0.89) and 1.54 (1.34-1.77) for Cumulus, respectively. LIBRA results were generally similar by machine type and breast view, although associations were strongest for Hologic machines and mediolateral oblique views. Results were also similar during the first 2 years, 2-5 years and 5-10 years after the baseline mammogram. CONCLUSION: Associations with breast cancer risk were generally similar for LIBRA and Cumulus density measures and were sustained for up to 10 years. These findings support the suitability of fully automated LIBRA assessments on processed FFDM images for large-scale research on breast density and cancer risk.
Assuntos
Neoplasias da Mama , Feminino , Humanos , Neoplasias da Mama/diagnóstico por imagem , Neoplasias da Mama/epidemiologia , Densidade da Mama , Estudos de Coortes , Brancos , Mama/diagnóstico por imagem , Mamografia/métodos , Fatores de Risco , Estudos de Casos e ControlesRESUMO
Previous studies using different exposure methods to assess air pollution and breast cancer risk among primarily whites have been inconclusive. Air pollutant exposures of particulate matter and oxides of nitrogen were estimated by kriging (NOx , NO2 , PM10 , PM2.5 ), land use regression (LUR, NOx , NO2 ) and California Line Source Dispersion model (CALINE4, NOx , PM2.5 ) for 57,589 females from the Multiethnic Cohort, residing largely in Los Angeles County from recruitment (1993-1996) through 2010. Cox proportional hazards models were used to examine the associations between time-varying air pollution and breast cancer incidence adjusting for confounding factors. Stratified analyses were conducted by race/ethnicity and distance to major roads. Among all women, breast cancer risk was positively but not significantly associated with NOx (per 50 parts per billion [ppb]) and NO2 (per 20 ppb) determined by kriging and LUR and with PM2.5 and PM10 (per 10 µg/m3 ) determined by kriging. However, among women who lived within 500 m of major roads, significantly increased risks were observed with NOx (hazard ratio [HR] = 1.35, 95% confidence interval [95% CI]: 1.02-1.79), NO2 (HR = 1.44, 95% CI: 1.04-1.99), PM10 (HR = 1.29, 95% CI: 1.07-1.55) and PM2.5 (HR = 1.85, 95% CI: 1.15-2.99) determined by kriging and NOx (HR = 1.21, 95% CI:1.01-1.45) and NO2 (HR = 1.26, 95% CI: 1.00-1.59) determined by LUR. No overall associations were observed with exposures assessed by CALINE4. Subgroup analyses suggested stronger associations of NOx and NO2 among African Americans and Japanese Americans. Further studies of multiethnic populations to confirm the effects of air pollution, particularly near-roadway exposures, on the risk of breast cancer is warranted.
Assuntos
Poluentes Atmosféricos/efeitos adversos , Poluição do Ar/efeitos adversos , Neoplasias da Mama/epidemiologia , Material Particulado/efeitos adversos , Negro ou Afro-Americano/estatística & dados numéricos , Idoso , Poluentes Atmosféricos/análise , Poluição do Ar/análise , Asiático/estatística & dados numéricos , Neoplasias da Mama/etiologia , California/epidemiologia , Estudos de Coortes , Feminino , Seguimentos , Humanos , Incidência , Pessoa de Meia-Idade , Material Particulado/análise , Estudos Prospectivos , Fatores de Risco , Fatores de TempoRESUMO
Women of African ancestry have lower incidence of epithelial ovarian cancer (EOC) yet worse survival compared to women of European ancestry. We conducted a genome-wide association study in African ancestry women with 755 EOC cases, including 537 high-grade serous ovarian carcinomas (HGSOC) and 1,235 controls. We identified four novel loci with suggestive evidence of association with EOC (p < 1 × 10-6 ), including rs4525119 (intronic to AKR1C3), rs7643459 (intronic to LOC101927394), rs4286604 (12 kb 3' of UGT2A2) and rs142091544 (5 kb 5' of WWC1). For HGSOC, we identified six loci with suggestive evidence of association including rs37792 (132 kb 5' of follistatin [FST]), rs57403204 (81 kb 3' of MAGEC1), rs79079890 (LOC105376360 intronic), rs66459581 (5 kb 5' of PRPSAP1), rs116046250 (GABRG3 intronic) and rs192876988 (32 kb 3' of GK2). Among the identified variants, two are near genes known to regulate hormones and diseases of the ovary (AKR1C3 and FST), and two are linked to cancer (AKR1C3 and MAGEC1). In follow-up studies of the 10 identified variants, the GK2 region SNP, rs192876988, showed an inverse association with EOC in European ancestry women (p = 0.002), increased risk of ER positive breast cancer in African ancestry women (p = 0.027) and decreased expression of GK2 in HGSOC tissue from African ancestry women (p = 0.004). A European ancestry-derived polygenic risk score showed positive associations with EOC and HGSOC in women of African ancestry suggesting shared genetic architecture. Our investigation presents evidence of variants for EOC shared among European and African ancestry women and identifies novel EOC risk loci in women of African ancestry.
Assuntos
População Negra/genética , Negro ou Afro-Americano/genética , Neoplasias da Mama/genética , Carcinoma Epitelial do Ovário/genética , População Branca/genética , Membro C3 da Família 1 de alfa-Ceto Redutase/genética , Antígenos de Neoplasias/genética , Neoplasias da Mama/epidemiologia , Carcinoma Epitelial do Ovário/epidemiologia , Feminino , Folistatina/genética , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Humanos , Proteínas de Neoplasias/genética , Polimorfismo de Nucleotídeo Único/genética , Estados Unidos/epidemiologiaRESUMO
Genome-wide sequencing enables evaluation of associations between traits and combinations of variants in genes and pathways. But such evaluation requires multi-locus association tests with good power, regardless of the variant and trait characteristics. And since analyzing families may yield more power than analyzing unrelated individuals, we need multi-locus tests applicable to both related and unrelated individuals. Here we describe such tests, and we introduce SKAT-X, a new test statistic that uses genome-wide data obtained from related or unrelated subjects to optimize power for the specific data at hand. Simulations show that: a) SKAT-X performs well regardless of variant and trait characteristics; and b) for binary traits, analyzing affected relatives brings more power than analyzing unrelated individuals, consistent with previous findings for single-locus tests. We illustrate the methods by application to rare unclassified missense variants in the tumor suppressor gene BRCA2, as applied to combined data from prostate cancer families and unrelated prostate cancer cases and controls in the Multi-ethnic Cohort (MEC). The methods can be implemented using open-source code for public use as the R-package GATARS (Genetic Association Tests for Arbitrarily Related Subjects)
Assuntos
Estudos de Associação Genética/estatística & dados numéricos , Testes Genéticos , Software , Simulação por Computador , Variação Genética/genética , Estudo de Associação Genômica Ampla/estatística & dados numéricos , Humanos , Modelos Genéticos , Fenótipo , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
BACKGROUND: Independent validation is essential to justify use of models of breast cancer risk prediction and inform decisions about prevention options and screening. Few independent validations had been done using cohorts for common breast cancer risk prediction models, and those that have been done had small sample sizes and short follow-up periods, and used earlier versions of the prediction tools. We aimed to validate the relative performance of four commonly used models of breast cancer risk and assess the effect of limited data input on each one's performance. METHODS: In this validation study, we used the Breast Cancer Prospective Family Study Cohort (ProF-SC), which includes 18â856 women from Australia, Canada, and the USA who did not have breast cancer at recruitment, between March 17, 1992, and June 29, 2011. We selected women from the cohort who were 20-70 years old and had no previous history of bilateral prophylactic mastectomy or ovarian cancer, at least 2 months of follow-up data, and information available about family history of breast cancer. We used this selected cohort to calculate 10-year risk scores and compare four models of breast cancer risk prediction: the Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm model (BOADICEA), BRCAPRO, the Breast Cancer Risk Assessment Tool (BCRAT), and the International Breast Cancer Intervention Study model (IBIS). We compared model calibration based on the ratio of the expected number of breast cancer cases to the observed number of breast cancer cases in the cohort, and on the basis of their discriminatory ability to separate those who will and will not have breast cancer diagnosed within 10 years as measured with the concordance statistic (C-statistic). We did subgroup analyses to compare the performance of the models at 10 years in BRCA1 or BRCA2 mutation carriers (ie, BRCA-positive women), tested non-carriers and untested participants (ie, BRCA-negative women), and participants younger than 50 years at recruitment. We also assessed the effect that limited data input (eg, restriction of the amount of family history and non-genetic information included) had on the models' performance. FINDINGS: After median follow-up of 11·1 years (IQR 6·0-14·4), 619 (4%) of 15â732 women selected from the ProF-SC cohort study were prospectively diagnosed with breast cancer after recruitment, of whom 519 (84%) had histologically confirmed disease. BOADICEA and IBIS were well calibrated in the overall validation cohort, whereas BRCAPRO and BCRAT underpredicted risk (ratio of expected cases to observed cases 1·05 [95% CI 0·97-1·14] for BOADICEA, 1·03 [0·96-1·12] for IBIS, 0·59 [0·55-0·64] for BRCAPRO, and 0·79 [0·73-0·85] for BRCAT). The estimated C-statistics for the complete validation cohort were 0·70 (95% CI 0·68-0·72) for BOADICEA, 0·71 (0·69-0·73) for IBIS, 0·68 (0·65-0·70) for BRCAPRO, and 0·60 (0·58-0·62) for BCRAT. In subgroup analyses by BRCA mutation status, the ratio of expected to observed cases for BRCA-negative women was 1·02 (95% CI 0·93-1·12) for BOADICEA, 1·00 (0·92-1·10) for IBIS, 0·53 (0·49-0·58) for BRCAPRO, and 0·97 (0·89-1·06) for BCRAT. For BRCA-positive participants, BOADICEA and IBIS were well calibrated, but BRCAPRO underpredicted risk (ratio of expected to observed cases 1·17 [95% CI 0·99-1·38] for BOADICEA, 1·14 [0·96-1·35] for IBIS, and 0·80 [0·68-0·95] for BRCAPRO). We noted similar patterns of calibration for women younger than 50 years at recruitment. Finally, BOADICEA and IBIS predictive scores were not appreciably affected by limiting input data to family history for first-degree and second-degree relatives. INTERPRETATION: Our results suggest that models that include multigenerational family history, such as BOADICEA and IBIS, have better ability to predict breast cancer risk, even for women at average or below-average risk of breast cancer. Although BOADICEA and IBIS performed similarly, further improvements in the accuracy of predictions could be possible with hybrid models that incorporate the polygenic risk component of BOADICEA and the non-family-history risk factors included in IBIS. FUNDING: US National Institutes of Health, National Cancer Institute, Breast Cancer Research Foundation, Australian National Health and Medical Research Council, Victorian Health Promotion Foundation, Victorian Breast Cancer Research Consortium, Cancer Australia, National Breast Cancer Foundation, Queensland Cancer Fund, Cancer Councils of New South Wales, Victoria, Tasmania, and South Australia, and Cancer Foundation of Western Australia.
Assuntos
Neoplasias da Mama/epidemiologia , Modelos Estatísticos , Adulto , Idoso , Neoplasias da Mama/diagnóstico , Calibragem , Feminino , Seguimentos , Humanos , Pessoa de Meia-Idade , Valor Preditivo dos Testes , Estudos Prospectivos , Medição de Risco , Fatores de Risco , Adulto JovemRESUMO
As a follow-up to genome-wide association analysis of common variants associated with ovarian carcinoma (cancer), our study considers seven well-known ovarian cancer risk factors and their interactions with 28 genome-wide significant common genetic variants. The interaction analyses were based on data from 9971 ovarian cancer cases and 15,566 controls from 17 case-control studies. Likelihood ratio and Wald tests for multiplicative interaction and for relative excess risk due to additive interaction were used. The top multiplicative interaction was noted between oral contraceptive pill (OCP) use (ever vs. never) and rs13255292 (p value = 3.48 × 10-4 ). Among women with the TT genotype for this variant, the odds ratio for OCP use was 0.53 (95% CI = 0.46-0.60) compared to 0.71 (95%CI = 0.66-0.77) for women with the CC genotype. When stratified by duration of OCP use, women with 1-5 years of OCP use exhibited differential protective benefit across genotypes. However, no interaction on either the multiplicative or additive scale was found to be statistically significant after multiple testing correction. The results suggest that OCP use may offer increased benefit for women who are carriers of the T allele in rs13255292. On the other hand, for women carrying the C allele in this variant, longer (5+ years) use of OCP may reduce the impact of carrying the risk allele of this SNP. Replication of this finding is needed. The study presents a comprehensive analytic framework for conducting gene-environment analysis in ovarian cancer.
Assuntos
Exposição Ambiental/efeitos adversos , Interação Gene-Ambiente , Predisposição Genética para Doença/genética , Neoplasias Ovarianas/etiologia , Neoplasias Ovarianas/genética , Estudos de Casos e Controles , Anticoncepcionais Orais Hormonais , Meio Ambiente , Feminino , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Polimorfismo de Nucleotídeo Único/genética , RiscoRESUMO
Breast density is a modifiable factor that is strongly associated with breast cancer risk. We sought to understand the influence of newer technologies of full-field digital mammography (FFDM) on breast density research and to determine whether results are comparable across studies using FFDM and previous studies using traditional film-screen mammography. We studied 24,840 screening-age (40-74 years) non-Hispanic white women who were participants in the Research Program on Genes, Environment and Health of Kaiser Permanente Northern California and underwent screening mammography with either Hologic (Hologic, Inc., Marlborough, Massachusetts) or General Electric (General Electric Company, Boston, Massachusetts) FFDM machines between 2003 and 2013. We estimated the associations of parity, age at first birth, age at menarche, and menopausal status with percent density and dense area as measured by a single radiological technologist using Cumulus software (Canto Software, Inc., San Francisco, California). We found that associations between reproductive factors and mammographic density measured using processed FFDM images were generally similar in magnitude and direction to those from prior studies using film mammography. Estimated associations for both types of FFDM machines were in the same direction. There was some evidence of heterogeneity in the magnitude of the effect sizes by machine type, which we accounted for using random-effects meta-analysis when combining results. Our findings demonstrate the robustness of quantitative mammographic density measurements across FFDM and film mammography platforms.
Assuntos
Densidade da Mama/fisiologia , Neoplasias da Mama/epidemiologia , Mamografia/métodos , História Reprodutiva , Adulto , Idoso , Neoplasias da Mama/diagnóstico por imagem , Feminino , Humanos , Menarca/fisiologia , Menopausa/fisiologia , Pessoa de Meia-Idade , Paridade , População BrancaRESUMO
The vast majority of coding variants are rare, and assessment of the contribution of rare variants to complex traits is hampered by low statistical power and limited functional data. Improved methods for predicting the pathogenicity of rare coding variants are needed to facilitate the discovery of disease variants from exome sequencing studies. We developed REVEL (rare exome variant ensemble learner), an ensemble method for predicting the pathogenicity of missense variants on the basis of individual tools: MutPred, FATHMM, VEST, PolyPhen, SIFT, PROVEAN, MutationAssessor, MutationTaster, LRT, GERP, SiPhy, phyloP, and phastCons. REVEL was trained with recently discovered pathogenic and rare neutral missense variants, excluding those previously used to train its constituent tools. When applied to two independent test sets, REVEL had the best overall performance (p < 10-12) as compared to any individual tool and seven ensemble methods: MetaSVM, MetaLR, KGGSeq, Condel, CADD, DANN, and Eigen. Importantly, REVEL also had the best performance for distinguishing pathogenic from rare neutral variants with allele frequencies <0.5%. The area under the receiver operating characteristic curve (AUC) for REVEL was 0.046-0.182 higher in an independent test set of 935 recent SwissVar disease variants and 123,935 putatively neutral exome sequencing variants and 0.027-0.143 higher in an independent test set of 1,953 pathogenic and 2,406 benign variants recently reported in ClinVar than the AUCs for other ensemble methods. We provide pre-computed REVEL scores for all possible human missense variants to facilitate the identification of pathogenic variants in the sea of rare variants discovered as sequencing studies expand in scale.
Assuntos
Doença/genética , Mutação de Sentido Incorreto/genética , Software , Área Sob a Curva , Análise Mutacional de DNA , Exoma/genética , Frequência do Gene , Humanos , Curva ROCRESUMO
PURPOSE: Racial/ethnic minorities are often assumed to be less willing to participate in and provide biospecimens for biomedical research. We examined racial/ethnic differences in enrollment of women with breast cancer (probands) and their first-degree relatives in the Northern California site of the Breast Cancer Family Registry from 1996 to 2011. METHODS: We evaluated participation in several study components, including biospecimen collection, for probands and relatives by race/ethnicity, cancer history, and other factors. RESULTS: Of 4,780 eligible probands, 76% enrolled in the family registry by completing the family history and risk factor questionnaires and 68% also provided a blood or mouthwash sample. Enrollment was highest (81%) for non-Hispanic whites (NHWs) and intermediate (73-76%) for Hispanics, African Americans, and all Asian American subgroups, except Filipina women (66%). Of 4,279 eligible relatives, 77% enrolled in the family registry, and 65% also provided a biospecimen sample. Enrollment was highest for NHWs (87%) and lowest for Chinese (68%) and Filipinas (67%). Among those enrolled, biospecimen collection rates were similar for NHW, Hispanic, and African American women, both for probands (92-95%) and relatives (82-87%), but lower for some Asian-American subgroups (probands: 72-88%; relatives: 71-88%), foreign-born Asian Americans, and probands those who were more recent immigrants or had low English language proficiency. CONCLUSIONS: These results show that racial/ethnic minority populations are willing to provide biospecimen samples for research, although some Asian American subgroups in particular may need more directed recruitment methods. To address long-standing and well-documented cancer health disparities, minority populations need equal opportunities to contribute to biomedical research.
Assuntos
Neoplasias da Mama/epidemiologia , Grupos Raciais/estatística & dados numéricos , Adolescente , Adulto , Negro ou Afro-Americano/estatística & dados numéricos , Asiático/estatística & dados numéricos , Neoplasias da Mama/etnologia , California/epidemiologia , Feminino , Hispânico ou Latino/estatística & dados numéricos , Humanos , Pessoa de Meia-Idade , Sistema de Registros , Fatores de Risco , População Branca/estatística & dados numéricos , Adulto JovemRESUMO
Next-generation sequencing technologies have afforded unprecedented characterization of low-frequency and rare genetic variation. Due to low power for single-variant testing, aggregative methods are commonly used to combine observed rare variation within a single gene. Causal variation may also aggregate across multiple genes within relevant biomolecular pathways. Kernel-machine regression and adaptive testing methods for aggregative rare-variant association testing have been demonstrated to be powerful approaches for pathway-level analysis, although these methods tend to be computationally intensive at high-variant dimensionality and require access to complete data. An additional analytical issue in scans of large pathway definition sets is multiple testing correction. Gene set definitions may exhibit substantial genic overlap, and the impact of the resultant correlation in test statistics on Type I error rate control for large agnostic gene set scans has not been fully explored. Herein, we first outline a statistical strategy for aggregative rare-variant analysis using component gene-level linear kernel score test summary statistics as well as derive simple estimators of the effective number of tests for family-wise error rate control. We then conduct extensive simulation studies to characterize the behavior of our approach relative to direct application of kernel and adaptive methods under a variety of conditions. We also apply our method to two case-control studies, respectively, evaluating rare variation in hereditary prostate cancer and schizophrenia. Finally, we provide open-source R code for public use to facilitate easy application of our methods to existing rare-variant analysis results.
Assuntos
Algoritmos , Estudos de Associação Genética/métodos , Variação Genética , Simulação por Computador , Humanos , Modelos Genéticos , Tamanho da Amostra , Estatísticas não ParamétricasRESUMO
BACKGROUND: The immune system has been implicated in the pathophysiology of cutaneous squamous cell carcinoma (cSCC) as evidenced by the substantially increased risk of cSCC in immunosuppressed individuals. Associations between cSCC risk and single nucleotide polymorphisms (SNPs) in the HLA region have been identified by genome-wide association studies (GWAS). The translation of the associated HLA SNPs to structural amino acids changes in HLA molecules has not been previously elucidated. METHODS: Using data from a GWAS that included 7238 cSCC cases and 56,961 controls of non-Hispanic white ancestry, we imputed classical alleles and corresponding amino acid changes in HLA genes. Logistic regression models were used to examine associations between cSCC risk and genotyped or imputed SNPs, classical HLA alleles, and amino acid changes. RESULTS: Among the genotyped SNPs, cSCC risk was associated with rs28535317 (OR = 1.20, p = 9.88 × 10- 11) corresponding to an amino-acid change from phenylalanine to leucine at codon 26 of HLA-DRB1 (OR = 1.17, p = 2.48 × 10- 10). An additional independent association was observed for a threonine to isoleucine change at codon 107 of HLA-DQA1 (OR = 1.14, p = 2.34 × 10- 9). Among the classical HLA alleles, cSCC was associated with DRB1*01 (OR = 1.18, p = 5.86 × 10- 10). Conditional analyses revealed additional independent cSCC associations with DQA1*05:01 and DQA1*05:05. Extended haplotype analysis was used to complement the imputed haplotypes, which identified three extended haplotypes in the HLA-DR and HLA-DQ regions. CONCLUSIONS: Associations with specific HLA-DR and -DQ alleles are likely to explain previously observed GWAS signals in the HLA region associated with cSCC risk.
Assuntos
Carcinoma de Células Escamosas/genética , Carcinoma de Células Escamosas/patologia , Genes MHC da Classe II , Polimorfismo de Nucleotídeo Único , Neoplasias Cutâneas/genética , Neoplasias Cutâneas/patologia , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Fatores de RiscoRESUMO
MOTIVATION: Interpreting genetic variation in noncoding regions of the genome is an important challenge for personal genome analysis. One mechanism by which noncoding single nucleotide variants (SNVs) influence downstream phenotypes is through the regulation of gene expression. Methods to predict whether or not individual SNVs are likely to regulate gene expression would aid interpretation of variants of unknown significance identified in whole-genome sequencing studies. RESULTS: We developed FIRE (Functional Inference of Regulators of Expression), a tool to score both noncoding and coding SNVs based on their potential to regulate the expression levels of nearby genes. FIRE consists of 23 random forests trained to recognize SNVs in cis-expression quantitative trait loci (cis-eQTLs) using a set of 92 genomic annotations as predictive features. FIRE scores discriminate cis-eQTL SNVs from non-eQTL SNVs in the training set with a cross-validated area under the receiver operating characteristic curve (AUC) of 0.807, and discriminate cis-eQTL SNVs shared across six populations of different ancestry from non-eQTL SNVs with an AUC of 0.939. FIRE scores are also predictive of cis-eQTL SNVs across a variety of tissue types. AVAILABILITY AND IMPLEMENTATION: FIRE scores for genome-wide SNVs in hg19/GRCh37 are available for download at https://sites.google.com/site/fireregulatoryvariation/. CONTACT: nilah@stanford.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Regulação da Expressão Gênica , Variação Genética , Software , Genômica , Humanos , Locos de Características QuantitativasRESUMO
BACKGROUND: Metabolomics is emerging as an important tool for detecting differences between diseased and non-diseased individuals. However, prospective studies are limited. METHODS: We examined the detectability, reliability, and distribution of metabolites measured in pre-diagnostic plasma samples in a pilot study of women enrolled in the Northern California site of the Breast Cancer Family Registry. The study included 45 cases diagnosed with breast cancer at least one year after the blood draw, and 45 controls. Controls were matched on age (within 5 years), family status, BRCA status, and menopausal status. Duplicate samples were included for reliability assessment. We used a liquid chromatography/gas chromatography mass spectrometer platform to measure metabolites. We calculated intraclass correlations (ICCs) among duplicate samples, and coefficients of variation (CVs) across metabolites. RESULTS: Of the 661 named metabolites detected, 338 (51%) were found in all samples, and 490 (74%) in more than 80% of samples. The median ICC between duplicates was 0.96 (25th - 75th percentile: 0.82-0.99). We observed a greater than 20% case-control difference in 24 metabolites (p < 0.05), although these associations were not significant after adjusting for multiple comparisons. CONCLUSIONS: These data show that assays are reproducible for many metabolites, there is a minimal laboratory variation for the same sample, and a large between-person variation. Despite small sample size, differences between cases and controls in some metabolites suggest that a well-powered large-scale study is likely to detect biological meaningful differences to provide a better understanding of breast cancer etiology.
Assuntos
Neoplasias da Mama/metabolismo , Metabolômica/métodos , Sistema de Registros/estatística & dados numéricos , Adulto , Idoso , Neoplasias da Mama/sangue , Neoplasias da Mama/epidemiologia , Neoplasias da Mama/etiologia , California/epidemiologia , Estudos de Casos e Controles , Cromatografia Líquida/métodos , Feminino , Cromatografia Gasosa-Espectrometria de Massas/métodos , Humanos , Metaboloma , Pessoa de Meia-Idade , Projetos Piloto , Estudos Prospectivos , Reprodutibilidade dos TestesRESUMO
Rare variants (RVs) have been shown to be significant contributors to complex disease risk. By definition, these variants have very low minor allele frequencies and traditional single-marker methods for statistical analysis are underpowered for typical sequencing study sample sizes. Multimarker burden-type approaches attempt to identify aggregation of RVs across case-control status by analyzing relatively small partitions of the genome, such as genes. However, it is generally the case that the aggregative measure would be a mixture of causal and neutral variants, and these omnibus tests do not directly provide any indication of which RVs may be driving a given association. Recently, Bayesian variable selection approaches have been proposed to identify RV associations from a large set of RVs under consideration. Although these approaches have been shown to be powerful at detecting associations at the RV level, there are often computational limitations on the total quantity of RVs under consideration and compromises are necessary for large-scale application. Here, we propose a computationally efficient alternative formulation of this method using a probit regression approach specifically capable of simultaneously analyzing hundreds to thousands of RVs. We evaluate our approach to detect causal variation on simulated data and examine sensitivity and specificity in instances of high RV dimensionality as well as apply it to pathway-level RV analysis results from a prostate cancer (PC) risk case-control sequencing study. Finally, we discuss potential extensions and future directions of this work.
Assuntos
Modelos Genéticos , Teorema de Bayes , Estudos de Casos e Controles , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Análise de Sequência de DNARESUMO
BACKGROUND: Pubertal milestones, such as onset of breast development and menstruation, play an important role in breast cancer etiology. It is unclear if these milestones are different in girls with a first- or second-degree breast cancer family history (BCFH). METHODS: In the LEGACY Girls Study (n = 1040), we examined whether three mother/guardian-reported pubertal milestones (having reached Tanner Stage 2 or higher (T2+) for breast and pubic hair development, and having started menstruation) differed by BCFH. We also examined whether associations between body size and race/ethnicity and pubertal milestones were modified by BCFH. We used mother/guardian reports as the primary measure of pubertal milestones, but also conducted sensitivity analyses using clinical Tanner measurements available for a subcohort (n = 204). We analyzed cross-sectional baseline data with logistic regression models for the entire cohort, and longitudinal data with Weibull survival models for the subcohort of girls that were aged 5-7 years at baseline (n = 258). RESULTS: BCFH was modestly, but not statistically significantly, associated with Breast T2+ (odds ratio (OR) = 1.36, 95% confidence interval (CI) = 0.88-2.10), with a stronger association seen in the subcohort of girls with clinical breast Tanner staging (OR = 2.20, 95% CI = 0.91-5.32). In a longitudinal analysis of girls who were aged 5-7 years at baseline, BCFH was associated with a 50% increased rate of having early breast development (hazard ratio (HR) = 1.49, 95% CI = 1.0-2.21). This association increased to twofold in girls who were not overweight at baseline (HR = 2.04, 95% CI = 1.29-3.21). BCFH was not associated with pubic hair development and post-menarche status. The median interval between onset of breast development and menarche was longer for BCFH+ than BCFH- girls (2.3 versus 1.7 years), suggesting a slower developmental tempo for BCFH+ girls. Associations between pubertal milestones and body size and race/ethnicity were similar in girls with or without a BCFH. For example, weight was positively associated with Breast T2+ in both girls with (OR = 1.06 per 1 kg, 95% CI = 1.03-1.10) and without (OR = 1.14 per 1 kg, 95% CI = 1.04-1.24) a BCFH. CONCLUSIONS: These results suggest that BCFH may be related to earlier breast development and slower pubertal tempo independent of body size and race/ethnicity.
Assuntos
Neoplasias da Mama/epidemiologia , Neoplasias da Mama/etiologia , Puberdade , Índice de Massa Corporal , Canadá/epidemiologia , Criança , Pré-Escolar , Estudos de Coortes , Feminino , Humanos , Incidência , Estimativa de Kaplan-Meier , Menarca , Razão de Chances , Vigilância da População , Risco , Estados Unidos/epidemiologiaRESUMO
Cigarette smoking is associated with an increased risk of developing mucinous ovarian tumors but whether it is associated with ovarian cancer survival overall or for the different histotypes is unestablished. Furthermore, it is unknown whether the association between cigarette smoking and survival differs according to strata of ovarian cancer stage at diagnosis. In a large pooled analysis, we evaluated the association between various measures of cigarette smoking and survival among women with epithelial ovarian cancer. We obtained data from 19 case-control studies in the Ovarian Cancer Association Consortium (OCAC), including 9,114 women diagnosed with ovarian cancer. Cox regression models were used to estimate adjusted study-specific hazard ratios (HRs), which were combined into pooled hazard ratios (pHR) with corresponding 95% confidence intervals (CIs) under random effects models. Overall, 5,149 (57%) women died during a median follow-up period of 7.0 years. Among women diagnosed with ovarian cancer, both current (pHR = 1.17, 95% CI: 1.08-1.28) and former smokers (pHR = 1.10, 95% CI: 1.02-1.18) had worse survival compared with never smoking women. In histotype-stratified analyses, associations were observed for mucinous (current smoking: pHR = 1.91, 95% CI: 1.01-3.65) and serous histotypes (current smoking: pHR = 1.11, 95% CI: 1.00-1.23; former smoking: pHR = 1.12, 95% CI: 1.04-1.20). Further, our results suggested that current smoking has a greater impact on survival among women with localized than disseminated disease. The identification of cigarette smoking as a modifiable factor associated with survival has potential clinical importance as a focus area to improve ovarian cancer prognosis.
Assuntos
Neoplasias Epiteliais e Glandulares/mortalidade , Nicotiana/efeitos adversos , Neoplasias Ovarianas/mortalidade , Fumar/efeitos adversos , Adulto , Idoso , Idoso de 80 Anos ou mais , Carcinoma Epitelial do Ovário , Estudos de Casos e Controles , Feminino , Humanos , Pessoa de Meia-Idade , Modelos de Riscos Proporcionais , Fatores de Risco , Adulto JovemRESUMO
Epidemiological studies have reported inconsistent associations between telomere length (TL) and risk for various cancers. These inconsistencies are likely attributable, in part, to biases that arise due to post-diagnostic and post-treatment TL measurement. To avoid such biases, we used a Mendelian randomization approach and estimated associations between nine TL-associated SNPs and risk for five common cancer types (breast, lung, colorectal, ovarian and prostate cancer, including subtypes) using data on 51 725 cases and 62 035 controls. We then used an inverse-variance weighted average of the SNP-specific associations to estimate the association between a genetic score representing long TL and cancer risk. The long TL genetic score was significantly associated with increased risk of lung adenocarcinoma (P = 6.3 × 10(-15)), even after exclusion of a SNP residing in a known lung cancer susceptibility region (TERT-CLPTM1L) P = 6.6 × 10(-6)). Under Mendelian randomization assumptions, the association estimate [odds ratio (OR) = 2.78] is interpreted as the OR for lung adenocarcinoma corresponding to a 1000 bp increase in TL. The weighted TL SNP score was not associated with other cancer types or subtypes. Our finding that genetic determinants of long TL increase lung adenocarcinoma risk avoids issues with reverse causality and residual confounding that arise in observational studies of TL and disease risk. Under Mendelian randomization assumptions, our finding suggests that longer TL increases lung adenocarcinoma risk. However, caution regarding this causal interpretation is warranted in light of the potential issue of pleiotropy, and a more general interpretation is that SNPs influencing telomere biology are also implicated in lung adenocarcinoma risk.
Assuntos
Predisposição Genética para Doença , Análise da Randomização Mendeliana , Neoplasias/epidemiologia , Neoplasias/genética , Homeostase do Telômero/genética , Adulto , Idoso , Estudos de Casos e Controles , Feminino , Estudos de Associação Genética , Variação Genética , Humanos , Masculino , Pessoa de Meia-Idade , Razão de Chances , Polimorfismo de Nucleotídeo Único , RiscoRESUMO
BACKGROUND: Prostate cancer prognosis is variable, and management decisions involve balancing patients' risks of recurrence and recurrence-free death. Moreover, the roles of body mass index (BMI) and race in risk of recurrence are controversial [1,2]. To address these issues, we developed and cross-validated RAPS (Risks After Prostate Surgery), a personal prediction model for biochemical recurrence (BCR) within 10 years of radical prostatectomy (RP) that includes BMI and race as possible predictors, and recurrence-free death as a competing risk. METHODS: RAPS uses a patient's risk factors at surgery to assign him a recurrence probability based on statistical learning methods applied to a cohort of 1,276 patients undergoing RP at the University of Pennsylvania. We compared the performance of RAPS to that of an existing model with respect to calibration (by comparing observed and predicted outcomes), and discrimination (using the area under the receiver operating characteristic curve (AUC)). RESULTS: RAPS' cross-validated BCR predictions provided better calibration than those of an existing model that underestimated patients' risks. Discrimination was similar for the two models, with BCR AUCs of 0.793, 95% confidence interval (0.766-0.820) for RAPS, and 0.780 (0.745-0.815) for the existing model. RAPS' most important BCR predictors were tumor grade, preoperative prostate-specific antigen (PSA) level and BMI; race was less important [3]. RAPS' predictions can be obtained online at https://predict.shinyapps.io/raps. CONCLUSION: RAPS' cross-validated BCR predictions were better calibrated than those of an existing model, and BMI information contributed substantially to these predictions. RAPS predictions for recurrence-free death were limited by lack of co-morbidity data; however the model provides a simple framework for extension to include such data. Its use and extension should facilitate decision strategies for post-RP prostate cancer management. Prostate 77:291-298, 2017. © 2016 Wiley Periodicals, Inc.
Assuntos
Recidiva Local de Neoplasia/diagnóstico , Prostatectomia/tendências , Neoplasias da Próstata/diagnóstico , Neoplasias da Próstata/cirurgia , Idoso , Estudos de Coortes , Humanos , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Recidiva Local de Neoplasia/sangue , Valor Preditivo dos Testes , Antígeno Prostático Específico/sangue , Neoplasias da Próstata/sangue , Curva ROCRESUMO
Purpose To compare three metrics of breast density on full-field digital mammographic (FFDM) images as predictors of future breast cancer risk. Materials and Methods This institutional review board-approved study included 125 women with invasive breast cancer and 274 age- and race-matched control subjects who underwent screening FFDM during 2004-2013 and provided informed consent. The percentage of density and dense area were assessed semiautomatically with software (Cumulus 4.0; University of Toronto, Toronto, Canada), and volumetric percentage of density and dense volume were assessed automatically with software (Volpara; Volpara Solutions, Wellington, New Zealand). Clinical Breast Imaging Reporting and Data System (BI-RADS) classifications of breast density were extracted from mammography reports. Odds ratios and 95% confidence intervals (CIs) were estimated by using conditional logistic regression stratified according to age and race and adjusted for body mass index, parity, and menopausal status, and the area under the receiver operating characteristic curve (AUC) was computed. Results The adjusted odds ratios and 95% CIs for each standard deviation increment of the percentage of density, dense area, volumetric percentage of density, and dense volume were 1.61 (95% CI: 1.19, 2.19), 1.49 (95% CI: 1.15, 1.92), 1.54 (95% CI: 1.12, 2.10), and 1.41 (95% CI: 1.11, 1.80), respectively. Odds ratios for women with extremely dense breasts compared with those with scattered areas of fibroglandular density were 2.06 (95% CI: 0.85, 4.97) and 2.05 (95% CI: 0.90, 4.64) for BI-RADS and Volpara density classifications, respectively. Clinical BI-RADS was more accurate (AUC, 0.68; 95% CI: 0.63, 0.74) than Volpara (AUC, 0.64; 95% CI: 0.58, 0.70) and continuous measures of percentage of density (AUC, 0.66; 95% CI: 0.60, 0.72), dense area (AUC, 0.66; 95% CI: 0.60, 0.72), volumetric percentage of density (AUC, 0.64; 95% CI: 0.58, 0.70), and density volume (AUC, 0.65; 95% CI: 0.59, 0.71), although the AUC differences were not statistically significant. Conclusion Mammographic density on FFDM images was positively associated with breast cancer risk by using the computer assisted methods and BI-RADS. BI-RADS classification was as accurate as computer-assisted methods for discrimination of patients from control subjects. © RSNA, 2016.