RESUMEN
The Human Genome Variation Society (HGVS) nomenclature guidelines encourage the accurate and standard description of DNA, RNA, and protein sequence variants in public variant databases and the scientific literature. Inconsistent application of the HGVS guidelines can lead to misinterpretation of variants in clinical settings. Reliable software tools are essential to ensure consistent application of the HGVS guidelines when reporting and interpreting variants. We present the hgvs Python package, a comprehensive tool for manipulating sequence variants according to the HGVS nomenclature guidelines. Distinguishing features of the hgvs package include: (1) parsing, formatting, validating, and normalizing variants on genome, transcript, and protein sequences; (2) projecting variants between aligned sequences, including those with gapped alignments; (3) flexible installation using remote or local data (fully local installations eliminate network dependencies); (4) extensive automated tests; and (5) open source development by a community from eight organizations worldwide. This report summarizes recent and significant updates to the hgvs package since its original release in 2014, and presents results of extensive validation using clinical relevant variants from ClinVar and HGMD.
Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Variación Genética , Genoma Humano , Guías como Asunto , Humanos , Sociedades Médicas , Programas InformáticosRESUMEN
Analyses of genome-wide association study (GWAS) data have revealed that detectable genetic mosaicism involving large (>2 Mb) structural autosomal alterations occurs in a fraction of individuals. We present results for a set of 24,849 genotyped individuals (total GWAS set II [TGSII]) in whom 341 large autosomal abnormalities were observed in 168 (0.68%) individuals. Merging data from the new TGSII set with data from two prior reports (the Gene-Environment Association Studies and the total GWAS set I) generated a large dataset of 127,179 individuals; we then conducted a meta-analysis to investigate the patterns of detectable autosomal mosaicism (n = 1,315 events in 925 [0.73%] individuals). Restricting to events >2 Mb in size, we observed an increase in event frequency as event size decreased. The combined results underscore that the rate of detectable mosaicism increases with age (p value = 5.5 × 10(-31)) and is higher in men (p value = 0.002) but lower in participants of African ancestry (p value = 0.003). In a subset of 47 individuals from whom serial samples were collected up to 6 years apart, complex changes were noted over time and showed an overall increase in the proportion of mosaic cells as age increased. Our large combined sample allowed for a unique ability to characterize detectable genetic mosaicism involving large structural events and strengthens the emerging evidence of non-random erosion of the genome in the aging population.
Asunto(s)
Aberraciones Cromosómicas , Genoma Humano , Mosaicismo , Anciano , Femenino , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Neoplasias/genéticaRESUMEN
There is evidence across several species for genetic control of phenotypic variation of complex traits, such that the variance among phenotypes is genotype dependent. Understanding genetic control of variability is important in evolutionary biology, agricultural selection programmes and human medicine, yet for complex traits, no individual genetic variants associated with variance, as opposed to the mean, have been identified. Here we perform a meta-analysis of genome-wide association studies of phenotypic variation using â¼170,000 samples on height and body mass index (BMI) in human populations. We report evidence that the single nucleotide polymorphism (SNP) rs7202116 at the FTO gene locus, which is known to be associated with obesity (as measured by mean BMI for each rs7202116 genotype), is also associated with phenotypic variability. We show that the results are not due to scale effects or other artefacts, and find no other experiment-wise significant evidence for effects on variability, either at loci other than FTO for BMI or at any locus for height. The difference in variance for BMI among individuals with opposite homozygous genotypes at the FTO locus is approximately 7%, corresponding to a difference of â¼0.5 kilograms in the standard deviation of weight. Our results indicate that genetic variants can be discovered that are associated with variability, and that between-person variability in obesity can partly be explained by the genotype at the FTO locus. The results are consistent with reported FTO by environment interactions for BMI, possibly mediated by DNA methylation. Our BMI results for other SNPs and our height results for all SNPs suggest that most genetic variants, including those that influence mean height or mean BMI, are not associated with phenotypic variance, or that their effects on variability are too small to detect even with samples sizes greater than 100,000.
Asunto(s)
Índice de Masa Corporal , Variación Genética , Fenotipo , Proteínas/genética , Dioxigenasa FTO Dependiente de Alfa-Cetoglutarato , Estatura/genética , Proteínas Co-Represoras , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Proteínas del Tejido Nervioso/genética , Polimorfismo de Nucleótido Simple , Proteínas Represoras/genéticaRESUMEN
Recently, common variants on human chromosome 8q24 were found to be associated with prostate cancer risk. While conducting a genome-wide association study in the Cancer Genetic Markers of Susceptibility project with 550,000 SNPs in a nested case-control study (1,172 cases and 1,157 controls of European origin), we identified a new association at 8q24 with an independent effect on prostate cancer susceptibility. The most significant signal is 70 kb centromeric to the previously reported SNP, rs1447295, but shows little evidence of linkage disequilibrium with it. A combined analysis with four additional studies (total: 4,296 cases and 4,299 controls) confirms association with prostate cancer for rs6983267 in the centromeric locus (P = 9.42 x 10(-13); heterozygote odds ratio (OR): 1.26, 95% confidence interval (c.i.): 1.13-1.41; homozygote OR: 1.58, 95% c.i.: 1.40-1.78). Each SNP remained significant in a joint analysis after adjusting for the other (rs1447295 P = 1.41 x 10(-11); rs6983267 P = 6.62 x 10(-10)). These observations, combined with compelling evidence for a recombination hotspot between the two markers, indicate the presence of at least two independent loci within 8q24 that contribute to prostate cancer in men of European ancestry. We estimate that the population attributable risk of the new locus, marked by rs6983267, is higher than the locus marked by rs1447295 (21% versus 9%).
Asunto(s)
Cromosomas Humanos Par 8/genética , Predisposición Genética a la Enfermedad/genética , Variación Genética , Neoplasias de la Próstata/genética , Negro o Afroamericano , Secuencia de Bases , Etnicidad/genética , Frecuencia de los Genes , Genómica/métodos , Genotipo , Haplotipos/genética , Humanos , Masculino , Datos de Secuencia Molecular , Oportunidad Relativa , Polimorfismo de Nucleótido Simple , Factores de Riesgo , Estados Unidos , Población BlancaRESUMEN
We conducted a genome-wide association study (GWAS) of breast cancer by genotyping 528,173 SNPs in 1,145 postmenopausal women of European ancestry with invasive breast cancer and 1,142 controls. We identified four SNPs in intron 2 of FGFR2 (which encodes a receptor tyrosine kinase and is amplified or overexpressed in some breast cancers) that were highly associated with breast cancer and confirmed this association in 1,776 affected individuals and 2,072 controls from three additional studies. Across the four studies, the association with all four SNPs was highly statistically significant (P(trend) for the most strongly associated SNP (rs1219648) = 1.1 x 10(-10); population attributable risk = 16%). Four SNPs at other loci most strongly associated with breast cancer in the initial GWAS were not associated in the replication studies. Our summary results from the GWAS are available online in a form that should speed the identification of additional risk loci.
Asunto(s)
Alelos , Neoplasias de la Mama/genética , Predisposición Genética a la Enfermedad , Genoma Humano , Posmenopausia , Receptor Tipo 2 de Factor de Crecimiento de Fibroblastos/genética , Anciano , Femenino , Humanos , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Factores de RiesgoRESUMEN
Genome-wide association studies (GWASs) have identified multiple common genetic variants associated with an increased risk of testicular germ cell tumors (TGCTs). A previous GWAS reported a possible TGCT susceptibility locus on chromosome 1q23 in the UCK2 gene, but failed to reach genome-wide significance following replication. We interrogated this region by conducting a meta-analysis of two independent GWASs including a total of 940 TGCT cases and 1559 controls for 122 single-nucleotide polymorphisms (SNPs) on chromosome 1q23 and followed up the most significant SNPs in an additional 2202 TGCT cases and 2386 controls from four case-control studies. We observed genome-wide significant associations for several UCK2 markers, the most significant of which was for rs3790665 (PCombined = 6.0 × 10(-9)). Additional support is provided from an independent familial study of TGCT where a significant over-transmission for rs3790665 with TGCT risk was observed (PFBAT = 2.3 × 10(-3)). Here, we provide substantial evidence for the association between UCK2 genetic variation and TGCT risk.
Asunto(s)
Cromosomas Humanos Par 1 , Sitios Genéticos , Predisposición Genética a la Enfermedad , Neoplasias de Células Germinales y Embrionarias/genética , Neoplasias Testiculares/genética , Uridina Quinasa/genética , Estudios de Casos y Controles , Genotipo , Humanos , Desequilibrio de Ligamiento , Masculino , Polimorfismo de Nucleótido Simple , Recombinación GenéticaRESUMEN
Genetic loci for body mass index (BMI) in adolescence and young adulthood, a period of high risk for weight gain, are understudied, yet may yield important insight into the etiology of obesity and early intervention. To identify novel genetic loci and examine the influence of known loci on BMI during this critical time period in late adolescence and early adulthood, we performed a two-stage meta-analysis using 14 genome-wide association studies in populations of European ancestry with data on BMI between ages 16 and 25 in up to 29 880 individuals. We identified seven independent loci (P < 5.0 × 10â»8) near FTO (P = 3.72 × 10⻲³), TMEM18 (P = 3.24 × 10⻹7), MC4R (P = 4.41 × 10⻹7), TNNI3K (P = 4.32 × 10⻹¹), SEC16B (P = 6.24 × 10â»9), GNPDA2 (P = 1.11 × 10â»8) and POMC (P = 4.94 × 10â»8) as well as a potential secondary signal at the POMC locus (rs2118404, P = 2.4 × 10â»5 after conditioning on the established single-nucleotide polymorphism at this locus) in adolescents and young adults. To evaluate the impact of the established genetic loci on BMI at these young ages, we examined differences between the effect sizes of 32 published BMI loci in European adult populations (aged 18-90) and those observed in our adolescent and young adult meta-analysis. Four loci (near PRKD1, TNNI3K, SEC16B and CADM2) had larger effects and one locus (near SH2B1) had a smaller effect on BMI during adolescence and young adulthood compared with older adults (P < 0.05). These results suggest that genetic loci for BMI can vary in their effects across the life course, underlying the importance of evaluating BMI at different ages.
Asunto(s)
Índice de Masa Corporal , Sitios Genéticos , Aumento de Peso/genética , Adolescente , Adulto , Factores de Edad , Anciano , Anciano de 80 o más Años , Estudios de Cohortes , Estudio de Asociación del Genoma Completo , Humanos , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Población Blanca/genética , Adulto JovenRESUMEN
Pooling genome-wide association studies (GWASs) increases power but also poses methodological challenges because studies are often heterogeneous. For example, combining GWASs of related but distinct traits can provide promising directions for the discovery of loci with small but common pleiotropic effects. Classical approaches for meta-analysis or pooled analysis, however, might not be suitable for such analysis because individual variants are likely to be associated with only a subset of the traits or might demonstrate effects in different directions. We propose a method that exhaustively explores subsets of studies for the presence of true association signals that are in either the same direction or possibly opposite directions. An efficient approximation is used for rapid evaluation of p values. We present two illustrative applications, one for a meta-analysis of separate case-control studies of six distinct cancers and another for pooled analysis of a case-control study of glioma, a class of brain tumors that contains heterogeneous subtypes. Both the applications and additional simulation studies demonstrate that the proposed methods offer improved power and more interpretable results when compared to traditional methods for the analysis of heterogeneous traits. The proposed framework has applications beyond genetic association studies.
Asunto(s)
Heterogeneidad Genética , Estudio de Asociación del Genoma Completo/métodos , Fenotipo , Estudios de Casos y Controles , Interpretación Estadística de Datos , Frecuencia de los Genes , Humanos , Modelos Logísticos , Modelos Genéticos , Modelos TeóricosRESUMEN
Genome-wide association studies have identified susceptibility loci for esophageal squamous cell carcinoma (ESCC). We conducted a meta-analysis of all single-nucleotide polymorphisms (SNPs) that showed nominally significant P-values in two previously published genome-wide scans that included a total of 2961 ESCC cases and 3400 controls. The meta-analysis revealed five SNPs at 2q33 with P< 5 × 10(-8), and the strongest signal was rs13016963, with a combined odds ratio (95% confidence interval) of 1.29 (1.19-1.40) and P= 7.63 × 10(-10). An imputation analysis of 4304 SNPs at 2q33 suggested a single association signal, and the strongest imputed SNP associations were similar to those from the genotyped SNPs. We conducted an ancestral recombination graph analysis with 53 SNPs to identify one or more haplotypes that harbor the variants directly responsible for the detected association signal. This showed that the five SNPs exist in a single haplotype along with 45 imputed SNPs in strong linkage disequilibrium, and the strongest candidate was rs10201587, one of the genotyped SNPs. Our meta-analysis found genome-wide significant SNPs at 2q33 that map to the CASP8/ALS2CR12/TRAK2 gene region. Variants in CASP8 have been extensively studied across a spectrum of cancers with mixed results. The locus we identified appears to be distinct from the widely studied rs3834129 and rs1045485 SNPs in CASP8. Future studies of esophageal and other cancers should focus on comprehensive sequencing of this 2q33 locus and functional analysis of rs13016963 and rs10201587 and other strongly correlated variants.
Asunto(s)
Carcinoma de Células Escamosas/genética , Cromosomas Humanos Par 2/genética , Neoplasias Esofágicas/genética , Pueblo Asiatico/genética , China , Cromosomas Humanos Par 10/genética , Predisposición Genética a la Enfermedad , Variación Genética , Estudio de Asociación del Genoma Completo , Haplotipos , Humanos , Polimorfismo de Nucleótido Simple , Recombinación GenéticaRESUMEN
Renal cell carcinoma (RCC) is the most lethal urologic cancer. Only two common susceptibility loci for RCC have been confirmed to date. To identify additional RCC common susceptibility loci, we conducted an independent genome-wide association study (GWAS). We analyzed 533 191 single nucleotide polymorphisms (SNPs) for association with RCC in 894 cases and 1516 controls of European descent recruited from MD Anderson Cancer Center in the primary scan, and validated the top 500 SNPs in silico in 3772 cases and 8505 controls of European descent involved in the only published GWAS of RCC. We identified two common variants in linkage disequilibrium, rs718314 and rs1049380 (r(2) = 0.64, Dâ' = 0.84), in the inositol 1,4,5-triphosphate receptor, type 2 (ITPR2) gene on 12p11.23 as novel susceptibility loci for RCC (P = 8.89 × 10(-10) and P = 6.07 × 10(-9), respectively, in meta-analysis) with an allelic odds ratio of 1.19 [95% confidence interval (CI): 1.13-1.26] for rs718314 and 1.18 (95% CI: 1.12-1.25) for rs1049380. It has been recently identified that rs718314 in ITPR2 is associated with waist-hip ratio (WHR) phenotype. To our knowledge, this is the first genetic locus associated with both cancer risk and WHR.
Asunto(s)
Carcinoma de Células Renales/genética , Cromosomas Humanos Par 12 , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Neoplasias Renales/genética , HumanosRESUMEN
Retinol is one of the most biologically active forms of vitamin A and is hypothesized to influence a wide range of human diseases including asthma, cardiovascular disease, infectious diseases and cancer. We conducted a genome-wide association study of 5006 Caucasian individuals drawn from two cohorts of men: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention (ATBC) Study and the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. We identified two independent single-nucleotide polymorphisms associated with circulating retinol levels, which are located near the transthyretin (TTR) and retinol binding protein 4 (RBP4) genes which encode major carrier proteins of retinol: rs1667255 (P =2.30× 10(-17)) and rs10882272 (P =6.04× 10(-12)). We replicated the association with rs10882272 in RBP4 in independent samples from the Nurses' Health Study and the Invecchiare in Chianti Study (InCHIANTI) that included 3792 women and 504 men (P =9.49× 10(-5)), but found no association for retinol with rs1667255 in TTR among women, thus suggesting evidence for gender dimorphism (P-interaction=1.31× 10(-5)). Discovery of common genetic variants associated with serum retinol levels may provide further insight into the contribution of retinol and other vitamin A compounds to the development of cancer and other complex diseases.
Asunto(s)
Estudio de Asociación del Genoma Completo , Vitamina A/sangre , Anciano , Cromosomas Humanos Par 18/genética , Estudios de Cohortes , Femenino , Humanos , Desequilibrio de Ligamiento/genética , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple/genética , Reproducibilidad de los ResultadosRESUMEN
Genome-wide and candidate-gene association studies of bladder cancer have identified 10 susceptibility loci thus far. We conducted a meta-analysis of two previously published genome-wide scans (4501 cases and 6076 controls of European background) and followed up the most significant association signals [17 single nucleotide polymorphisms (SNPs) in 10 genomic regions] in 1382 cases and 2201 controls from four studies. A combined analysis adjusted for study center, age, sex, and smoking status identified a novel susceptibility locus that mapped to a region of 18q12.3, marked by rs7238033 (P = 8.7 × 10(-9); allelic odds ratio 1.20 with 95% CI: 1.13-1.28) and two highly correlated SNPs, rs10775480/rs10853535 (r(2)= 1.00; P = 8.9 × 10(-9); allelic odds ratio 1.16 with 95% CI: 1.10-1.22). The signal localizes to the solute carrier family 14 member 1 gene, SLC14A1, a urea transporter that regulates cellular osmotic pressure. In the kidney, SLC14A1 regulates urine volume and concentration whereas in erythrocytes it determines the Kidd blood groups. Our findings suggest that genetic variation in SLC14A1 could provide new etiological insights into bladder carcinogenesis.
Asunto(s)
Cromosomas Humanos Par 18/genética , Sitios Genéticos/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Proteínas de Transporte de Membrana/genética , Neoplasias de la Vejiga Urinaria/genética , Humanos , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Factores de Riesgo , Neoplasias de la Vejiga Urinaria/mortalidad , Transportadores de UreaRESUMEN
Prostate cancer (PrCa) is the most common non-skin cancer diagnosed among males in developed countries and the second leading cause of cancer mortality, yet little is known regarding its etiology and factors that influence clinical outcome. Genome-wide association studies (GWAS) of PrCa have identified at least 30 distinct loci associated with small differences in risk. We conducted a GWAS in 2782 advanced PrCa cases (Gleason grade ≥ 8 or tumor stage C/D) and 4458 controls with 571 243 single nucleotide polymorphisms (SNPs). Based on in silico replication of 4679 SNPs (Stage 1, P < 0.02) in two published GWAS with 7358 PrCa cases and 6732 controls, we identified a new susceptibility locus associated with overall PrCa risk at 2q37.3 (rs2292884, P= 4.3 × 10(-8)). We also confirmed a locus suggested by an earlier GWAS at 12q13 (rs902774, P= 8.6 × 10(-9)). The estimated per-allele odds ratios for these loci (1.14 for rs2292884 and 1.17 for rs902774) did not differ between advanced and non-advanced PrCa (case-only test for heterogeneity P= 0.72 and P= 0.61, respectively). Further studies will be needed to assess whether these or other loci are differentially associated with PrCa subtypes.
Asunto(s)
Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Neoplasias de la Próstata/genética , Estudios de Casos y Controles , Estudios de Cohortes , Humanos , MasculinoRESUMEN
Previous genome-wide association studies have identified two independent variants in HNF1B as susceptibility loci for prostate cancer risk. To fine-map common genetic variation in this region, we genotyped 79 single nucleotide polymorphisms (SNPs) in the 17q12 region harboring HNF1B in 10 272 prostate cancer cases and 9123 controls of European ancestry from 10 case-control studies as part of the Cancer Genetic Markers of Susceptibility (CGEMS) initiative. Ten SNPs were significantly related to prostate cancer risk at a genome-wide significance level of P < 5 × 10(-8) with the most significant association with rs4430796 (P = 1.62 × 10(-24)). However, risk within this first locus was not entirely explained by rs4430796. Although modestly correlated (r(2)= 0.64), rs7405696 was also associated with risk (P = 9.35 × 10(-23)) even after adjustment for rs4430769 (P = 0.007). As expected, rs11649743 was related to prostate cancer risk (P = 3.54 × 10(-8)); however, the association within this second locus was stronger for rs4794758 (P = 4.95 × 10(-10)), which explained all of the risk observed with rs11649743 when both SNPs were included in the same model (P = 0.32 for rs11649743; P = 0.002 for rs4794758). Sequential conditional analyses indicated that five SNPs (rs4430796, rs7405696, rs4794758, rs1016990 and rs3094509) together comprise the best model for risk in this region. This study demonstrates a complex relationship between variants in the HNF1B region and prostate cancer risk. Further studies are needed to investigate the biological basis of the association of variants in 17q12 with prostate cancer.
Asunto(s)
Sitios Genéticos/genética , Predisposición Genética a la Enfermedad , Factor Nuclear 1-beta del Hepatocito/genética , Mapeo Físico de Cromosoma/métodos , Neoplasias de la Próstata/genética , Alelos , Genoma Humano/genética , Humanos , Masculino , Modelos Genéticos , Polimorfismo de Nucleótido Simple/genética , Análisis de Regresión , Factores de RiesgoRESUMEN
Genome-wide association studies have identified prostate cancer susceptibility alleles on chromosome 11q13. As part of the Cancer Genetic Markers of Susceptibility (CGEMS) Initiative, the region flanking the most significant marker, rs10896449, was fine mapped in 10 272 cases and 9123 controls of European origin (10 studies) using 120 common single nucleotide polymorphisms (SNPs) selected by a two-staged tagging strategy using HapMap SNPs. Single-locus analysis identified 18 SNPs below genome-wide significance (P< 10(-8)) with rs10896449 the most significant (P= 7.94 × 10(-19)). Multi-locus models that included significant SNPs sequentially identified a second association at rs12793759 [odds ratio (OR) = 1.14, P= 4.76 × 10(-5), adjusted P= 0.004] that is independent of rs10896449 and remained significant after adjustment for multiple testing within the region. rs10896438, a proxy of previously reported rs12418451 (r(2)= 0.96), independent of both rs10896449 and rs12793759 was detected (OR = 1.07, P= 5.92 × 10(-3), adjusted P= 0.054). Our observation of a recombination hotspot that separates rs10896438 from rs10896449 and rs12793759, and low linkage disequilibrium (rs10896449-rs12793759, r(2)= 0.17; rs10896449-rs10896438, r(2)= 0.10; rs12793759-rs10896438, r(2)= 0.12) corroborate our finding of three independent signals. By analysis of tagged SNPs across â¼123 kb using next generation sequencing of 63 controls of European origin, 1000 Genome and HapMap data, we observed multiple surrogates for the three independent signals marked by rs10896449 (n= 31), rs10896438 (n= 24) and rs12793759 (n= 8). Our results indicate that a complex architecture underlying the common variants contributing to prostate cancer risk at 11q13. We estimate that at least 63 common variants should be considered in future studies designed to investigate the biological basis of the multiple association signals.
Asunto(s)
Mapeo Cromosómico , Cromosomas Humanos Par 11/genética , Sitios Genéticos , Predisposición Genética a la Enfermedad/genética , Desequilibrio de Ligamiento , Polimorfismo de Nucleótido Simple , Neoplasias de la Próstata/genética , Estudios de Casos y Controles , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Factores de RiesgoRESUMEN
BACKGROUND: Genomewide association studies have identified multiple genetic variants associated with breast cancer. The extent to which these variants add to existing risk-assessment models is unknown. METHODS: We used information on traditional risk factors and 10 common genetic variants associated with breast cancer in 5590 case subjects and 5998 control subjects, 50 to 79 years of age, from four U.S. cohort studies and one case-control study from Poland to fit models of the absolute risk of breast cancer. With the use of receiver-operating-characteristic curve analysis, we calculated the area under the curve (AUC) as a measure of discrimination. By definition, random classification of case and control subjects provides an AUC of 50%; perfect classification provides an AUC of 100%. We calculated the fraction of case subjects in quintiles of estimated absolute risk after the addition of genetic variants to the traditional risk model. RESULTS: The AUC for a risk model with age, study and entry year, and four traditional risk factors was 58.0%; with the addition of 10 genetic variants, the AUC was 61.8%. About half the case subjects (47.2%) were in the same quintile of risk as in a model without genetic variants; 32.5% were in a higher quintile, and 20.4% were in a lower quintile. CONCLUSIONS: The inclusion of newly discovered genetic factors modestly improved the performance of risk models for breast cancer. The level of predicted breast-cancer risk among most women changed little after the addition of currently available genetic information.
Asunto(s)
Neoplasias de la Mama/genética , Predisposición Genética a la Enfermedad , Modelos Estadísticos , Medición de Riesgo/métodos , Anciano , Área Bajo la Curva , Estudios de Casos y Controles , Estudios de Cohortes , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Modelos Logísticos , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Curva ROC , Factores de RiesgoRESUMEN
PURPOSE: Diabetes is a suspected risk factor for pancreatic cancer, but questions remain about whether it is a risk factor or a result of the disease. This study prospectively examined the association between diabetes and the risk of pancreatic adenocarcinoma in pooled data from the NCI pancreatic cancer cohort consortium (PanScan). METHODS: The pooled data included 1,621 pancreatic adenocarcinoma cases and 1,719 matched controls from twelve cohorts using a nested case-control study design. Subjects who were diagnosed with diabetes near the time (<2 years) of pancreatic cancer diagnosis were excluded from all analyses. All analyses were adjusted for age, race, gender, study, alcohol use, smoking, BMI, and family history of pancreatic cancer. RESULTS: Self-reported diabetes was associated with a forty percent increased risk of pancreatic cancer (OR = 1.40, 95 % CI: 1.07, 1.84). The association differed by duration of diabetes; risk was highest for those with a duration of 2-8 years (OR = 1.79, 95 % CI: 1.25, 2.55); there was no association for those with 9+ years of diabetes (OR = 1.02, 95 % CI: 0.68, 1.52). CONCLUSIONS: These findings provide support for a relationship between diabetes and pancreatic cancer risk. The absence of association in those with the longest duration of diabetes may reflect hypoinsulinemia and warrants further investigation.
Asunto(s)
Adenocarcinoma/etiología , Diabetes Mellitus/epidemiología , Neoplasias Pancreáticas/etiología , Adenocarcinoma/epidemiología , Adulto , Anciano , Anciano de 80 o más Años , Estudios de Casos y Controles , Estudios de Cohortes , Complicaciones de la Diabetes/epidemiología , Femenino , Humanos , Masculino , Persona de Mediana Edad , Neoplasias Pancreáticas/epidemiología , Factores de RiesgoRESUMEN
PURPOSE: The evidence of a relation between folate intake and one-carbon metabolism (OCM) with pancreatic cancer (PanCa) is inconsistent. In this study, the association between genes and single-nucleotide polymorphisms (SNPs) related to OCM and PanCa was assessed. METHODS: Using biochemical knowledge of the OCM pathway, we identified thirty-seven genes and 834 SNPs to examine in association with PanCa. Our study included 1,408 cases and 1,463 controls nested within twelve cohorts (PanScan). The ten SNPs and five genes with lowest p values (<0.02) were followed up in 2,323 cases and 2,340 controls from eight case-control studies (PanC4) that participated in PanScan2. The correlation of SNPs with metabolite levels was assessed for 649 controls from the European Prospective Investigation into Cancer and Nutrition. RESULTS: When both stages were combined, we observed suggestive associations with PanCa for rs10887710 (MAT1A) (OR 1.13, 95 %CI 1.04-1.23), rs1552462 (SYT9) (OR 1.27, 95 %CI 1.02-1.59), and rs7074891 (CUBN) (OR 1.91, 95 %CI 1.12-3.26). After correcting for multiple comparisons, no significant associations were observed in either the first or second stage. The three suggested SNPs showed no correlations with one-carbon biomarkers. CONCLUSIONS: This is the largest genetic study to date to examine the relation between germline variations in OCM-related genes polymorphisms and the risk of PanCa. Suggestive evidence for an association between polymorphisms and PanCa was observed among the cohort-nested studies, but this did not replicate in the case-control studies. Our results do not strongly support the hypothesis that genes related to OCM play a role in pancreatic carcinogenesis.
Asunto(s)
Carbono/metabolismo , Neoplasias Pancreáticas/genética , Neoplasias Pancreáticas/metabolismo , Estudios de Casos y Controles , Estudios de Cohortes , Mutación de Línea Germinal , Humanos , Neoplasias Pancreáticas/epidemiología , Polimorfismo de Nucleótido Simple , Estados Unidos/epidemiologíaRESUMEN
Cells are the singular building blocks of life, and a comprehensive understanding of morphology, among other properties, is crucial to the assessment of underlying heterogeneity. We developed Computational Sorting and Mapping of Single Cells (COSMOS), a platform based on Artificial Intelligence (AI) and microfluidics to characterize and sort single cells based on real-time deep learning interpretation of high-resolution brightfield images. Supervised deep learning models were applied to characterize and sort cell lines and dissociated primary tissue based on high-dimensional embedding vectors of morphology without the need for biomarker labels and stains/dyes. We demonstrate COSMOS capabilities with multiple human cell lines and tissue samples. These early results suggest that our neural networks embedding space can capture and recapitulate deep visual characteristics and can be used to efficiently purify unlabeled viable cells with desired morphological traits. Our approach resolves a technical gap in the ability to perform real-time deep learning assessment and sorting of cells based on high-resolution brightfield images.
Asunto(s)
Inteligencia Artificial , Aprendizaje Profundo , Humanos , Movimiento Celular , Línea Celular , Separación Celular , ColorantesRESUMEN
Four loci have been associated with pancreatic cancer through genome-wide association studies (GWAS). Pathway-based analysis of GWAS data is a complementary approach to identify groups of genes or biological pathways enriched with disease-associated single-nucleotide polymorphisms (SNPs) whose individual effect sizes may be too small to be detected by standard single-locus methods. We used the adaptive rank truncated product method in a pathway-based analysis of GWAS data from 3851 pancreatic cancer cases and 3934 control participants pooled from 12 cohort studies and 8 case-control studies (PanScan). We compiled 23 biological pathways hypothesized to be relevant to pancreatic cancer and observed a nominal association between pancreatic cancer and five pathways (P < 0.05), i.e. pancreatic development, Helicobacter pylori lacto/neolacto, hedgehog, Th1/Th2 immune response and apoptosis (P = 2.0 × 10(-6), 1.6 × 10(-5), 0.0019, 0.019 and 0.023, respectively). After excluding previously identified genes from the original GWAS in three pathways (NR5A2, ABO and SHH), the pancreatic development pathway remained significant (P = 8.3 × 10(-5)), whereas the others did not. The most significant genes (P < 0.01) in the five pathways were NR5A2, HNF1A, HNF4G and PDX1 for pancreatic development; ABO for H.pylori lacto/neolacto; SHH for hedgehog; TGFBR2 and CCL18 for Th1/Th2 immune response and MAPK8 and BCL2L11 for apoptosis. Our results provide a link between inherited variation in genes important for pancreatic development and cancer and show that pathway-based approaches to analysis of GWAS data can yield important insights into the collective role of genetic risk variants in cancer.