Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 95
Filtrar
1.
bioRxiv ; 2024 Sep 28.
Artículo en Inglés | MEDLINE | ID: mdl-39386487

RESUMEN

5-methylcytosine (5mC) is the most common chemical modification occurring on the CpG sites across the human genome. Bisulfite conversion combined with short-read whole genome sequencing can capture and quantify the modification at single nucleotide resolution. However, the PCR amplification process could lead to duplicative methylation patterns and introduce 5mC detection bias. Additionally, the limited read length also restricts co-methylation analysis between distant CpG sites. The bisulfite conversion process presents a significant challenge for detecting variant-specific methylation due to the destruction of allele information in the sequencing reads. To address these issues, we sought to characterize the human methylation profiling with the nanopore long-read sequencing, aiming to demonstrate its potential for long-range co-methylation analysis with native modification call and intact allele information retained. In this regard, we first analyzed the nanopore demo data in the adaptive sampling sequencing run targeting all human CpG islands. We applied the linkage disequilibrium (LD) R2 to calculate the co-methylation in nanopore data, and further identified 27,875, 50,481, 26,542 and 51,189 methylation haplotype blocks (MHB) in COLO829, COLO829BL, HCC1395 and HCC1395BL cell lines, respectively. Interestingly, while we found that majority of the co-methylation were in a short range (≤200bp), a small portion (1~3%) showed long distance (≥1,000bp), suggesting potential remote regulatory mechanisms across the genome. To further characterize the epigenetic changes related to transcription factor binding, we profiled the 5mC percentage changes surrounding various motif sites in JASPAR collection and found that CTCF and KLF5 binding sites showed reduced methylation, while FOXE1 and ZNF354A sites showed increased methylation. To further investigate the allele-specific 5mCG in the prostate genome, we designed a target region covering methylation quantitative trait loci (mQTL) and genome-wide association study (GWAS) risk germline variants and generated long reads with adaptive sampling run in the 22Rv1 cell line. To identify the allele-specific methylation in the 22Rv1 cell line, we performed long-read based phasing and compared the 5mCG signals between the two haplotypes. As a result, we identified 6,390 haplotype-specific methylated regions in the 22Rv1 cell line (p-MWU ≤ 1e-5 and delta ≥ 50%). By examining haplotype-specific methylated regions near the phasing variants, we identified examples of allele-specific methylated regions that showed allelespecific accessibility in the ATAC-seq data. By further integrating the ATAC-seq data of 22Rv1, we found that methylation levels were negatively correlated with chromatin accessibility at the genome-wide scale. Our study has revealed native methylome profiling while preserving haplotype information, offering a novel approach to uncovering the regulatory mechanisms of the human prostate genome.

3.
medRxiv ; 2024 Jun 25.
Artículo en Inglés | MEDLINE | ID: mdl-38978643

RESUMEN

Frontotemporal lobar degeneration with neuronal inclusions of the TAR DNA-binding protein 43 (FTLD-TDP) is a fatal neurodegenerative disorder with only a limited number of risk loci identified. We report our comprehensive genome-wide association study as part of the International FTLD-TDP Whole-Genome Sequencing Consortium, including 985 cases and 3,153 controls, and meta-analysis with the Dementia-seq cohort, compiled from 26 institutions/brain banks in the United States, Europe and Australia. We confirm UNC13A as the strongest overall FTLD-TDP risk factor and identify TNIP1 as a novel FTLD-TDP risk factor. In subgroup analyses, we further identify for the first time genome-wide significant loci specific to each of the three main FTLD-TDP pathological subtypes (A, B and C), as well as enrichment of risk loci in distinct tissues, brain regions, and neuronal subtypes, suggesting distinct disease aetiologies in each of the subtypes. Rare variant analysis confirmed TBK1 and identified VIPR1 , RBPJL , and L3MBTL1 as novel subtype specific FTLD-TDP risk genes, further highlighting the role of innate and adaptive immunity and notch signalling pathway in FTLD-TDP, with potential diagnostic and novel therapeutic implications.

4.
Front Genet ; 13: 836841, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35432445

RESUMEN

Large genome-wide association studies have identified hundreds of single-nucleotide polymorphisms associated with increased risk of prostate cancer (PrCa), and many of these risk loci is presumed to confer regulatory effects on gene expression. While eQTL studies of long RNAs has yielded many potential risk genes, the relationship between PrCa risk genetics and microRNA expression dysregulation is understudied. We performed an microRNA transcriptome-wide association study of PrCa risk using small RNA sequencing and genome-wide genotyping data from N = 441 normal prostate epithelium tissue samples along with N = 411 prostate adenocarcinoma tumor samples from the Cancer Genome Atlas (TCGA). Genetically regulated expression prediction models were trained for all expressed microRNAs using the FUSION TWAS software. TWAS for PrCa risk was performed with both sets of models using single-SNP summary statistics from the recent PRACTICAL consortium PrCa case-control OncoArray GWAS meta-analysis. A total of 613 and 571 distinct expressed microRNAs were identified in the normal and tumor tissue datasets, respectively (overlap: 480). Among these, 79 (13%) normal tissue microRNAs demonstrated significant cis-heritability (median cis-h2 = 0.15, range: 0.03-0.79) for model training. Similar results were obtained from TCGA tumor samples, with 48 (9%) microRNA expression models successfully trained (median cis-h2 = 0.14, range: 0.06-0.60). Using normal tissue models, we identified two significant TWAS microRNA associations with PrCa risk: over-expression of mir-941 family microRNAs (PTWAS = 2.9E-04) and reduced expression of miR-3617-5p (PTWAS = 1.0E-03). The TCGA tumor TWAS also identified a significant association with miR-941 overexpression (PTWAS = 9.7E-04). Subsequent finemapping of the TWAS results using a multi-tissue database indicated limited evidence of causal status for each microRNA with PrCa risk (posterior inclusion probabilities <0.05). Future work will examine downstream regulatory effects of microRNA dysregulation as well as microRNA-mediated risk mechanisms via competing endogenous RNA relationships.

5.
Am J Hum Genet ; 109(5): 900-908, 2022 05 05.
Artículo en Inglés | MEDLINE | ID: mdl-35353984

RESUMEN

Polygenic risk scores (PRSs) for a variety of diseases have recently been shown to have relative risks that depend on age, and genetic relative risks decrease with increasing age. A refined understanding of the age dependency of PRSs for a disease is important for personalized risk predictions and risk stratification. To further evaluate how the PRS relative risk for prostate cancer depends on age, we refined analyses for a validated PRS for prostate cancer by using 64,274 prostate cancer cases and 46,432 controls of diverse ancestry (82.8% European, 9.8% African American, 3.8% Latino, 2.8% Asian, and 0.8% Ghanaian). Our strategy applied a novel weighted proportional hazards model to case-control data to fully utilize age to refine how the relative risk decreased with age. We found significantly greater relative risks for younger men (age 30-55 years) compared with older men (70-88 years) for both relative risk per standard deviation of the PRS and dichotomized according to the upper 90th percentile of the PRS distribution. For the largest European ancestral group that could provide reliable resolution, the log-relative risk decreased approximately linearly from age 50 to age 75. Despite strong evidence of age-dependent genetic relative risk, our results suggest that absolute risk predictions differed little from predictions that assumed a constant relative risk over ages, from short-term to long-term predictions, simplifying implementation of risk discussions into clinical practice.


Asunto(s)
Predisposición Genética a la Enfermedad , Neoplasias de la Próstata , Adulto , Anciano , Estudio de Asociación del Genoma Completo , Ghana , Humanos , Masculino , Persona de Mediana Edad , Herencia Multifactorial/genética , Neoplasias de la Próstata/genética , Factores de Riesgo
6.
Eur Urol Oncol ; 4(4): 570-579, 2021 08.
Artículo en Inglés | MEDLINE | ID: mdl-33436325

RESUMEN

BACKGROUND: Germline ATM mutations are suggested to contribute to predisposition to prostate cancer (PrCa). Previous studies have had inadequate power to estimate variant effect sizes. OBJECTIVE: To precisely estimate the contribution of germline ATM mutations to PrCa risk. DESIGN, SETTING, AND PARTICIPANTS: We analysed next-generation sequencing data from 13 PRACTICAL study groups comprising 5560 cases and 3353 controls of European ancestry. OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS: Variant Call Format files were harmonised, annotated for rare ATM variants, and classified as tier 1 (likely pathogenic) or tier 2 (potentially deleterious). Associations with overall PrCa risk and clinical subtypes were estimated. RESULTS AND LIMITATIONS: PrCa risk was higher in carriers of a tier 1 germline ATM variant, with an overall odds ratio (OR) of 4.4 (95% confidence interval [CI]: 2.0-9.5). There was also evidence that PrCa cases with younger age at diagnosis (<65 yr) had elevated tier 1 variant frequencies (pdifference = 0.04). Tier 2 variants were also associated with PrCa risk, with an OR of 1.4 (95% CI: 1.1-1.7). CONCLUSIONS: Carriers of pathogenic ATM variants have an elevated risk of developing PrCa and are at an increased risk for earlier-onset disease presentation. These results provide information for counselling of men and their families. PATIENT SUMMARY: In this study, we estimated that men who inherit a likely pathogenic mutation in the ATM gene had an approximately a fourfold risk of developing prostate cancer. In addition, they are likely to develop the disease earlier.


Asunto(s)
Predisposición Genética a la Enfermedad , Neoplasias de la Próstata , Proteínas de la Ataxia Telangiectasia Mutada/genética , Mutación de Línea Germinal , Humanos , Masculino , Neoplasias de la Próstata/epidemiología , Neoplasias de la Próstata/genética
7.
Eur Urol ; 79(3): 353-361, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-32800727

RESUMEN

BACKGROUND: Family history of prostate cancer (PCa) is a well-known risk factor, and both common and rare genetic variants are associated with the disease. OBJECTIVE: To detect new genetic variants associated with PCa, capitalizing on the role of family history and more aggressive PCa. DESIGN, SETTING, AND PARTICIPANTS: A two-stage design was used. In stage one, whole-exome sequencing was used to identify potential risk alleles among affected men with a strong family history of disease or with more aggressive disease (491 cases and 429 controls). Aggressive disease was based on a sum of scores for Gleason score, node status, metastasis, tumor stage, prostate-specific antigen at diagnosis, systemic recurrence, and time to PCa death. Genes identified in stage one were screened in stage two using a custom-capture design in an independent set of 2917 cases and 1899 controls. OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS: Frequencies of genetic variants (singly or jointly in a gene) were compared between cases and controls. RESULTS AND LIMITATIONS: Eleven genes previously reported to be associated with PCa were detected (ATM, BRCA2, HOXB13, FAM111A, EMSY, HNF1B, KLK3, MSMB, PCAT1, PRSS3, and TERT), as well as an additional 10 novel genes (PABPC1, QK1, FAM114A1, MUC6, MYCBP2, RAPGEF4, RNASEH2B, ULK4, XPO7, and THAP3). Of these 10 novel genes, all but PABPC1 and ULK4 were primarily associated with the risk of aggressive PCa. CONCLUSIONS: Our approach demonstrates the advantage of gene sequencing in the search for genetic variants associated with PCa and the benefits of sampling patients with a strong family history of disease or an aggressive form of disease. PATIENT SUMMARY: Multiple genes are associated with prostate cancer (PCa) among men with a strong family history of this disease or among men with an aggressive form of PCa.


Asunto(s)
Neoplasias de la Próstata , Genes BRCA2 , Factores de Intercambio de Guanina Nucleótido , Humanos , Masculino , Clasificación del Tumor , Neoplasias de la Próstata/genética , Proteínas Serina-Treonina Quinasas , Tripsina , Secuenciación del Exoma
8.
Cancer Epidemiol Biomarkers Prev ; 29(9): 1731-1738, 2020 09.
Artículo en Inglés | MEDLINE | ID: mdl-32581112

RESUMEN

BACKGROUND: A polygenic hazard score (PHS), the weighted sum of 54 SNP genotypes, was previously validated for association with clinically significant prostate cancer and for improved prostate cancer screening accuracy. Here, we assess the potential impact of PHS-informed screening. METHODS: United Kingdom population incidence data (Cancer Research United Kingdom) and data from the Cluster Randomized Trial of PSA Testing for Prostate Cancer were combined to estimate age-specific clinically significant prostate cancer incidence (Gleason score ≥7, stage T3-T4, PSA ≥10, or nodal/distant metastases). Using HRs estimated from the ProtecT prostate cancer trial, age-specific incidence rates were calculated for various PHS risk percentiles. Risk-equivalent age, when someone with a given PHS percentile has prostate cancer risk equivalent to an average 50-year-old man (50-year-standard risk), was derived from PHS and incidence data. Positive predictive value (PPV) of PSA testing for clinically significant prostate cancer was calculated using PHS-adjusted age groups. RESULTS: The expected age at diagnosis of clinically significant prostate cancer differs by 19 years between the 1st and 99th PHS percentiles: men with PHS in the 1st and 99th percentiles reach the 50-year-standard risk level at ages 60 and 41, respectively. PPV of PSA was higher for men with higher PHS-adjusted age. CONCLUSIONS: PHS provides individualized estimates of risk-equivalent age for clinically significant prostate cancer. Screening initiation could be adjusted by a man's PHS. IMPACT: Personalized genetic risk assessments could inform prostate cancer screening decisions.


Asunto(s)
Neoplasias de la Próstata/genética , Anciano , Detección Precoz del Cáncer , Humanos , Masculino , Persona de Mediana Edad , Clasificación del Tumor , Regulación de la Población
9.
Eur J Hum Genet ; 28(10): 1467-1475, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-32514134

RESUMEN

We determined the effect of sample size on performance of polygenic hazard score (PHS) models in prostate cancer. Age and genotypes were obtained for 40,861 men from the PRACTICAL consortium. The dataset included 201,590 SNPs per subject, and was split into training and testing sets. Established-SNP models considered 65 SNPs that had been previously associated with prostate cancer. Discovery-SNP models used stepwise selection to identify new SNPs. The performance of each PHS model was calculated for random sizes of the training set. The performance of a representative Established-SNP model was estimated for random sizes of the testing set. Mean HR98/50 (hazard ratio of top 2% to average in test set) of the Established-SNP model increased from 1.73 [95% CI: 1.69-1.77] to 2.41 [2.40-2.43] when the number of training samples was increased from 1 thousand to 30 thousand. Corresponding HR98/50 of the Discovery-SNP model increased from 1.05 [0.93-1.18] to 2.19 [2.16-2.23]. HR98/50 of a representative Established-SNP model using testing set sample sizes of 0.6 thousand and 6 thousand observations were 1.78 [1.70-1.85] and 1.73 [1.71-1.76], respectively. We estimate that a study population of 20 thousand men is required to develop Discovery-SNP PHS models while 10 thousand men should be sufficient for Established-SNP models.


Asunto(s)
Estudio de Asociación del Genoma Completo/métodos , Herencia Multifactorial , Polimorfismo de Nucleótido Simple , Neoplasias de la Próstata/genética , Ensayos Clínicos como Asunto , Humanos , Masculino , Modelos Genéticos , Modelos de Riesgos Proporcionales , Tamaño de la Muestra
10.
PLoS One ; 14(4): e0214588, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-30958860

RESUMEN

Prostate cancer (PrCa) is highly heritable; 284 variants have been identified to date that are associated with increased prostate cancer risk, yet few genes contributing to its development are known. Expression quantitative trait loci (eQTL) studies link variants with affected genes, helping to determine how these variants might regulate gene expression and may influence prostate cancer risk. In the current study, we performed eQTL analysis on 471 normal prostate epithelium samples and 249 PrCa-risk variants in 196 risk loci, utilizing RNA sequencing transcriptome data based on ENSEMBL gene definition and genome-wide variant data. We identified a total of 213 genes associated with known PrCa-risk variants, including 141 protein-coding genes, 16 lncRNAs, and 56 other non-coding RNA species with differential expression. Compared to our previous analysis, where RefSeq was used for gene annotation, we identified an additional 130 expressed genes associated with known PrCa-risk variants. We detected an eQTL signal for more than half (n = 102, 52%) of the 196 loci tested; 52 (51%) of which were a Group 1 signal, indicating high linkage disequilibrium (LD) between the peak eQTL variant and the PrCa-risk variant (r2>0.5) and may help explain how risk variants influence the development of prostate cancer.


Asunto(s)
Predisposición Genética a la Enfermedad , Desequilibrio de Ligamiento , Neoplasias de la Próstata/diagnóstico , Sitios de Carácter Cuantitativo , Variación Genética , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Masculino , Polimorfismo de Nucleótido Simple , Próstata/patología , Neoplasias de la Próstata/genética , Control de Calidad , Factores de Riesgo , Análisis de Secuencia de ARN , Transcriptoma
11.
Acta Neuropathol ; 137(6): 879-899, 2019 06.
Artículo en Inglés | MEDLINE | ID: mdl-30739198

RESUMEN

Frontotemporal lobar degeneration with neuronal inclusions of the TAR DNA-binding protein 43 (FTLD-TDP) represents the most common pathological subtype of FTLD. We established the international FTLD-TDP whole-genome sequencing consortium to thoroughly characterize the known genetic causes of FTLD-TDP and identify novel genetic risk factors. Through the study of 1131 unrelated Caucasian patients, we estimated that C9orf72 repeat expansions and GRN loss-of-function mutations account for 25.5% and 13.9% of FTLD-TDP patients, respectively. Mutations in TBK1 (1.5%) and other known FTLD genes (1.4%) were rare, and the disease in 57.7% of FTLD-TDP patients was unexplained by the known FTLD genes. To unravel the contribution of common genetic factors to the FTLD-TDP etiology in these patients, we conducted a two-stage association study comprising the analysis of whole-genome sequencing data from 517 FTLD-TDP patients and 838 controls, followed by targeted genotyping of the most associated genomic loci in 119 additional FTLD-TDP patients and 1653 controls. We identified three genome-wide significant FTLD-TDP risk loci: one new locus at chromosome 7q36 within the DPP6 gene led by rs118113626 (p value = 4.82e - 08, OR = 2.12), and two known loci: UNC13A, led by rs1297319 (p value = 1.27e - 08, OR = 1.50) and HLA-DQA2 led by rs17219281 (p value = 3.22e - 08, OR = 1.98). While HLA represents a locus previously implicated in clinical FTLD and related neurodegenerative disorders, the association signal in our study is independent from previously reported associations. Through inspection of our whole-genome sequence data for genes with an excess of rare loss-of-function variants in FTLD-TDP patients (n ≥ 3) as compared to controls (n = 0), we further discovered a possible role for genes functioning within the TBK1-related immune pathway (e.g., DHX58, TRIM21, IRF7) in the genetic etiology of FTLD-TDP. Together, our study based on the largest cohort of unrelated FTLD-TDP patients assembled to date provides a comprehensive view of the genetic landscape of FTLD-TDP, nominates novel FTLD-TDP risk loci, and strongly implicates the immune pathway in FTLD-TDP pathogenesis.


Asunto(s)
Proteínas del Tejido Nervioso/genética , Proteinopatías TDP-43/genética , Anciano , Expansión de las Repeticiones de ADN , Dipeptidil-Peptidasas y Tripeptidil-Peptidasas/genética , Femenino , Lóbulo Frontal/metabolismo , Degeneración Lobar Frontotemporal/genética , Degeneración Lobar Frontotemporal/inmunología , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Antígenos HLA-DQ/genética , Humanos , Péptidos y Proteínas de Señalización Intracelular , Mutación con Pérdida de Función , Masculino , Persona de Mediana Edad , Proteínas del Tejido Nervioso/fisiología , Canales de Potasio/genética , Progranulinas/genética , Progranulinas/fisiología , Proteínas Serina-Treonina Quinasas/genética , Proteínas Serina-Treonina Quinasas/fisiología , Proteínas/genética , Proteínas/fisiología , ARN Mensajero/biosíntesis , Factores de Riesgo , Análisis de Secuencia de ARN , Sociedades Científicas , Proteinopatías TDP-43/inmunología , Población Blanca/genética
12.
Genet Epidemiol ; 43(4): 440-448, 2019 06.
Artículo en Inglés | MEDLINE | ID: mdl-30740785

RESUMEN

The familial recurrence risk is the probability a person will have disease, given a reported family history. When family histories are obtained as simple counts of disease among family members, as often obtained in cancer registries or surveys, we propose methods to estimate recurrence risks based on truncated binomial distributions. By this approach, we are able to obtain unbiased estimates of risk for a person with at least k-affected relatives, where k can be specified to determine how risk varies with k. We also derive robust variances of the recurrence risk estimate, to account for correlations within families, such as those induced by shared genes or shared environment, without explicitly modeling the factors that cause familial correlations. Furthermore, we illustrate how mixture models can be used to account for a sample composed of low- and high-risk families. Using simulations, we illustrate the properties of the proposed methods. Application of our methods to a family history survey of prostate cancer shows that the recurrence risk for prostate cancer increased from 16%, when there was at least one affected relative, to 52%, when there was at least five affected relatives.


Asunto(s)
Familia , Anamnesis , Modelos Genéticos , Neoplasias de la Próstata/epidemiología , Neoplasias de la Próstata/genética , Distribución Binomial , Predisposición Genética a la Enfermedad , Humanos , Incidencia , Masculino , Anamnesis/estadística & datos numéricos , Sistema de Registros , Riesgo , Factores de Riesgo , Encuestas y Cuestionarios
13.
Appl Clin Genet ; 11: 121-127, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-30498369

RESUMEN

PURPOSE: In aging adults, mitochondrial dysfunction may be an important contributor. We evaluated the association between mitochondrial DNA (mtDNA) copy number, which is a biomarker for mitochondrial function, and self-rated health status. PATIENTS AND METHODS: We conducted a cross-sectional study of patients enrolled within the Mayo Clinic Biobank. We utilized the questionnaire and sequence data from 944 patients. We examined the association between mtDNA copy number and self-rated health status with 3 collapsed categories for the latter variable (excellent/very good, good, and fair/poor). For analysis, we used proportional odds models after log-transforming mtDNA copy number, and we adjusted for age and sex. RESULTS: We found the median age at enrollment was 61 years (25th-75th percentile: 51-71), and 64% reported excellent or very good health, 31% reported good health, and 6% reported fair/poor health. Overall, the median mtDNA copy number was 88.9 (25th-75th percentile: 77.6-101.1). Higher mtDNA copy number was found for subjects reporting better self-rated health status after adjusting for age, sex, and comorbidity burden (OR =2.3 [95% CI: 1.2-4.5] for having better self-rated health for a one-unit increase in log-transformed mtDNA copy number). CONCLUSION: We found that a higher mtDNA copy number is associated with better self-rated health status after adjustment for age, sex, and comorbidity burden. The current study implies that mtDNA copy number may serve as a biomarker for self-reported health. Further studies, potentially including cohort studies, may be required.

14.
Cell ; 174(3): 564-575.e18, 2018 07 26.
Artículo en Inglés | MEDLINE | ID: mdl-30033362

RESUMEN

The prostate cancer (PCa) risk-associated SNP rs11672691 is positively associated with aggressive disease at diagnosis. We showed that rs11672691 maps to the promoter of a short isoform of long noncoding RNA PCAT19 (PCAT19-short), which is in the third intron of the long isoform (PCAT19-long). The risk variant is associated with decreased and increased levels of PCAT19-short and PCAT19-long, respectively. Mechanistically, the risk SNP region is bifunctional with both promoter and enhancer activity. The risk variants of rs11672691 and its LD SNP rs887391 decrease binding of transcription factors NKX3.1 and YY1 to the promoter of PCAT19-short, resulting in weaker promoter but stronger enhancer activity that subsequently activates PCAT19-long. PCAT19-long interacts with HNRNPAB to activate a subset of cell-cycle genes associated with PCa progression, thereby promoting PCa tumor growth and metastasis. Taken together, these findings reveal a risk SNP-mediated promoter-enhancer switching mechanism underlying both initiation and progression of aggressive PCa.


Asunto(s)
Neoplasias de la Próstata/genética , ARN Largo no Codificante/genética , Alelos , Línea Celular Tumoral , Elementos de Facilitación Genéticos/genética , Regulación Neoplásica de la Expresión Génica/genética , Frecuencia de los Genes/genética , Predisposición Genética a la Enfermedad/genética , Proteínas de Homeodominio/metabolismo , Humanos , Masculino , Polimorfismo de Nucleótido Simple/genética , Regiones Promotoras Genéticas/genética , Unión Proteica , Isoformas de ARN/genética , Factores de Riesgo , Factores de Transcripción/metabolismo , Factor de Transcripción YY1/metabolismo
15.
BMC Bioinformatics ; 19(1): 139, 2018 04 16.
Artículo en Inglés | MEDLINE | ID: mdl-29661148

RESUMEN

BACKGROUND: After decades of identifying risk factors using array-based genome-wide association studies (GWAS), genetic research of complex diseases has shifted to sequencing-based rare variants discovery. This requires large sample sizes for statistical power and has brought up questions about whether the current variant calling practices are adequate for large cohorts. It is well-known that there are discrepancies between variants called by different pipelines, and that using a single pipeline always misses true variants exclusively identifiable by other pipelines. Nonetheless, it is common practice today to call variants by one pipeline due to computational cost and assume that false negative calls are a small percent of total. RESULTS: We analyzed 10,000 exomes from the Alzheimer's Disease Sequencing Project (ADSP) using multiple analytic pipelines consisting of different read aligners and variant calling strategies. We compared variants identified by using two aligners in 50,100, 200, 500, 1000, and 1952 samples; and compared variants identified by adding single-sample genotyping to the default multi-sample joint genotyping in 50,100, 500, 2000, 5000 and 10,000 samples. We found that using a single pipeline missed increasing numbers of high-quality variants correlated with sample sizes. By combining two read aligners and two variant calling strategies, we rescued 30% of pass-QC variants at sample size of 2000, and 56% at 10,000 samples. The rescued variants had higher proportions of low frequency (minor allele frequency [MAF] 1-5%) and rare (MAF < 1%) variants, which are the very type of variants of interest. In 660 Alzheimer's disease cases with earlier onset ages of ≤65, 4 out of 13 (31%) previously-published rare pathogenic and protective mutations in APP, PSEN1, and PSEN2 genes were undetected by the default one-pipeline approach but recovered by the multi-pipeline approach. CONCLUSIONS: Identification of the complete variant set from sequencing data is the prerequisite of genetic association analyses. The current analytic practice of calling genetic variants from sequencing data using a single bioinformatics pipeline is no longer adequate with the increasingly large projects. The number and percentage of quality variants that passed quality filters but are missed by the one-pipeline approach rapidly increased with sample size.


Asunto(s)
Biología Computacional/métodos , Variación Genética , Enfermedad de Alzheimer/genética , Composición de Base/genética , Descubrimiento de Drogas , Genoma , Genotipo , Técnicas de Genotipaje , Humanos , Tamaño de la Muestra , Alineación de Secuencia
16.
Oncotarget ; 8(49): 85896-85908, 2017 Oct 17.
Artículo en Inglés | MEDLINE | ID: mdl-29156765

RESUMEN

Large-scale genome-wide association studies have identified multiple single-nucleotide polymorphisms associated with risk of prostate cancer. Many of these genetic variants are presumed to be regulatory in nature; however, follow-up expression quantitative trait loci (eQTL) association studies have to-date been restricted largely to cis-acting associations due to study limitations. While trans-eQTL scans suffer from high testing dimensionality, recent evidence indicates most trans-eQTL associations are mediated by cis-regulated genes, such as transcription factors. Leveraging a data-driven gene co-expression network, we conducted a comprehensive cis-mediator analysis using RNA-Seq data from 471 normal prostate tissue samples to identify downstream regulatory associations of previously identified prostate cancer risk variants. We discovered multiple trans-eQTL associations that were significantly mediated by cis-regulated transcripts, four of which involved risk locus 17q12, proximal transcription factor HNF1B, and target trans-genes with known HNF response elements (MIA2, SRC, SEMA6A, KIF12). We additionally identified evidence of cis-acting down-regulation of MSMB via rs10993994 corresponding to reduced co-expression of NDRG1. The majority of these cis-mediator relationships demonstrated trans-eQTL replicability in 87 prostate tissue samples from the Gene-Tissue Expression Project. These findings provide further biological context to known risk loci and outline new hypotheses for investigation into the etiology of prostate cancer.

17.
Bioinformatics ; 33(24): 3895-3901, 2017 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-28961785

RESUMEN

MOTIVATION: Interpreting genetic variation in noncoding regions of the genome is an important challenge for personal genome analysis. One mechanism by which noncoding single nucleotide variants (SNVs) influence downstream phenotypes is through the regulation of gene expression. Methods to predict whether or not individual SNVs are likely to regulate gene expression would aid interpretation of variants of unknown significance identified in whole-genome sequencing studies. RESULTS: We developed FIRE (Functional Inference of Regulators of Expression), a tool to score both noncoding and coding SNVs based on their potential to regulate the expression levels of nearby genes. FIRE consists of 23 random forests trained to recognize SNVs in cis-expression quantitative trait loci (cis-eQTLs) using a set of 92 genomic annotations as predictive features. FIRE scores discriminate cis-eQTL SNVs from non-eQTL SNVs in the training set with a cross-validated area under the receiver operating characteristic curve (AUC) of 0.807, and discriminate cis-eQTL SNVs shared across six populations of different ancestry from non-eQTL SNVs with an AUC of 0.939. FIRE scores are also predictive of cis-eQTL SNVs across a variety of tissue types. AVAILABILITY AND IMPLEMENTATION: FIRE scores for genome-wide SNVs in hg19/GRCh37 are available for download at https://sites.google.com/site/fireregulatoryvariation/. CONTACT: nilah@stanford.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Regulación de la Expresión Génica , Variación Genética , Programas Informáticos , Genómica , Humanos , Sitios de Carácter Cuantitativo
18.
Oncotarget ; 8(1): 1495-1507, 2017 Jan 03.
Artículo en Inglés | MEDLINE | ID: mdl-27902461

RESUMEN

Prostate cancer (PCa) susceptibility is defined by a continuum from rare, high-penetrance to common, low-penetrance alleles. Research to date has concentrated on identification of variants at the ends of that continuum. Taking an alternate approach, we focused on the important but elusive class of low-frequency, moderately penetrant variants by performing disease model-based variant filtering of whole exome sequence data from 75 hereditary PCa families. Analysis of 341 candidate risk variants identified nine variants significantly associated with increased PCa risk in a population-based, case-control study of 2,495 men. In an independent nested case-control study of 7,121 men, there was risk association evidence for TANGO2 p.Ser17Ter and the established HOXB13 p.Gly84Glu variant. Meta-analysis combining the case-control studies identified two additional variants suggestively associated with risk, OR5H14 p.Met59Val and CHAD p.Ala342Asp. The TANGO2 and HOXB13 variants co-occurred in cases more often than expected by chance and never in controls. Finally, TANGO2 p.Ser17Ter was associated with aggressive disease in both case-control studies separately. Our analyses identified three new PCa susceptibility alleles in the TANGO2, OR5H14 and CHAD genes that not only segregate in multiple high-risk families but are also of importance in altering disease risk for men from the general population. This is the first successful study to utilize sequencing in high-risk families for the express purpose of identifying low-frequency, moderately penetrant PCa risk mutations.


Asunto(s)
Neoplasias de la Próstata/genética , Anciano , Anciano de 80 o más Años , Estudios de Casos y Controles , Predisposición Genética a la Enfermedad , Humanos , Masculino , Persona de Mediana Edad , Factores de Riesgo , Secuenciación del Exoma
19.
Cell Syst ; 4(1): 31-45.e6, 2017 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-27916600

RESUMEN

It is unclear how standing genetic variation affects the prognosis of prostate cancer patients. To provide one controlled answer to this problem, we crossed a dominant, penetrant mouse model of prostate cancer to Diversity Outbred mice, a collection of animals that carries over 40 million SNPs. Integration of disease phenotype and SNP variation data in 493 F1 males identified a metastasis modifier locus on Chromosome 8 (LOD = 8.42); further analysis identified the genes Rwdd4, Cenpu, and Casp3 as functional effectors of this locus. Accordingly, analysis of over 5,300 prostate cancer patient samples revealed correlations between the presence of genetic variants at these loci, their expression levels, cancer aggressiveness, and patient survival. We also observed that ectopic overexpression of RWDD4 and CENPU increased the aggressiveness of two human prostate cancer cell lines. In aggregate, our approach demonstrates how well-characterized genetic variation in mice can be harnessed in conjunction with systems genetics approaches to identify and characterize germline modifiers of human disease processes.


Asunto(s)
Mapeo Cromosómico/métodos , Neoplasias de la Próstata/genética , Animales , Caspasa 3/genética , Proteínas de Ciclo Celular/genética , Proteínas de Ciclo Celular/fisiología , Ratones de Colaboración Cruzada/genética , Modelos Animales de Enfermedad , Genética de Población/métodos , Estudio de Asociación del Genoma Completo , Células Germinativas/patología , Mutación de Línea Germinal/genética , Humanos , Masculino , Ratones , Herencia Multifactorial/genética , Metástasis de la Neoplasia/genética , Procesos Neoplásicos , Fenotipo , Polimorfismo de Nucleótido Simple , Neoplasias de la Próstata/metabolismo , Sitios de Carácter Cuantitativo
20.
Genes Chromosomes Cancer ; 56(3): 177-184, 2017 03.
Artículo en Inglés | MEDLINE | ID: mdl-27636879

RESUMEN

MicroRNAs (miRNAs) regulate up to one-third of all protein-coding genes including genes relevant to cancer. Variants within miRNAs have been reported to be associated with prognosis, survival, response to chemotherapy across cancer types, in vitro parameters of cell growth, and altered risks for development of cancer. Five miRNA variants have been reported to be associated with risk for development of colorectal cancer (CRC). In this study, we evaluated germline genetic variation in 1,123 miRNAs in 899 individuals with CRCs categorized by clinical subtypes and in 204 controls. The role of common miRNA variation in CRC was investigated using single variant and miRNA-level association tests. Twenty-nine miRNAs and 30 variants exhibited some marginal association with CRC in at least one subtype of CRC. Previously reported associations were not confirmed (n = 4) or could not be evaluated (n = 1). The variants noted for the CRCs with deficient mismatch repair showed little overlap with the variants noted for CRCs with proficient mismatch repair, consistent with our evolving understanding of the distinct biology underlying these two groups. © 2016 The Authors Genes, Chromosomes & Cancer Published by Wiley Periodicals, Inc.


Asunto(s)
Biomarcadores de Tumor/genética , Neoplasias Colorrectales/genética , Variación Genética/genética , Mutación de Línea Germinal/genética , MicroARNs/genética , Estudios de Casos y Controles , Estudios de Seguimiento , Humanos , Estadificación de Neoplasias , Pronóstico , Factores de Riesgo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...