RESUMO
BACKGROUND: Association studies are useful to unravel the genetic basis of common human diseases. However, the presence of undetected population structure can lead to both false positive results and failures to detect genuine associations. Even when most of the approaches to deal with population stratification require genome-wide data, the use of a well-selected panel of ancestry informative markers (AIMs) may appropriately correct for population stratification. Few panels of AIMs have been developed for Latino populations and most contain a high number of markers (> 100 AIMs). For some association studies such as candidate gene approaches, it may be unfeasible to genotype a numerous set of markers to avoid false positive results. In such cases, methods that use fewer AIMs may be appropriate. RESULTS: We validated an accurate and cost-effective panel of AIMs, for use in population stratification correction of association studies and global ancestry estimation in Mexicans, as well as in populations having large proportions of both European and Native American ancestries. Based on genome-wide data from 1953 Mexican individuals, we performed a PCA and SNP weights were calculated to select subsets of unlinked AIMs within percentiles 0.10 and 0.90, ensuring that all chromosomes were represented. Correlations between PC1 calculated using genome-wide data versus each subset of AIMs (16, 32, 48 and 64) were r2 = 0.923, 0.959, 0.972 and 0.978, respectively. When evaluating PCs performance as population stratification adjustment covariates, no correlation was found between P values obtained from uncorrected and genome-wide corrected association analyses (r2 = 0.141), highlighting that population stratification correction is compulsory for association analyses in admixed populations. In contrast, high correlations were found when adjusting for both PC1 and PC2 for either subset of AIMs (r2 > 0.900). After multiple validations, including an independent sample, we selected a minimal panel of 32 AIMs, which are highly informative of the major ancestral components of Mexican mestizos, namely European and Native American ancestries. Finally, the correlation between the global ancestry proportions calculated using genome-wide data and our panel of 32 AIMs was r2 = 0.972. CONCLUSIONS: Our panel of 32 AIMs accurately estimated global ancestry and corrected for population stratification in association studies in Mexican individuals.
Assuntos
Genética Populacional , Grupos Populacionais/genética , População Branca/genética , Análise Custo-Benefício , Genética Populacional/economia , Estudo de Associação Genômica Ampla , Humanos , México/etnologia , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Breast cancer (BRCA) is a serious public health problem, as it is the most frequent malignant tumor in women worldwide. BRCA is a molecularly heterogeneous disease, particularly at gene expression (mRNAs) level. Recent evidence shows that coding RNAs represent only 34% of the total transcriptome in a human cell. The rest of the 66% of RNAs are non-coding, so we might be missing relevant biological, clinical or regulatory information. In this report, we identified two novel tumor types from TCGA with LINC00460 deregulation. We used survival analysis to demonstrate that LINC00460 expression is a marker for poor overall (OS), relapse-free (RFS) and distant metastasis-free survival (DMFS) in basal-like BRCA patients. LINC00460 expression is a potential marker for aggressive phenotypes in distinct tumors, including HPV-negative HNSC, stage IV KIRC, locally advanced lung cancer and basal-like BRCA. We show that the LINC00460 prognostic expression effect is tissue-specific, since its upregulation can predict poor OS in some tumors, but also predicts an improved clinical course in BRCA patients. We found that the LINC00460 expression is significantly enriched in the Basal-like 2 (BL2) TNBC subtype and potentially regulates the WNT differentiation pathway. LINC00460 can also modulate a plethora of immunogenic related genes in BRCA, such as SFRP5, FOSL1, IFNK, CSF2, DUSP7 and IL1A and interacts with miR-103-a-1, in-silico, which, in turn, can no longer target WNT7A. Finally, LINC00460:WNT7A ratio constitutes a composite marker for decreased OS and DMFS in Basal-like BRCA, and can predict anthracycline therapy response in ER-BRCA patients. This evidence confirms that LINC00460 is a master regulator in BRCA molecular circuits and influences clinical outcome.
RESUMO
Studies have suggested a potential role of somatic mitochondrial mutations in cancer development. To analyze the landscape of somatic mitochondrial mutation in breast cancer and to determine whether mitochondrial DNA (mtDNA) mutational burden is correlated with overall survival (OS), we sequenced whole mtDNA from 92 matched-paired primary breast tumors and peripheral blood. A total of 324 germline variants and 173 somatic mutations were found in the tumors. The most common germline allele was 663G (12S), showing lower heteroplasmy levels in peripheral blood lymphocytes than in their matched tumors, even reaching homoplasmic status in several cases. The heteroplasmy load was higher in tumors than in their paired normal tissues. Somatic mtDNA mutations were found in 73.9% of breast tumors; 59% of these mutations were located in the coding region (66.7% non-synonymous and 33.3% synonymous). Although the CO1 gene presented the highest number of mutations, tRNA genes (T,C, and W), rRNA 12S, and CO1 and ATP6 exhibited the highest mutation rates. No specific mtDNA mutational profile was associated with molecular subtypes of breast cancer, and we found no correlation between mtDNA mutational burden and OS. Future investigations will provide insight into the molecular mechanisms through which mtDNA mutations and heteroplasmy shifting contribute to breast cancer development.
RESUMO
Breast cancer is the most commonly diagnosed neoplasm in women worldwide with a well-recognized heterogeneous pathology, classified into four molecular subtypes: Luminal A, Luminal B, HER2-enriched and Basal-like, each one with different biological and clinical characteristics. Long non-coding RNAs (lncRNAs) represent 33% of the human transcriptome and play critical roles in breast carcinogenesis, but most of their functions are still unknown. Therefore, cancer research could benefit from continued exploration into the biology of lncRNAs in this neoplasm. We characterized lncRNA expression portraits in 74 breast tumors belonging to the four molecular subtypes using transcriptome microarrays. To infer the biological role of the deregulated lncRNAs in the molecular subtypes, we performed co-expression analysis of lncRNA-mRNA and gene ontology analysis. We identified 307 deregulated lncRNAs in tumor compared to normal tissue and 354 deregulated lncRNAs among the different molecular subtypes. Through co-expression analysis between lncRNAs and protein-coding genes, along with gene enrichment analysis, we inferred the potential function of the most deregulated lncRNAs in each molecular subtype, and independently validated our results taking advantage of TCGA data. Overexpression of the AC009283.1 was observed in the HER2-enriched subtype and it is localized in an amplification zone at chromosome 17q12, suggesting it to be a potential tumorigenic lncRNA. The functional role of lncRNA AC009283.1 was examined through loss of function assays in vitro and determining its impact on global gene expression. These studies revealed that AC009283.1 regulates genes involved in proliferation, cell cycle and apoptosis in a HER2 cellular model. We further confirmed these findings through ssGSEA and CEMITool analysis in an independent HER2-amplified breast cancer cohort. Our findings suggest a wide range of biological functions for lncRNAs in each breast cancer molecular subtype and provide a basis for their biological and functional study, as was conducted for AC009283.1, showing it to be a potential regulator of proliferation and apoptosis in the HER2-enriched subtype.
Assuntos
Apoptose , Neoplasias da Mama/metabolismo , Proliferação de Células , Cromossomos Humanos Par 17 , RNA Longo não Codificante/biossíntese , Receptor ErbB-2/metabolismo , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Cromossomos Humanos Par 17/genética , Cromossomos Humanos Par 17/metabolismo , Feminino , Humanos , Células MCF-7 , RNA Longo não Codificante/genética , Receptor ErbB-2/genéticaRESUMO
Breast cancer (BRCA) is a serious public health problem, as it is the most frequent malignant tumor in women worldwide. BRCA is a molecularly heterogenic disease, particularly at gene expression (mRNAs) level. Recent evidence shows that coding RNAs represent only 34% of the total transcriptome in a human cell. The rest of the 66% of RNAs are non-coding, so we might be missing relevant biological, clinical or regulatory information. In this report, we identified nine novel tumor types from TCGA with FAM83H-AS1 deregulation. We used survival analysis to demonstrate that FAM83H-AS1 expression is a marker for poor survival in IHC-detected ER and PR positive BRCA patients and found a significant correlation between FAM83H-AS1 overexpression and tamoxifen resistance. Estrogen and Progesterone receptor expression levels interact with FAM83H-AS1 to potentiate its effect in OS prediction. FAM83H-AS1 silencing impairs two important breast cancer related pathways: cell migration and cell death. Among the most relevant potential FAM83H-AS1 gene targets, we found p63 and claudin 1 (CLDN1) to be deregulated after FAM83H-AS1 knockdown. Using correlation analysis, we show that FAM83H-AS1 can regulate a plethora of cancer-related genes across multiple tumor types, including BRCA. This evidence suggests that FAM83H-AS1 is a master regulator in different cancer types, and BRCA in particular.
Assuntos
Neoplasias da Mama/genética , Neoplasias da Mama/terapia , Regulação Neoplásica da Expressão Gênica/genética , Proteínas/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Neoplasias da Mama/patologia , Movimento Celular/genética , Claudina-1/genética , Resistencia a Medicamentos Antineoplásicos/genética , Feminino , Humanos , Pessoa de Meia-Idade , Prognóstico , Proteínas/metabolismo , Receptores de Estrogênio/metabolismo , Receptores de Progesterona/metabolismo , Análise de Sobrevida , Tamoxifeno/uso terapêutico , Fatores de Transcrição/genética , Proteínas Supressoras de Tumor/genética , Adulto JovemRESUMO
The Human Papillomavirus (HPV) E1 protein is the only viral protein with enzymatic activity. The main known function of this protein is the regulation of the viral DNA replication. Nevertheless, it has been demonstrated that the ablation of HPV18 E1 mRNA in HeLa cells promotes a deregulation of several genes, particularly those involved in host defense mechanisms against viral infections; however, the specific contribution of E1 protein in HPV-independent context has not been studied. The aim of this work was to determine the effect of the HPV E1 protein in the regulation of cellular gene expression profiles evaluated through RNA-seq. We found that E1 proteins from HPV16 and 18 induced an overexpression of different set of genes associated with proliferation and differentiation processes, as well as downregulation of immune response genes, including IFNß1 and IFNλ1 and Interferon-stimulated gene (ISG), which are important components involved in the antiviral immune response. Together, our results indicate that HR-(High-Risk) and LR-(Low-Risk) HPV E1 proteins play an important role in inhibiting the anti-viral immune response.