RESUMEN
An RNA structure or modified RNA sequences can provide a platform for ribosome loading and internal translation initiation. The functional significance of internal translation has recently been highlighted by the discovery that a subset of circular RNAs (circRNAs) is internally translated. However, the molecular mechanisms underlying the internal initiation of translation in circRNAs remain unclear. Here, we identify eIF3g (a subunit of eIF3 complex) as a binding partner of eIF4A3, a core component of the exon-junction complex (EJC) that is deposited onto spliced mRNAs and plays multiple roles in the regulation of gene expression. The direct interaction between eIF4A3-eIF3g serves as a molecular linker between the eIF4A3 and eIF3 complex, thereby facilitating internal ribosomal entry. Protein synthesis from in vitro-synthesized circRNA demonstrates eIF4A3-driven internal translation, which relies on the eIF4A3-eIF3g interaction. Furthermore, our transcriptome-wide analysis shows that efficient polysomal association of endogenous circRNAs requires eIF4A3. Notably, a subset of endogenous circRNAs can express a full-length intact protein, such as ß-catenin, in an eIF4A3-dependent manner. Collectively, our results expand the understanding of the protein-coding potential of the human transcriptome, including circRNAs.
Asunto(s)
Factor 3 de Iniciación Eucariótica , Factor 4A Eucariótico de Iniciación , ARN Circular , Humanos , Factor 3 de Iniciación Eucariótica/genética , Factor 3 de Iniciación Eucariótica/metabolismo , Factor 4A Eucariótico de Iniciación/metabolismo , Proteínas , Ribosomas/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismoRESUMEN
The comorbid association of autoimmune diseases with cancers has been a major obstacle to successful anti-cancer treatment. Cancer survival rate decreases significantly in patients with preexisting autoimmunity. However, to date, the molecular and cellular profiles of such comorbidities are poorly understood. We used Aicardi-Goutières syndrome (AGS) as a model autoimmune disease and explored the underlying mechanisms of genome instability in AGS-associated-gene-deficient patient cells. We found that R-loops are highly enriched at transcription-replication conflict regions of the genome in fibroblast of patients bearing SAMHD1 mutation, which is the AGS-associated-gene mutation most frequently reported with tumor and malignancies. In SAMHD1-depleted cells, R-loops accumulated with the concomitant activation of DNA damage responses. Removal of R-loops in SAMHD1 deficiency reduced cellular responses to genome instability. Furthermore, downregulation of SAMHD1 expression is associated with various types of cancer and poor survival rate. Our findings suggest that SAMHD1 functions as a tumor suppressor by resolving R-loops, and thus, SAMHD1 and R-loop may be novel diagnostic markers and targets for patient stratification in anti-cancer therapy.
Asunto(s)
Enfermedades Autoinmunes del Sistema Nervioso/genética , Enfermedades Autoinmunes/genética , Inestabilidad Genómica/genética , Malformaciones del Sistema Nervioso/genética , Proteína 1 que Contiene Dominios SAM y HD/genética , Enfermedades Autoinmunes/inmunología , Enfermedades Autoinmunes/patología , Enfermedades Autoinmunes del Sistema Nervioso/inmunología , Enfermedades Autoinmunes del Sistema Nervioso/patología , Línea Celular Tumoral , Daño del ADN/genética , Replicación del ADN/genética , Fibroblastos/metabolismo , Genoma Humano/genética , Humanos , Mutación/genética , Neoplasias/genética , Neoplasias/terapia , Malformaciones del Sistema Nervioso/inmunología , Malformaciones del Sistema Nervioso/patología , Estructuras R-Loop/genética , Proteína 1 que Contiene Dominios SAM y HD/ultraestructura , Transcripción Genética/genética , TransfecciónRESUMEN
MicroRNAs (miRNAs) regulate target mRNAs through a combination of translational repression and mRNA destabilization, with mRNA destabilization dominating at steady state in the few contexts examined globally. Here, we extend the global steady-state measurements to additional mammalian contexts and find that regardless of the miRNA, cell type, growth condition, or translational state, mRNA destabilization explains most (66%->90%) miRNA-mediated repression. We also determine the relative dynamics of translational repression and mRNA destabilization for endogenous mRNAs as a miRNA is induced. Although translational repression occurs rapidly, its effect is relatively weak, such that by the time consequential repression ensues, the effect of mRNA destabilization dominates. These results imply that consequential miRNA-mediated repression is largely irreversible and provide other insights into the nature of miRNA-mediated regulation. They also simplify future studies, dramatically extending the known contexts and time points for which monitoring mRNA changes captures most of the direct miRNA effects.
Asunto(s)
Regulación de la Expresión Génica , MicroARNs/fisiología , Modelos Genéticos , Estabilidad del ARN , ARN Mensajero/metabolismo , MicroARNs/genética , MicroARNs/metabolismoRESUMEN
MicroRNAs (miRNAs) are small regulatory RNAs that derive from distinctive hairpin transcripts. To learn more about the miRNAs of mammals, we sequenced 60 million small RNAs from mouse brain, ovary, testes, embryonic stem cells, three embryonic stages, and whole newborns. Analysis of these sequences confirmed 398 annotated miRNA genes and identified 108 novel miRNA genes. More than 150 previously annotated miRNAs and hundreds of candidates failed to yield sequenced RNAs with miRNA-like features. Ectopically expressing these previously proposed miRNA hairpins also did not yield small RNAs, whereas ectopically expressing the confirmed and newly identified hairpins usually did yield small RNAs with the classical miRNA features, including dependence on the Drosha endonuclease for processing. These experiments, which suggest that previous estimates of conserved mammalian miRNAs were inflated, provide a substantially revised list of confidently identified murine miRNAs from which to infer the general features of mammalian miRNAs. Our analyses also revealed new aspects of miRNA biogenesis and modification, including tissue-specific strand preferences, sequential Dicer cleavage of a metazoan precursor miRNA (pre-miRNA), consequential 5' heterogeneity, newly identified instances of miRNA editing, and evidence for widespread pre-miRNA uridylation reminiscent of miRNA regulation by Lin28.
Asunto(s)
Genes/genética , Genoma/genética , MicroARNs/genética , Animales , Línea Celular , Perfilación de la Expresión Génica , Humanos , Secuencias Invertidas Repetidas/genética , Ratones , MicroARNs/biosíntesis , MicroARNs/metabolismo , Ribonucleasa III/metabolismoRESUMEN
UNLABELLED: MicroRNAs (miRNAs) are small, noncoding RNAs that posttranscriptionally regulate gene expression in many tissues. Although a number of brain-enriched miRNAs have been identified, only a few specific miRNAs have been revealed as critical regulators of synaptic plasticity, learning, and memory. miR-9-5p/3p are brain-enriched miRNAs known to regulate development and their changes have been implicated in several neurological disorders, yet their role in mature neurons in mice is largely unknown. Here, we report that inhibition of miR-9-3p, but not miR-9-5p, impaired hippocampal long-term potentiation (LTP) without affecting basal synaptic transmission. Moreover, inhibition of miR-9-3p in the hippocampus resulted in learning and memory deficits. Furthermore, miR-9-3p inhibition increased the expression of the LTP-related genes Dmd and SAP97, the expression levels of which are negatively correlated with LTP. These results suggest that miR-9-3p-mediated gene regulation plays important roles in synaptic plasticity and hippocampus-dependent memory. SIGNIFICANCE STATEMENT: Despite the abundant expression of the brain-specific microRNA miR-9-5p/3p in both proliferating and postmitotic neurons, most functional studies have focused on their role in neuronal development. Here, we examined the role of miR-9-5p/3p in adult brain and found that miR-9-3p, but not miR-9-5p, has a critical role in hippocampal synaptic plasticity and memory. Moreover, we identified in vivo binding targets of miR-9-3p that are involved in the regulation of long-term potentiation. Our study provides the very first evidence for the critical role of miR-9-3p in synaptic plasticity and memory in the adult mouse.
Asunto(s)
Hipocampo/metabolismo , MicroARNs/metabolismo , Plasticidad Neuronal/fisiología , Reconocimiento en Psicología/fisiología , Animales , Condicionamiento Psicológico/fisiología , Homólogo 1 de la Proteína Discs Large , Distrofina/metabolismo , Conducta Exploratoria/fisiología , Miedo/fisiología , Proteínas Fluorescentes Verdes/genética , Proteínas Fluorescentes Verdes/metabolismo , Guanilato-Quinasas/metabolismo , Células HEK293 , Hipocampo/citología , Humanos , Masculino , Aprendizaje por Laberinto/efectos de los fármacos , Aprendizaje por Laberinto/fisiología , Proteínas de la Membrana/metabolismo , Ratones , Ratones Endogámicos C57BL , MicroARNs/genética , Plasticidad Neuronal/efectos de los fármacos , Receptores CXCR4/genética , Receptores CXCR4/metabolismo , Reconocimiento en Psicología/efectos de los fármacos , Sinapsinas/genética , Sinapsinas/metabolismo , Transducción GenéticaRESUMEN
UNLABELLED: Hepatic resection is the most curative treatment option for early-stage hepatocellular carcinoma, but is associated with a high recurrence rate, which exceeds 50% at 5 years after surgery. Understanding the genetic basis of hepatocellular carcinoma at surgically curable stages may enable the identification of new molecular biomarkers that accurately identify patients in need of additional early therapeutic interventions. Whole exome sequencing and copy number analysis was performed on 231 hepatocellular carcinomas (72% with hepatitis B viral infection) that were classified as early-stage hepatocellular carcinomas, candidates for surgical resection. Recurrent mutations were validated by Sanger sequencing. Unsupervised genomic analyses identified an association between specific genetic aberrations and postoperative clinical outcomes. Recurrent somatic mutations were identified in nine genes, including TP53, CTNNB1, AXIN1, RPS6KA3, and RB1. Recurrent homozygous deletions in FAM123A, RB1, and CDKN2A, and high-copy amplifications in MYC, RSPO2, CCND1, and FGF19 were detected. Pathway analyses of these genes revealed aberrations in the p53, Wnt, PIK3/Ras, cell cycle, and chromatin remodeling pathways. RB1 mutations were significantly associated with cancer-specific and recurrence-free survival after resection (multivariate P = 0.038 and P = 0.012, respectively). FGF19 amplifications, known to activate Wnt signaling, were mutually exclusive with CTNNB1 and AXIN1 mutations, and significantly associated with cirrhosis (P = 0.017). CONCLUSION: RB1 mutations can be used as a prognostic molecular biomarker for resectable hepatocellular carcinoma. Further study is required to investigate the potential role of FGF19 amplification in driving hepatocarcinogenesis in patients with liver cirrhosis and to investigate the potential of anti-FGF19 treatment in these patients.
Asunto(s)
Biomarcadores de Tumor/metabolismo , Carcinoma Hepatocelular/genética , Factores de Crecimiento de Fibroblastos/genética , Neoplasias Hepáticas/genética , Proteína de Retinoblastoma/genética , Adulto , Anciano , Anciano de 80 o más Años , Carcinoma Hepatocelular/metabolismo , Carcinoma Hepatocelular/cirugía , Variaciones en el Número de Copia de ADN , Análisis Mutacional de ADN , Factor de Transcripción E2F1/metabolismo , Femenino , Humanos , Neoplasias Hepáticas/metabolismo , Neoplasias Hepáticas/cirugía , Masculino , Persona de Mediana Edad , Proteína de Retinoblastoma/metabolismoRESUMEN
MicroRNAs are endogenous approximately 23-nucleotide RNAs that can pair to sites in the messenger RNAs of protein-coding genes to downregulate the expression from these messages. MicroRNAs are known to influence the evolution and stability of many mRNAs, but their global impact on protein output had not been examined. Here we use quantitative mass spectrometry to measure the response of thousands of proteins after introducing microRNAs into cultured cells and after deleting mir-223 in mouse neutrophils. The identities of the responsive proteins indicate that targeting is primarily through seed-matched sites located within favourable predicted contexts in 3' untranslated regions. Hundreds of genes were directly repressed, albeit each to a modest degree, by individual microRNAs. Although some targets were repressed without detectable changes in mRNA levels, those translationally repressed by more than a third also displayed detectable mRNA destabilization, and, for the more highly repressed targets, mRNA destabilization usually comprised the major component of repression. The impact of microRNAs on the proteome indicated that for most interactions microRNAs act as rheostats to make fine-scale adjustments to protein output.
Asunto(s)
MicroARNs/genética , MicroARNs/metabolismo , Biosíntesis de Proteínas , Animales , Regulación de la Expresión Génica , Células HeLa , Humanos , Marcaje Isotópico , Masculino , Ratones , Neutrófilos/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos , Proteómica , TransfecciónRESUMEN
The exponential growth of big data in RNA biology (RB) has led to the development of deep learning (DL) models that have driven crucial discoveries. As constantly evidenced by DL studies in other fields, the successful implementation of DL in RB depends heavily on the effective utilization of large-scale datasets from public databases. In achieving this goal, data encoding methods, learning algorithms, and techniques that align well with biological domain knowledge have played pivotal roles. In this review, we provide guiding principles for applying these DL concepts to various problems in RB by demonstrating successful examples and associated methodologies. We also discuss the remaining challenges in developing DL models for RB and suggest strategies to overcome these challenges. Overall, this review aims to illuminate the compelling potential of DL for RB and ways to apply this powerful technology to investigate the intriguing biology of RNA more effectively.
Asunto(s)
Macrodatos , Aprendizaje Profundo , ARN , ARN/genética , Humanos , Biología Computacional/métodos , Algoritmos , AnimalesRESUMEN
Fibro-adipogenic progenitors (FAPs) are muscle-resident mesenchymal progenitors that can contribute to muscle tissue homeostasis and regeneration, as well as postnatal maturation and lifelong maintenance of the neuromuscular system. Recently, traumatic injury to the peripheral nerve was shown to activate FAPs, suggesting that FAPs can respond to nerve injury. However, questions of how FAPs can sense the anatomically distant peripheral nerve injury and whether FAPs can directly contribute to nerve regeneration remained unanswered. Here, utilizing single-cell transcriptomics and mouse models, we discovered that a subset of FAPs expressing GDNF receptors Ret and Gfra1 can respond to peripheral nerve injury by sensing GDNF secreted by Schwann cells. Upon GDNF sensing, this subset becomes activated and expresses Bdnf. FAP-specific inactivation of Bdnf (Prrx1Cre; Bdnffl/fl) resulted in delayed nerve regeneration owing to defective remyelination, indicating that GDNF-sensing FAPs play an important role in the remyelination process during peripheral nerve regeneration. In aged mice, significantly reduced Bdnf expression in FAPs was observed upon nerve injury, suggesting the clinical relevance of FAP-derived BDNF in the age-related delays in nerve regeneration. Collectively, our study revealed the previously unidentified role of FAPs in peripheral nerve regeneration, and the molecular mechanism behind FAPs' response to peripheral nerve injury.
Asunto(s)
Factor Neurotrófico Derivado del Encéfalo , Factor Neurotrófico Derivado de la Línea Celular Glial , Células Madre Mesenquimatosas , Regeneración Nerviosa , Traumatismos de los Nervios Periféricos , Animales , Factor Neurotrófico Derivado de la Línea Celular Glial/metabolismo , Factor Neurotrófico Derivado de la Línea Celular Glial/genética , Traumatismos de los Nervios Periféricos/metabolismo , Ratones , Células Madre Mesenquimatosas/metabolismo , Factor Neurotrófico Derivado del Encéfalo/metabolismo , Receptores del Factor Neurotrófico Derivado de la Línea Celular Glial/metabolismo , Receptores del Factor Neurotrófico Derivado de la Línea Celular Glial/genética , Células de Schwann/metabolismo , Masculino , Proteínas Proto-Oncogénicas c-ret/metabolismo , Proteínas Proto-Oncogénicas c-ret/genéticaRESUMEN
MicroRNAs (miRNAs) play cardinal roles in regulating biological pathways and processes, resulting in significant physiological effects. To understand the complex regulatory network of miRNAs, previous studies have utilized massivescale datasets of miRNA targeting and attempted to computationally predict the functional targets of miRNAs. Many miRNA target prediction tools have been developed and are widely used by scientists from various fields of biology and medicine. Most of these tools consider seed pairing between miRNAs and their mRNA targets and additionally consider other determinants to improve prediction accuracy. However, these tools exhibit limited prediction accuracy and high false positive rates. The utilization of additional determinants, such as RNA modifications and RNA-binding protein binding sites, may further improve miRNA target prediction. In this review, we discuss the determinants of functional miRNA targeting that are currently used in miRNA target prediction and the potentially predictive but unappreciated determinants that may improve prediction accuracy.
Asunto(s)
Marcación de Gen , MicroARNs , Biología Computacional , MicroARNs/genética , MicroARNs/metabolismo , ARN Mensajero/genética , Marcación de Gen/métodosRESUMEN
The detection of somatic DNA variants in tumor samples with low tumor purity or sequencing depth remains a daunting challenge despite numerous attempts to address this problem. In this study, we constructed a substantially extended set of actual positive variants originating from a wide range of tumor purities and sequencing depths, as well as actual negative variants derived from sequencer-specific sequencing errors. A deep learning model named AIVariant, trained on this extended dataset, outperforms previously reported methods when tested under various tumor purities and sequencing depths, especially low tumor purity and sequencing depth.
Asunto(s)
Aprendizaje Profundo , Neoplasias , Humanos , Frecuencia de los Genes , Biología Computacional/métodos , Algoritmos , Neoplasias/genética , Neoplasias/diagnóstico , MutaciónRESUMEN
Variant callers typically produce massive numbers of false positives for structural variations, such as cancer-relevant copy-number alterations and fusion genes resulting from genome rearrangements. Here we describe an ultrafast and accurate detector of somatic structural variations that reduces read-mapping costs by filtering out reads matched to pan-genome k-mer sets. The detector, which we named ETCHING (for efficient detection of chromosomal rearrangements and fusion genes), reduces the number of false positives by leveraging machine-learning classifiers trained with six breakend-related features (clipped-read count, split-reads count, supporting paired-end read count, average mapping quality, depth difference and total length of clipped bases). When benchmarked against six callers on reference cell-free DNA, validated biomarkers of structural variants, matched tumour and normal whole genomes, and tumour-only targeted sequencing datasets, ETCHING was 11-fold faster than the second-fastest structural-variant caller at comparable performance and memory use. The speed and accuracy of ETCHING may aid large-scale genome projects and facilitate practical implementations in precision medicine.
Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Neoplasias , Humanos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Genoma , Análisis de Secuencia de ADN/métodosRESUMEN
Despite substantial advances in disease genetics, studies to date have largely focused on individuals of European descent. This limits further discoveries of novel functional genetic variants in other ethnic groups. To alleviate the paucity of East Asian population genome resources, we established the Korean Variant Archive 2 (KOVA 2), which is composed of 1896 whole-genome sequences and 3409 whole-exome sequences from healthy individuals of Korean ethnicity. This is the largest genome database from the ethnic Korean population to date, surpassing the 1909 Korean individuals deposited in gnomAD. The variants in KOVA 2 displayed all the known genetic features of those from previous genome databases, and we compiled data from Korean-specific runs of homozygosity, positively selected intervals, and structural variants. In doing so, we found loci, such as the loci of ADH1A/1B and UHRF1BP1, that are strongly selected in the Korean population relative to other East Asian populations. Our analysis of allele ages revealed a correlation between variant functionality and evolutionary age. The data can be browsed and downloaded from a public website ( https://www.kobic.re.kr/kova/ ). We anticipate that KOVA 2 will serve as a valuable resource for genetic studies involving East Asian populations.
Asunto(s)
Pueblo Asiatico , Exoma , Humanos , Pueblo Asiatico/genética , República de Corea , Polimorfismo de Nucleótido SimpleRESUMEN
COVID-19 is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which infected >200 million people resulting in >4 million deaths. However, temporal landscape of the SARS-CoV-2 translatome and its impact on the human genome remain unexplored. Here, we report a high-resolution atlas of the translatome and transcriptome of SARS-CoV-2 for various time points after infecting human cells. Intriguingly, substantial amount of SARS-CoV-2 translation initiates at a novel translation initiation site (TIS) located in the leader sequence, termed TIS-L. Since TIS-L is included in all the genomic and subgenomic RNAs, the SARS-CoV-2 translatome may be regulated by a sophisticated interplay between TIS-L and downstream TISs. TIS-L functions as a strong translation enhancer for ORF S, and as translation suppressors for most of the other ORFs. Our global temporal atlas provides compelling insight into unique regulation of the SARS-CoV-2 translatome and helps comprehensively evaluate its impact on the human genome.
Asunto(s)
COVID-19/virología , Biosíntesis de Proteínas , SARS-CoV-2/genética , Transcriptoma , Regulación Viral de la Expresión Génica , Genoma Humano , Humanos , Sistemas de Lectura Abierta , ARN Viral/genética , ARN Viral/metabolismo , SARS-CoV-2/metabolismo , Proteínas Virales/genética , Proteínas Virales/metabolismoRESUMEN
Argonaute is the primary mediator of metazoan miRNA targeting (MT). Among the currently identified >1,500 human RNA-binding proteins (RBPs), there are only a handful of RBPs known to enhance MT and several others reported to suppress MT, leaving the global impact of RBPs on MT elusive. In this study, we have systematically analyzed transcriptome-wide binding sites for 150 human RBPs and evaluated the quantitative effect of individual RBPs on MT efficacy. In contrast to previous studies, we show that most RBPs significantly affect MT and that all of those MT-regulating RBPs function as MT enhancers rather than suppressors, by making the local secondary structure of the target site accessible to Argonaute. Our findings illuminate the unappreciated regulatory impact of human RBPs on MT, and as these RBPs may play key roles in the gene regulatory network governed by metazoan miRNAs, MT should be understood in the context of co-regulating RBPs.
Asunto(s)
MicroARNs/metabolismo , Proteínas de Unión al ARN/metabolismo , Regiones no Traducidas 3'/genética , Sitios de Unión , Evolución Molecular , Células HeLa , Células Hep G2 , Humanos , MicroARNs/genética , Conformación de Ácido Nucleico , Unión Proteica , ARN Mensajero/química , ARN Mensajero/genética , ARN Mensajero/metabolismo , Reproducibilidad de los Resultados , Especificidad por SustratoRESUMEN
Ionization-based volatile organic compound (VOC) sensors that use photons or electrons operating at room temperature have attracted considerable attention as a promising alternative to conventional metal oxide-based sensors that require high temperature for sensing function. However, the photoionization sensors cannot ionize many gas species for their limited photon energy, and field emission-based ionization sensors that rely on the breakdown voltage of specific gas species in a pure state may not tell different concentration. This work demonstrates the detection of VOCs using impact ionization induced by accelerated photoelectrons. Although the photoelectrons emitted by relatively low photon energy typically have insufficient kinetic energy to cause impact ionization, in this approach, they are accelerated between microgap electrodes to enhance their kinetic energy such that the impact ionization of VOCs can be achieved. The demonstrated gas sensor sensitively detects toluene concentration in a wide range from 1000 ppm to 100 ppb with fast response and recovery time at room temperature. Additionally, diverse VOC species including benzene, p-xylene, and even acetylene with high ionization energy can be detected. The proposed method could be a viable solution for VOC sensors with low cost, scalable producibility, and high performance.
RESUMEN
Insertions and deletions (INDELs) comprise a significant proportion of human genetic variation, and recent papers have revealed that many human diseases may be attributable to INDELs. With the development of next-generation sequencing (NGS) technology, many statistical/computational tools have been developed for calling INDELs. However, there are differences among those tools, and comparisons among them have been limited. In order to better understand these inter-tool differences, five popular and publicly available INDEL calling tools-GATK HaplotypeCaller, Platypus, VarScan2, Scalpel, and GotCloud-were evaluated using simulation data, 1000 Genomes Project data, and family-based sequencing data. The accuracy of INDEL calling by each tool was mainly evaluated by concordance rates. Family-based sequencing data, which consisted of 49 individuals from eight Korean families, were used to calculate Mendelian error rates. Our comparison results show that GATK HaplotypeCaller usually performs the best and that joint calling with Platypus can lead to additional improvements in accuracy. The result of this study provides important information regarding future directions for the variant detection and the algorithms development.
Asunto(s)
Análisis Mutacional de ADN , Secuenciación de Nucleótidos de Alto Rendimiento , Mutación INDEL/genética , Análisis de Secuencia de ADN , Programas Informáticos , Algoritmos , Biología Computacional , Simulación por Computador , Análisis Mutacional de ADN/métodos , Análisis Mutacional de ADN/normas , Bases de Datos Genéticas , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/normas , Humanos , Reproducibilidad de los Resultados , Análisis de Secuencia de ADN/métodos , Análisis de Secuencia de ADN/normasRESUMEN
We report a novel transcriptomic analysis workflow called LiEB (Life cycle of Epstein-Barr virus) to characterize distributions of oncogenic virus, Epstein-Barr virus (EBV) infection in human tumors. We analyzed 851 The Cancer Genome Atlas whole-transcriptome sequencing (WTS) data to investigate EBV infection by life cycle information using three-step LiEB workflow: 1) characterize virus infection generally; 2) align transcriptome sequences against a hybrid human-EBV genome, and 3) quantify EBV gene expression. Our results agreed with EBV infection status of public cell line data. Analysis in stomach adenocarcinoma identified EBV-positive cases involving PIK3CA mutations and/or CDKN2A silencing with biologically more determination, compared to previous reports. In this study, we found that a small number of colorectal adenocarcinoma cases involved with EBV lytic gene expression. Expression of EBV lytic genes was also observed in 3% of external colon cancer cohort upon WTS analysis. Gene set enrichment analysis showed elevated expression of genes related to E2F targeting and interferon-gamma responses in EBV-associated tumors. Finally, we suggest that interpretation of EBV life cycle is essential when analyzing its infection in tumors, and LiEB provides high capability of detecting EBV-positive tumors. Observation of EBV lytic gene expression in a subset of colon cancers warrants further research.
Asunto(s)
Infecciones por Virus de Epstein-Barr/complicaciones , Infecciones por Virus de Epstein-Barr/virología , Genoma Humano , Herpesvirus Humano 4/fisiología , Neoplasias/etiología , Transformación Celular Viral , Infecciones por Virus de Epstein-Barr/diagnóstico , Perfilación de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Estadios del Ciclo de Vida , Técnicas de Diagnóstico Molecular , Mutación , Neoplasias/diagnóstico , Reproducibilidad de los Resultados , TranscriptomaRESUMEN
The roles of miRNAs in lung cancer have not yet been explored systematically at the genome scale despite their important regulatory functions. Here, we report an integrative analysis of miRNA and mRNA sequencing data for matched tumor-normal samples from 109 Korean female patients with non-small-cell lung adenocarcinoma (LUAD). We produced miRNA sequencing (miRNA-Seq) and RNA-Seq data for 48 patients and RNA-Seq data for 61 additional patients. Subsequent differential expression analysis with stringent criteria yielded 44 miRNAs and 2322 genes. Integrative gene set analysis of the differentially expressed miRNAs and genes using miRNA-target information revealed several regulatory processes related to the cell cycle that were targeted by tumor suppressor miRNAs (TSmiR). We performed colony formation assays in A549 and NCI-H460 cell lines to test the tumor-suppressive activity of downregulated miRNAs in cancer and identified 7 novel TSmiRs (miR-144-5p, miR-218-1-3p, miR-223-3p, miR-27a-5p, miR-30a-3p, miR-30c-2-3p, miR-338-5p). Two miRNAs, miR-30a-3p and miR-30c-2-3p, showed differential survival characteristics in the Tumor Cancer Genome Atlas (TCGA) LUAD patient cohort indicating their prognostic value. Finally, we identified a network cluster of miRNAs and target genes that could be responsible for cell cycle regulation. Our study not only provides a dataset of miRNA as well as mRNA sequencing from the matched tumor-normal samples, but also reports several novel TSmiRs that could potentially be developed into prognostic biomarkers or therapeutic RNA drugs.
Asunto(s)
Adenocarcinoma del Pulmón/genética , Neoplasias Pulmonares/genética , MicroARNs/metabolismo , ARN Mensajero/metabolismo , Células A549 , Adulto , Anciano , Anciano de 80 o más Años , Línea Celular Tumoral , Femenino , Regulación Neoplásica de la Expresión Génica/genética , Humanos , Masculino , MicroARNs/genética , Persona de Mediana Edad , ARN Mensajero/genéticaRESUMEN
MicroRNAs (miRNAs) are â¼22nt-long single-stranded RNA molecules that form a RNA-induced silencing complex with Argonaute (AGO) protein to post-transcriptionally downregulate their target messenger RNAs (mRNAs). To understand the regulatory mechanisms of miRNA, discovering the underlying functional rules for how miRNAs recognize and repress their target mRNAs is of utmost importance. To determine functional miRNA targeting rules, previous studies extensively utilized various methods including high-throughput biochemical assays and bioinformatics analyses. However, targeting rules reported in one study often fail to be reproduced in other studies and therefore the general rules for functional miRNA targeting remain elusive. In this review, we evaluate previously-reported miRNA targeting rules and discuss the biological impact of the functional miRNAs on gene-regulatory networks as well as the future direction of miRNA targeting research. [BMB Reports 2017; 50(11): 554-559].