Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 87
Filtrar
1.
bioRxiv ; 2024 Jun 14.
Artigo em Inglês | MEDLINE | ID: mdl-38915585

RESUMO

Genetic regulation of alternative splicing constitutes an important link between genetic variation and disease. Nonetheless, RNA splicing is regulated by both cis-acting elements and trans-acting splicing factors. Determining splicing events that are directed primarily by the cis- or trans-acting mechanisms will greatly inform our understanding of the genetic basis of disease. Here, we show that long-read RNA-seq, combined with our new method isoLASER, enables a clear segregation of cis- and trans-directed splicing events for individual samples. The genetic linkage of splicing is largely individual-specific, in stark contrast to the tissue-specific pattern of splicing profiles. Analysis of long-read RNA-seq data from human and mouse revealed thousands of cis-directed splicing events susceptible to genetic regulation. We highlight such events in the HLA genes whose analysis was challenging with short-read data. We also highlight novel cis-directed splicing events in Alzheimer's disease-relevant genes such as MAPT and BIN1. Together, the clear demarcation of cis- and trans-directed splicing paves ways for future studies of the genetic basis of disease.

2.
Am J Hum Genet ; 2024 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-38925119

RESUMO

Recent studies have highlighted the essential role of RNA splicing, a key mechanism of alternative RNA processing, in establishing connections between genetic variations and disease. Genetic loci influencing RNA splicing variations show considerable influence on complex traits, possibly surpassing those affecting total gene expression. Dysregulated RNA splicing has emerged as a major potential contributor to neurological and psychiatric disorders, likely due to the exceptionally high prevalence of alternatively spliced genes in the human brain. Nevertheless, establishing direct associations between genetically altered splicing and complex traits has remained an enduring challenge. We introduce Spliced-Transcriptome-Wide Associations (SpliTWAS) to integrate alternative splicing information with genome-wide association studies to pinpoint genes linked to traits through exon splicing events. We applied SpliTWAS to two schizophrenia (SCZ) RNA-sequencing datasets, BrainGVEX and CommonMind, revealing 137 and 88 trait-associated exons (in 84 and 67 genes), respectively. Enriched biological functions in the associated gene sets converged on neuronal function and development, immune cell activation, and cellular transport, which are highly relevant to SCZ. SpliTWAS variants impacted RNA-binding protein binding sites, revealing potential disruption of RNA-protein interactions affecting splicing. We extended the probabilistic fine-mapping method FOCUS to the exon level, identifying 36 genes and 48 exons as putatively causal for SCZ. We highlight VPS45 and APOPT1, where splicing of specific exons was associated with disease risk, eluding detection by conventional gene expression analysis. Collectively, this study supports the substantial role of alternative splicing in shaping the genetic basis of SCZ, providing a valuable approach for future investigations in this area.

3.
Mol Cell ; 84(11): 2087-2103.e8, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38815579

RESUMO

RNA splicing is pivotal in post-transcriptional gene regulation, yet the exponential expansion of intron length in humans poses a challenge for accurate splicing. Here, we identify hnRNPM as an essential RNA-binding protein that suppresses cryptic splicing through binding to deep introns, maintaining human transcriptome integrity. Long interspersed nuclear elements (LINEs) in introns harbor numerous pseudo splice sites. hnRNPM preferentially binds at intronic LINEs to repress pseudo splice site usage for cryptic splicing. Remarkably, cryptic exons can generate long dsRNAs through base-pairing of inverted ALU transposable elements interspersed among LINEs and consequently trigger an interferon response, a well-known antiviral defense mechanism. Significantly, hnRNPM-deficient tumors show upregulated interferon-associated pathways and elevated immune cell infiltration. These findings unveil hnRNPM as a guardian of transcriptome integrity by repressing cryptic splicing and suggest that targeting hnRNPM in tumors may be used to trigger an inflammatory immune response, thereby boosting cancer surveillance.


Assuntos
Ribonucleoproteínas Nucleares Heterogêneas Grupo M , Íntrons , Elementos Nucleotídeos Longos e Dispersos , Splicing de RNA , RNA de Cadeia Dupla , Humanos , Ribonucleoproteínas Nucleares Heterogêneas Grupo M/genética , Ribonucleoproteínas Nucleares Heterogêneas Grupo M/metabolismo , RNA de Cadeia Dupla/genética , RNA de Cadeia Dupla/metabolismo , Elementos Nucleotídeos Longos e Dispersos/genética , Interferons/metabolismo , Interferons/genética , Animais , Células HEK293 , Camundongos , Transcriptoma , Éxons , Sítios de Splice de RNA , Elementos Alu/genética
4.
Genome Res ; 34(4): 515-529, 2024 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-38702197

RESUMO

RNA modifications, also termed epitranscriptomic marks, encompass chemical alterations to individual nucleotides, including processes such as methylation and editing. These marks contribute to a wide range of biological processes, many of which are related to host immune system defense. The functions of immune-related RNA modifications can be categorized into three main groups: regulation of immunogenic RNAs, control of genes involved in innate immune response, and facilitation of adaptive immunity. Here, we provide an overview of recent research findings that elucidate the contributions of RNA modifications to each of these processes. We also discuss relevant methods for genome-wide identification of RNA modifications and their immunogenic substrates. Finally, we highlight recent advances in cancer immunotherapies that aim to reduce cancer cell viability by targeting the enzymes responsible for RNA modifications. Our presentation of these dynamic research avenues sets the stage for future investigations in this field.


Assuntos
Epigênese Genética , Imunidade Inata , Neoplasias , Transcriptoma , Humanos , Neoplasias/genética , Neoplasias/imunologia , Imunidade Inata/genética , Processamento Pós-Transcricional do RNA , Animais , Imunidade Adaptativa/genética , RNA/genética , RNA/metabolismo
5.
Nat Commun ; 15(1): 3335, 2024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38637555

RESUMO

Understanding the function of rare non-coding variants represents a significant challenge. Using MapUTR, a screening method, we studied the function of rare 3' UTR variants affecting mRNA abundance post-transcriptionally. Among 17,301 rare gnomAD variants, an average of 24.5% were functional, with 70% in cancer-related genes, many in critical cancer pathways. This observation motivated an interrogation of 11,929 somatic mutations, uncovering 3928 (33%) functional mutations in 155 cancer driver genes. Functional MapUTR variants were enriched in microRNA- or protein-binding sites and may underlie outlier gene expression in tumors. Further, we introduce untranslated tumor mutational burden (uTMB), a metric reflecting the amount of somatic functional MapUTR variants of a tumor and show its potential in predicting patient survival. Through prime editing, we characterized three variants in cancer-relevant genes (MFN2, FOSL2, and IRAK1), demonstrating their cancer-driving potential. Our study elucidates the function of tens of thousands of non-coding variants, nominates non-coding cancer driver mutations, and demonstrates their potential contributions to cancer.


Assuntos
Neoplasias , Oncogenes , Humanos , Regiões 3' não Traduzidas/genética , RNA Mensageiro/genética , Mutação , Neoplasias/genética
6.
Bioinformatics ; 39(11)2023 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-37871161

RESUMO

MOTIVATION: Double-stranded RNAs (dsRNAs) are potent triggers of innate immune responses upon recognition by cytosolic dsRNA sensor proteins. Identification of endogenous dsRNAs helps to better understand the dsRNAome and its relevance to innate immunity related to human diseases. RESULTS: Here, we report dsRID (double-stranded RNA identifier), a machine-learning-based method to predict dsRNA regions in silico, leveraging the power of long-read RNA-sequencing (RNA-seq) and molecular traits of dsRNAs. Using models trained with PacBio long-read RNA-seq data derived from Alzheimer's disease (AD) brain, we show that our approach is highly accurate in predicting dsRNA regions in multiple datasets. Applied to an AD cohort sequenced by the ENCODE consortium, we characterize the global dsRNA profile with potentially distinct expression patterns between AD and controls. Together, we show that dsRID provides an effective approach to capture global dsRNA profiles using long-read RNA-seq data. AVAILABILITY AND IMPLEMENTATION: Software implementation of dsRID, and genomic coordinates of regions predicted by dsRID in all samples are available at the GitHub repository: https://github.com/gxiaolab/dsRID.


Assuntos
Genoma , RNA de Cadeia Dupla , Humanos , RNA-Seq , Análise de Sequência de RNA , Sequência de Bases , Software
7.
Genome Biol ; 24(1): 171, 2023 07 20.
Artigo em Inglês | MEDLINE | ID: mdl-37474948

RESUMO

Although long-read RNA-seq is increasingly applied to characterize full-length transcripts it can also enable detection of nucleotide variants, such as genetic mutations or RNA editing sites, which is significantly under-explored. Here, we present an in-depth study to detect and analyze RNA editing sites in long-read RNA-seq. Our new method, L-GIREMI, effectively handles sequencing errors and read biases. Applied to PacBio RNA-seq data, L-GIREMI affords a high accuracy in RNA editing identification. Additionally, our analysis uncovered novel insights about RNA editing occurrences in single molecules and double-stranded RNA structures. L-GIREMI provides a valuable means to study nucleotide variants in long-read RNA-seq.


Assuntos
Edição de RNA , Transcriptoma , RNA-Seq , Nucleotídeos , Análise de Sequência de RNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos
8.
Cell Rep ; 42(8): 112856, 2023 08 29.
Artigo em Inglês | MEDLINE | ID: mdl-37481717

RESUMO

To identify addiction genes, we evaluate intravenous self-administration of cocaine or saline in 84 inbred and recombinant inbred mouse strains over 10 days. We integrate the behavior data with brain RNA-seq data from 41 strains. The self-administration of cocaine and that of saline are genetically distinct. We maximize power to map loci for cocaine intake by using a linear mixed model to account for this longitudinal phenotype while correcting for population structure. A total of 15 unique significant loci are identified in the genome-wide association study. A transcriptome-wide association study highlights the Trpv2 ion channel as a key locus for cocaine self-administration as well as identifying 17 additional genes, including Arhgef26, Slc18b1, and Slco5a1. We find numerous instances where alternate splice site selection or RNA editing altered transcript abundance. Our work emphasizes the importance of Trpv2, an ionotropic cannabinoid receptor, for the response to cocaine.


Assuntos
Transtornos Relacionados ao Uso de Cocaína , Cocaína , Camundongos , Animais , Cocaína/farmacologia , Estudo de Associação Genômica Ampla , Encéfalo , Administração Intravenosa , Camundongos Endogâmicos C57BL
9.
bioRxiv ; 2023 Jun 07.
Artigo em Inglês | MEDLINE | ID: mdl-37333092

RESUMO

Double-stranded RNAs (dsRNAs) are potent triggers of innate immune responses upon recognition by cytosolic dsRNA sensor proteins. Identification of endogenous dsRNAs helps to better understand the dsRNAome and its relevance to innate immunity related to human diseases. Here, we report dsRID (double-stranded RNA identifier), a machine learning-based method to predict dsRNA regions in silico, leveraging the power of long-read RNA-sequencing (RNA-seq) and molecular traits of dsRNAs. Using models trained with PacBio long-read RNA-seq data derived from Alzheimer's disease (AD) brain, we show that our approach is highly accurate in predicting dsRNA regions in multiple datasets. Applied to an AD cohort sequenced by the ENCODE consortium, we characterize the global dsRNA profile with potentially distinct expression patterns between AD and controls. Together, we show that dsRID provides an effective approach to capture global dsRNA profiles using long-read RNA-seq data.

10.
Sci Adv ; 9(14): eade9997, 2023 04 07.
Artigo em Inglês | MEDLINE | ID: mdl-37027465

RESUMO

RNA editing, the endogenous modification of nucleic acids, is known to be altered in genes with important neurological function in schizophrenia (SCZ). However, the global profile and molecular functions of disease-associated RNA editing remain unclear. Here, we analyzed RNA editing in postmortem brains of four SCZ cohorts and uncovered a significant and reproducible trend of hypoediting in patients of European descent. We report a set of SCZ-associated editing sites via WGCNA analysis, shared across cohorts. Using massively parallel reporter assays and bioinformatic analyses, we observed that differential 3' untranslated region (3'UTR) editing sites affecting host gene expression were enriched for mitochondrial processes. Furthermore, we characterized the impact of two recoding sites in the mitofusin 1 (MFN1) gene and showed their functional relevance to mitochondrial fusion and cellular apoptosis. Our study reveals a global reduction of editing in SCZ and a compelling link between editing and mitochondrial function in the disease.


Assuntos
RNA , Esquizofrenia , Humanos , RNA/metabolismo , Esquizofrenia/genética , Esquizofrenia/metabolismo , Encéfalo/metabolismo , Mitocôndrias/genética
11.
bioRxiv ; 2023 Feb 24.
Artigo em Inglês | MEDLINE | ID: mdl-36865202

RESUMO

RNA splicing plays a critical role in post-transcriptional gene regulation. Exponential expansion of intron length poses a challenge for accurate splicing. Little is known about how cells prevent inadvertent and often deleterious expression of intronic elements due to cryptic splicing. In this study, we identify hnRNPM as an essential RNA binding protein that suppresses cryptic splicing through binding to deep introns, preserving transcriptome integrity. Long interspersed nuclear elements (LINEs) harbor large amounts of pseudo splice sites in introns. hnRNPM preferentially binds at intronic LINEs and represses LINE-containing pseudo splice site usage for cryptic splicing. Remarkably, a subgroup of the cryptic exons can form long dsRNAs through base-pairing of inverted Alu transposable elements scattered in between LINEs and trigger interferon immune response, a well-known antiviral defense mechanism. Notably, these interferon-associated pathways are found to be upregulated in hnRNPM-deficient tumors, which also exhibit elevated immune cell infiltration. These findings unveil hnRNPM as a guardian of transcriptome integrity. Targeting hnRNPM in tumors may be used to trigger an inflammatory immune response thereby boosting cancer surveillance.

12.
Cancer Res ; 83(3): 374-385, 2023 02 03.
Artigo em Inglês | MEDLINE | ID: mdl-36449563

RESUMO

RNA editing modifies single nucleotides of RNAs, regulating primary protein structure and protein abundance. In recent years, the diversity of proteins and complexity of gene regulation associated with RNA editing dysregulation has been increasingly appreciated in oncology. Large-scale shifts in editing have been observed in bulk tumors across various cancer types. However, RNA editing in single cells and individual cell types within tumors has not been explored. By profiling editing in single cells from lung adenocarcinoma biopsies, we found that the increased editing trend of bulk lung tumors was unique to cancer cells. Elevated editing levels were observed in cancer cells resistant to targeted therapy, and editing sites associated with drug response were enriched. Consistent with the regulation of antiviral pathways by RNA editing, higher editing levels in cancer cells were associated with reduced antitumor innate immune response, especially levels of natural killer cell infiltration. In addition, the level of RNA editing in cancer cells was positively associated with somatic point mutation burden. This observation motivated the definition of a new metric, RNA editing load, reflecting the amount of RNA mutations created by RNA editing. Importantly, in lung cancer, RNA editing load was a stronger predictor of patient survival than DNA mutations. This study provides the first single cell dissection of editing in cancer and highlights the significance of RNA editing load in cancer prognosis. SIGNIFICANCE: RNA editing analysis in single lung adenocarcinoma cells uncovers RNA mutations that correlate with tumor mutation burden and cancer innate immunity and reveals the amount of RNA mutations that strongly predicts patient survival. See related commentary by Luo and Liang, p. 351.


Assuntos
Adenocarcinoma de Pulmão , Neoplasias Pulmonares , Humanos , Edição de RNA , Adenocarcinoma de Pulmão/genética , Neoplasias Pulmonares/patologia , RNA , Prognóstico , Imunidade Inata/genética , Análise de Célula Única
13.
Sci Adv ; 8(35): eabn6398, 2022 Sep 02.
Artigo em Inglês | MEDLINE | ID: mdl-36054357

RESUMO

Single-cell RNA sequencing (scRNA-seq) data contain rich information at the gene, transcript, and nucleotide levels. Most analyses of scRNA-seq have focused on gene expression profiles, and it remains challenging to extract nucleotide variants and isoform-specific information. Here, we present scAllele, an integrative approach that detects single-nucleotide variants, insertions, deletions, and their allelic linkage with splicing patterns in scRNA-seq. We demonstrate that scAllele achieves better performance in identifying nucleotide variants than other commonly used tools. In addition, the read-specific variant calls by scAllele enables allele-specific splicing analysis, a unique feature not afforded by other methods. Applied to a lung cancer scRNA-seq dataset, scAllele identified variants with strong allelic linkage to alternative splicing, some of which are cancer specific and enriched in cancer-relevant pathways. scAllele represents a versatile tool to uncover multilayer information and previously unidentified biological insights from scRNA-seq data.

14.
iScience ; 25(8): 104836, 2022 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-35992085

RESUMO

PODXL, a protein that is dysregulated in multiple cancers, plays an important role in promoting cancer metastasis. In this study, we report that RNA editing promotes the inclusion of a PODXL alternative exon. The resulting edited PODXL long isoform is more prone to protease digestion and has the strongest effects on reducing cell migration and cisplatin chemoresistance among the three PODXL isoforms (short, unedited long, and edited long isoforms). Importantly, the editing level of the PODXL recoding site and the inclusion level of the PODXL alternative exon are strongly associated with overall patient survival in Kidney Renal Clear Cell Carcinoma (KIRC). Supported by significant enrichment of exonic RNA editing sites in alternatively spliced exons, we hypothesize that exonic RNA editing sites may enhance proteomic diversity through alternative splicing, in addition to amino acid changes, a previously under-appreciated aspect of RNA editing function.

15.
Gene ; 843: 146803, 2022 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-35961439

RESUMO

The zebra finch (Taeniopygia guttata), a representative oscine songbird species, has been widely studied to investigate behavioral neuroscience, most notably the neurobiological basis of vocal learning, a rare trait shared in only a few animal groups including humans. In 2019, an updated zebra finch genome annotation (bTaeGut1_v1.p) was released from the Ensembl database and is substantially more comprehensive than the first version published in 2010. In this study, we utilized the publicly available RNA-seq data generated from Illumina-based short-reads and PacBio single-molecule real-time (SMRT) long-reads to assess the bird transcriptome. To analyze the high-throughput RNA-seq data, we adopted a hybrid bioinformatic approach combining short and long-read pipelines. From our analysis, we added 220 novel genes and 8,134 transcript variants to the Ensembl annotation, and predicted a new proteome based on the refined annotation. We further validated 18 different novel proteins by using mass-spectrometry data generated from zebra finch caudal telencephalon tissue. Our results provide additional resources for future studies of zebra finches utilizing this improved bird genome annotation and proteome.


Assuntos
Tentilhões , Animais , Encéfalo , Feminino , Tentilhões/genética , Humanos , Masculino , Proteoma/genética , Caracteres Sexuais , Transcriptoma/genética , Vocalização Animal
16.
JCI Insight ; 6(17)2021 09 08.
Artigo em Inglês | MEDLINE | ID: mdl-34494556

RESUMO

Lipin 1 regulates cellular lipid homeostasis through roles in glycerolipid synthesis (through phosphatidic acid phosphatase activity) and transcriptional coactivation. Lipin 1-deficient individuals exhibit episodic disease symptoms that are triggered by metabolic stress, such as stress caused by prolonged fasting. We sought to identify critical lipin 1 activities during fasting. We determined that lipin 1 deficiency induces widespread alternative mRNA splicing in liver during fasting, much of which is normalized by refeeding. The role of lipin 1 in mRNA splicing was largely independent of its enzymatic function. We identified interactions between lipin 1 and spliceosome proteins, as well as a requirement for lipin 1 to maintain homeostatic levels of spliceosome small nuclear RNAs and specific RNA splicing factors. In fasted Lpin1-/- liver, we identified a correspondence between alternative splicing of phospholipid biosynthetic enzymes and dysregulated phospholipid levels; splicing patterns and phospholipid levels were partly normalized by feeding. Thus, lipin 1 influences hepatic lipid metabolism through mRNA splicing, as well as through enzymatic and transcriptional activities, and fasting exacerbates the deleterious effects of lipin 1 deficiency on metabolic homeostasis.


Assuntos
Adaptação Fisiológica/genética , Jejum/fisiologia , Metabolismo dos Lipídeos/genética , Fígado/metabolismo , RNA Mensageiro/genética , Processamento Alternativo , Animais , Células Cultivadas , Feminino , Humanos , Fígado/citologia , Masculino , Camundongos , Camundongos Endogâmicos BALB C , Modelos Animais , Fosfatidato Fosfatase , Splicing de RNA , Fatores de Transcrição/genética
17.
PLoS Genet ; 17(6): e1009580, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-34166378

RESUMO

The squamates (lizards and snakes) are close relatives of birds and mammals, with more than 10,000 described species that display extensive variation in a number of important biological traits, including coloration, venom production, and regeneration. Due to a lack of genomic tools, few genetic studies in squamates have been carried out. The leopard gecko, Eublepharis macularius, is a popular companion animal, and displays a variety of coloration patterns. We took advantage of a large breeding colony and used linkage analysis, synteny, and homozygosity mapping to investigate a spontaneous semi-dominant mutation, "Lemon Frost", that produces white coloration and causes skin tumors (iridophoroma). We localized the mutation to a single locus which contains a strong candidate gene, SPINT1, a tumor suppressor implicated in human skin cutaneous melanoma (SKCM) and over-proliferation of epithelial cells in mice and zebrafish. Our work establishes the leopard gecko as a tractable genetic system and suggests that a tumor suppressor in melanocytes in humans can also suppress tumor development in iridophores in lizards.


Assuntos
Lagartos/genética , Neoplasias Cutâneas/genética , Pigmentação da Pele , Alelos , Animais , Ligação Genética , Homozigoto , Mutação , Proteínas Secretadas Inibidoras de Proteinases/genética
18.
Nat Neurosci ; 24(3): 331-342, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33619405

RESUMO

Human stem-cell-derived models provide the promise of accelerating our understanding of brain disorders, but not knowing whether they possess the ability to mature beyond mid- to late-fetal stages potentially limits their utility. We leveraged a directed differentiation protocol to comprehensively assess maturation in vitro. Based on genome-wide analysis of the epigenetic clock and transcriptomics, as well as RNA editing, we observe that three-dimensional human cortical organoids reach postnatal stages between 250 and 300 days, a timeline paralleling in vivo development. We demonstrate the presence of several known developmental milestones, including switches in the histone deacetylase complex and NMDA receptor subunits, which we confirm at the protein and physiological levels. These results suggest that important components of an intrinsic in vivo developmental program persist in vitro. We further map neurodevelopmental and neurodegenerative disease risk genes onto in vitro gene expression trajectories to provide a resource and webtool (Gene Expression in Cortical Organoids, GECO) to guide disease modeling.


Assuntos
Diferenciação Celular/fisiologia , Metilação de DNA/fisiologia , Células-Tronco Pluripotentes Induzidas/citologia , Organoides/citologia , Redes Reguladoras de Genes , Humanos , Técnicas In Vitro , Doenças Neurodegenerativas/genética
19.
Genome Res ; 31(3): 359-371, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33452016

RESUMO

Alternative splicing is an RNA processing mechanism that affects most genes in human, contributing to disease mechanisms and phenotypic diversity. The regulation of splicing involves an intricate network of cis-regulatory elements and trans-acting factors. Due to their high sequence specificity, cis-regulation of splicing can be altered by genetic variants, significantly affecting splicing outcomes. Recently, multiple methods have been applied to understanding the regulatory effects of genetic variants on splicing. However, it is still challenging to go beyond apparent association to pinpoint functional variants. To fill in this gap, we utilized large-scale data sets of the Genotype-Tissue Expression (GTEx) project to study genetically modulated alternative splicing (GMAS) via identification of allele-specific splicing events. We demonstrate that GMAS events are shared across tissues and individuals more often than expected by chance, consistent with their genetically driven nature. Moreover, although the allelic bias of GMAS exons varies across samples, the degree of variation is similar across tissues versus individuals. Thus, genetic background drives the GMAS pattern to a similar degree as tissue-specific splicing mechanisms. Leveraging the genetically driven nature of GMAS, we developed a new method to predict functional splicing-altering variants, built upon a genotype-phenotype concordance model across samples. Complemented by experimental validations, this method predicted >1000 functional variants, many of which may alter RNA-protein interactions. Lastly, 72% of GMAS-associated SNPs were in linkage disequilibrium with GWAS-reported SNPs, and such association was enriched in tissues of relevance for specific traits/diseases. Our study enables a comprehensive view of genetically driven splicing variations in human tissues.


Assuntos
Alelos , Processamento Alternativo/genética , Variação Genética , Linhagem Celular , Éxons , Feminino , Estudo de Associação Genômica Ampla , Humanos , Desequilíbrio de Ligação , Masculino , Especificidade de Órgãos/genética , Polimorfismo de Nucleotídeo Único/genética
20.
Epigenetics ; 16(9): 1000-1015, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-33092484

RESUMO

microRNAs (miRNAs) are small non-coding RNAs that play critical roles in gene regulation. The presence of miRNAs in extracellular biofluids is increasingly recognized. However, most previous characterization of extracellular miRNAs focused on their overall expression levels. Alternative sequence isoforms and modifications of miRNAs were rarely considered in the extracellular space. Here, we developed a highly accurate bioinformatic method, called miNTA, to identify 3' non-templated additions (NTAs) of miRNAs using small RNA-sequencing data. Using miNTA, we conducted an in-depth analysis of miRNA 3' NTA profiles in 1047 extracellular RNA-sequencing data sets of 4 types of biofluids. This analysis identified hundreds of miRNAs with 3' uridylation or adenylation, with the former being more prevalent. Among these miRNAs, up to 53% (22%) had an average 3' uridylation (adenylation) level of at least 10% in a specific biofluid. Strikingly, we found that 3' uridylation levels enabled segregation of different types of biofluids, more effectively than overall miRNA expression levels. This observation suggests that 3' NTA levels possess fluid-specific information relatively robust to batch effects. In addition, we observed that extracellular miRNAs with 3' uridylations are enriched in processes related to angiogenesis, apoptosis, and inflammatory response, and this type of modification may stabilize base-pairing between miRNAs and their target genes. Together, our study provides a comprehensive landscape of miRNA NTAs in human biofluids, which paves way for further biomarker discoveries. The insights generated in our work built a foundation for future functional, mechanistic, and translational discoveries.


Assuntos
Líquidos Corporais , MicroRNAs , Líquidos Corporais/metabolismo , Metilação de DNA , Regulação da Expressão Gênica , Humanos , MicroRNAs/metabolismo , Análise de Sequência de RNA
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA