Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
1.
BMC Bioinformatics ; 19(1): 367, 2018 Oct 04.
Artigo em Inglês | MEDLINE | ID: mdl-30286713

RESUMO

BACKGROUND: Genome-wide high-throughput sequencing (HTS) time series experiments are a powerful tool for monitoring various genomic elements over time. They can be used to monitor, for example, gene or transcript expression with RNA sequencing (RNA-seq), DNA methylation levels with bisulfite sequencing (BS-seq), or abundances of genetic variants in populations with pooled sequencing (Pool-seq). However, because of high experimental costs, the time series data sets often consist of a very limited number of time points with very few or no biological replicates, posing challenges in the data analysis. RESULTS: Here we present the GPrank R package for modelling genome-wide time series by incorporating variance information obtained during pre-processing of the HTS data using probabilistic quantification methods or from a beta-binomial model using sequencing depth. GPrank is well-suited for analysing both short and irregularly sampled time series. It is based on modelling each time series by two Gaussian process (GP) models, namely, time-dependent and time-independent GP models, and comparing the evidence provided by data under two models by computing their Bayes factor (BF). Genomic elements are then ranked by their BFs, and temporally most dynamic elements can be identified. CONCLUSIONS: Incorporating the variance information helps GPrank avoid false positives without compromising computational efficiency. Fitted models can be easily further explored in a browser. Detection and visualisation of temporally most active dynamic elements in the genome can provide a good starting point for further downstream analyses for increasing our understanding of the studied processes.


Assuntos
Variação Genética/genética , Genoma/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Software
2.
Proc Natl Acad Sci U S A ; 112(42): 13115-20, 2015 Oct 20.
Artigo em Inglês | MEDLINE | ID: mdl-26438844

RESUMO

Genes with similar transcriptional activation kinetics can display very different temporal mRNA profiles because of differences in transcription time, degradation rate, and RNA-processing kinetics. Recent studies have shown that a splicing-associated RNA production delay can be significant. To investigate this issue more generally, it is useful to develop methods applicable to genome-wide datasets. We introduce a joint model of transcriptional activation and mRNA accumulation that can be used for inference of transcription rate, RNA production delay, and degradation rate given data from high-throughput sequencing time course experiments. We combine a mechanistic differential equation model with a nonparametric statistical modeling approach allowing us to capture a broad range of activation kinetics, and we use Bayesian parameter estimation to quantify the uncertainty in estimates of the kinetic parameters. We apply the model to data from estrogen receptor α activation in the MCF-7 breast cancer cell line. We use RNA polymerase II ChIP-Seq time course data to characterize transcriptional activation and mRNA-Seq time course data to quantify mature transcripts. We find that 11% of genes with a good signal in the data display a delay of more than 20 min between completing transcription and mature mRNA production. The genes displaying these long delays are significantly more likely to be short. We also find a statistical association between high delay and late intron retention in pre-mRNA data, indicating significant splicing-associated production delays in many genes.


Assuntos
Genoma Humano , Modelos Genéticos , RNA/biossíntese , Transcrição Gênica , Receptor alfa de Estrogênio/metabolismo , Humanos , Cinética , Células MCF-7 , RNA/genética , Transdução de Sinais
3.
Bioinformatics ; 32(12): i147-i155, 2016 06 15.
Artigo em Inglês | MEDLINE | ID: mdl-27307611

RESUMO

MOTIVATION: Alternative splicing is an important mechanism in which the regions of pre-mRNAs are differentially joined in order to form different transcript isoforms. Alternative splicing is involved in the regulation of normal physiological functions but also linked to the development of diseases such as cancer. We analyse differential expression and splicing using RNA-sequencing time series in three different settings: overall gene expression levels, absolute transcript expression levels and relative transcript expression levels. RESULTS: Using estrogen receptor α signaling response as a model system, our Gaussian process-based test identifies genes with differential splicing and/or differentially expressed transcripts. We discover genes with consistent changes in alternative splicing independent of changes in absolute expression and genes where some transcripts change whereas others stay constant in absolute level. The results suggest classes of genes with different modes of alternative splicing regulation during the experiment. AVAILABILITY AND IMPLEMENTATION: R and Matlab codes implementing the method are available at https://github.com/PROBIC/diffsplicing An interactive browser for viewing all model fits is available at http://users.ics.aalto.fi/hande/splicingGP/ CONTACT: hande.topa@helsinki.fi or antti.honkela@helsinki.fi SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Processamento Alternativo , Perfilação da Expressão Gênica , Humanos , Isoformas de Proteínas , Precursores de RNA , Análise de Sequência de RNA
4.
Bioinformatics ; 31(11): 1762-70, 2015 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-25614471

RESUMO

MOTIVATION: Recent advances in high-throughput sequencing (HTS) have made it possible to monitor genomes in great detail. New experiments not only use HTS to measure genomic features at one time point but also monitor them changing over time with the aim of identifying significant changes in their abundance. In population genetics, for example, allele frequencies are monitored over time to detect significant frequency changes that indicate selection pressures. Previous attempts at analyzing data from HTS experiments have been limited as they could not simultaneously include data at intermediate time points, replicate experiments and sources of uncertainty specific to HTS such as sequencing depth. RESULTS: We present the beta-binomial Gaussian process model for ranking features with significant non-random variation in abundance over time. The features are assumed to represent proportions, such as proportion of an alternative allele in a population. We use the beta-binomial model to capture the uncertainty arising from finite sequencing depth and combine it with a Gaussian process model over the time series. In simulations that mimic the features of experimental evolution data, the proposed method clearly outperforms classical testing in average precision of finding selected alleles. We also present simulations exploring different experimental design choices and results on real data from Drosophila experimental evolution experiment in temperature adaptation. AVAILABILITY AND IMPLEMENTATION: R software implementing the test is available at https://github.com/handetopa/BBGP.


Assuntos
Evolução Molecular , Frequência do Gene , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Alelos , Animais , Drosophila/genética , Genômica/métodos , Modelos Estatísticos , Distribuição Normal , Polimorfismo de Nucleotídeo Único , Software
5.
Genome Biol ; 25(1): 144, 2024 05 31.
Artigo em Inglês | MEDLINE | ID: mdl-38822397

RESUMO

BACKGROUND: Variation in X chromosome inactivation (XCI) in human-induced pluripotent stem cells (hiPSCs) can impact their ability to model biological sex biases. The gene-wise landscape of X chromosome gene dosage remains unresolved in female hiPSCs. To characterize patterns of de-repression and escape from inactivation, we performed a systematic survey of allele specific expression in 165 female hiPSC lines. RESULTS: XCI erosion is non-random and primarily affects genes that escape XCI in human tissues. Individual genes and cell lines vary in the frequency and degree of de-repression. Bi-allelic expression increases gradually after modest decrease of XIST in cultures, whose loss is commonly used to mark lines with eroded XCI. We identify three clusters of female lines at different stages of XCI. Increased XCI erosion amplifies female-biased expression at hypomethylated sites and regions normally occupied by repressive histone marks, lowering male-biased differences in the X chromosome. In autosomes, erosion modifies sex differences in a dose-dependent way. Male-biased genes are enriched for hypermethylated regions, and de-repression of XIST-bound autosomal genes in female lines attenuates normal male-biased gene expression in eroded lines. XCI erosion can compensate for a dominant loss of function effect in several disease genes. CONCLUSIONS: We present a comprehensive view of X chromosome gene dosage in hiPSCs and implicate a direct mechanism for XCI erosion in regulating autosomal gene expression in trans. The uncommon and variable reactivation of X chromosome genes in female hiPSCs can provide insight into X chromosome's role in regulating gene expression and sex differences in humans.


Assuntos
Cromossomos Humanos X , Células-Tronco Pluripotentes Induzidas , RNA Longo não Codificante , Inativação do Cromossomo X , Humanos , Células-Tronco Pluripotentes Induzidas/metabolismo , Feminino , Cromossomos Humanos X/genética , Masculino , RNA Longo não Codificante/genética , Alelos , Regulação da Expressão Gênica , Metilação de DNA
6.
Eur J Hum Genet ; 30(5): 619-627, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-35087184

RESUMO

Biallelic loss-of-function variants in the SMG9 gene, encoding a regulatory subunit of the mRNA nonsense-mediated decay (NMD) machinery, are reported to cause heart and brain malformation syndrome. Here we report five patients from three unrelated families with intellectual disability (ID) and a novel pathogenic SMG9 c.551 T > C p.(Val184Ala) homozygous missense variant, identified using exome sequencing. Sanger sequencing confirmed recessive segregation in each family. SMG9 c.551T > C p.(Val184Ala) is most likely an autozygous variant identical by descent. Characteristic clinical findings in patients were mild to moderate ID, intention tremor, pyramidal signs, dyspraxia, and ocular manifestations. We used RNA sequencing of patients and age- and sex-matched healthy controls to assess the effect of the variant. RNA sequencing revealed that the SMG9 c.551T > C variant did not affect the splicing or expression level of SMG9 gene products, and allele-specific expression analysis did not provide evidence that the nonsense mRNA-induced NMD was affected. Differential gene expression analysis identified prevalent upregulation of genes in patients, including the genes SMOX, OSBP2, GPX3, and ZNF155. These findings suggest that normal SMG9 function may be involved in transcriptional regulation without affecting nonsense mRNA-induced NMD. In conclusion, we demonstrate that the SMG9 c.551T > C missense variant causes a neurodevelopmental disorder and impacts gene expression. NMD components have roles beyond aberrant mRNA degradation that are crucial for neurocognitive development.


Assuntos
Deficiência Intelectual , Peptídeos e Proteínas de Sinalização Intracelular , Degradação do RNAm Mediada por Códon sem Sentido , Alelos , Homozigoto , Humanos , Deficiência Intelectual/genética , Peptídeos e Proteínas de Sinalização Intracelular/genética , RNA Mensageiro/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA