Búsqueda | Portal de Búsqueda de la BVS Colombia

Robust statistical modeling improves sensitivity of high-throughput RNA structure probing experiments.

Selega, Alina; Sirocchi, Christel; Iosub, Ira; Granneman, Sander; Sanguinetti, Guido.

Nat Methods ; 14(1): 83-89, 2017 01.

Artículo en Inglés | MEDLINE | ID: mdl-27819660

RESUMEN

Structure probing coupled with high-throughput sequencing could revolutionize our understanding of the role of RNA structure in regulation of gene expression. Despite recent technological advances, intrinsic noise and high sequence coverage requirements greatly limit the applicability of these techniques. Here we describe a probabilistic modeling pipeline that accounts for biological variability and biases in the data, yielding statistically interpretable scores for the probability of nucleotide modification transcriptome wide. Using two yeast data sets, we demonstrate that our method has increased sensitivity, and thus our pipeline identifies modified regions on many more transcripts than do existing pipelines. Our method also provides confident predictions at much lower sequence coverage levels than those recommended for reliable structural probing. Our results show that statistical modeling extends the scope and potential of transcriptome-wide structure probing experiments.

Asunto(s)

Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Modelos Estadísticos , ARN/química , ARN/genética , Análisis de Secuencia de ARN/métodos , Transcriptoma/genética , Emparejamiento Base , Secuencia de Bases , Biología Computacional/métodos , Humanos , Conformación de Ácido Nucleico

Beyond benchmarking and towards predictive models of dataset-specific single-cell RNA-seq pipeline performance.

Fang, Cindy; Selega, Alina; Campbell, Kieran R.

Genome Biol ; 25(1): 159, 2024 06 17.

Artículo en Inglés | MEDLINE | ID: mdl-38886757

RESUMEN

BACKGROUND: The advent of single-cell RNA-sequencing (scRNA-seq) has driven significant computational methods development for all steps in the scRNA-seq data analysis pipeline, including filtering, normalization, and clustering. The large number of methods and their resulting parameter combinations has created a combinatorial set of possible pipelines to analyze scRNA-seq data, which leads to the obvious question: which is best? Several benchmarking studies compare methods but frequently find variable performance depending on dataset and pipeline characteristics. Alternatively, the large number of scRNA-seq datasets along with advances in supervised machine learning raise a tantalizing possibility: could the optimal pipeline be predicted for a given dataset? RESULTS: Here, we begin to answer this question by applying 288 scRNA-seq analysis pipelines to 86 datasets and quantifying pipeline success via a range of measures evaluating cluster purity and biological plausibility. We build supervised machine learning models to predict pipeline success given a range of dataset and pipeline characteristics. We find that prediction performance is significantly better than random and that in many cases pipelines predicted to perform well provide clustering outputs similar to expert-annotated cell type labels. We identify characteristics of datasets that correlate with strong prediction performance that could guide when such prediction models may be useful. CONCLUSIONS: Supervised machine learning models have utility for recommending analysis pipelines and therefore the potential to alleviate the burden of choosing from the near-infinite number of possibilities. Different aspects of datasets influence the predictive performance of such models which will further guide users.

Asunto(s)

RNA-Seq , Análisis de Expresión Génica de una Sola Célula , Animales , Humanos , Análisis por Conglomerados , Biología Computacional/métodos , Aprendizaje Automático , RNA-Seq/métodos , Análisis de Secuencia de ARN/métodos , Aprendizaje Automático Supervisado

TrackSigFreq: subclonal reconstructions based on mutation signatures and allele frequencies.

Harrigan, Caitlin F; Rubanova, Yulia; Morris, Quaid; Selega, Alina.

Pac Symp Biocomput ; 25: 238-249, 2020.

Artículo en Inglés | MEDLINE | ID: mdl-31797600

RESUMEN

Mutational signatures are patterns of mutation types, many of which are linked to known mutagenic processes. Signature activity represents the proportion of mutations a signature generates. In cancer, cells may gain advantageous phenotypes through mutation accumulation, causing rapid growth of that subpopulation within the tumour. The presence of many subclones can make cancers harder to treat and have other clinical implications. Reconstructing changes in signature activities can give insight into the evolution of cells within a tumour. Recently, we introduced a new method, TrackSig, to detect changes in signature activities across time from single bulk tumour sample. By design, TrackSig is unable to identify mutation populations with different frequencies but little to no difference in signature activity. Here we present an extension of this method, TrackSigFreq, which enables trajectory reconstruction based on both observed density of mutation frequencies and changes in mutational signature activities. TrackSigFreq preserves the advantages of TrackSig, namely optimal and rapid mutation clustering through segmentation, while extending it so that it can identify distinct mutation populations that share similar signature activities.

Asunto(s)

Genoma Humano , Neoplasias , Biología Computacional , Frecuencia de los Genes , Humanos , Mutación , Neoplasias/genética

Kinetic CRAC uncovers a role for Nab3 in determining gene expression profiles during stress.

van Nues, Rob; Schweikert, Gabriele; de Leau, Erica; Selega, Alina; Langford, Andrew; Franklin, Ryan; Iosub, Ira; Wadsworth, Peter; Sanguinetti, Guido; Granneman, Sander.

Nat Commun ; 8(1): 12, 2017 04 11.

Artículo en Inglés | MEDLINE | ID: mdl-28400552

RESUMEN

RNA-binding proteins play a key role in shaping gene expression profiles during stress, however, little is known about the dynamic nature of these interactions and how this influences the kinetics of gene expression. To address this, we developed kinetic cross-linking and analysis of cDNAs (χCRAC), an ultraviolet cross-linking method that enabled us to quantitatively measure the dynamics of protein-RNA interactions in vivo on a minute time-scale. Here, using χCRAC we measure the global RNA-binding dynamics of the yeast transcription termination factor Nab3 in response to glucose starvation. These measurements reveal rapid changes in protein-RNA interactions within 1 min following stress imposition. Changes in Nab3 binding are largely independent of alterations in transcription rate during the early stages of stress response, indicating orthogonal transcriptional control mechanisms. We also uncover a function for Nab3 in dampening expression of stress-responsive genes. χCRAC has the potential to greatly enhance our understanding of in vivo dynamics of protein-RNA interactions.Protein RNA interactions are dynamic and regulated in response to environmental changes. Here the authors describe 'kinetic CRAC', an approach that allows time resolved analyses of protein RNA interactions with minute time point resolution and apply it to gain insight into the function of the RNA-binding protein Nab3.

Asunto(s)

Regulación Fúngica de la Expresión Génica , Proteínas Nucleares/genética , ARN de Hongos/genética , Proteínas de Unión al ARN/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Transcriptoma , Medios de Cultivo/farmacología , ADN Complementario/genética , ADN Complementario/metabolismo , Perfilación de la Expresión Génica , Glucosa/deficiencia , Cinética , Proteínas Nucleares/metabolismo , Unión Proteica , ARN de Hongos/metabolismo , Proteínas de Unión al ARN/metabolismo , Saccharomyces cerevisiae/efectos de los fármacos , Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/efectos de la radiación , Proteínas de Saccharomyces cerevisiae/metabolismo , Estrés Fisiológico , Factores de Tiempo , Rayos Ultravioleta

Trends and challenges in computational RNA biology.

Selega, Alina; Sanguinetti, Guido.

Genome Biol ; 17(1): 253, 2016 12 07.

Artículo en Inglés | MEDLINE | ID: mdl-27927225

RESUMEN

A report on the Wellcome Trust Conference on Computational RNA Biology, held in Hinxton, UK, on 17-19 October 2016.

Asunto(s)

Biología Computacional/tendencias , Genómica , ARN/genética , Humanos , Proteómica

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA