Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
1.
PLoS Comput Biol ; 17(2): e1008608, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33566819

RESUMEN

Different miRNA profiling protocols and technologies introduce differences in the resulting quantitative expression profiles. These include differences in the presence (and measurability) of certain miRNAs. We present and examine a method based on quantile normalization, Adjusted Quantile Normalization (AQuN), to combine miRNA expression data from multiple studies in breast cancer into a single joint dataset for integrative analysis. By pooling multiple datasets, we obtain increased statistical power, surfacing patterns that do not emerge as statistically significant when separately analyzing these datasets. To merge several datasets, as we do here, one needs to overcome both technical and batch differences between these datasets. We compare several approaches for merging and jointly analyzing miRNA datasets. We investigate the statistical confidence for known results and highlight potential new findings that resulted from the joint analysis using AQuN. In particular, we detect several miRNAs to be differentially expressed in estrogen receptor (ER) positive versus ER negative samples. In addition, we identify new potential biomarkers and therapeutic targets for both clinical groups. As a specific example, using the AQuN-derived dataset we detect hsa-miR-193b-5p to have a statistically significant over-expression in the ER positive group, a phenomenon that was not previously reported. Furthermore, as demonstrated by functional assays in breast cancer cell lines, overexpression of hsa-miR-193b-5p in breast cancer cell lines resulted in decreased cell viability in addition to inducing apoptosis. Together, these observations suggest a novel functional role for this miRNA in breast cancer. Packages implementing AQuN are provided for Python and Matlab: https://github.com/YakhiniGroup/PyAQN.


Asunto(s)
Neoplasias de la Mama/genética , Neoplasias de la Mama/metabolismo , Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , MicroARNs/metabolismo , Algoritmos , Biomarcadores/metabolismo , Biomarcadores de Tumor/genética , Línea Celular Tumoral , Simulación por Computador , Receptor alfa de Estrógeno/metabolismo , Femenino , Humanos , Células MCF-7 , Análisis de Secuencia por Matrices de Oligonucleótidos , Lenguajes de Programación , ARN Mensajero/genética
2.
Bioinformatics ; 32(17): i559-i566, 2016 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-27587675

RESUMEN

MOTIVATION: Complex interactions among alleles often drive differences in inherited properties including disease predisposition. Isolating the effects of these interactions requires phasing information that is difficult to measure or infer. Furthermore, prevalent sequencing technologies used in the essential first step of determining a haplotype limit the range of that step to the span of reads, namely hundreds of bases. With the advent of pseudo-long read technologies, observable partial haplotypes can span several orders of magnitude more. Yet, measuring whole-genome-single-individual haplotypes remains a challenge. A different view of whole genome measurement addresses the 3D structure of the genome-with great development of Hi-C techniques in recent years. A shortcoming of current Hi-C, however, is the difficulty in inferring information that is specific to each of a pair of homologous chromosomes. RESULTS: In this work, we develop a robust algorithmic framework that takes two measurement derived datasets: raw Hi-C and partial short-range haplotypes, and constructs the full-genome haplotype as well as phased diploid Hi-C maps. By analyzing both data sets together we thus bridge important gaps in both technologies-from short to long haplotypes and from un-phased to phased Hi-C. We demonstrate that our method can recover ground truth haplotypes with high accuracy, using measured biological data as well as simulated data. We analyze the impact of noise, Hi-C sequencing depth and measured haplotype lengths on performance. Finally, we use the inferred 3D structure of a human genome to point at transcription factor targets nuclear co-localization. AVAILABILITY AND IMPLEMENTATION: The implementation available at https://github.com/YakhiniGroup/SpectraPh CONTACT: zohar.yakhini@gmail.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Cromosomas , Genoma Humano , Haplotipos , Conformación Molecular , Algoritmos , Variación Genética , Estudio de Asociación del Genoma Completo , Humanos
3.
Nucleic Acids Res ; 41(4): 2191-201, 2013 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-23303780

RESUMEN

While it has been long recognized that genes are not randomly positioned along the genome, the degree to which its 3D structure influences the arrangement of genes has remained elusive. In particular, several lines of evidence suggest that actively transcribed genes are spatially co-localized, forming transcription factories; however, a generalized systematic test has hitherto not been described. Here we reveal transcription factories using a rigorous definition of genomic structure based on Saccharomyces cerevisiae chromosome conformation capture data, coupled with an experimental design controlling for the primary gene order. We develop a data-driven method for the interpolation and the embedding of such datasets and introduce statistics that enable the comparison of the spatial and genomic densities of genes. Combining these, we report evidence that co-regulated genes are clustered in space, beyond their observed clustering in the context of gene order along the genome and show this phenomenon is significant for 64 out of 117 transcription factors. Furthermore, we show that those transcription factors with high spatially co-localized targets are expressed higher than those whose targets are not spatially clustered. Collectively, our results support the notion that, at a given time, the physical density of genes is intimately related to regulatory activity.


Asunto(s)
Regulación Fúngica de la Expresión Génica , Saccharomyces cerevisiae/genética , Interpretación Estadística de Datos , Orden Génico , Genoma Fúngico , Modelos Genéticos , Factores de Transcripción/metabolismo
4.
Bioinformatics ; 29(11): 1455-7, 2013 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-23603334

RESUMEN

MOTIVATION: Paired-end sequencing resulting in gapped short reads is commonly used for de novo genome assembly. Assembly methods use paired-end sequences in a two-step process, first treating each read-end independently, only later invoking the pairing to join the contiguous assemblies (contigs) into gapped scaffolds. Here, we present ELOPER, a pre-processing tool for pair-end sequences that produces a better read library for assembly programs. RESULTS: ELOPER proceeds by simultaneously considering both ends of paired reads generating elongated reads. We show that ELOPER theoretically doubles read-lengths while halving the number of reads. We provide evidence that pre-processing read libraries using ELOPER leads to considerably improved assemblies as predicted from the Lander-Waterman model. AVAILABILITY: http://sourceforge.net/projects/eloper SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genómica/métodos , Programas Informáticos , Mapeo Contig , Biblioteca de Genes , Análisis de Secuencia de ADN/métodos
5.
Mol Syst Biol ; 8: 587, 2012 Jun 05.
Artículo en Inglés | MEDLINE | ID: mdl-22669615

RESUMEN

The phenotype of an organism is determined by its genotype and environment. An interaction between these two arises from the differential effect of the environment on gene expression in distinct genotypes; however, the genomic properties identifying these are not well understood. Here we analyze the transcriptomes of five C. elegans strains (genotype) cultivated in five growth conditions (environment), and find that highly regulated genes, as distinguished by intergenic lengths, motif concentration, and expression levels, are particularly biased toward genotype-environment interactions. Sequencing these strains, we find that genes with expression variation across genotypes are enriched for promoter single-nucleotide polymorphisms (SNPs), as expected. However, genes with genotype-environment interactions do not significantly differ from background in terms of their promoter SNPs. Collectively, these results indicate that the highly regulated nature of particular genes predispose them for exhibiting genotype-environment interaction as a consequence of changes to upstream regulators. This observation may provide a deeper understanding into the origin of the extraordinary gene expression diversity present in even closely related species.


Asunto(s)
Caenorhabditis elegans/genética , Interacción Gen-Ambiente , Regiones Promotoras Genéticas , Animales , Regulación de la Expresión Génica , Genómica/métodos , Genotipo , Polimorfismo de Nucleótido Simple , Transcriptoma
6.
Sci Rep ; 12(1): 15835, 2022 09 23.
Artículo en Inglés | MEDLINE | ID: mdl-36151122

RESUMEN

Excision tissue biopsy, while central to cancer treatment and precision medicine, presents risks to the patient and does not provide a sufficiently broad and faithful representation of the heterogeneity of solid tumors. Here we introduce e-biopsy-a novel concept for molecular profiling of solid tumors using molecular sampling with electroporation. As e-biopsy provides access to the molecular composition of a solid tumor by permeabilization of the cell membrane, it facilitates tumor diagnostics without tissue resection. Furthermore, thanks to its non tissue destructive characteristics, e-biopsy enables probing the solid tumor multiple times in several distinct locations in the same procedure, thereby enabling the spatial profiling of tumor molecular heterogeneity.We demonstrate e-biopsy in vivo, using the 4T1 breast cancer model in mice to assess its performance, as well as the inferred spatial differential protein expression. In particular, we show that proteomic profiles obtained via e-biopsy in vivo distinguish the tumors from healthy breast tissue and reflect spatial tumor differential protein expression. E-biopsy provides a completely new molecular sampling modality for solid tumors molecular cartography, providing information that potentially enables more rapid and sensitive detection at lesser risk, as well as more precise personalized medicine.


Asunto(s)
Neoplasias , Proteómica , Animales , Electroporación , Ratones , Proteínas de Neoplasias , Neoplasias/patología , Medicina de Precisión
7.
Sci Rep ; 9(1): 12734, 2019 09 04.
Artículo en Inglés | MEDLINE | ID: mdl-31484964

RESUMEN

Genome conformation capture techniques permit a systematic investigation into the functional spatial organization of genomes, including functional aspects like assessing the co-localization of sets of genomic elements. For example, the co-localization of genes targeted by a transcription factor (TF) within a transcription factory. We quantify spatial co-localization using a rigorous statistical model that measures the enrichment of a subset of elements in neighbourhoods inferred from Hi-C data. We also control for co-localization that can be attributed to genomic order. We systematically apply our open-sourced framework, spatial-mHG, to search for spatial co-localization phenomena in multiple unicellular Hi-C datasets with corresponding genomic annotations. Our biological findings shed new light on the functional spatial organization of genomes, including: In C. crescentus, DNA replication genes reside in two genomic clusters that are spatially co-localized. Furthermore, these clusters contain similar gene copies and lay in genomic vicinity to the ori and ter sequences. In S. cerevisae, Ty5 retrotransposon family element spatially co-localize at a spatially adjacent subset of telomeres. In N. crassa, both Proteasome lid subcomplex genes and protein refolding genes jointly spatially co-localize at a shared location. An implementation of our algorithms is available online.


Asunto(s)
Bacillus subtilis/genética , Caulobacter crescentus/genética , Genoma Bacteriano , Genoma Fúngico , Neurospora crassa/genética , Saccharomyces cerevisiae/genética , Schizosaccharomyces/genética , Genómica , Modelos Genéticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA