RESUMEN
Reporter genes integrated into the genome are a powerful tool to reveal effects of regulatory elements and local chromatin context on gene expression. However, so far such reporter assays have been of low throughput. Here, we describe a multiplexing approach for the parallel monitoring of transcriptional activity of thousands of randomly integrated reporters. More than 27,000 distinct reporter integrations in mouse embryonic stem cells, obtained with two different promoters, show â¼1,000-fold variation in expression levels. Data analysis indicates that lamina-associated domains act as attenuators of transcription, likely by reducing access of transcription factors to binding sites. Furthermore, chromatin compaction is predictive of reporter activity. We also found evidence for crosstalk between neighboring genes and estimate that enhancers can influence gene expression on average over â¼20 kb. The multiplexed reporter assay is highly flexible in design and can be modified to query a wide range of aspects of gene regulation.
Asunto(s)
Efectos de la Posición Cromosómica , Técnicas Genéticas , Animales , Cromatina/metabolismo , Células Madre Embrionarias/metabolismo , Genes Reporteros , Secuenciación de Nucleótidos de Alto Rendimiento , Ratones , Regiones Promotoras GenéticasRESUMEN
Annotating the molecular basis of human disease remains an unsolved challenge, as 93% of disease loci are non-coding and gene-regulatory annotations are highly incomplete1-3. Here we present EpiMap, a compendium comprising 10,000 epigenomic maps across 800 samples, which we used to define chromatin states, high-resolution enhancers, enhancer modules, upstream regulators and downstream target genes. We used this resource to annotate 30,000 genetic loci that were associated with 540 traits4, predicting trait-relevant tissues, putative causal nucleotide variants in enriched tissue enhancers and candidate tissue-specific target genes for each. We partitioned multifactorial traits into tissue-specific contributing factors with distinct functional enrichments and disease comorbidity patterns, and revealed both single-factor monotropic and multifactor pleiotropic loci. Top-scoring loci frequently had multiple predicted driver variants, converging through multiple enhancers with a common target gene, multiple genes in common tissues, or multiple genes and multiple tissues, indicating extensive pleiotropy. Our results demonstrate the importance of dense, rich, high-resolution epigenomic annotations for the investigation of complex traits.
Asunto(s)
Enfermedad/genética , Epigénesis Genética/genética , Epigenómica , Redes Reguladoras de Genes/genética , Sitios Genéticos/genética , Cromatina/genética , Elementos de Facilitación Genéticos/genética , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Herencia Multifactorial/genética , Especificidad de Órganos/genética , Reproducibilidad de los ResultadosRESUMEN
DNase I hypersensitive sites (DHSs) are generic markers of regulatory DNA1-5 and contain genetic variations associated with diseases and phenotypic traits6-8. We created high-resolution maps of DHSs from 733 human biosamples encompassing 438 cell and tissue types and states, and integrated these to delineate and numerically index approximately 3.6 million DHSs within the human genome sequence, providing a common coordinate system for regulatory DNA. Here we show that these maps highly resolve the cis-regulatory compartment of the human genome, which encodes unexpectedly diverse cell- and tissue-selective regulatory programs at very high density. These programs can be captured comprehensively by a simple vocabulary that enables the assignment to each DHS of a regulatory barcode that encapsulates its tissue manifestations, and global annotation of protein-coding and non-coding RNA genes in a manner orthogonal to gene expression. Finally, we show that sharply resolved DHSs markedly enhance the genetic association and heritability signals of diseases and traits. Rather than being confined to a small number of distal elements or promoters, we find that genetic signals converge on congruently regulated sets of DHSs that decorate entire gene bodies. Together, our results create a universal, extensible coordinate system and vocabulary for human regulatory DNA marked by DHSs, and provide a new global perspective on the architecture of human gene regulation.
Asunto(s)
Cromatina/genética , ADN/metabolismo , Desoxirribonucleasa I/metabolismo , Anotación de Secuencia Molecular , Cromatina/química , Cromatina/metabolismo , ADN/química , ADN/genética , Regulación de la Expresión Génica , Genes/genética , Genoma Humano/genética , Humanos , Regiones Promotoras Genéticas/genética , Secuencias Reguladoras de Ácidos Nucleicos/genéticaRESUMEN
Combinatorial binding of transcription factors to regulatory DNA underpins gene regulation in all organisms. Genetic variation in regulatory regions has been connected with diseases and diverse phenotypic traits1, but it remains challenging to distinguish variants that affect regulatory function2. Genomic DNase I footprinting enables the quantitative, nucleotide-resolution delineation of sites of transcription factor occupancy within native chromatin3-6. However, only a small fraction of such sites have been precisely resolved on the human genome sequence6. Here, to enable comprehensive mapping of transcription factor footprints, we produced high-density DNase I cleavage maps from 243 human cell and tissue types and states and integrated these data to delineate about 4.5 million compact genomic elements that encode transcription factor occupancy at nucleotide resolution. We map the fine-scale structure within about 1.6 million DNase I-hypersensitive sites and show that the overwhelming majority are populated by well-spaced sites of single transcription factor-DNA interaction. Cell-context-dependent cis-regulation is chiefly executed by wholesale modulation of accessibility at regulatory DNA rather than by differential transcription factor occupancy within accessible elements. We also show that the enrichment of genetic variants associated with diseases or phenotypic traits in regulatory regions1,7 is almost entirely attributable to variants within footprints, and that functional variants that affect transcription factor occupancy are nearly evenly partitioned between loss- and gain-of-function alleles. Unexpectedly, we find increased density of human genetic variation within transcription factor footprints, revealing an unappreciated driver of cis-regulatory evolution. Our results provide a framework for both global and nucleotide-precision analyses of gene regulatory mechanisms and functional genetic variation.
Asunto(s)
Huella de ADN/normas , Genoma Humano/genética , Factores de Transcripción/metabolismo , Secuencia de Consenso , ADN/genética , ADN/metabolismo , Desoxirribonucleasa I/metabolismo , Genética de Población , Estudio de Asociación del Genoma Completo , Humanos , Modelos Moleculares , Polimorfismo de Nucleótido Simple , Secuencias Reguladoras de Ácidos Nucleicos/genéticaRESUMEN
The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.
Asunto(s)
Epigénesis Genética/genética , Epigenómica , Genoma Humano/genética , Secuencia de Bases , Linaje de la Célula/genética , Células Cultivadas , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Cromosomas Humanos/química , Cromosomas Humanos/genética , Cromosomas Humanos/metabolismo , ADN/química , ADN/genética , ADN/metabolismo , Metilación de ADN , Conjuntos de Datos como Asunto , Elementos de Facilitación Genéticos/genética , Variación Genética/genética , Estudio de Asociación del Genoma Completo , Histonas/metabolismo , Humanos , Especificidad de Órganos/genética , ARN/genética , Valores de ReferenciaRESUMEN
The spatial arrangement of chromatin is linked to the regulation of nuclear processes. One striking aspect of nuclear organization is the spatial segregation of heterochromatic and euchromatic domains. The mechanisms of this chromatin segregation are still poorly understood. In this work, we investigated the link between the primary genomic sequence and chromatin domains. We analyzed the spatial intranuclear arrangement of a human artificial chromosome (HAC) in a xenospecific mouse background in comparison to an orthologous region of native mouse chromosome. The two orthologous regions include segments that can be assigned to three major chromatin classes according to their gene abundance and repeat repertoire: (1) gene-rich and SINE-rich euchromatin; (2) gene-poor and LINE/LTR-rich heterochromatin; and (3) gene-depleted and satellite DNA-containing constitutive heterochromatin. We show, using fluorescence in situ hybridization (FISH) and 4C-seq technologies, that chromatin segments ranging from 0.6 to 3 Mb cluster with segments of the same chromatin class. As a consequence, the chromatin segments acquire corresponding positions in the nucleus irrespective of their chromosomal context, thereby strongly suggesting that this is their autonomous property. Interactions with the nuclear lamina, although largely retained in the HAC, reveal less autonomy. Taken together, our results suggest that building of a functional nucleus is largely a self-organizing process based on mutual recognition of chromosome segments belonging to the major chromatin classes.
Asunto(s)
Núcleo Celular/genética , Cromosomas Artificiales Humanos/metabolismo , Eucromatina/metabolismo , Fibroblastos/metabolismo , Heterocromatina/metabolismo , Retina/metabolismo , Animales , Línea Celular Transformada , Núcleo Celular/metabolismo , Núcleo Celular/ultraestructura , Cromosomas Artificiales Humanos/ultraestructura , Eucromatina/clasificación , Eucromatina/ultraestructura , Fibroblastos/ultraestructura , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Heterocromatina/clasificación , Heterocromatina/ultraestructura , Humanos , Hibridación Fluorescente in Situ , Ratones , Cultivo Primario de Células , Retina/ultraestructuraRESUMEN
BACKGROUND: Genomewide association studies can be used to identify disease-relevant genomic regions, but interpretation of the data is challenging. The FTO region harbors the strongest genetic association with obesity, yet the mechanistic basis of this association remains elusive. METHODS: We examined epigenomic data, allelic activity, motif conservation, regulator expression, and gene coexpression patterns, with the aim of dissecting the regulatory circuitry and mechanistic basis of the association between the FTO region and obesity. We validated our predictions with the use of directed perturbations in samples from patients and from mice and with endogenous CRISPR-Cas9 genome editing in samples from patients. RESULTS: Our data indicate that the FTO allele associated with obesity represses mitochondrial thermogenesis in adipocyte precursor cells in a tissue-autonomous manner. The rs1421085 T-to-C single-nucleotide variant disrupts a conserved motif for the ARID5B repressor, which leads to derepression of a potent preadipocyte enhancer and a doubling of IRX3 and IRX5 expression during early adipocyte differentiation. This results in a cell-autonomous developmental shift from energy-dissipating beige (brite) adipocytes to energy-storing white adipocytes, with a reduction in mitochondrial thermogenesis by a factor of 5, as well as an increase in lipid storage. Inhibition of Irx3 in adipose tissue in mice reduced body weight and increased energy dissipation without a change in physical activity or appetite. Knockdown of IRX3 or IRX5 in primary adipocytes from participants with the risk allele restored thermogenesis, increasing it by a factor of 7, and overexpression of these genes had the opposite effect in adipocytes from nonrisk-allele carriers. Repair of the ARID5B motif by CRISPR-Cas9 editing of rs1421085 in primary adipocytes from a patient with the risk allele restored IRX3 and IRX5 repression, activated browning expression programs, and restored thermogenesis, increasing it by a factor of 7. CONCLUSIONS: Our results point to a pathway for adipocyte thermogenesis regulation involving ARID5B, rs1421085, IRX3, and IRX5, which, when manipulated, had pronounced pro-obesity and anti-obesity effects. (Funded by the German Research Center for Environmental Health and others.).
Asunto(s)
Adipocitos/metabolismo , Obesidad/genética , Proteínas/genética , Termogénesis/genética , Alelos , Dioxigenasa FTO Dependiente de Alfa-Cetoglutarato , Animales , Secuencia de Bases , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Epigenómica , Expresión Génica , Ingeniería Genética , Humanos , Ratones , Mitocondrias/metabolismo , Datos de Secuencia Molecular , Obesidad/metabolismo , Fenotipo , Edición de ARN , Riesgo , Termogénesis/fisiologíaRESUMEN
The three-dimensional organization of chromosomes within the nucleus and its dynamics during differentiation are largely unknown. To visualize this process in molecular detail, we generated high-resolution maps of genome-nuclear lamina interactions during subsequent differentiation of mouse embryonic stem cells via lineage-committed neural precursor cells into terminally differentiated astrocytes. This reveals that a basal chromosome architecture present in embryonic stem cells is cumulatively altered at hundreds of sites during lineage commitment and subsequent terminal differentiation. This remodeling involves both individual transcription units and multigene regions and affects many genes that determine cellular identity. Often, genes that move away from the lamina are concomitantly activated; many others, however, remain inactive yet become unlocked for activation in a next differentiation step. These results suggest that lamina-genome interactions are widely involved in the control of gene expression programs during lineage commitment and terminal differentiation.
Asunto(s)
Diferenciación Celular , Posicionamiento de Cromosoma , Células Madre Embrionarias/citología , Genoma , Lámina Nuclear/metabolismo , Animales , Astrocitos/citología , Linaje de la Célula , Drosophila , Humanos , Ratones , Neuronas/citologíaRESUMEN
In metazoans, the nuclear lamina is thought to play an important role in the spatial organization of interphase chromosomes, by providing anchoring sites for large genomic segments named lamina-associated domains (LADs). Some of these LADs are cell-type specific, while many others appear constitutively associated with the lamina. Constitutive LADs (cLADs) may contribute to a basal chromosome architecture. By comparison of mouse and human lamina interaction maps, we find that the sizes and genomic positions of cLADs are strongly conserved. Moreover, cLADs are depleted of synteny breakpoints, pointing to evolutionary selective pressure to keep cLADs intact. Paradoxically, the overall sequence conservation is low for cLADs. Instead, cLADs are universally characterized by long stretches of DNA of high A/T content. Cell-type specific LADs also tend to adhere to this "A/T rule" in embryonic stem cells, but not in differentiated cells. This suggests that the A/T rule represents a default positioning mechanism that is locally overruled during lineage commitment. Analysis of paralogs suggests that during evolution changes in A/T content have driven the relocation of genes to and from the nuclear lamina, in tight association with changes in expression level. Taken together, these results reveal that the spatial organization of mammalian genomes is highly conserved and tightly linked to local nucleotide composition.
Asunto(s)
Secuencia Rica en At , Secuencia Conservada , Genoma , Lámina Nuclear/metabolismo , Animales , Caenorhabditis elegans , Secuencia Conservada/genética , Drosophila melanogaster , Células Madre Embrionarias/metabolismo , Humanos , Lamina Tipo A/metabolismo , Lamina Tipo B/metabolismo , Ratones , Factor 1 de Transcripción de Unión a Octámeros/metabolismoRESUMEN
The architecture of human chromosomes in interphase nuclei is still largely unknown. Microscopy studies have indicated that specific regions of chromosomes are located in close proximity to the nuclear lamina (NL). This has led to the idea that certain genomic elements may be attached to the NL, which may contribute to the spatial organization of chromosomes inside the nucleus. However, sequences in the human genome that interact with the NL in vivo have not been identified. Here we construct a high-resolution map of the interaction sites of the entire genome with NL components in human fibroblasts. This map shows that genome-lamina interactions occur through more than 1,300 sharply defined large domains 0.1-10 megabases in size. These lamina-associated domains (LADs) are typified by low gene-expression levels, indicating that LADs represent a repressive chromatin environment. The borders of LADs are demarcated by the insulator protein CTCF, by promoters that are oriented away from LADs, or by CpG islands, suggesting possible mechanisms of LAD confinement. Taken together, these results demonstrate that the human genome is divided into large, discrete domains that are units of chromosome organization within the nucleus.
Asunto(s)
Posicionamiento de Cromosoma , Cromosomas Humanos/metabolismo , Lámina Nuclear/metabolismo , Factor de Unión a CCCTC , Línea Celular , Cromatina/genética , Cromatina/metabolismo , Cromosomas Humanos/genética , Islas de CpG/genética , Proteínas de Unión al ADN/metabolismo , Fibroblastos , Genoma Humano , Humanos , Lamina Tipo B/metabolismo , Lámina Nuclear/química , Regiones Promotoras Genéticas/genética , Unión Proteica , Proteínas Represoras/metabolismoRESUMEN
An important and largely unsolved problem in synthetic biology is how to target gene expression to specific cell types. Here, we apply iterative deep learning to design synthetic enhancers with strong differential activity between two human cell lines. We initially train models on published datasets of enhancer activity and chromatin accessibility and use them to guide the design of synthetic enhancers that maximize predicted specificity. We experimentally validate these sequences, use the measurements to re-optimize the predictor, and design a second generation of enhancers with improved specificity. Our design methods embed relevant transcription factor binding site (TFBS) motifs with higher frequencies than comparable endogenous enhancers while using a more selective motif vocabulary, and we show that enhancer activity is correlated with transcription factor expression at the single cell level. Finally, we characterize causal features of top enhancers via perturbation experiments and show enhancers as short as 50bp can maintain specificity.
RESUMEN
The challenge of systematically modifying and optimizing regulatory elements for precise gene expression control is central to modern genomics and synthetic biology. Advancements in generative AI have paved the way for designing synthetic sequences with the aim of safely and accurately modulating gene expression. We leverage diffusion models to design context-specific DNA regulatory sequences, which hold significant potential toward enabling novel therapeutic applications requiring precise modulation of gene expression. Our framework uses a cell type-specific diffusion model to generate synthetic 200 bp regulatory elements based on chromatin accessibility across different cell types. We evaluate the generated sequences based on key metrics to ensure they retain properties of endogenous sequences: transcription factor binding site composition, potential for cell type-specific chromatin accessibility, and capacity for sequences generated by DNA diffusion to activate gene expression in different cell contexts using state-of-the-art prediction models. Our results demonstrate the ability to robustly generate DNA sequences with cell type-specific regulatory potential. DNA-Diffusion paves the way for revolutionizing a regulatory modulation approach to mammalian synthetic biology and precision gene therapy.
RESUMEN
Mutations in the A-type lamins A and C, two major components of the nuclear lamina, cause a large group of phenotypically diverse diseases collectively referred to as laminopathies. These conditions often involve defects in chromatin organization. However, it is unclear whether A-type lamins interact with chromatin in vivo and whether aberrant chromatin-lamin interactions contribute to disease. Here, we have used an unbiased approach to comparatively map genome-wide interactions of gene promoters with lamin A and progerin, the mutated lamin A isoform responsible for the premature aging disorder Hutchinson-Gilford progeria syndrome (HGPS) in mouse cardiac myoytes and embryonic fibroblasts. We find that lamin A-associated genes are predominantly transcriptionally silent and that loss of lamin association leads to the relocation of peripherally localized genes, but not necessarily to their activation. We demonstrate that progerin induces global changes in chromatin organization by enhancing interactions with a specific subset of genes in addition to the identified lamin A-associated genes. These observations demonstrate disease-related changes in higher order genome organization in HGPS and provide novel insights into the role of lamin-chromatin interactions in chromatin organization.
Asunto(s)
Lamina Tipo A/metabolismo , Proteínas Nucleares/metabolismo , Progeria/metabolismo , Precursores de Proteínas/metabolismo , Animales , Línea Celular , Mapeo Cromosómico , Fibroblastos/metabolismo , Humanos , Lamina Tipo A/genética , Ratones , Células Musculares/metabolismo , Proteínas Nucleares/genética , Progeria/genética , Unión Proteica , Precursores de Proteínas/genéticaRESUMEN
T cell development is restricted to the thymus and is dependent on high levels of Notch signaling induced within the thymic microenvironment. To understand Notch function in thymic restriction, we investigated the basis for target gene selectivity in response to quantitative differences in Notch signal strength, focusing on the chromatin architecture of genes essential for T cell differentiation. We find that high Notch signal strength is required to activate promoters of known targets essential for T cell commitment, including Il2ra, Cd3ε, and Rag1, which feature low CpG content (LCG) and DNA inaccessibility in hematopoietic stem progenitor cells. Our findings suggest that promoter DNA inaccessibility at LCG T lineage genes provides robust protection against stochastic activation in inappropriate Notch signaling contexts, limiting T cell development to the thymus.
Asunto(s)
Islas de CpG/genética , Regiones Promotoras Genéticas/genética , Receptores Notch/metabolismo , Transducción de Señal , Linfocitos T/metabolismo , Animales , ADN/metabolismo , Desoxirribonucleasa I/metabolismo , Ratones Endogámicos C57BLRESUMEN
A central question in the post-genomic era is how genes interact to form biological pathways. Measurements of gene dependency across hundreds of cell lines have been used to cluster genes into 'co-essential' pathways, but this approach has been limited by ubiquitous false positives. In the present study, we develop a statistical method that enables robust identification of gene co-essentiality and yields a genome-wide set of functional modules. This atlas recapitulates diverse pathways and protein complexes, and predicts the functions of 108 uncharacterized genes. Validating top predictions, we show that TMEM189 encodes plasmanylethanolamine desaturase, a key enzyme for plasmalogen synthesis. We also show that C15orf57 encodes a protein that binds the AP2 complex, localizes to clathrin-coated pits and enables efficient transferrin uptake. Finally, we provide an interactive webtool for the community to explore our results, which establish co-essentiality profiling as a powerful resource for biological pathway identification and discovery of new gene functions.
Asunto(s)
Redes Reguladoras de Genes , Genes , Genoma , Clatrina/metabolismo , Endocitosis , Epigénesis Genética , Regulación de la Expresión Génica , Células HeLa , Humanos , Anotación de Secuencia Molecular , Neoplasias/genética , Plasmalógenos/biosíntesis , Transducción de Señal/genéticaRESUMEN
BACKGROUND: Spectra resulting from Surface-Enhanced Laser Desorption/Ionisation (SELDI) mass spectrometry measurements are constructed by combining sub-spectra, each of which are the result of a single firing of the laser responsible for the process of desorption/ionisation. These firings are performed at different locations of the spot on which the sample is analysed. The final spectrum is then constructed by summing over all these sub-spectra. This process is sub-optimal in that it can average out peaks from peptides that are present in low abundance or are unevenly distributed across the spot, particularly because the amount of noise varies considerably between sub-spectra. This argues for analysing sub-spectra separately and combining results afterwards. RESULTS: Here, we propose to analyse these sub-spectra one-by-one and combine the results using a framework which includes a significance test. This allows one to, for the first time, attach a confidence measure to detected peaks, based on the signal strength of a peak across sub-spectra. In a comparison with three other approaches the sub-spectral approach achieves a higher sensitivity and a low FDR. We further introduce the notion of peak-bags, which provide rich information about the sub-spectral contributions to a given peak. CONCLUSION: The proposed procedure offers better control over the process of distinguishing signal from noise, resulting in an improved performance over other available methods. Moreover, our method provides an implicit deconvolution of peaks, yielding insight in the actual shape of a peak, potentially aiding in a deeper understanding of peak distribution. AVAILABILITY: Implementations of the algorithm in R are available upon request.
Asunto(s)
Péptidos/química , Espectrometría de Masa por Láser de Matriz Asistida de Ionización Desorción/métodos , Algoritmos , Reconocimiento de Normas Patrones Automatizadas , Péptidos/análisis , Procesamiento de Señales Asistido por ComputadorRESUMEN
BACKGROUND: Mass spectrometry for biological data analysis is an active field of research, providing an efficient way of high-throughput proteome screening. A popular variant of mass spectrometry is SELDI, which is often used to measure sample populations with the goal of developing (clinical) classifiers. Unfortunately, not only is the data resulting from such measurements quite noisy, variance between replicate measurements of the same sample can be high as well. Normalisation of spectra can greatly reduce the effect of this technical variance and further improve the quality and interpretability of the data. However, it is unclear which normalisation method yields the most informative result. RESULTS: In this paper, we describe the first systematic comparison of a wide range of normalisation methods, using two objectives that should be met by a good method. These objectives are minimisation of inter-spectra variance and maximisation of signal with respect to class separation. The former is assessed using an estimation of the coefficient of variation, the latter using the classification performance of three types of classifiers on real-world datasets representing two-class diagnostic problems. To obtain a maximally robust evaluation of a normalisation method, both objectives are evaluated over multiple datasets and multiple configurations of baseline correction and peak detection methods. Results are assessed for statistical significance and visualised to reveal the performance of each normalisation method, in particular with respect to using no normalisation. The normalisation methods described have been implemented in the freely available MASDA R-package. CONCLUSION: In the general case, normalisation of mass spectra is beneficial to the quality of data. The majority of methods we compared performed significantly better than the case in which no normalisation was used. We have shown that normalisation methods that scale spectra by a factor based on the dispersion (e.g., standard deviation) of the data clearly outperform those where a factor based on the central location (e.g., mean) is used. Additional improvements in performance are obtained when these factors are estimated locally, using a sliding window within spectra, instead of globally, over full spectra. The underperforming category of methods using a globally estimated factor based on the central location of the data includes the method used by the majority of SELDI users.
Asunto(s)
Espectrometría de Masa por Láser de Matriz Asistida de Ionización Desorción/métodos , Espectrometría de Masa por Láser de Matriz Asistida de Ionización Desorción/normas , Valores de Referencia , Espectrometría de Masa por Láser de Matriz Asistida de Ionización Desorción/estadística & datos numéricosRESUMEN
In this paper, we demonstrate in situ synthesis of oligonucleotide probes on poly(dimethylsiloxane) (PDMS) microchannels through use of conventional phosphoramidite chemistry. PDMS polymer was moulded into a series of microchannels using standard soft lithography (micro-moulding), with dimensions <100 microm. The surface of the PDMS was derivatized by exposure to ultraviolet/ozone followed by vapour phase deposition of glycidoxypropyltrimethoxysilane and reaction with poly(ethylene glycol) spacer, resulting in a reactive surface for oligonucleotide coupling. High, reproducible yields were achieved for both 6mer and 21mer probes as assessed by hybridization to fluorescent oligonucleotides. Oligonucleotide surface density was comparable with that obtained on glass substrates. These results suggest PDMS as a stable and flexible alternative to glass as a suitable substrate in the fabrication and synthesis of DNA microarrays.
Asunto(s)
Dimetilpolisiloxanos/química , Análisis de Secuencia por Matrices de Oligonucleótidos , Sondas de Oligonucleótidos/síntesis química , Siliconas/química , Colorantes Fluorescentes/química , Indicadores y Reactivos , Técnicas Analíticas Microfluídicas , Análisis de Secuencia por Matrices de Oligonucleótidos/instrumentación , Sondas de Oligonucleótidos/química , Polietilenglicoles/químicaRESUMEN
Chromatin-state analysis is widely applied in the studies of development and diseases. However, existing methods operate at a single length scale, and therefore cannot distinguish large domains from isolated elements of the same type. To overcome this limitation, we present a hierarchical hidden Markov model, diHMM, to systematically annotate chromatin states at multiple length scales. We apply diHMM to analyse a public ChIP-seq data set. diHMM not only accurately captures nucleosome-level information, but identifies domain-level states that vary in nucleosome-level state composition, spatial distribution and functionality. The domain-level states recapitulate known patterns such as super-enhancers, bivalent promoters and Polycomb repressed regions, and identify additional patterns whose biological functions are not yet characterized. By integrating chromatin-state information with gene expression and Hi-C data, we identify context-dependent functions of nucleosome-level states. Thus, diHMM provides a powerful tool for investigating the role of higher-order chromatin structure in gene regulation.