Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
PLoS One ; 12(9): e0182438, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28926565

RESUMEN

In the current precision medicine era, more and more samples get genotyped and sequenced. Both researchers and commercial companies expend significant time and resources to reduce the error rate. However, it has been reported that there is a sample mix-up rate of between 0.1% and 1%, not to mention the possibly higher mix-up rate during the down-stream genetic reporting processes. Even on the low end of this estimate, this translates to a significant number of mislabeled samples, especially over the projected one billion people that will be sequenced within the next decade. Here, we first describe a method to identify a small set of Single nucleotide polymorphisms (SNPs) that can uniquely identify a personal genome, which utilizes allele frequencies of five major continental populations reported in the 1000 genomes project and the ExAC Consortium. To make this panel more informative, we added four SNPs that are commonly used to predict ABO blood type, and another two SNPs that are capable of predicting sex. We then implement a web interface (http://qrcme.tech), nicknamed QRC (for QR code based Concordance check), which is capable of extracting the relevant ID SNPs from a raw genetic data, coding its genotype as a quick response (QR) code, and comparing QR codes to report the concordance of underlying genetic datasets. The resulting 80 fingerprinting SNPs represent a significant decrease in complexity and the number of markers used for genetic data labelling and tracking. Our method and web tool is easily accessible to both researchers and the general public who consider the accuracy of complex genetic data as a prerequisite towards precision medicine.


Asunto(s)
Polimorfismo de Nucleótido Simple , Interfaz Usuario-Computador , Frecuencia de los Genes , Genoma Humano , Genotipo , Humanos , Internet
2.
Bioinformatics ; 33(23): 3793-3795, 2017 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-28582503

RESUMEN

MOTIVATION: High throughput chromatin conformation capture (3C) technologies, such as Hi-C and ChIA-PET, have the potential to elucidate the functional roles of non-coding variants. However, most of published genome-wide unbiased chromatin organization studies have used cultured cell lines, limiting their generalizability. RESULTS: We developed a web browser, HUGIn, to visualize Hi-C data generated from 21 human primary tissues and cell lines. HUGIn enables assessment of chromatin contacts both constitutive across and specific to tissue(s) and/or cell line(s) at any genomic loci, including GWAS SNPs, eQTLs and cis-regulatory elements, facilitating the understanding of both GWAS and eQTL results and functional genomics data. AVAILABILITY AND IMPLEMENTATION: HUGIn is available at http://yunliweb.its.unc.edu/HUGIn. CONTACT: yunli@med.unc.edu or hum@ccf.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Cromatina/ultraestructura , Estudio de Asociación del Genoma Completo/métodos , Genómica/métodos , Sitios de Carácter Cuantitativo , Secuencias Reguladoras de Ácidos Nucleicos , Programas Informáticos , Línea Celular , Cromatina/metabolismo , Humanos , Polimorfismo de Nucleótido Simple
3.
Nat Methods ; 12(11): 1077-84, 2015 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-26414013

RESUMEN

We report Single Molecule Cluster Analysis (SiMCAn), which utilizes hierarchical clustering of hidden Markov modeling-fitted single-molecule fluorescence resonance energy transfer (smFRET) trajectories to dissect the complex conformational dynamics of biomolecular machines. We used this method to study the conformational dynamics of a precursor mRNA during the splicing cycle as carried out by the spliceosome. By clustering common dynamic behaviors derived from selectively blocked splicing reactions, SiMCAn was able to identify the signature conformations and dynamic behaviors of multiple ATP-dependent intermediates. In addition, it identified an open conformation adopted late in splicing by a 3' splice-site mutant, invoking a mechanism for substrate proofreading. SiMCAn enables rapid interpretation of complex single-molecule behaviors and should prove useful for the comprehensive analysis of a plethora of dynamic cellular machines.


Asunto(s)
Análisis por Conglomerados , Precursores del ARN/química , Empalme del ARN , Adenosina Trifosfato/química , Dominio Catalítico , Simulación por Computador , Transferencia Resonante de Energía de Fluorescencia , Colorantes Fluorescentes/química , Humanos , Intrones , Cadenas de Markov , Mutación , Conformación de Ácido Nucleico , Sitios de Empalme de ARN , ARN Mensajero/química , Empalmosomas/química
4.
PLoS One ; 9(2): e85041, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-24586236

RESUMEN

The folding of linear polymers into discrete three-dimensional structures is often required for biological function. The formation of long-lived intermediates is a hallmark of the folding of large RNA molecules due to the ruggedness of their energy landscapes. The precise thermodynamic nature of the barriers (whether enthalpic or entropic) that leads to intermediate formation is still poorly characterized in large structured RNA molecules. A classic approach to analyzing kinetic barriers are temperature dependent studies analyzed with Eyring's transition state theory. We applied Eyring's theory to time-resolved hydroxyl radical (•OH) footprinting kinetics progress curves collected at eight temperature from 21.5 °C to 51 °C to characterize the thermodynamic nature of folding intermediate formation for the Mg(2+)-mediated folding of the Tetrahymena thermophila group I ribozyme. A common kinetic model configuration describes this RNA folding reaction over the entire temperature range studied consisting of primary (fast) transitions to misfolded intermediates followed by much slower secondary transitions, consistent with previous studies. Eyring analysis reveals that the primary transitions are moderate in magnitude and primarily enthalpic in nature. In contrast, the secondary transitions are daunting in magnitude and entropic in nature. The entropic character of the secondary transitions is consistent with structural rearrangement of the intermediate species to the final folded form. This segregation of kinetic control reveals distinctly different molecular mechanisms during the two stages of RNA folding and documents the importance of entropic barriers to defining rugged RNA folding landscapes.


Asunto(s)
Pliegue del ARN/genética , Secuencia de Bases , Radical Hidroxilo/metabolismo , Cinética , Magnesio/metabolismo , Datos de Secuencia Molecular , Conformación de Ácido Nucleico , ARN Catalítico/genética , ARN Catalítico/metabolismo , ARN Protozoario/genética , Temperatura , Tetrahymena thermophila/genética , Tetrahymena thermophila/metabolismo , Termodinámica
5.
PLoS Comput Biol ; 9(7): e1003152, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23935473

RESUMEN

Sequence conservation and co-variation of base pairs are hallmarks of structured RNAs. For certain RNAs (e.g. riboswitches), a single sequence must adopt at least two alternative secondary structures to effectively regulate the message. If alternative secondary structures are important to the function of an RNA, we expect to observe evolutionary co-variation supporting multiple conformations. We set out to characterize the evolutionary co-variation supporting alternative conformations in riboswitches to determine the extent to which alternative secondary structures are conserved. We found strong co-variation support for the terminator, P1, and anti-terminator stems in the purine riboswitch by extending alignments to include terminator sequences. When we performed Boltzmann suboptimal sampling on purine riboswitch sequences with terminators we found that these sequences appear to have evolved to favor specific alternative conformations. We extended our analysis of co-variation to classic alignments of group I/II introns, tRNA, and other classes of riboswitches. In a majority of these RNAs, we found evolutionary evidence for alternative conformations that are compatible with the Boltzmann suboptimal ensemble. Our analyses suggest that alternative conformations are selected for and thus likely play functional roles in even the most structured of RNAs.


Asunto(s)
Evolución Molecular , Conformación de Ácido Nucleico , ARN Bacteriano/genética , ARN Bacteriano/química
6.
BMC Genomics ; 13 Suppl 4: S6, 2012 Jun 18.
Artículo en Inglés | MEDLINE | ID: mdl-22759654

RESUMEN

The structure of RiboNucleic Acid (RNA) has the potential to be altered by a Single Nucleotide Polymorphism (SNP). Disease-associated SNPs mapping to non-coding regions of the genome that are transcribed into RiboNucleic Acid (RNA) can potentially affect cellular regulation (and cause disease) by altering the structure of the transcript. We performed a large-scale meta-analysis of Selective 2'-Hydroxyl Acylation analyzed by Primer Extension (SHAPE) data, which probes the structure of RNA. We found that several single point mutations exist that significantly disrupt RNA secondary structure in the five transcripts we analyzed. Thus, every RNA that is transcribed has the potential to be a "RiboSNitch;" where a SNP causes a large conformational change that alters regulatory function. Predicting the SNPs that will have the largest effect on RNA structure remains a contemporary computational challenge. We therefore benchmarked the most popular RNA structure prediction algorithms for their ability to identify mutations that maximally affect structure. We also evaluated metrics for rank ordering the extent of the structural change. Although no single algorithm/metric combination dramatically outperformed the others, small differences in AUC (Area Under the Curve) values reveal that certain approaches do provide better agreement with experiment. The experimental data we analyzed nonetheless show that multiple single point mutations exist in all RNA transcripts that significantly disrupt structure in agreement with the predictions.


Asunto(s)
Polimorfismo de Nucleótido Simple/genética , ARN/química , ARN/genética , Algoritmos , Mutación , Conformación de Ácido Nucleico
7.
RNA ; 18(1): 77-87, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22109839

RESUMEN

A majority of SNPs (single nucleotide polymorphisms) map to noncoding and intergenic regions of the genome. Noncoding SNPs are often identified in genome-wide association studies (GWAS) as strongly associated with human disease. Two such disease-associated SNPs in the 5' UTR of the human FTL (Ferritin Light Chain) gene are predicted to alter the ensemble of structures adopted by the mRNA. High-accuracy single nucleotide resolution chemical mapping reveals that these SNPs result in substantial changes in the structural ensemble in agreement with the computational prediction. Furthermore six rescue mutations are correctly predicted to restore the mRNA to its wild-type ensemble. Our data confirm that the FTL 5' UTR is a "RiboSNitch," an RNA that changes structure if a particular disease-associated SNP is present. The structural change observed is analogous to that of a bacterial Riboswitch in that it likely regulates translation. These data further suggest that specific pairs of SNPs in high linkage disequilibrium (LD) will form RNA structure-stabilizing haplotypes (SSHs). We identified 484 SNP pairs that form SSHs in UTRs of the human genome, and in eight of the 10 SSH-containing transcripts, SNP pairs stabilize RNA protein binding sites. The ubiquitous nature of SSHs in the transcriptome suggests that certain haplotypes are conserved to avoid RiboSNitch formation.


Asunto(s)
Regiones no Traducidas 5'/genética , Genoma Humano/genética , Desequilibrio de Ligamiento , ARN/genética , Transcriptoma/genética , Apoferritinas/genética , Haplotipos , Humanos , Mutación , Conformación de Ácido Nucleico , Polimorfismo de Nucleótido Simple , ARN/química , Proteínas de Unión al ARN/genética , Proteínas de Unión al ARN/metabolismo
8.
PLoS Genet ; 6(8): e1001074, 2010 Aug 19.
Artículo en Inglés | MEDLINE | ID: mdl-20808897

RESUMEN

Genome-wide association studies (GWAS) often identify disease-associated mutations in intergenic and non-coding regions of the genome. Given the high percentage of the human genome that is transcribed, we postulate that for some observed associations the disease phenotype is caused by a structural rearrangement in a regulatory region of the RNA transcript. To identify such mutations, we have performed a genome-wide analysis of all known disease-associated Single Nucleotide Polymorphisms (SNPs) from the Human Gene Mutation Database (HGMD) that map to the untranslated regions (UTRs) of a gene. Rather than using minimum free energy approaches (e.g. mFold), we use a partition function calculation that takes into consideration the ensemble of possible RNA conformations for a given sequence. We identified in the human genome disease-associated SNPs that significantly alter the global conformation of the UTR to which they map. For six disease-states (Hyperferritinemia Cataract Syndrome, beta-Thalassemia, Cartilage-Hair Hypoplasia, Retinoblastoma, Chronic Obstructive Pulmonary Disease (COPD), and Hypertension), we identified multiple SNPs in UTRs that alter the mRNA structural ensemble of the associated genes. Using a Boltzmann sampling procedure for sub-optimal RNA structures, we are able to characterize and visualize the nature of the conformational changes induced by the disease-associated mutations in the structural ensemble. We observe in several cases (specifically the 5' UTRs of FTL and RB1) SNP-induced conformational changes analogous to those observed in bacterial regulatory Riboswitches when specific ligands bind. We propose that the UTR and SNP combinations we identify constitute a "RiboSNitch," that is a regulatory RNA in which a specific SNP has a structural consequence that results in a disease phenotype. Our SNPfold algorithm can help identify RiboSNitches by leveraging GWAS data and an analysis of the mRNA structural ensemble.


Asunto(s)
Enfermedad/genética , Mutación , ARN/química , Genoma Humano , Estudio de Asociación del Genoma Completo , Humanos , Conformación de Ácido Nucleico , Polimorfismo de Nucleótido Simple , ARN/genética , Regiones no Traducidas
9.
RNA ; 16(6): 1108-17, 2010 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-20413617

RESUMEN

Structure mapping experiments (using probes such as dimethyl sulfate [DMS], kethoxal, and T1 and V1 RNases) are used to determine the secondary structures of RNA molecules. The process is iterative, combining the results of several probes with constrained minimum free-energy calculations to produce a model of the structure. We aim to evaluate whether particular probes provide more structural information, and specifically, how noise in the data affects the predictions. Our approach involves generating "decoy" RNA structures (using the sFold Boltzmann sampling procedure) and evaluating whether we are able to identify the correct structure from this ensemble of structures. We show that with perfect information, we are always able to identify the optimal structure for five RNAs of known structure. We then collected orthogonal structure mapping data (DMS and RNase T1 digest) under several solution conditions using our high-throughput capillary automated footprinting analysis (CAFA) technique on two group I introns of known structure. Analysis of these data reveals the error rates in the data under optimal (low salt) and suboptimal solution conditions (high MgCl(2)). We show that despite these errors, our computational approach is less sensitive to experimental noise than traditional constraint-based structure prediction algorithms. Finally, we propose a novel approach for visualizing the interaction of chemical and enzymatic mapping data with RNA structure. We project the data onto the first two dimensions of a multidimensional scaling of the sFold-generated decoy structures. We are able to directly visualize the structural information content of structure mapping data and reconcile multiple data sets.


Asunto(s)
Proteínas/química , ARN/química , Secuencia de Bases , Cristalografía por Rayos X , Enzimas/química , Modelos Moleculares , Conformación Molecular , Conformación de Ácido Nucleico , Estructura Secundaria de Proteína , Ribonucleasa T1/química , Ribonucleasa T1/genética , Análisis de Secuencia de ARN/métodos
10.
Proc Natl Acad Sci U S A ; 107(11): 4991-5, 2010 Mar 16.
Artículo en Inglés | MEDLINE | ID: mdl-20194778

RESUMEN

The stochasticity of chromosome organization was investigated by fluorescently labeling genetic loci in live Escherichia coli cells. In spite of the common assumption that the chromosome is well modeled by an unstructured polymer, measurements of the locus distributions reveal that the E. coli chromosome is precisely organized into a nucleoid filament with a linear order. Loci in the body of the nucleoid show a precision of positioning within the cell of better than 10% of the cell length. The precision of interlocus distance of genomically-proximate loci was better than 4% of the cell length. The measured dependence of the precision of interlocus distance on genomic distance singles out intranucleoid interactions as the mechanism responsible for chromosome organization. From the magnitude of the variance, we infer the existence of an as-yet uncharacterized higher-order DNA organization in bacteria. We demonstrate that both the stochastic and average structure of the nucleoid is captured by a fluctuating elastic filament model.


Asunto(s)
Cromosomas Bacterianos/metabolismo , ADN Bacteriano/metabolismo , Escherichia coli/metabolismo , Escherichia coli/citología , Escherichia coli/genética , Sitios Genéticos/genética , Modelos Biológicos
11.
Algorithms ; 2(1): 200-214, 2009 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-19865589

RESUMEN

Unlike protein folding, the process by which a large RNA molecule adopts a functionally active conformation remains poorly understood. Chemical mapping techniques, such as Hydroxyl Radical (·OH) footprinting report on local structural changes in an RNA as it folds with single nucleotide resolution. The analysis and interpretation of this kinetic data requires the identification and subsequent optimization of a kinetic model and its parameters. We detail our approach to this problem, specifically focusing on a novel strategy to overcome a factorial explosion in the number of possible models that need to be tested to identify the best fitting model. Previously, smaller systems (less than three intermediates) were computationally tractable using a distributed computing approach. However, for larger systems with three or more intermediates, the problem became computationally intractable. With our new enumeration strategy, we are able to significantly reduce the number of models that need to be tested using non-linear least squares optimization, allowing us to study systems with up to five intermediates. Furthermore, two intermediate systems can now be analyzed on a desktop computer, which eliminates the need for a distributed computing solution for most medium-sized data sets. Our new approach also allows us to study potential degeneracy in kinetic model selection, elucidating the limits of the method when working with large systems. This work establishes clear criteria for determining if experimental ·OH data is sufficient to determine the underlying kinetic model, or if other experimental modalities are required to resolve any degeneracy.

12.
Methods Enzymol ; 468: 47-66, 2009.
Artículo en Inglés | MEDLINE | ID: mdl-20946764

RESUMEN

The use of highly reactive chemical species to probe the structure and dynamics of nucleic acids is greatly simplified by software that enables rapid quantification of the gel images that result from these experiments. Semiautomated footprinting analysis (SAFA) allows a user to quickly and reproducibly quantify a chemical footprinting gel image through a series of steps that rectify, assign, and integrate the relative band intensities. The output of this procedure is raw band intensities that report on the relative reactivity of each nucleotide with the chemical probe. We describe here how to obtain these raw band intensities using SAFA and the subsequent normalization and analysis procedures required to process these data. In particular, we focus on analyzing time-resolved hydroxyl radical ((•)OH) data, which we use to monitor the kinetics of folding of a large RNA (the L-21 T. thermophila group I intron). Exposing the RNA to bursts of (•)OH radicals at specific time points during the folding process monitors the time progress of the reaction. Specifically, we identify protected (nucleotides that become inaccessible to the (•)OH radical probe when folded) and invariant (nucleotides with constant accessibility to the (•)OH probe) residues that we use for monitoring and normalization of the data. With this analysis, we obtain time-progress curves from which we determine kinetic rates of folding. We also report on a data visualization tool implemented in SAFA that allows users to map data onto a secondary structure diagram.


Asunto(s)
Radical Hidroxilo/química , ARN/química , Cinética , Conformación de Ácido Nucleico
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...