Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nat Methods ; 15(7): 505-511, 2018 07.
Artigo em Inglês | MEDLINE | ID: mdl-29867192

RESUMO

Specialized RNA-seq methods are required to identify the 5' ends of transcripts, which are critical for studies of gene regulation, but these methods have not been systematically benchmarked. We directly compared six such methods, including the performance of five methods on a single human cellular RNA sample and a new spike-in RNA assay that helps circumvent challenges resulting from uncertainties in annotation and RNA processing. We found that the 'cap analysis of gene expression' (CAGE) method performed best for mRNA and that most of its unannotated peaks were supported by evidence from other genomic methods. We applied CAGE to eight brain-related samples and determined sample-specific transcription start site (TSS) usage, as well as a transcriptome-wide shift in TSS usage between fetal and adult brain.


Assuntos
RNA/química , Análise de Sequência de RNA/métodos , Sequência de Bases , Encéfalo , Células-Tronco Embrionárias , Biblioteca Gênica , Humanos , RNA/genética , RNA/metabolismo
3.
Bioinformatics ; 25(20): 2639-45, 2009 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-19628506

RESUMO

MOTIVATION: The roughness of energy landscapes is a major obstacle to protein structure prediction, since it forces conformational searches to spend much time struggling to escape numerous traps. Specifically, beta-sheet formation is prone to stray, since many possible combinations of hydrogen bonds are dead ends in terms of beta-sheet assembly. It has been shown that cooperative terms for backbone hydrogen bonds ease this problem by augmenting hydrogen bond patterns that are consistent with beta sheets. Here, we present a novel cooperative hydrogen-bond term that is both effective in promoting beta sheets and computationally efficient. In addition, the new term is differentiable and operates on all-atom protein models. RESULTS: Energy optimization of poly-alanine chains under the new term led to significantly more beta-sheet content than optimization under a non-cooperative term. Furthermore, the optimized structure included very few non-native patterns. AVAILABILITY: The new term is implemented within the MESHI package and is freely available at http://cs.bgu.ac.il/ approximately meshi.


Assuntos
Estrutura Secundária de Proteína , Proteínas/química , Simulação por Computador , Bases de Dados de Proteínas , Ligação de Hidrogênio , Modelos Moleculares , Dobramento de Proteína , Termodinâmica
4.
Nat Med ; 26(9): 1375-1379, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32778826

RESUMO

The promise of precision medicine lies in data diversity. More than the sheer size of biomedical data, it is the layering of multiple data modalities, offering complementary perspectives, that is thought to enable the identification of patient subgroups with shared pathophysiology. In the present study, we use autism to test this notion. By combining healthcare claims, electronic health records, familial whole-exome sequences and neurodevelopmental gene expression patterns, we identified a subgroup of patients with dyslipidemia-associated autism.


Assuntos
Transtorno Autístico/diagnóstico , Dislipidemias/diagnóstico , Medicina de Precisão/métodos , Transtorno Autístico/genética , Transtorno Autístico/patologia , Dislipidemias/genética , Dislipidemias/patologia , Registros Eletrônicos de Saúde , Exoma/genética , Feminino , Predisposição Genética para Doença/genética , Humanos , Lipídeos/sangue , Masculino , Técnicas de Diagnóstico Molecular , Sequenciamento do Exoma
5.
Genome Biol ; 16: 195, 2015 Sep 17.
Artigo em Inglês | MEDLINE | ID: mdl-26381377

RESUMO

Allelic expression analysis has become important for integrating genome and transcriptome data to characterize various biological phenomena such as cis-regulatory variation and nonsense-mediated decay. We analyze the properties of allelic expression read count data and technical sources of error, such as low-quality or double-counted RNA-seq reads, genotyping errors, allelic mapping bias, and technical covariates due to sample preparation and sequencing, and variation in total read depth. We provide guidelines for correcting such errors, show that our quality control measures improve the detection of relevant allelic expression, and introduce tools for the high-throughput production of allelic expression data from RNA-sequencing data.


Assuntos
Alelos , Perfilação da Expressão Gênica/métodos , Software , Linhagem Celular , Interpretação Estatística de Dados , Expressão Gênica , Perfilação da Expressão Gênica/normas , Técnicas de Genotipagem/normas , Humanos , Análise de Sequência de RNA
6.
Curr Protoc Bioinformatics ; 43: 11.10.1-11.10.33, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-25431634

RESUMO

This unit describes how to use BWA and the Genome Analysis Toolkit (GATK) to map genome sequencing data to a reference and produce high-quality variant calls that can be used in downstream analyses. The complete workflow includes the core NGS data processing steps that are necessary to make the raw data suitable for analysis by the GATK, as well as the key methods involved in variant discovery using the GATK.


Assuntos
Variação Genética , Genoma Humano , Software , Calibragem , Bases de Dados Genéticas , Haploidia , Haplótipos/genética , Humanos , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único/genética , Alinhamento de Sequência
7.
Biol Direct ; 7: 37, 2012 Oct 30.
Artigo em Inglês | MEDLINE | ID: mdl-23111013

RESUMO

BACKGROUND: The dramatic reduction in the cost of sequencing has allowed many researchers to join in the effort of sequencing and annotating prokaryotic genomes. Annotation methods vary considerably and may fail to identify some genes. Here we draw attention to a large number of likely genes missing from annotations using common tools such as Glimmer and BLAST. RESULTS: By analyzing 1,474 prokaryotic genome annotations in GenBank, we identify 13,602 likely missed genes that are homologs to non-hypothetical proteins, and 11,792 likely missed genes that are homologs only to hypothetical proteins, yet have supporting evidence of their protein-coding nature from COMBREX, a newly created gene function database. We also estimate the likelihood that each potential missing gene found is a genuine protein-coding gene using COMBREX. CONCLUSIONS: Our analysis of the causes of missed genes suggests that larger annotation centers tend to produce annotations with fewer missed genes than smaller centers, and many of the missed genes are short genes <300 bp. Over 1,000 of the likely missed genes could be associated with phenotype information available in COMBREX. 359 of these genes, found in pathogenic organisms, may be potential targets for pharmaceutical research. The newly identified genes are available on COMBREX's website. REVIEWERS: This article was reviewed by Daniel Haft, Arcady Mushegian, and M. Pilar Francino (nominated by David Ardell).


Assuntos
Bases de Dados de Ácidos Nucleicos , Genes Bacterianos , Anotação de Sequência Molecular/métodos , Fases de Leitura Aberta , Bactérias/genética , Biologia Computacional/métodos , Variação Genética , Genoma Bacteriano , Alinhamento de Sequência , Análise de Sequência de DNA , Homologia de Sequência , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA