Pesquisa | Portal de Pesquisa da BVS

Comprehensive comparative analysis of 5'-end RNA-sequencing methods.

Adiconis, Xian; Haber, Adam L; Simmons, Sean K; Levy Moonshine, Ami; Ji, Zhe; Busby, Michele A; Shi, Xi; Jacques, Justin; Lancaster, Madeline A; Pan, Jen Q; Regev, Aviv; Levin, Joshua Z.

Nat Methods ; 15(7): 505-511, 2018 07.

Artigo em Inglês | MEDLINE | ID: mdl-29867192

RESUMO

Specialized RNA-seq methods are required to identify the 5' ends of transcripts, which are critical for studies of gene regulation, but these methods have not been systematically benchmarked. We directly compared six such methods, including the performance of five methods on a single human cellular RNA sample and a new spike-in RNA assay that helps circumvent challenges resulting from uncertainties in annotation and RNA processing. We found that the 'cap analysis of gene expression' (CAGE) method performed best for mRNA and that most of its unannotated peaks were supported by evidence from other genomic methods. We applied CAGE to eight brain-related samples and determined sample-specific transcription start site (TSS) usage, as well as a transcriptome-wide shift in TSS usage between fetal and adult brain.

Assuntos

RNA/química , Análise de Sequência de RNA/métodos , Sequência de Bases , Encéfalo , Células-Tronco Embrionárias , Biblioteca Gênica , Humanos , RNA/genética , RNA/metabolismo

The COMBREX project: design, methodology, and initial results.

Anton, Brian P; Chang, Yi-Chien; Brown, Peter; Choi, Han-Pil; Faller, Lina L; Guleria, Jyotsna; Hu, Zhenjun; Klitgord, Niels; Levy-Moonshine, Ami; Maksad, Almaz; Mazumdar, Varun; McGettrick, Mark; Osmani, Lais; Pokrzywa, Revonda; Rachlin, John; Swaminathan, Rajeswari; Allen, Benjamin; Housman, Genevieve; Monahan, Caitlin; Rochussen, Krista; Tao, Kevin; Bhagwat, Ashok S; Brenner, Steven E; Columbus, Linda; de Crécy-Lagard, Valérie; Ferguson, Donald; Fomenkov, Alexey; Gadda, Giovanni; Morgan, Richard D; Osterman, Andrei L; Rodionov, Dmitry A; Rodionova, Irina A; Rudd, Kenneth E; Söll, Dieter; Spain, James; Xu, Shuang-Yong; Bateman, Alex; Blumenthal, Robert M; Bollinger, J Martin; Chang, Woo-Suk; Ferrer, Manuel; Friedberg, Iddo; Galperin, Michael Y; Gobeill, Julien; Haft, Daniel; Hunt, John; Karp, Peter; Klimke, William; Krebs, Carsten; Macelis, Dana.

PLoS Biol ; 11(8): e1001638, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-24013487

Assuntos

Genômica/métodos , Humanos , Modelos Teóricos

Enhancement of beta-sheet assembly by cooperative hydrogen bonds potential.

Levy-Moonshine, Ami; Amir, El-Ad David; Keasar, Chen.

Bioinformatics ; 25(20): 2639-45, 2009 Oct 15.

Artigo em Inglês | MEDLINE | ID: mdl-19628506

RESUMO

MOTIVATION: The roughness of energy landscapes is a major obstacle to protein structure prediction, since it forces conformational searches to spend much time struggling to escape numerous traps. Specifically, beta-sheet formation is prone to stray, since many possible combinations of hydrogen bonds are dead ends in terms of beta-sheet assembly. It has been shown that cooperative terms for backbone hydrogen bonds ease this problem by augmenting hydrogen bond patterns that are consistent with beta sheets. Here, we present a novel cooperative hydrogen-bond term that is both effective in promoting beta sheets and computationally efficient. In addition, the new term is differentiable and operates on all-atom protein models. RESULTS: Energy optimization of poly-alanine chains under the new term led to significantly more beta-sheet content than optimization under a non-cooperative term. Furthermore, the optimized structure included very few non-native patterns. AVAILABILITY: The new term is implemented within the MESHI package and is freely available at http://cs.bgu.ac.il/ approximately meshi.

Assuntos

Estrutura Secundária de Proteína , Proteínas/química , Simulação por Computador , Bases de Dados de Proteínas , Ligação de Hidrogênio , Modelos Moleculares , Dobramento de Proteína , Termodinâmica

A multidimensional precision medicine approach identifies an autism subtype characterized by dyslipidemia.

Luo, Yuan; Eran, Alal; Palmer, Nathan; Avillach, Paul; Levy-Moonshine, Ami; Szolovits, Peter; Kohane, Isaac S.

Nat Med ; 26(9): 1375-1379, 2020 09.

Artigo em Inglês | MEDLINE | ID: mdl-32778826

RESUMO

The promise of precision medicine lies in data diversity. More than the sheer size of biomedical data, it is the layering of multiple data modalities, offering complementary perspectives, that is thought to enable the identification of patient subgroups with shared pathophysiology. In the present study, we use autism to test this notion. By combining healthcare claims, electronic health records, familial whole-exome sequences and neurodevelopmental gene expression patterns, we identified a subgroup of patients with dyslipidemia-associated autism.

Assuntos

Transtorno Autístico/diagnóstico , Dislipidemias/diagnóstico , Medicina de Precisão/métodos , Transtorno Autístico/genética , Transtorno Autístico/patologia , Dislipidemias/genética , Dislipidemias/patologia , Registros Eletrônicos de Saúde , Exoma/genética , Feminino , Predisposição Genética para Doença/genética , Humanos , Lipídeos/sangue , Masculino , Técnicas de Diagnóstico Molecular , Sequenciamento do Exoma

Tools and best practices for data processing in allelic expression analysis.

Castel, Stephane E; Levy-Moonshine, Ami; Mohammadi, Pejman; Banks, Eric; Lappalainen, Tuuli.

Genome Biol ; 16: 195, 2015 Sep 17.

Artigo em Inglês | MEDLINE | ID: mdl-26381377

RESUMO

Allelic expression analysis has become important for integrating genome and transcriptome data to characterize various biological phenomena such as cis-regulatory variation and nonsense-mediated decay. We analyze the properties of allelic expression read count data and technical sources of error, such as low-quality or double-counted RNA-seq reads, genotyping errors, allelic mapping bias, and technical covariates due to sample preparation and sequencing, and variation in total read depth. We provide guidelines for correcting such errors, show that our quality control measures improve the detection of relevant allelic expression, and introduce tools for the high-throughput production of allelic expression data from RNA-sequencing data.

Assuntos

Alelos , Perfilação da Expressão Gênica/métodos , Software , Linhagem Celular , Interpretação Estatística de Dados , Expressão Gênica , Perfilação da Expressão Gênica/normas , Técnicas de Genotipagem/normas , Humanos , Análise de Sequência de RNA

From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline.

Van der Auwera, Geraldine A; Carneiro, Mauricio O; Hartl, Christopher; Poplin, Ryan; Del Angel, Guillermo; Levy-Moonshine, Ami; Jordan, Tadeusz; Shakir, Khalid; Roazen, David; Thibault, Joel; Banks, Eric; Garimella, Kiran V; Altshuler, David; Gabriel, Stacey; DePristo, Mark A.

Curr Protoc Bioinformatics ; 43: 11.10.1-11.10.33, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-25431634

RESUMO

This unit describes how to use BWA and the Genome Analysis Toolkit (GATK) to map genome sequencing data to a reference and produce high-quality variant calls that can be used in downstream analyses. The complete workflow includes the core NGS data processing steps that are necessary to make the raw data suitable for analysis by the GATK, as well as the key methods involved in variant discovery using the GATK.

Assuntos

Variação Genética , Genoma Humano , Software , Calibragem , Bases de Dados Genéticas , Haploidia , Haplótipos/genética , Humanos , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único/genética , Alinhamento de Sequência

Thousands of missed genes found in bacterial genomes and their analysis with COMBREX.

Wood, Derrick E; Lin, Henry; Levy-Moonshine, Ami; Swaminathan, Rajiswari; Chang, Yi-Chien; Anton, Brian P; Osmani, Lais; Steffen, Martin; Kasif, Simon; Salzberg, Steven L.

Biol Direct ; 7: 37, 2012 Oct 30.

Artigo em Inglês | MEDLINE | ID: mdl-23111013

RESUMO

BACKGROUND: The dramatic reduction in the cost of sequencing has allowed many researchers to join in the effort of sequencing and annotating prokaryotic genomes. Annotation methods vary considerably and may fail to identify some genes. Here we draw attention to a large number of likely genes missing from annotations using common tools such as Glimmer and BLAST. RESULTS: By analyzing 1,474 prokaryotic genome annotations in GenBank, we identify 13,602 likely missed genes that are homologs to non-hypothetical proteins, and 11,792 likely missed genes that are homologs only to hypothetical proteins, yet have supporting evidence of their protein-coding nature from COMBREX, a newly created gene function database. We also estimate the likelihood that each potential missing gene found is a genuine protein-coding gene using COMBREX. CONCLUSIONS: Our analysis of the causes of missed genes suggests that larger annotation centers tend to produce annotations with fewer missed genes than smaller centers, and many of the missed genes are short genes <300 bp. Over 1,000 of the likely missed genes could be associated with phenotype information available in COMBREX. 359 of these genes, found in pathogenic organisms, may be potential targets for pharmaceutical research. The newly identified genes are available on COMBREX's website. REVIEWERS: This article was reviewed by Daniel Haft, Arcady Mushegian, and M. Pilar Francino (nominated by David Ardell).

Assuntos

Bases de Dados de Ácidos Nucleicos , Genes Bacterianos , Anotação de Sequência Molecular/métodos , Fases de Leitura Aberta , Bactérias/genética , Biologia Computacional/métodos , Variação Genética , Genoma Bacteriano , Alinhamento de Sequência , Análise de Sequência de DNA , Homologia de Sequência , Software

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA