Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
BMC Bioinformatics ; 19(1): 82, 2018 03 05.
Artigo em Inglês | MEDLINE | ID: mdl-29506466

RESUMO

BACKGROUND: The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given sequence, the Boltzmann ensemble is multimodal. Several methods exist for clustering structures and finding those modes. However less focus is given to exploring the underlying reasons for this multimodality: the presence of conflicting basepairs. Information theory, or more specifically mutual information, provides a method to identify those basepairs that are key to the secondary structure. RESULTS: To this end we find most informative basepairs and visualize the effect of these basepairs on the secondary structure. Knowing whether a most informative basepair is present tells us not only the status of the particular pair but also provides a large amount of information about which other pairs are present or not present. We find that a few basepairs account for a large amount of the structural uncertainty. The identification of these pairs indicates small changes to sequence or stability that will have a large effect on structure. CONCLUSION: We provide a novel algorithm that uses mutual information to identify the key basepairs that lead to a multimodal Boltzmann distribution. We then visualize the effect of these pairs on the overall Boltzmann ensemble.


Assuntos
Algoritmos , Teoria da Informação , Conformação de Ácido Nucleico , RNA/química , Pareamento de Bases/genética , Sequência de Bases , Análise por Conglomerados , Entropia , Mutação/genética
2.
BMC Genomics ; 18(1): 522, 2017 07 10.
Artigo em Inglês | MEDLINE | ID: mdl-28693467

RESUMO

BACKGROUND: Repetitive elements are now known to have relevant cellular functions, including self-complementary sequences that form double stranded (ds) RNA. There are numerous pathways that determine the fate of endogenous dsRNA, and misregulation of endogenous dsRNA is a driver of autoimmune disease, particularly in the brain. Unfortunately, the alignment of high-throughput, short-read sequences to repeat elements poses a dilemma: Such sequences may align equally well to multiple genomic locations. In order to differentiate repeat elements, current alignment methods depend on sequence variation in the reference genome. Reads are discarded when no such variations are present. However, RNA hyper-editing, a possible fate for dsRNA, introduces enough variation to distinguish between repeats that are otherwise identical. RESULTS: To take advantage of this variation, we developed a new algorithm, RepProfile, that simultaneously aligns reads and predicts novel variations. RepProfile accurately aligns hyper-edited reads that other methods discard. In particular we predict hyper-editing of Drosophila melanogaster repeat elements in vivo at levels previously described only in vitro, and provide validation by Sanger sequencing sixty-two individual cloned sequences. We find that hyper-editing is concentrated in genes involved in cell-cell communication at the synapse, including some that are associated with neurodegeneration. We also find that hyper-editing tends to occur in short runs. CONCLUSIONS: Previous studies of RNA hyper-editing discarded ambiguously aligned reads, ignoring hyper-editing in long, perfect dsRNA - the perfect substrate for hyper-editing. We provide a method that simulation and Sanger validation show accurately predicts such RNA editing, yielding a superior picture of hyper-editing.


Assuntos
Drosophila melanogaster/genética , Edição de RNA , Alinhamento de Sequência , Algoritmos , Animais , Rearranjo Gênico , Polimorfismo de Nucleotídeo Único , Sequências Repetitivas de Ácido Nucleico/genética
3.
PLoS Negl Trop Dis ; 3(6): e449, 2009 Jun 02.
Artigo em Inglês | MEDLINE | ID: mdl-19488406

RESUMO

BACKGROUND: Blood flukes of the genus Schistosoma are platyhelminth parasites that infect 200 million people worldwide. Digestion of nutrients from the host bloodstream is essential for parasite development and reproduction. A network of proteolytic enzymes (proteases) facilitates hydrolysis of host hemoglobin and serum proteins. METHODOLOGY/PRINCIPAL FINDINGS: We identified a new cathepsin L termed SmCL3 using PCR strategies based on S. mansoni EST sequence data. An ortholog is present in Schistosoma japonicum. SmCL3 was heterologously expressed as an active enzyme in the yeast, Pichia pastoris. Recombinant SmCL3 has a broad pH activity range against peptidyl substrates and is inhibited by Clan CA protease inhibitors. Consistent with a function in degrading host proteins, SmCL3 hydrolyzes serum albumin and hemoglobin, is localized to the adult gastrodermis, and is expressed mainly in those life stages infecting the mammalian host. The predominant form of SmCL3 in the parasite exists as a zymogen, which is unusual for proteases. This zymogen includes an unusually long prodomain with alpha helical secondary structure motifs. The striking specificity of SmCL3 for amino acids with large aromatic side chains (Trp and Tyr) at the P2 substrate position, as determined with positional scanning-synthetic combinatorial library, is consistent with a molecular model that shows a large and deep S2 pocket. A sequence similarity network (SSN) view clusters SmCL3 and other cathepsins L in accordance with previous large-scale phylogenetic analyses that identify six super kingdoms. CONCLUSIONS/SIGNIFICANCE: SmCL3 is a gut-associated cathepsin L that may contribute to the network of proteases involved in degrading host blood proteins as nutrients. Furthermore, this enzyme exhibits some unusual sequence and biophysical features that may result in additional functions. The visualization of network inter-relationships among cathepsins L suggests that these enzymes are suitable 'marker sequences' for inclusion in future phylogenetic analyses.


Assuntos
Catepsina L/isolamento & purificação , Catepsina L/metabolismo , Schistosoma mansoni/enzimologia , Sequência de Aminoácidos , Animais , Catepsina L/genética , Cricetinae , DNA de Protozoário/genética , Estabilidade Enzimática , Etiquetas de Sequências Expressas , Trato Gastrointestinal/química , Expressão Gênica , Hemoglobinas/metabolismo , Humanos , Concentração de Íons de Hidrogênio , Modelos Moleculares , Dados de Sequência Molecular , Filogenia , Pichia/genética , Reação em Cadeia da Polimerase/métodos , Inibidores de Proteases/farmacologia , Schistosoma mansoni/genética , Homologia de Sequência de Aminoácidos , Albumina Sérica/metabolismo , Caramujos , Especificidade por Substrato
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA