Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros

Base de dados
Ano de publicação
Tipo de documento
Intervalo de ano de publicação
1.
BMC Bioinformatics ; 14 Suppl 14: S10, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24267485

RESUMO

BACKGROUND: Advances in technology have led to the generation of massive amounts of complex and multifarious biological data in areas ranging from genomics to structural biology. The volume and complexity of such data leads to significant challenges in terms of its analysis, especially when one seeks to generate hypotheses or explore the underlying biological processes. At the state-of-the-art, the application of automated algorithms followed by perusal and analysis of the results by an expert continues to be the predominant paradigm for analyzing biological data. This paradigm works well in many problem domains. However, it also is limiting, since domain experts are forced to apply their instincts and expertise such as contextual reasoning, hypothesis formulation, and exploratory analysis after the algorithm has produced its results. In many areas where the organization and interaction of the biological processes is poorly understood and exploratory analysis is crucial, what is needed is to integrate domain expertise during the data analysis process and use it to drive the analysis itself. RESULTS: In context of the aforementioned background, the results presented in this paper describe advancements along two methodological directions. First, given the context of biological data, we utilize and extend a design approach called experiential computing from multimedia information system design. This paradigm combines information visualization and human-computer interaction with algorithms for exploratory analysis of large-scale and complex data. In the proposed approach, emphasis is laid on: (1) allowing users to directly visualize, interact, experience, and explore the data through interoperable visualization-based and algorithmic components, (2) supporting unified query and presentation spaces to facilitate experimentation and exploration, (3) providing external contextual information by assimilating relevant supplementary data, and (4) encouraging user-directed information visualization, data exploration, and hypotheses formulation. Second, to illustrate the proposed design paradigm and measure its efficacy, we describe two prototype web applications. The first, called XMAS (Experiential Microarray Analysis System) is designed for analysis of time-series transcriptional data. The second system, called PSPACE (Protein Space Explorer) is designed for holistic analysis of structural and structure-function relationships using interactive low-dimensional maps of the protein structure space. Both these systems promote and facilitate human-computer synergy, where cognitive elements such as domain knowledge, contextual reasoning, and purpose-driven exploration, are integrated with a host of powerful algorithmic operations that support large-scale data analysis, multifaceted data visualization, and multi-source information integration. CONCLUSIONS: The proposed design philosophy, combines visualization, algorithmic components and cognitive expertise into a seamless processing-analysis-exploration framework that facilitates sense-making, exploration, and discovery. Using XMAS, we present case studies that analyze transcriptional data from two highly complex domains: gene expression in the placenta during human pregnancy and reaction of marine organisms to heat stress. With PSPACE, we demonstrate how complex structure-function relationships can be explored. These results demonstrate the novelty, advantages, and distinctions of the proposed paradigm. Furthermore, the results also highlight how domain insights can be combined with algorithms to discover meaningful knowledge and formulate evidence-based hypotheses during the data analysis process. Finally, user studies against comparable systems indicate that both XMAS and PSPACE deliver results with better interpretability while placing lower cognitive loads on the users. XMAS is available at: http://tintin.sfsu.edu:8080/xmas. PSPACE is available at: http://pspace.info/.


Assuntos
Expressão Gênica , Proteínas/química , Algoritmos , Computadores , Feminino , Genômica , Humanos , Modelos Moleculares , Gravidez , Estrutura Terciária de Proteína , Proteínas/genética
2.
BMC Bioinformatics ; 14 Suppl 2: S20, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23368815

RESUMO

BACKGROUND: Disulfide bonds constitute one of the most important cross-linkages in proteins and significantly influence protein structure and function. At the state-of-the-art, various methodological frameworks have been proposed for identification of disulfide bonds. These include among others, mass spectrometry-based methods, sequence-based predictive approaches, as well as techniques like crystallography and NMR. Each of these frameworks has its advantages and disadvantages in terms of pre-requisites for applicability, throughput, and accuracy. Furthermore, the results from different methods may concur or conflict in parts. RESULTS: In this paper, we propose a novel and theoretically rigorous framework for disulfide bond determination based on information fusion from different methods using an extended formulation of Dempster-Shafer theory. A key advantage of our approach is that it can automatically deal with concurring as well as conflicting evidence in a data-driven manner. Using the proposed framework, we have developed a method for disulfide bond determination that combines results from sequence-based prediction and mass spectrometric inference. This method leads to more accurate disulfide bond determination than any of the constituent methods taken individually. Furthermore, experiments indicate that the method improves the accuracy of bond identification as compared to leading extant methods at the state-of-the-art. Finally, the proposed framework is extensible in that results from any number of approaches can be incorporated. Results obtained using this framework can especially be useful in cases where the complexity of the bonding patterns coupled with specificities of the fragmentation pattern or limitations of computational models impair any single method to perform consistently across a diverse set of molecules.


Assuntos
Dissulfetos/química , Modelos Teóricos , Conformação Proteica , Proteínas/química , Algoritmos , Simulação por Computador , Espectrometria de Massas
3.
IEEE Trans Nanobioscience ; 12(4): 340-2, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24425101

RESUMO

MS2DB++ is a web server for computationally determining disulfide connectivity in proteins by combining evidence from multiple methods. The constituent methods implemented as part of the MS2DB++ webserver include a mass spectrometry-based method and two protein sequence-based predictive methods. The software also allows users to incorporate results from up to two other external methods of choice. The results from all these methods are combined using Dempster-Shafer theory through the use of four different formulations for evidence combination. In practice, MS2DB++ can be especially helpful in obtaining the disulfide topology in cases where no single method performs consistently across a set of molecules due to complexity of the bonding topology, specificities of the fragmentation pattern, or limitations of computational models.


Assuntos
Biologia Computacional/métodos , Dissulfetos/química , Internet , Proteínas/química , Software , Espectrometria de Massas , Modelos Estatísticos
4.
IEEE Trans Nanobioscience ; 12(2): 69-71, 2013 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-23096131

RESUMO

MS2DB+ is an open-source platform-independent web application for determining, in polynomial time, the disulfide linkages in proteins using tandem mass spectrometry (MS/MS) data. It utilizes an efficient approximation algorithm which allows the consideration of multiple ion-types (a, a(o), a*, b, b(o), b*, c, x, y, y(o), y*, and z) in the analysis. Once putative disulfide bonds are identified, a graph optimization approach is used to obtain the most likely global disulfide connectivity pattern.


Assuntos
Dissulfetos/química , Software , Algoritmos , Íons , Proteínas/química , Espectrometria de Massas em Tandem
5.
BMC Bioinformatics ; 12 Suppl 1: S12, 2011 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-21342541

RESUMO

BACKGROUND: Determining the disulfide (S-S) bond pattern in a protein is often crucial for understanding its structure and function. In recent research, mass spectrometry (MS) based analysis has been applied to this problem following protein digestion under both partial reduction and non-reduction conditions. However, this paradigm still awaits solutions to certain algorithmic problems fundamental amongst which is the efficient matching of an exponentially growing set of putative S-S bonded structural alternatives to the large amounts of experimental spectrometric data. Current methods circumvent this challenge primarily through simplifications, such as by assuming only the occurrence of certain ion-types (b-ions and y-ions) that predominate in the more popular dissociation methods, such as collision-induced dissociation (CID). Unfortunately, this can adversely impact the quality of results. METHOD: We present an algorithmic approach to this problem that can, with high computational efficiency, analyze multiple ions types (a, b, bo, b*, c, x, y, yo, y*, and z) and deal with complex bonding topologies, such as inter/intra bonding involving more than two peptides. The proposed approach combines an approximation algorithm-based search formulation with data driven parameter estimation. This formulation considers only those regions of the search space where the correct solution resides with a high likelihood. Putative disulfide bonds thus obtained are finally combined in a globally consistent pattern to yield the overall disulfide bonding topology of the molecule. Additionally, each bond is associated with a confidence score, which aids in interpretation and assimilation of the results. RESULTS: The method was tested on nine different eukaryotic Glycosyltransferases possessing disulfide bonding topologies of varying complexity. Its performance was found to be characterized by high efficiency (in terms of time and the fraction of search space considered), sensitivity, specificity, and accuracy. The method was also compared with other techniques at the state-of-the-art. It was found to perform as well or better than the competing techniques. An implementation is available at: http://tintin.sfsu.edu/~whemurad/disulfidebond. CONCLUSIONS: This research addresses some of the significant challenges in MS-based disulfide bond determination. To the best of our knowledge, this is the first algorithmic work that can consider multiple ion types in this problem setting while simultaneously ensuring polynomial time complexity and high accuracy of results.


Assuntos
Algoritmos , Dissulfetos/análise , Proteínas/análise , Biologia Computacional/métodos , Glicosiltransferases/análise , Íons/análise , Espectrometria de Massas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA