Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Más filtros











Base de datos
Intervalo de año de publicación
1.
Artículo en Inglés | MEDLINE | ID: mdl-34990369

RESUMEN

The encounter of large amounts of biological sequence data generated during the last decades and the algorithmic and hardware improvements have offered the possibility to apply machine learning techniques in bioinformatics. While the machine learning community is aware of the necessity to rigorously distinguish data transformation from data comparison and adopt reasonable combinations thereof, this awareness is often lacking in the field of comparative sequence analysis. With realization of the disadvantages of alignments for sequence comparison, some typical applications use more and more so-called alignment-free approaches. In light of this development, we present a conceptual framework for alignment-free sequence comparison, which highlights the delineation of: 1) the sequence data transformation comprising of adequate mathematical sequence coding and feature generation, from 2) the subsequent (dis-)similarity evaluation of the transformed data by means of problem-specific but mathematically consistent proximity measures. We consider coding to be an information-loss free data transformation in order to get an appropriate representation, whereas feature generation is inevitably information-lossy with the intention to extract just the task-relevant information. This distinction sheds light on the plethora of methods available and assists in identifying suitable methods in machine learning and data analysis to compare the sequences under these premises.


Asunto(s)
Algoritmos , Aprendizaje Automático , Alineación de Secuencia , Análisis de Secuencia , Matemática
2.
Entropy (Basel) ; 23(10)2021 Oct 17.
Artículo en Inglés | MEDLINE | ID: mdl-34682081

RESUMEN

In the present article we propose the application of variants of the mutual information function as characteristic fingerprints of biomolecular sequences for classification analysis. In particular, we consider the resolved mutual information functions based on Shannon-, Rényi-, and Tsallis-entropy. In combination with interpretable machine learning classifier models based on generalized learning vector quantization, a powerful methodology for sequence classification is achieved which allows substantial knowledge extraction in addition to the high classification ability due to the model-inherent robustness. Any potential (slightly) inferior performance of the used classifier is compensated by the additional knowledge provided by interpretable models. This knowledge may assist the user in the analysis and understanding of the used data and considered task. After theoretical justification of the concepts, we demonstrate the approach for various example data sets covering different areas in biomolecular sequence analysis.

3.
Org Lett ; 12(13): 2970-3, 2010 Jul 02.
Artículo en Inglés | MEDLINE | ID: mdl-20515060

RESUMEN

A series of dithiolane-substituted subphthalocyanines have been synthesized that can form self-assembled monolayers on gold surfaces, as confirmed by diverse characterization techniques.


Asunto(s)
Compuestos de Boro/síntesis química , Oro/química , Indoles/síntesis química , Membranas Artificiales , Compuestos de Boro/química , Cristalografía por Rayos X , Indoles/química , Isoindoles , Modelos Moleculares , Estructura Molecular , Tamaño de la Partícula , Estereoisomerismo , Propiedades de Superficie
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA