Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Más filtros

Banco de datos
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
Brief Bioinform ; 24(2)2023 03 19.
Artículo en Inglés | MEDLINE | ID: mdl-36869850

RESUMEN

Alignment is the cornerstone of many long-read pipelines and plays an essential role in resolving structural variants (SVs). However, forced alignments of SVs embedded in long reads, inflexibility of integrating novel SVs models and computational inefficiency remain problems. Here, we investigate the feasibility of resolving long-read SVs with alignment-free algorithms. We ask: (1) Is it possible to resolve long-read SVs with alignment-free approaches? and (2) Does it provide an advantage over existing approaches? To this end, we implemented the framework named Linear, which can flexibly integrate alignment-free algorithms such as the generative model for long-read SV detection. Furthermore, Linear addresses the problem of compatibility of alignment-free approaches with existing software. It takes as input long reads and outputs standardized results existing software can directly process. We conducted large-scale assessments in this work and the results show that the sensitivity, and flexibility of Linear outperform alignment-based pipelines. Moreover, the computational efficiency is orders of magnitude faster.


Asunto(s)
Genoma Humano , Programas Informáticos , Humanos , Algoritmos , Análisis de Secuencia , Modelos Estadísticos , Análisis de Secuencia de ADN/métodos , Secuenciación de Nucleótidos de Alto Rendimiento
2.
Bioinformatics ; 40(2)2024 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-38269626

RESUMEN

MOTIVATION: The minimizer concept is a data structure for sequence sketching. The standard canonical minimizer selects a subset of k-mers from the given DNA sequence by comparing the forward and reverse k-mers in a window simultaneously according to a predefined selection scheme. It is widely employed by sequence analysis such as read mapping and assembly. k-mer density, k-mer repetitiveness (e.g. k-mer bias), and computational efficiency are three critical measurements for minimizer selection schemes. However, there exist trade-offs between kinds of minimizer variants. Generic, effective, and efficient are always the requirements for high-performance minimizer algorithms. RESULTS: We propose a simple minimizer operator as a refinement of the standard canonical minimizer. It takes only a few operations to compute. However, it can improve the k-mer repetitiveness, especially for the lexicographic order. It applies to other selection schemes of total orders (e.g. random orders). Moreover, it is computationally efficient and the density is close to that of the standard minimizer. The refined minimizer may benefit high-performance applications like binning and read mapping. AVAILABILITY AND IMPLEMENTATION: The source code of the benchmark in this work is available at the github repository https://github.com/xp3i4/mini_benchmark.


Asunto(s)
Algoritmos , Programas Informáticos , Análisis de Secuencia de ADN , Secuenciación de Nucleótidos de Alto Rendimiento
3.
J Chem Inf Model ; 56(6): 1175-83, 2016 06 27.
Artículo en Inglés | MEDLINE | ID: mdl-27187084

RESUMEN

PharmMapper is a web server for drug target identification by reversed pharmacophore matching the query compound against an annotated pharmacophore model database, which provides a computational polypharmacology prediction approach for drug repurposing and side effect risk evaluation. But due to the inherent nondiscriminative feature of the simple fit scores used for prediction results ranking, the signal/noise ratio of the prediction results is high, posing a challenge for predictive reliability. In this paper, we improved the predictive accuracy of PharmMapper by generating a ligand-target pairwise fit score matrix from profiling all the annotated pharmacophore models against corresponding ligands in the original complex structures that were used to extract these pharmacophore models. The matrix reflects the noise baseline of fit score distribution of the background database, thus enabling estimation of the probability of finding a given target randomly with the calculated ligand-pharmacophore fit score. Two retrospective tests were performed which confirmed that the probability-based ranking score outperformed the simple fit score in terms of identification of both known drug targets and adverse drug reaction related off-targets.


Asunto(s)
Biología Computacional/métodos , Polifarmacología , Sitios de Unión , Minería de Datos , Bases de Datos Farmacéuticas , Internet , Ligandos , Terapia Molecular Dirigida , Programas Informáticos
4.
Genome Biol ; 25(1): 155, 2024 06 13.
Artículo en Inglés | MEDLINE | ID: mdl-38872200

RESUMEN

Advances in sequencing technology have facilitated population-scale long-read structural variant (SV) detection. Arguably, one of the main challenges in population-scale analysis is developing effective computational pipelines. Here, we present a new filter-based pipeline for population-scale long-read SV detection. It better captures SV signals at an early stage than conventional assembly-based or alignment-based pipelines. Assessments in this work suggest that the filter-based pipeline helps better resolve intra-read rearrangements. Moreover, it is also more computationally efficient than conventional pipelines and thus may facilitate population-scale long-read applications.


Asunto(s)
Programas Informáticos , Humanos , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN/métodos , Algoritmos , Variación Estructural del Genoma
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA