Magic-BLAST, an accurate RNA-seq aligner for long and short reads.
BMC Bioinformatics
; 20(1): 405, 2019 Jul 25.
Article
en En
| MEDLINE
| ID: mdl-31345161
BACKGROUND: Next-generation sequencing technologies can produce tens of millions of reads, often paired-end, from transcripts or genomes. But few programs can align RNA on the genome and accurately discover introns, especially with long reads. We introduce Magic-BLAST, a new aligner based on ideas from the Magic pipeline. RESULTS: Magic-BLAST uses innovative techniques that include the optimization of a spliced alignment score and selective masking during seed selection. We evaluate the performance of Magic-BLAST to accurately map short or long sequences and its ability to discover introns on real RNA-seq data sets from PacBio, Roche and Illumina runs, and on six benchmarks, and compare it to other popular aligners. Additionally, we look at alignments of human idealized RefSeq mRNA sequences perfectly matching the genome. CONCLUSIONS: We show that Magic-BLAST is the best at intron discovery over a wide range of conditions and the best at mapping reads longer than 250 bases, from any platform. It is versatile and robust to high levels of mismatches or extreme base composition, and reasonably fast. It can align reads to a BLAST database or a FASTA file. It can accept a FASTQ file as input or automatically retrieve an accession from the SRA repository at the NCBI.
Palabras clave
Texto completo:
1
Colección:
01-internacional
Base de datos:
MEDLINE
Asunto principal:
Programas Informáticos
/
ARN
/
Alineación de Secuencia
/
Análisis de Secuencia de ARN
Tipo de estudio:
Prognostic_studies
Límite:
Humans
Idioma:
En
Revista:
BMC Bioinformatics
Asunto de la revista:
INFORMATICA MEDICA
Año:
2019
Tipo del documento:
Article
País de afiliación:
Estados Unidos
Pais de publicación:
Reino Unido