Your browser doesn't support javascript.
loading
Critical assessment of bioinformatics methods for the characterization of pathological repeat expansions with single-molecule sequencing data.
Chiara, Matteo; Zambelli, Federico; Picardi, Ernesto; Horner, David S; Pesole, Graziano.
Affiliation
  • Chiara M; Department of Biosciences, University of Milan, via Celoria 26, 20133 Milan, Italy.
  • Zambelli F; Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy.
  • Picardi E; Department of Biosciences, University of Milan, via Celoria 26, 20133 Milan, Italy.
  • Horner DS; Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy.
  • Pesole G; Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy.
Brief Bioinform ; 21(6): 1971-1986, 2020 12 01.
Article in En | MEDLINE | ID: mdl-31792498
A number of studies have reported the successful application of single-molecule sequencing technologies to the determination of the size and sequence of pathological expanded microsatellite repeats over the last 5 years. However, different custom bioinformatics pipelines were employed in each study, preventing meaningful comparisons and somewhat limiting the reproducibility of the results. In this review, we provide a brief summary of state-of-the-art methods for the characterization of expanded repeats alleles, along with a detailed comparison of bioinformatics tools for the determination of repeat length and sequence, using both real and simulated data. Our reanalysis of publicly available human genome sequencing data suggests a modest, but statistically significant, increase of the error rate of single-molecule sequencing technologies at genomic regions containing short tandem repeats. However, we observe that all the methods herein tested, irrespective of the strategy used for the analysis of the data (either based on the alignment or assembly of the reads), show high levels of sensitivity in both the detection of expanded tandem repeats and the estimation of the expansion size, suggesting that approaches based on single-molecule sequencing technologies are highly effective for the detection and quantification of tandem repeat expansions and contractions.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Molecular Sequence Data / Sequence Analysis, DNA / Microsatellite Repeats / Computational Biology / High-Throughput Nucleotide Sequencing Limits: Humans Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2020 Type: Article Affiliation country: Italy

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Molecular Sequence Data / Sequence Analysis, DNA / Microsatellite Repeats / Computational Biology / High-Throughput Nucleotide Sequencing Limits: Humans Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2020 Type: Article Affiliation country: Italy