Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Más filtros

Banco de datos
Tipo del documento
Publication year range
1.
BMC Genomics ; 22(1): 822, 2021 Nov 14.
Artículo en Inglés | MEDLINE | ID: mdl-34773979

RESUMEN

BACKGROUND: We benchmarked sequencing technology and assembly strategies for short-read, long-read, and hybrid assemblers in respect to correctness, contiguity, and completeness of assemblies in genomes of Francisella tularensis. Benchmarking allowed in-depth analyses of genomic structures of the Francisella pathogenicity islands and insertion sequences. Five major high-throughput sequencing technologies were applied, including next-generation "short-read" and third-generation "long-read" sequencing methods. RESULTS: We focused on short-read assemblers, hybrid assemblers, and analysis of the genomic structure with particular emphasis on insertion sequences and the Francisella pathogenicity island. The A5-miseq pipeline performed best for MiSeq data, Mira for Ion Torrent data, and ABySS for HiSeq data from eight short-read assembly methods. Two approaches were applied to benchmark long-read and hybrid assembly strategies: long-read-first assembly followed by correction with short reads (Canu/Pilon, Flye/Pilon) and short-read-first assembly along with scaffolding based on long reads (Unicyler, SPAdes). Hybrid assembly can resolve large repetitive regions best with a "long-read first" approach. CONCLUSIONS: Genomic structures of the Francisella pathogenicity islands frequently showed misassembly. Insertion sequences (IS) could be used to perform an evolutionary conservation analysis. A phylogenetic structure of insertion sequences and the evolution within the clades elucidated the clade structure of the highly conservative F. tularensis.


Asunto(s)
Francisella tularensis , Genoma Bacteriano , Elementos Transponibles de ADN , Francisella tularensis/genética , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Filogenia , Análisis de Secuencia de ADN
2.
Front Mol Biosci ; 9: 944639, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36545510

RESUMEN

It has been shown that the best coverage of the HepG2 cell line transcriptome encoded by genes of a single chromosome, chromosome 18, is achieved by a combination of two sequencing platforms, Illumina RNA-Seq and Oxford Nanopore Technologies (ONT), using cut-off levels of FPKM > 0 and TPM > 0, respectively. In this study, we investigated the extent to which the combination of these transcriptomic analysis methods makes it possible to achieve a high coverage of the transcriptome encoded by the genes of other human chromosomes. A comparative analysis of transcriptome coverage for various types of biological material was carried out, and the HepG2 cell line transcriptome was compared with the transcriptome of liver tissue cells. In addition, the contribution of variability in the coverage of expressed genes in human transcriptomes to the creation of a draft human transcriptome was evaluated. For human liver tissues, ONT makes an extremely insignificant contribution to the overall coverage of the transcriptome. Thus, to ensure maximum coverage of the liver tissue transcriptome, it is sufficient to apply only one technology: Illumina RNA-Seq (FPKM > 0).

3.
Front Genet ; 12: 674534, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34194472

RESUMEN

The cutoff level applied in sequencing analysis varies according to the sequencing technology, sample type, and study purpose, which can largely affect the coverage and reliability of the data obtained. In this study, we aimed to determine the optimal combination of parameters for reliable RNA transcriptome data analysis. Toward this end, we compared the results obtained from different transcriptome analysis platforms (quantitative polymerase chain reaction, Illumina RNASeq, and Oxford Nanopore Technologies MinION) for the transcriptome encoded by human chromosome 18 (Chr 18) using the same sample types (HepG2 cells and liver tissue). A total of 275 protein-coding genes encoded by Chr 18 was taken as the gene set for evaluation. The combination of Illumina RNASeq and MinION nanopore technologies enabled the detection of at least one transcript for each protein-coding gene encoded by Chr 18. This combination also reduced the probability of false-positive detection of low-copy transcripts due to the simultaneous confirmation of the presence of a transcript by the two fundamentally different technologies: short reads essential for reliable detection (Illumina RNASeq) and long-read sequencing data (MinION). The combination of these technologies achieved complete coverage of all 275 protein-coding genes on Chr 18, identifying transcripts with non-zero expression levels. This approach can improve distinguishing the biological and technical reasons for the absence of mRNA detection for a given gene in transcriptomics.

4.
Mol Neurodegener ; 13(1): 46, 2018 08 21.
Artículo en Inglés | MEDLINE | ID: mdl-30126445

RESUMEN

BACKGROUND: Many neurodegenerative diseases are caused by nucleotide repeat expansions, but most expansions, like the C9orf72 'GGGGCC' (G4C2) repeat that causes approximately 5-7% of all amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) cases, are too long to sequence using short-read sequencing technologies. It is unclear whether long-read sequencing technologies can traverse these long, challenging repeat expansions. Here, we demonstrate that two long-read sequencing technologies, Pacific Biosciences' (PacBio) and Oxford Nanopore Technologies' (ONT), can sequence through disease-causing repeats cloned into plasmids, including the FTD/ALS-causing G4C2 repeat expansion. We also report the first long-read sequencing data characterizing the C9orf72 G4C2 repeat expansion at the nucleotide level in two symptomatic expansion carriers using PacBio whole-genome sequencing and a no-amplification (No-Amp) targeted approach based on CRISPR/Cas9. RESULTS: Both the PacBio and ONT platforms successfully sequenced through the repeat expansions in plasmids. Throughput on the MinION was a challenge for whole-genome sequencing; we were unable to attain reads covering the human C9orf72 repeat expansion using 15 flow cells. We obtained 8× coverage across the C9orf72 locus using the PacBio Sequel, accurately reporting the unexpanded allele at eight repeats, and reading through the entire expansion with 1324 repeats (7941 nucleotides). Using the No-Amp targeted approach, we attained > 800× coverage and were able to identify the unexpanded allele, closely estimate expansion size, and assess nucleotide content in a single experiment. We estimate the individual's repeat region was > 99% G4C2 content, though we cannot rule out small interruptions. CONCLUSIONS: Our findings indicate that long-read sequencing is well suited to characterizing known repeat expansions, and for discovering new disease-causing, disease-modifying, or risk-modifying repeat expansions that have gone undetected with conventional short-read sequencing. The PacBio No-Amp targeted approach may have future potential in clinical and genetic counseling environments. Larger and deeper long-read sequencing studies in C9orf72 expansion carriers will be important to determine heterogeneity and whether the repeats are interrupted by non-G4C2 content, potentially mitigating or modifying disease course or age of onset, as interruptions are known to do in other repeat-expansion disorders. These results have broad implications across all diseases where the genetic etiology remains unclear.


Asunto(s)
Proteína C9orf72/genética , Expansión de las Repeticiones de ADN/genética , Demencia Frontotemporal/genética , Análisis de Secuencia de ADN/métodos , Adulto , Anciano , Femenino , Humanos , Masculino , Técnicas de Amplificación de Ácido Nucleico/métodos
SELECCIÓN DE REFERENCIAS
Detalles de la búsqueda