Pesquisa | BVS Doenças Infecciosas e Parasitárias

1.

New insights into biologic interpretation of bioinformatic pipelines for fish eDNA metabarcoding: A case study in Pearl River estuary.

Li, Zhuoying; Zhao, Wencheng; Jiang, Yun; Wen, Yongjing; Li, Min; Liu, Li; Zou, Keshu.

J Environ Manage ; 368: 122136, 2024 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-39128344

RESUMO

Environmental DNA (eDNA) metabarcoding is an emerging tool for monitoring biological communities in aquatic ecosystems. The selection of bioinformatic pipelines significantly impacts the results of biodiversity assessments. However, there is currently no consensus on the appropriate bioinformatic pipelines for fish community analysis in eDNA metabarcoding. In this study, we compared three bioinformatic pipelines (Uparse, DADA2, and UNOISE3) using real and mock (constructed with 15/30 known fish) communities to investigate the differences in biological interpretation during the data analysis process in eDNA metabarcoding. Performance evaluation and diversity analyses revealed that the choice of bioinformatic pipeline could impact the biological results of metabarcoding experiments. Among the three pipelines, the operational taxonomic units (OTU)-based pipeline (Uparse) showed the best performance (sensitivity: 0.6250 ± 0.0166; compositional similarity: 0.4000 ± 0.0571), the highest richness (25-102) and minimal inter-group differences in alpha diversity. It suggested the OTU-based pipeline possessed superior capability in fish diversity monitoring compared to ASV/ZOTU-based pipeline. Additionally, the Bray-Curtis distance matrix achieved the highest discriminative effect in the PCoA (43.3%-53.89%) and inter-group analysis (P < 0.01), indicating it was better at distinguishing compositional differences or specific genera of fish community at different sampling sites than other distance matrices. These findings provide new insights into fish community monitoring through eDNA metabarcoding in estuarine environments.

Assuntos

Biodiversidade , Biologia Computacional , Código de Barras de DNA Taxonômico , DNA Ambiental , Estuários , Peixes , Rios , Animais , Peixes/genética , Biologia Computacional/métodos , DNA Ambiental/análise , Ecossistema , Monitoramento Ambiental/métodos

2.

The effect of metabarcoding 18S rRNA region choice on diversity of microeukaryotes including phytoplankton.

Bukin, Yuri S; Mikhailov, Ivan S; Petrova, Darya P; Galachyants, Yuri P; Zakharova, Yulia R; Likhoshway, Yelena V.

World J Microbiol Biotechnol ; 39(9): 229, 2023 Jun 21.

Artigo em Inglês | MEDLINE | ID: mdl-37341802

RESUMO

Metabarcoding using high throughput sequencing of amplicons of the 18S rRNA gene is one of the widely used methods for assessing the diversity of microeukaryotes in various ecosystems. We investigated the effectiveness of the V4 and V8-V9 regions of the 18S rRNA gene by comparing the results of metabarcoding microeukaryotic communities using the DADA2 (ASV), USEARCH-UNOISE3 (ZOTU), and USEARCH-UPARSE (OTU with 97% similarity) algorithms. Both regions showed similar levels of genetic variability and taxa identification accuracy. Richness for DADA2 datasets of both regions was lower than for UNOISE3 and UPARSE datasets, which is due to more accurate error correction in amplicons. Microeukaryotic communities (autotrophs and heterotrophs) structure identified using both regions showed a significant relationship with phytoplankton (autotrophs) communities structure based on microscopy in a seasonal freshwater sample series. The strongest relationship was found between the phytoplankton species and V8-V9 ASVs produced by DADA2.

Assuntos

Ecossistema , Fitoplâncton , Fitoplâncton/genética , RNA Ribossômico 18S/genética , Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala

3.

hgtseq: A Standard Pipeline to Study Horizontal Gene Transfer.

Carpanzano, Simone; Santorsola, Mariangela; Lescai, Francesco.

Int J Mol Sci ; 23(23)2022 Nov 22.

Artigo em Inglês | MEDLINE | ID: mdl-36498841

RESUMO

Horizontal gene transfer (HGT) is well described in prokaryotes: it plays a crucial role in evolution, and has functional consequences in insects and plants. However, less is known about HGT in humans. Studies have reported bacterial integrations in cancer patients, and microbial sequences have been detected in data from well-known human sequencing projects. Few of the existing tools for investigating HGT are highly automated. Thanks to the adoption of Nextflow for life sciences workflows, and to the standards and best practices curated by communities such as nf-core, fully automated, portable, and scalable pipelines can now be developed. Here we present nf-core/hgtseq to facilitate the analysis of HGT from sequencing data in different organisms. We showcase its performance by analysing six exome datasets from five mammals. Hgtseq can be run seamlessly in any computing environment and accepts data generated by existing exome and whole-genome sequencing projects; this will enable researchers to expand their analyses into this area. Fundamental questions are still open about the mechanisms and the extent or role of horizontal gene transfer: by releasing hgtseq we provide a standardised tool which will enable a systematic investigation of this phenomenon, thus paving the way for a better understanding of HGT.

Assuntos

Evolução Molecular , Transferência Genética Horizontal , Animais , Humanos , Células Procarióticas , Bactérias/genética , Sequência de Bases , Filogenia , Mamíferos/genética

4.

Current advances in circular RNA detection and investigation methods: Are we running in circles?

Drula, RareÈ; Braicu, Cornelia; Neagoe, Ioana-Berindan.

Wiley Interdiscip Rev RNA ; 15(3): e1850, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38702943

RESUMO

Circular RNAs (circRNAs), characterized by their closed-loop structure, have emerged as significant transcriptomic regulators, with roles spanning from microRNA sponging to modulation of gene expression and potential peptide coding. The discovery and functional analysis of circRNAs have been propelled by advancements in both experimental and bioinformatics tools, yet the field grapples with challenges related to their detection, isoform diversity, and accurate quantification. This review navigates through the evolution of circRNA research methodologies, from early detection techniques to current state-of-the-art approaches that offer comprehensive insights into circRNA biology. We examine the limitations of existing methods, particularly the difficulty in differentiating circRNA isoforms and distinguishing circRNAs from their linear counterparts. A critical evaluation of various bioinformatics tools and novel experimental strategies is presented, emphasizing the need for integrated approaches to enhance our understanding and interpretation of circRNA functions. Our insights underscore the dynamic and rapidly advancing nature of circRNA research, highlighting the ongoing development of analytical frameworks designed to address the complexity of circRNAs and facilitate the assessment of their clinical utility. As such, this comprehensive overview aims to catalyze further advancements in circRNA study, fostering a deeper understanding of their roles in cellular processes and potential implications in disease. This article is categorized under: RNA Methods > RNA Nanotechnology RNA Methods > RNA Analyses in Cells RNA Methods > RNA Analyses In Vitro and In Silico.

Assuntos

RNA Circular , RNA Circular/metabolismo , RNA Circular/genética , Humanos , Biologia Computacional/métodos , Animais

5.

Fusion InPipe, an integrative pipeline for gene fusion detection from RNA-seq data in acute pediatric leukemia.

Vicente-Garcés, Clara; Maynou, Joan; Fernández, Guerau; Esperanza-Cebollada, Elena; Torrebadell, Montserrat; Català, Albert; Rives, Susana; Camós, Mireia; Vega-García, Nerea.

Front Mol Biosci ; 10: 1141310, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37363396

RESUMO

RNA sequencing (RNA-seq) is a reliable tool for detecting gene fusions in acute leukemia. Multiple bioinformatics pipelines have been developed to analyze RNA-seq data, but an agreed gold standard has not been established. This study aimed to compare the applicability of 5 fusion calling pipelines (Arriba, deFuse, CICERO, FusionCatcher, and STAR-Fusion), as well as to define and develop an integrative bioinformatics pipeline (Fusion InPipe) to detect clinically relevant gene fusions in acute pediatric leukemia. We analyzed RNA-seq data by each pipeline individually and by Fusion InPipe. Each algorithm individually called most of the fusions with similar sensitivity and precision. However, not all rearrangements were called, suggesting that choosing a single pipeline might cause missing important fusions. To improve this, we integrated the results of the five algorithms in just one pipeline, Fusion InPipe, comparing the output from the agreement of 5/5, 4/5, and 3/5 algorithms. The maximum sensitivity was achieved with the agreement of 3/5 algorithms, with a global sensitivity of 95%, achieving a 100% in patients' data. Furthermore, we showed the necessity of filtering steps to reduce the false positive detection rate. Here, we demonstrate that Fusion InPipe is an excellent tool for fusion detection in pediatric acute leukemia with the best performance when selecting those fusions called by at least 3/5 pipelines.

6.

nf-core/clipseq - a robust Nextflow pipeline for comprehensive CLIP data analysis.

West, Charlotte; Capitanchik, Charlotte; Cheshire, Chris; Luscombe, Nicholas M; Chakrabarti, Anob; Ule, Jernej.

Wellcome Open Res ; 8: 286, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37829674

RESUMO

Crosslinking and immunoprecipitation (CLIP) technologies have become a central component of the molecular biologists' toolkit to study protein-RNA interactions and thus to uncover core principles of RNA biology. There has been a proliferation of CLIP-based experimental protocols, as well as computational tools, especially for peak-calling. Consequently, there is an urgent need for a well-documented bioinformatic pipeline that enshrines the principles of robustness, reproducibility, scalability, portability and flexibility while embracing the diversity of experimental and computational CLIP tools. To address this, we present nf-core/clipseq - a robust Nextflow pipeline for quality control and analysis of CLIP sequencing data. It is part of the international nf-core community effort to develop and curate a best-practice, gold-standard set of pipelines for data analysis. The standards enabled by Nextflow and nf-core, including workflow management, version control, continuous integration and containerisation ensure that these key needs are met. Furthermore, multiple tools are implemented ( e.g. for peak-calling), alongside visualisation of quality control metrics to empower the user to make their own informed decisions based on their data. nf-core/clipseq remains under active development, with plans to incorporate newly released tools to ensure that pipeline remains up-to-date and relevant for the community. Engagement with users and developers is encouraged through the nf-core GitHub repository and Slack channel to promote collaboration. It is available at https://nf-co.re/clipseq.

7.

Single cell genomics as a transformative approach for aquaculture research and innovation.

Daniels, Rose Ruiz; Taylor, Richard S; Robledo, Diego; Macqueen, Daniel J.

Rev Aquac ; 15(4): 1618-1637, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-38505116

RESUMO

Single cell genomics encompasses a suite of rapidly maturing technologies that measure the molecular profiles of individual cells within target samples. These approaches provide a large up-step in biological information compared to long-established 'bulk' methods that profile the average molecular profiles of all cells in a sample, and have led to transformative advances in understanding of cellular biology, particularly in humans and model organisms. The application of single cell genomics is fast expanding to non-model taxa, including aquaculture species, where numerous research applications are underway with many more envisaged. In this review, we highlight the potential transformative applications of single cell genomics in aquaculture research, considering barriers and potential solutions to the broad uptake of these technologies. Focusing on single cell transcriptomics, we outline considerations for experimental design, including the essential requirement to obtain high quality cells/nuclei for sequencing in ectothermic aquatic species. We further outline data analysis and bioinformatics considerations, tailored to studies with the under-characterized genomes of aquaculture species, where our knowledge of cellular heterogeneity and cell marker genes is immature. Overall, this review offers a useful source of knowledge for researchers aiming to apply single cell genomics to address biological challenges faced by the global aquaculture sector though an improved understanding of cell biology.

8.

Perturbation on gut microbiota impedes the onset of obesity in high fat diet-induced mice.

Yu, Zhongjia; Yu, Xiang-Fang; Kerem, Goher; Ren, Pei-Gen.

Front Endocrinol (Lausanne) ; 13: 795371, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-36017311

RESUMO

High-calorie intake has become one of the most common causes of dietary obesity, which eventually develops into type 2 diabetes mellitus (T2DM). Microbiota, along with the length of the gastrointestinal tract, is related to metabolic disorders, but its shifts and following impact on metabolic disorders due to external perturbation are still unclear. To evaluate shifts of microbiota from the proximal to the distal intestine and their impact on metabolic disorders, we profiled jejunal and colonic microbiota with the perturbation using high salt (HS) and antibiotic-induced microbiota depletion (AIMD) in diet-induced obesity (DIO) mice and analyzed the association with parameters of both obesity and blood glucose. After ten weeks of feeding DIO mice with HS intake and AIMD, they failed to develop obesity. The DIO mice with HS intake had T2DM symptoms, whereas the AIMD DIO mice showed no significant difference in blood glucose parameters. We observed that the jejunal and colonic microbiota had shifted due to settled perturbation, and jejunal microbiota within a group were more dispersed than colonic microbiota. After further analyzing jejunal microbiota using quantified amplicon sequencing, we found that the absolute abundance of Colidextribacter (R = 0.695, p = 0.001) and Faecalibaculum (R = 0.631, p = 0.005) in the jejunum was positively correlated with the changes in BW and FBG levels. The predicted pathway of glucose and metabolism of other substances significantly changed between groups (p <0.05). We demonstrated that the onset of obesity and T2DM in DIO mice is impeded when the gut microbiota is perturbed; thus, this pathogenesis depends on the gut microbiota.

Assuntos

Diabetes Mellitus Tipo 2 , Microbioma Gastrointestinal , Doenças Metabólicas , Animais , Glicemia , Diabetes Mellitus Tipo 2/complicações , Dieta Hiperlipídica/efeitos adversos , Doenças Metabólicas/complicações , Camundongos , Camundongos Obesos , Obesidade/etiologia , Obesidade/metabolismo

9.

DnoisE: distance denoising by entropy. An open-source parallelizable alternative for denoising sequence datasets.

Antich, Adrià; Palacín, Creu; Turon, Xavier; Wangensteen, Owen S.

PeerJ ; 10: e12758, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35111399

RESUMO

DNA metabarcoding is broadly used in biodiversity studies encompassing a wide range of organisms. Erroneous amplicons, generated during amplification and sequencing procedures, constitute one of the major sources of concern for the interpretation of metabarcoding results. Several denoising programs have been implemented to detect and eliminate these errors. However, almost all denoising software currently available has been designed to process non-coding ribosomal sequences, most notably prokaryotic 16S rDNA. The growing number of metabarcoding studies using coding markers such as COI or RuBisCO demands a re-assessment and calibration of denoising algorithms. Here we present DnoisE, the first denoising program designed to detect erroneous reads and merge them with the correct ones using information from the natural variability (entropy) associated to each codon position in coding barcodes. We have developed an open-source software using a modified version of the UNOISE algorithm. DnoisE implements different merging procedures as options, and can incorporate codon entropy information either retrieved from the data or supplied by the user. In addition, the algorithm of DnoisE is parallelizable, greatly reducing runtimes on computer clusters. Our program also allows different input file formats, so it can be readily incorporated into existing metabarcoding pipelines.

Assuntos

Algoritmos , Software , Entropia , DNA Ribossômico , Códon

10.

Current Status of Next-Generation Sequencing Approaches for Candidate Gene Discovery in Familial Parkinson´s Disease.

Pillay, Nikita Simone; Ross, Owen A; Christoffels, Alan; Bardien, Soraya.

Front Genet ; 13: 781816, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35299952

RESUMO

Parkinson's disease is a neurodegenerative disorder with a heterogeneous genetic etiology. The advent of next-generation sequencing (NGS) technologies has aided novel gene discovery in several complex diseases, including PD. This Perspective article aimed to explore the use of NGS approaches to identify novel loci in familial PD, and to consider their current relevance. A total of 17 studies, spanning various populations (including Asian, Middle Eastern and European ancestry), were identified. All the studies used whole-exome sequencing (WES), with only one study incorporating both WES and whole-genome sequencing. It is worth noting how additional genetic analyses (including linkage analysis, haplotyping and homozygosity mapping) were incorporated to enhance the efficacy of some studies. Also, the use of consanguineous families and the specific search for de novo mutations appeared to facilitate the finding of causal mutations. Across the studies, similarities and differences in downstream analysis methods and the types of bioinformatic tools used, were observed. Although these studies serve as a practical guide for novel gene discovery in familial PD, these approaches have not significantly resolved the "missing heritability" of PD. We speculate that what is needed is the use of third-generation sequencing technologies to identify complex genomic rearrangements and new sequence variation, missed with existing methods. Additionally, the study of ancestrally diverse populations (in particular those of Black African ancestry), with the concomitant optimization and tailoring of sequencing and analytic workflows to these populations, are critical. Only then, will this pave the way for exciting new discoveries in the field.

11.

Benchmark of thirteen bioinformatic pipelines for metagenomic virus diagnostics using datasets from clinical samples.

de Vries, Jutte J C; Brown, Julianne R; Fischer, Nicole; Sidorov, Igor A; Morfopoulou, Sofia; Huang, Jiabin; Munnink, Bas B Oude; Sayiner, Arzu; Bulgurcu, Alihan; Rodriguez, Christophe; Gricourt, Guillaume; Keyaerts, Els; Beller, Leen; Bachofen, Claudia; Kubacki, Jakub; Samuel, Cordey; Florian, Laubscher; Dennis, Schmitz; Beer, Martin; Hoeper, Dirk; Huber, Michael; Kufner, Verena; Zaheri, Maryam; Lebrand, Aitana; Papa, Anna; van Boheemen, Sander; Kroes, Aloys C M; Breuer, Judith; Lopez-Labrador, F Xavier; Claas, Eric C J.

J Clin Virol ; 141: 104908, 2021 08.

Artigo em Inglês | MEDLINE | ID: mdl-34273858

RESUMO

INTRODUCTION: Metagenomic sequencing is increasingly being used in clinical settings for difficult to diagnose cases. The performance of viral metagenomic protocols relies to a large extent on the bioinformatic analysis. In this study, the European Society for Clinical Virology (ESCV) Network on NGS (ENNGS) initiated a benchmark of metagenomic pipelines currently used in clinical virological laboratories. METHODS: Metagenomic datasets from 13 clinical samples from patients with encephalitis or viral respiratory infections characterized by PCR were selected. The datasets were analyzed with 13 different pipelines currently used in virological diagnostic laboratories of participating ENNGS members. The pipelines and classification tools were: Centrifuge, DAMIAN, DIAMOND, DNASTAR, FEVIR, Genome Detective, Jovian, MetaMIC, MetaMix, One Codex, RIEMS, VirMet, and Taxonomer. Performance, characteristics, clinical use, and user-friendliness of these pipelines were analyzed. RESULTS: Overall, viral pathogens with high loads were detected by all the evaluated metagenomic pipelines. In contrast, lower abundance pathogens and mixed infections were only detected by 3/13 pipelines, namely DNASTAR, FEVIR, and MetaMix. Overall sensitivity ranged from 80% (10/13) to 100% (13/13 datasets). Overall positive predictive value ranged from 71-100%. The majority of the pipelines classified sequences based on nucleotide similarity (8/13), only a minority used amino acid similarity, and 6 of the 13 pipelines assembled sequences de novo. No clear differences in performance were detected that correlated with these classification approaches. Read counts of target viruses varied between the pipelines over a range of 2-3 log, indicating differences in limit of detection. CONCLUSION: A wide variety of viral metagenomic pipelines is currently used in the participating clinical diagnostic laboratories. Detection of low abundant viral pathogens and mixed infections remains a challenge, implicating the need for standardization and validation of metagenomic analysis for clinical diagnostic use. Future studies should address the selective effects due to the choice of different reference viral databases.

Assuntos

Biologia Computacional , Vírus , Benchmarking , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Metagenômica , Vírus/genética

12.

Evaluating Bioinformatic Pipeline Performance for Forensic Microbiome Analysis^*,,.

Kaszubinski, Sierra F; Pechal, Jennifer L; Schmidt, Carl J; Jordan, Heather R; Benbow, Mark E; Meek, Mariah H.

J Forensic Sci ; 65(2): 513-525, 2020 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-31657871

RESUMO

Microbial communities have potential evidential utility for forensic applications. However, bioinformatic analysis of high-throughput sequencing data varies widely among laboratories. These differences can potentially affect microbial community composition and downstream analyses. To illustrate the importance of standardizing methodology, we compared analyses of postmortem microbiome samples using several bioinformatic pipelines, varying minimum library size or minimum number of sequences per sample, and sample size. Using the same input sequence data, we found that three open-source bioinformatic pipelines, MG-RAST, mothur, and QIIME2, had significant differences in relative abundance, alpha-diversity, and beta-diversity, despite the same input data. Increasing minimum library size and sample size increased the number of low-abundant and infrequent taxa detected. Our results show that bioinformatic pipeline and parameter choice affect results in important ways. Given the growing potential application of forensic microbiology to the criminal justice system, continued research on standardizing computational methodology will be important for downstream applications.

Assuntos

Bactérias/genética , Biologia Computacional , Microbiota , Conjuntos de Dados como Assunto , Ciências Forenses , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Boca/microbiologia , RNA Ribossômico 16S , Reto/microbiologia

13.

Bioinformatic Tools and Genome Analysis of Citrus tristeza virus.

Ruiz-García, Ana Belén; Bester, Rachelle; Olmos, Antonio; Maree, Hans Jacob.

Methods Mol Biol ; 2015: 163-178, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-31222703

RESUMO

High-throughput sequencing (HTS) is a powerful tool employed by plant virologists for the detection of viruses, the characterization of virus genomes and the study of host-pathogen interactions. Virus detection has been an important application of this technology, which has resulted in the discovery of novel viruses or viral strains as well as for the detection of known viruses in a plant sample. Here we describe the entire process that needs to be considered for the genome analysis of Citrus tristeza virus (CTV) by HTS, including the experimental design, sample preparation, nucleic acid purification, HTS library construction, and bioinformatic analysis.

Assuntos

Closterovirus/genética , Biologia Computacional/métodos , Genoma Viral/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos

14.

Chromatin Immunoprecipitation and High-Throughput Sequencing (ChIP-Seq): Tips and Tricks Regarding the Laboratory Protocol and Initial Downstream Data Analysis.

Patten, Darren K; Corleone, Giacomo; Magnani, Luca.

Methods Mol Biol ; 1767: 271-288, 2018.

Artigo em Inglês | MEDLINE | ID: mdl-29524141

RESUMO

Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become an essential tool for epigenetic scientists. ChIP-seq is used to map protein-DNA interactions and epigenetic marks such as histone modifications at the genome-wide level. Here we describe a complete ChIP-seq laboratory protocol (tailored toward processing tissue samples as well as cell lines) and the bioinformatic pipelines utilized for handling raw sequencing files through to peak calling.

Assuntos

Imunoprecipitação da Cromatina/métodos , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Animais , Biblioteca Gênica , Humanos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA