Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 91
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Bioinformatics ; 40(4)2024 Mar 29.
Artículo en Inglés | MEDLINE | ID: mdl-38561180

RESUMEN

SUMMARY: Sequence technology advancements have led to an exponential increase in bacterial genomes, necessitating robust taxonomic classification methods. The Percentage Of Conserved Proteins (POCP), proposed initially by Qin et al. (2014), is a valuable metric for assessing prokaryote genus boundaries. Here, I introduce a computational pipeline for automated POCP calculation, aiming to enhance reproducibility and ease of use in taxonomic studies. AVAILABILITY AND IMPLEMENTATION: The POCP-nf pipeline uses DIAMOND for faster protein alignments, achieving similar sensitivity to BLASTP. The pipeline is implemented in Nextflow with Conda and Docker support and is freely available on GitHub under https://github.com/hoelzer/pocp. The open-source code can be easily adapted for various prokaryotic genome and protein datasets. Detailed documentation and usage instructions are provided in the repository.


Asunto(s)
Células Procariotas , Programas Informáticos , Reproducibilidad de los Resultados , Genoma Bacteriano
2.
BMC Genomics ; 25(1): 528, 2024 May 28.
Artículo en Inglés | MEDLINE | ID: mdl-38807060

RESUMEN

BACKGROUND: Direct RNA sequencing (dRNA-seq) on the Oxford Nanopore Technologies (ONT) platforms can produce reads covering up to full-length gene transcripts, while containing decipherable information about RNA base modifications and poly-A tail lengths. Although many published studies have been expanding the potential of dRNA-seq, its sequencing accuracy and error patterns remain understudied. RESULTS: We present the first comprehensive evaluation of sequencing accuracy and characterisation of systematic errors in dRNA-seq data from diverse organisms and synthetic in vitro transcribed RNAs. We found that for sequencing kits SQK-RNA001 and SQK-RNA002, the median read accuracy ranged from 87% to 92% across species, and deletions significantly outnumbered mismatches and insertions. Due to their high abundance in the transcriptome, heteropolymers and short homopolymers were the major contributors to the overall sequencing errors. We also observed systematic biases across all species at the levels of single nucleotides and motifs. In general, cytosine/uracil-rich regions were more likely to be erroneous than guanines and adenines. By examining raw signal data, we identified the underlying signal-level features potentially associated with the error patterns and their dependency on sequence contexts. While read quality scores can be used to approximate error rates at base and read levels, failure to detect DNA adapters may be a source of errors and data loss. By comparing distinct basecallers, we reason that some sequencing errors are attributable to signal insufficiency rather than algorithmic (basecalling) artefacts. Lastly, we generated dRNA-seq data using the latest SQK-RNA004 sequencing kit released at the end of 2023 and found that although the overall read accuracy increased, the systematic errors remain largely identical compared to the previous kits. CONCLUSIONS: As the first systematic investigation of dRNA-seq errors, this study offers a comprehensive overview of reproducible error patterns across diverse datasets, identifies potential signal-level insufficiency, and lays the foundation for error correction methods.


Asunto(s)
Secuenciación de Nanoporos , Análisis de Secuencia de ARN , Análisis de Secuencia de ARN/métodos , Secuenciación de Nanoporos/métodos , Nanoporos , Humanos , Animales , ARN/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
3.
J Virol ; 97(10): e0020523, 2023 10 31.
Artículo en Inglés | MEDLINE | ID: mdl-37728614

RESUMEN

IMPORTANCE: A common hypothesis holds that bats (order Chiroptera) are outstanding reservoirs for zoonotic viruses because of a special antiviral interferon (IFN) system. However, functional studies about key components of the bat IFN system are rare. RIG-I is a cellular sensor for viral RNA signatures that activates the antiviral signaling chain to induce IFN. We cloned and functionally characterized RIG-I genes from two species of the suborders Yangochiroptera and Yinpterochiroptera. The bat RIG-Is were conserved in their sequence and domain organization, and similar to human RIG-I in (i) mediating virus- and IFN-activated gene expression, (ii) antiviral signaling, (iii) temperature dependence, and (iv) recognition of RNA ligands. Moreover, RIG-I of Rousettus aegyptiacus (suborder Yinpterochiroptera) and of humans were found to recognize SARS-CoV-2 infection. Thus, members of both bat suborders encode RIG-Is that are comparable to their human counterpart. The ability of bats to harbor zoonotic viruses therefore seems due to other features.


Asunto(s)
Quirópteros , Receptores de Ácido Retinoico , SARS-CoV-2 , Animales , Humanos , Quirópteros/metabolismo , COVID-19 , Receptores Inmunológicos/química , Receptores Inmunológicos/genética , Receptores Inmunológicos/metabolismo , SARS-CoV-2/fisiología , Virus , Receptores de Ácido Retinoico/química , Receptores de Ácido Retinoico/genética , Receptores de Ácido Retinoico/metabolismo
4.
PLoS Comput Biol ; 19(8): e1011422, 2023 08.
Artículo en Inglés | MEDLINE | ID: mdl-37639475

RESUMEN

The study of viral communities has revealed the enormous diversity and impact these biological entities have on various ecosystems. These observations have sparked widespread interest in developing computational strategies that support the comprehensive characterisation of viral communities based on sequencing data. Here we introduce VIRify, a new computational pipeline designed to provide a user-friendly and accurate functional and taxonomic characterisation of viral communities. VIRify identifies viral contigs and prophages from metagenomic assemblies and annotates them using a collection of viral profile hidden Markov models (HMMs). These include our manually-curated profile HMMs, which serve as specific taxonomic markers for a wide range of prokaryotic and eukaryotic viral taxa and are thus used to reliably classify viral contigs. We tested VIRify on assemblies from two microbial mock communities, a large metagenomics study, and a collection of publicly available viral genomic sequences from the human gut. The results showed that VIRify could identify sequences from both prokaryotic and eukaryotic viruses, and provided taxonomic classifications from the genus to the family rank with an average accuracy of 86.6%. In addition, VIRify allowed the detection and taxonomic classification of a range of prokaryotic and eukaryotic viruses present in 243 marine metagenomic assemblies. Finally, the use of VIRify led to a large expansion in the number of taxonomically classified human gut viral sequences and the improvement of outdated and shallow taxonomic classifications. Overall, we demonstrate that VIRify is a novel and powerful resource that offers an enhanced capability to detect a broad range of viral contigs and taxonomically classify them.


Asunto(s)
Eucariontes , Microbiota , Humanos , Células Eucariotas , Genoma Viral/genética , Metagenoma/genética
5.
BMC Genomics ; 24(1): 548, 2023 Sep 16.
Artículo en Inglés | MEDLINE | ID: mdl-37715127

RESUMEN

Mycoplasmopsis (M.) bovis, the agent of mastitis, pneumonia, and arthritis in cattle, harbors a small genome of approximately 1 Mbp. Combining data from Illumina and Nanopore technologies, we sequenced and assembled the genomes of 35 European strains and isolate DL422_88 from Cuba. While the high proportion of repetitive structures in M. bovis genomes represent a particular challenge, implementation of our own pipeline Mycovista (available on GitHub www.github.com/sandraTriebel/mycovista ) in a hybrid approach enabled contiguous assembly of the genomes and, consequently, improved annotation rates considerably. To put our European strain panel in a global context, we analyzed the new genome sequences together with 175 genome assemblies from public databases. Construction of a phylogenetic tree based on core genes of these 219 strains revealed a clustering pattern according to geographical origin, with European isolates positioned on clades 4 and 5. Genomic data allowing assignment of strains to tissue specificity or certain disease manifestations could not be identified. Seven strains isolated from cattle with systemic circular condition (SCC), still a largely unknown manifestation of M. bovis disease, were located on both clades 4 and 5. Pairwise association analysis revealed 108 genomic elements associated with a particular clade of the phylogenetic tree. Further analyzing these hits, 25 genes are functionally annotated and could be linked to a M. bovis protein, e.g. various proteases and nucleases, as well as ten variable surface lipoproteins (Vsps) and other surface proteins. These clade-specific genes could serve as useful markers in epidemiological and clinical surveys.


Asunto(s)
Genómica , Mycoplasma bovis , Femenino , Animales , Bovinos , Filogenia , Análisis por Conglomerados , Bases de Datos Factuales , Endonucleasas , Mycoplasma bovis/genética
6.
BMC Genomics ; 24(1): 288, 2023 May 29.
Artículo en Inglés | MEDLINE | ID: mdl-37248517

RESUMEN

BACKGROUND: Chlamydia (C.) psittaci, the causative agent of avian chlamydiosis and human psittacosis, is a genetically heterogeneous species. Its broad host range includes parrots and many other birds, but occasionally also humans (via zoonotic transmission), ruminants, horses, swine and rodents. To assess whether there are genetic markers associated with host tropism we comparatively analyzed whole-genome sequences of 61 C. psittaci strains, 47 of which carrying a 7.6-kbp plasmid. RESULTS: Following clean-up, reassembly and polishing of poorly assembled genomes from public databases, phylogenetic analyses using C. psittaci whole-genome sequence alignment revealed four major clades within this species. Clade 1 represents the most recent lineage comprising 40/61 strains and contains 9/10 of the psittacine strains, including type strain 6BC, and 10/13 of human isolates. Strains from different non-psittacine hosts clustered in Clades 2- 4. We found that clade membership correlates with typing schemes based on SNP types, ompA genotypes, multilocus sequence types as well as plasticity zone (PZ) structure and host preference. Genome analysis also revealed that i) sequence variation in the major outer membrane porin MOMP can result in 3D structural changes of immunogenic domains, ii) past host change of Clade 3 and 4 strains could be associated with loss of MAC/perforin in the PZ, rather than the large cytotoxin, iii) the distinct phylogeny of atypical strains (Clades 3 and 4) is also reflected in their repertoire of inclusion proteins (Inc family) and polymorphic membrane proteins (Pmps). CONCLUSIONS: Our study identified a number of genomic features that can be correlated with the phylogeny and host preference of C. psittaci strains. Our data show that intra-species genomic divergence is associated with past host change and includes deletions in the plasticity zone, structural variations in immunogenic domains and distinct repertoires of virulence factors.


Asunto(s)
Chlamydia , Chlamydophila psittaci , Psitacosis , Animales , Humanos , Caballos , Porcinos , Chlamydophila psittaci/genética , Psitacosis/veterinaria , Filogenia , Chlamydia/genética , Aves , Genómica
7.
BMC Genomics ; 24(1): 258, 2023 May 12.
Artículo en Inglés | MEDLINE | ID: mdl-37173617

RESUMEN

BACKGROUND: Bacterial epidemiology needs to understand the spread and dissemination of strains in a One Health context. This is important for highly pathogenic bacteria such as Bacillus anthracis, Brucella species, and Francisella tularensis. Whole genome sequencing (WGS) has paved the way for genetic marker detection and high-resolution genotyping. While such tasks are established for Illumina short-read sequencing, Oxford Nanopore Technology (ONT) long-read sequencing has yet to be evaluated for such highly pathogenic bacteria with little genomic variations between strains. In this study, three independent sequencing runs were performed using Illumina, ONT flow cell version 9.4.1, and 10.4 for six strains of each of Ba. anthracis, Br. suis and F. tularensis. Data from ONT sequencing alone, Illumina sequencing alone and two hybrid assembly approaches were compared. RESULTS: As previously shown, ONT produces ultra-long reads, while Illumina produces short reads with higher sequencing accuracy. Flow cell version 10.4 improved sequencing accuracy over version 9.4.1. The correct (sub-)species were inferred from all tested technologies, individually. Moreover, the sets of genetic markers for virulence, were almost identical for the respective species. The long reads of ONT allowed to assemble not only chromosomes of all species to near closure, but also virulence plasmids of Ba. anthracis. Assemblies based on nanopore data alone, Illumina data alone, and both hybrid assemblies correctly detected canonical (sub-)clades for Ba. anthracis and F. tularensis as well as multilocus sequence types for Br. suis. For F. tularensis, high-resolution genotyping using core-genome MLST (cgMLST) and core-genome Single-Nucleotide-Polymorphism (cgSNP) typing produced highly comparable results between data from Illumina and both ONT flow cell versions. For Ba. anthracis, only data from flow cell version 10.4 produced similar results to Illumina for both high-resolution typing methods. However, for Br. suis, high-resolution genotyping yielded larger differences comparing Illumina data to data from both ONT flow cell versions. CONCLUSIONS: In summary, combining data from ONT and Illumina for high-resolution genotyping might be feasible for F. tularensis and Ba. anthracis, but not yet for Br. suis. The ongoing improvement of nanopore technology and subsequent data analysis may facilitate high-resolution genotyping for all bacteria with highly stable genomes in future.


Asunto(s)
Bacillus anthracis , Brucella suis , Francisella tularensis , Nanoporos , Francisella tularensis/genética , Brucella suis/genética , Bacillus anthracis/genética , Tipificación de Secuencias Multilocus , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos
8.
BMC Genomics ; 24(1): 151, 2023 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-36973643

RESUMEN

BACKGROUND: Most plant-pathogenic Xanthomonas bacteria harbor transcription activator-like effector (TALE) genes, which function as transcriptional activators of host plant genes and support infection. The entire repertoire of up to 29 TALE genes of a Xanthomonas strain is also referred to as TALome. The DNA-binding domain of TALEs is comprised of highly conserved repeats and TALE genes often occur in gene clusters, which precludes the assembly of TALE-carrying Xanthomonas genomes based on standard sequencing approaches. RESULTS: Here, we report the successful assembly of the 5 Mbp genomes of five Xanthomonas strains from Oxford Nanopore Technologies (ONT) sequencing data. For one of these strains, Xanthomonas oryzae pv. oryzae (Xoo) PXO35, we illustrate why Illumina short reads and longer PacBio reads are insufficient to fully resolve the genome. While ONT reads are perfectly suited to yield highly contiguous genomes, they suffer from a specific error profile within homopolymers. To still yield complete and correct TALomes from ONT assemblies, we present a computational correction pipeline specifically tailored to TALE genes, which yields at least comparable accuracy as Illumina-based polishing. We further systematically assess the ONT-based pipeline for its multiplexing capacity and find that, combined with computational correction, the complete TALome of Xoo PXO35 could have been reconstructed from less than 20,000 ONT reads. CONCLUSIONS: Our results indicate that multiplexed ONT sequencing combined with a computational correction of TALE genes constitutes a highly capable tool for characterizing the TALomes of huge collections of Xanthomonas strains in the future.


Asunto(s)
Secuenciación de Nanoporos , Xanthomonas , Efectores Tipo Activadores de la Transcripción/genética , Xanthomonas/genética , Genoma
9.
J Virol ; 96(5): e0155621, 2022 03 09.
Artículo en Inglés | MEDLINE | ID: mdl-35019718

RESUMEN

Thogotoviruses are tick-borne arboviruses that comprise a unique genus within the Orthomyxoviridae family. Infections with thogotoviruses primarily cause disease in livestock with occasional reports of human infections suggesting a zoonotic potential. In the past, multiple genetically distinct thogotoviruses were isolated mostly from collected ticks. However, many aspects regarding their phylogenetic relationships, morphological characteristics, and virulence in mammals remain unclear. For the present comparative study, we used a collection of 10 different thogotovirus isolates from different geographic areas. Next-generation sequencing and subsequent phylogenetic analyses revealed a distinct separation of these viruses into two major clades, the Thogoto-like and Dhori-like viruses. Electron microscopy demonstrated a heterogeneous morphology with spherical and filamentous particles being present in virus preparations. To study their pathogenicity, we analyzed the viruses in a small animal model system. In intraperitoneally infected C57BL/6 mice, all isolates showed a tropism for liver, lung, and spleen. Importantly, we did not observe horizontal transmission to uninfected, highly susceptible contact mice. The isolates enormously differed in their capacity to induce disease, ranging from subclinical to fatal outcomes. In vivo multistep passaging experiments of two low-pathogenic isolates showed no increased virulence and sequence analyses of the passaged viruses indicated a high stability of the viral genomes after 10 mouse passages. In summary, our analysis demonstrates the broad genetic and phenotypic variability within the thogotovirus genus. Moreover, thogotoviruses are well adapted to mammals but their horizontal transmission seems to depend on ticks as their vectors. IMPORTANCE Since their discovery over 60 years ago, 15 genetically distinct members of the thogotovirus genus have been isolated. These arboviruses belong to the Orthomyxovirus family and share many features with influenza viruses. However, numerous of these isolates have not been characterized in depth. In the present study, we comparatively analyzed a collection of 10 different thogotovirus isolates to answer basic questions about their phylogenetic relationships, morphology, and pathogenicity in mice. Our results highlight shared and unique characteristics of this diverse genus. Taken together, these observations provide a framework for the phylogenic classification and phenotypic characterization of newly identified thogotovirus isolates that could potentially cause severe human infections as exemplified by the recently reported, fatal Bourbon virus cases in the United States.


Asunto(s)
Infecciones por Orthomyxoviridae , Thogotovirus , Animales , Modelos Animales de Enfermedad , Variación Genética , Genoma Viral/genética , Inestabilidad Genómica , Ratones , Ratones Endogámicos C57BL , Microscopía Electrónica , Infecciones por Orthomyxoviridae/transmisión , Infecciones por Orthomyxoviridae/virología , Filogenia , Thogotovirus/clasificación , Thogotovirus/genética , Thogotovirus/patogenicidad , Thogotovirus/ultraestructura , Garrapatas/virología
10.
Brief Bioinform ; 22(2): 642-663, 2021 03 22.
Artículo en Inglés | MEDLINE | ID: mdl-33147627

RESUMEN

SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) is a novel virus of the family Coronaviridae. The virus causes the infectious disease COVID-19. The biology of coronaviruses has been studied for many years. However, bioinformatics tools designed explicitly for SARS-CoV-2 have only recently been developed as a rapid reaction to the need for fast detection, understanding and treatment of COVID-19. To control the ongoing COVID-19 pandemic, it is of utmost importance to get insight into the evolution and pathogenesis of the virus. In this review, we cover bioinformatics workflows and tools for the routine detection of SARS-CoV-2 infection, the reliable analysis of sequencing data, the tracking of the COVID-19 pandemic and evaluation of containment measures, the study of coronavirus evolution, the discovery of potential drug targets and development of therapeutic strategies. For each tool, we briefly describe its use case and how it advances research specifically for SARS-CoV-2. All tools are free to use and available online, either through web applications or public code repositories. Contact:evbc@unj-jena.de.


Asunto(s)
COVID-19/prevención & control , Biología Computacional , SARS-CoV-2/aislamiento & purificación , Investigación Biomédica , COVID-19/epidemiología , COVID-19/virología , Genoma Viral , Humanos , Pandemias , SARS-CoV-2/genética
11.
Bioinformatics ; 38(17): 4223-4225, 2022 09 02.
Artículo en Inglés | MEDLINE | ID: mdl-35799354

RESUMEN

SUMMARY: The ongoing pandemic caused by SARS-CoV-2 emphasizes the importance of genomic surveillance to understand the evolution of the virus, to monitor the viral population, and plan epidemiological responses. Detailed analysis, easy visualization and intuitive filtering of the latest viral sequences are powerful for this purpose. We present CovRadar, a tool for genomic surveillance of the SARS-CoV-2 Spike protein. CovRadar consists of an analytical pipeline and a web application that enable the analysis and visualization of hundreds of thousand sequences. First, CovRadar extracts the regions of interest using local alignment, then builds a multiple sequence alignment, infers variants and consensus and finally presents the results in an interactive app, making accessing and reporting simple, flexible and fast. AVAILABILITY AND IMPLEMENTATION: CovRadar is freely accessible at https://covradar.net, its open-source code is available at https://gitlab.com/dacs-hpi/covradar. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , Genómica , Mutación
12.
Eur Arch Otorhinolaryngol ; 280(5): 2373-2385, 2023 May.
Artículo en Inglés | MEDLINE | ID: mdl-36441246

RESUMEN

PURPOSE: The aim of the present study was to assess the efficacy of the Ronch®AP palatal device in treating patients with moderate and severe forms of obstructive sleep apnea syndrome. METHODS: In a randomized controlled trial 22 patients were examined with the Ronch®AP palatal device after 4 weeks of usage. Their results were compared to a control group of 30 patients who did not receive any treatment during this time. All patients included did not tolerate CPAP therapy. Among other parameters the apnea-hypopnea index (AHI) was measured using nocturnal cardiorespiratory polysomnography. Daytime sleepiness was assessed using Epworth Sleepiness Scale. Pittsburgh Sleep Quality Index was used to analyze sleep quality. RESULTS: Using the Ronch®AP palatal device AHI was reduced from an average of 35.34 ± 14.9/h to 19.18 ± 14.93/h, whereas the control group only showed a minimal mean reduction from 31.32 ± 12.76/h to 29.37 ± 17.11/h. The difference in reduction between the two randomized groups was highly significant (d = - 14.2, 95% CI 5.9-22.6, t = 3.4, df = 49.9, p = 0.001). Epworth Sleepiness Scale score was lowered from 9.18 ± 4.73 to 7.82 ± 4.14 on average and sleep quality improved by - 1.91 ± 2.31. Both changes were also statistically relevant (p < 0.005). CONCLUSIONS: The Ronch®AP device is an effective alternative treatment option for patients suffering from moderate and severe forms of obstructive sleep apnea syndrome and not tolerating CPAP therapy. TRIAL REGISTRATION NUMBER: 407-16 with approval from the local ethical committee (Ethikkommission der Medizinischen Fakultät der LMU München).


Asunto(s)
Apnea Obstructiva del Sueño , Somnolencia , Humanos , Apnea Obstructiva del Sueño/diagnóstico , Apnea Obstructiva del Sueño/terapia , Polisomnografía , Resultado del Tratamiento , Hueso Paladar , Presión de las Vías Aéreas Positiva Contínua
13.
Clin Infect Dis ; 75(Suppl 1): S110-S120, 2022 08 15.
Artículo en Inglés | MEDLINE | ID: mdl-35749674

RESUMEN

BACKGROUND: Comprehensive pathogen genomic surveillance represents a powerful tool to complement and advance precision vaccinology. The emergence of the Alpha variant in December 2020 and the resulting efforts to track the spread of this and other severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants of concern led to an expansion of genomic sequencing activities in Germany. METHODS: At Robert Koch Institute (RKI), the German National Institute of Public Health, we established the Integrated Molecular Surveillance for SARS-CoV-2 (IMS-SC2) network to perform SARS-CoV-2 genomic surveillance at the national scale, SARS-CoV-2-positive samples from laboratories distributed across Germany regularly undergo whole-genome sequencing at RKI. RESULTS: We report analyses of 3623 SARS-CoV-2 genomes collected between December 2020 and December 2021, of which 3282 were randomly sampled. All variants of concern were identified in the sequenced sample set, at ratios equivalent to those in the 100-fold larger German GISAID sequence dataset from the same time period. Phylogenetic analysis confirmed variant assignments. Multiple mutations of concern emerged during the observation period. To model vaccine effectiveness in vitro, we employed authentic-virus neutralization assays, confirming that both the Beta and Zeta variants are capable of immune evasion. The IMS-SC2 sequence dataset facilitated an estimate of the SARS-CoV-2 incidence based on genetic evolution rates. Together with modeled vaccine efficacies, Delta-specific incidence estimation indicated that the German vaccination campaign contributed substantially to a deceleration of the nascent German Delta wave. CONCLUSIONS: SARS-CoV-2 molecular and genomic surveillance may inform public health policies including vaccination strategies and enable a proactive approach to controlling coronavirus disease 2019 spread as the virus evolves.


Asunto(s)
COVID-19 , SARS-CoV-2 , COVID-19/epidemiología , COVID-19/prevención & control , Genoma Viral , Genómica , Humanos , Filogenia , SARS-CoV-2/genética , Vacunología
14.
Genome Res ; 29(9): 1545-1554, 2019 09.
Artículo en Inglés | MEDLINE | ID: mdl-31439691

RESUMEN

Sequence analyses of RNA virus genomes remain challenging owing to the exceptional genetic plasticity of these viruses. Because of high mutation and recombination rates, genome replication by viral RNA-dependent RNA polymerases leads to populations of closely related viruses, so-called "quasispecies." Standard (short-read) sequencing technologies are ill-suited to reconstruct large numbers of full-length haplotypes of (1) RNA virus genomes and (2) subgenome-length (sg) RNAs composed of noncontiguous genome regions. Here, we used a full-length, direct RNA sequencing (DRS) approach based on nanopores to characterize viral RNAs produced in cells infected with a human coronavirus. By using DRS, we were able to map the longest (∼26-kb) contiguous read to the viral reference genome. By combining Illumina and Oxford Nanopore sequencing, we reconstructed a highly accurate consensus sequence of the human coronavirus (HCoV)-229E genome (27.3 kb). Furthermore, by using long reads that did not require an assembly step, we were able to identify, in infected cells, diverse and novel HCoV-229E sg RNAs that remain to be characterized. Also, the DRS approach, which circumvents reverse transcription and amplification of RNA, allowed us to detect methylation sites in viral RNAs. Our work paves the way for haplotype-based analyses of viral quasispecies by showing the feasibility of intra-sample haplotype separation. Even though several technical challenges remain to be addressed to exploit the potential of the nanopore technology fully, our work illustrates that DRS may significantly advance genomic studies of complex virus populations, including predictions on long-range interactions in individual full-length viral RNA haplotypes.


Asunto(s)
Coronavirus/genética , Secuenciación de Nanoporos/métodos , Análisis de Secuencia de ARN/métodos , Línea Celular , Evolución Molecular , Variación Genética , Tamaño del Genoma , Humanos , Metilación , Cuasiespecies
15.
BMC Plant Biol ; 22(1): 340, 2022 Jul 14.
Artículo en Inglés | MEDLINE | ID: mdl-35836106

RESUMEN

BACKGROUND: Fruits are the seed-bearing structures of flowering plants and are highly diverse in terms of morphology, texture and maturation. Dehiscent fruits split open upon maturation to discharge their seeds while indehiscent fruits are dispersed as a whole. Indehiscent fruits evolved from dehiscent fruits several times independently in the crucifer family (Brassicaceae). The fruits of Lepidium appelianum, for example, are indehiscent while the fruits of the closely related L. campestre are dehiscent. Here, we investigate the molecular and genetic mechanisms underlying the evolutionary transition from dehiscent to indehiscent fruits using these two Lepidium species as model system. RESULTS: We have sequenced the transcriptomes and small RNAs of floral buds, flowers and fruits of L. appelianum and L. campestre and analyzed differentially expressed genes (DEGs) and differently differentially expressed genes (DDEGs). DEGs are genes that show significantly different transcript levels in the same structures (buds, flowers and fruits) in different species, or in different structures in the same species. DDEGs are genes for which the change in expression level between two structures is significantly different in one species than in the other. Comparing the two species, the highest number of DEGs was found in flowers, followed by fruits and floral buds while the highest number of DDEGs was found in fruits versus flowers followed by flowers versus floral buds. Several gene ontology terms related to cell wall synthesis and degradation were overrepresented in different sets of DEGs highlighting the importance of these processes for fruit opening. Furthermore, the fruit valve identity genes FRUITFULL and YABBY3 were among the DEGs identified. Finally, the microRNA miR166 as well as the TCP transcription factors BRANCHED1 (BRC1) and TCP FAMILY TRANSCRIPTION FACTOR 4 (TCP4) were found to be DDEGs. CONCLUSIONS: Our study reveals differences in gene expression between dehiscent and indehiscent fruits and uncovers miR166, BRC1 and TCP4 as candidate genes for the evolutionary transition from dehiscent to indehiscent fruits in Lepidium.


Asunto(s)
Brassicaceae , Lepidium , Brassicaceae/genética , Brassicaceae/metabolismo , Flores/genética , Frutas/genética , Frutas/metabolismo , Regulación de la Expresión Génica de las Plantas , Lepidium/genética , Transcriptoma
16.
Bioinformatics ; 37(7): 1018-1020, 2021 05 17.
Artículo en Inglés | MEDLINE | ID: mdl-32735310

RESUMEN

SUMMARY: PoSeiDon is an easy-to-use pipeline that helps researchers to find recombination events and sites under positive selection in protein-coding sequences. By entering homologous sequences, PoSeiDon builds an alignment, estimates a best-fitting substitution model and performs a recombination analysis followed by the construction of all corresponding phylogenies. Finally, significantly positive selected sites are detected according to different models for the full alignment and possible recombination fragments. The results of PoSeiDon are summarized in a user-friendly HTML page providing all intermediate results and the graphical representation of recombination events and positively selected sites. AVAILABILITY AND IMPLEMENTATION: PoSeiDon is freely available at https://github.com/hoelzer/poseidon. The pipeline is implemented in Nextflow with Docker support and processes the output of various tools.


Asunto(s)
Recombinación Genética , Programas Informáticos , Filogenia
17.
Bioinformatics ; 37(4): 448-455, 2021 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-32915967

RESUMEN

MOTIVATION: By binding to specific structures on antigenic proteins, the so-called epitopes, B-cell antibodies can neutralize pathogens. The identification of B-cell epitopes is of great value for the development of specific serodiagnostic assays and the optimization of medical therapy. However, identifying diagnostically or therapeutically relevant epitopes is a challenging task that usually involves extensive laboratory work. In this study, we show that the time, cost and labor-intensive process of epitope detection in the lab can be significantly reduced using in silico prediction. RESULTS: Here, we present EpiDope, a python tool which uses a deep neural network to detect linear B-cell epitope regions on individual protein sequences. With an area under the curve between 0.67 ± 0.07 in the receiver operating characteristic curve, EpiDope exceeds all other currently used linear B-cell epitope prediction tools. Our software is shown to reliably predict linear B-cell epitopes of a given protein sequence, thus contributing to a significant reduction of laboratory experiments and costs required for the conventional approach. AVAILABILITYAND IMPLEMENTATION: EpiDope is available on GitHub (http://github.com/mcollatz/EpiDope). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Epítopos de Linfocito B , Programas Informáticos , Secuencia de Aminoácidos , Simulación por Computador , Mapeo Epitopo , Redes Neurales de la Computación
18.
PLoS Comput Biol ; 17(2): e1008716, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33561126

RESUMEN

Metagenomics has redefined many areas of microbiology. However, metagenome-assembled genomes (MAGs) are often fragmented, primarily when sequencing was performed with short reads. Recent long-read sequencing technologies promise to improve genome reconstruction. However, the integration of two different sequencing modalities makes downstream analyses complex. We, therefore, developed MUFFIN, a complete metagenomic workflow that uses short and long reads to produce high-quality bins and their annotations. The workflow is written by using Nextflow, a workflow orchestration software, to achieve high reproducibility and fast and straightforward use. This workflow also produces the taxonomic classification and KEGG pathways of the bins and can be further used for quantification and annotation by providing RNA-Seq data (optionally). We tested the workflow using twenty biogas reactor samples and assessed the capacity of MUFFIN to process and output relevant files needed to analyze the microbial community and their function. MUFFIN produces functional pathway predictions and, if provided de novo metatranscript annotations across the metagenomic sample and for each bin. MUFFIN is available on github under GNUv3 licence: https://github.com/RVanDamme/MUFFIN.


Asunto(s)
Biología Computacional/métodos , Metagenoma , Metagenómica , Programas Informáticos , Flujo de Trabajo , Algoritmos , Reactores Biológicos , Simulación por Computador , Genómica , Humanos , RNA-Seq , Reproducibilidad de los Resultados , Análisis de Secuencia de ADN
19.
Eur Arch Otorhinolaryngol ; 279(2): 801-810, 2022 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-34019148

RESUMEN

PURPOSE: For perennial inhalant allergens such as house dust mite (HDM), the German guideline on allergen-specific immunotherapy explicitly recommends provocation testing. This procedure is time-consuming, expensive, and potentially dangerous for the patient. Recently it has been discussed whether provocation tests are really necessary for diagnosing allergy to Dermatophagoides species. Thus, our study aimed to analyze the concordance between nasal provocation tests with Dermatophagoides species and anamnestic data. METHODS: We retrospectively analyzed the concordance between patients' histories including self-reported symptom scores and the results of provocation testing in 471 individuals with proven sensitization to Dermatophagoides species. RESULTS: 248 patients had a positive nasal provocation test (NPT) result to Dermatophagoides species and 223 individuals a negative NPT result. Patients allergic to HSM suffered significantly more often from atopic dermatitis (14% vs. 7%, p = 0.046) and more from asthma (20% vs. 19%, p = 0.851). Moreover, individuals with clinically silent sensitization complained less about nasal secretion (37% vs. 45%, p = 0.244) but significantly more about nasal dryness (46% vs. 34%, p = 0.046) whereas rates of nasal airway obstruction, ocular complaints and sleep quality were comparable in both groups. Allergic patients reported more often perennial (34% vs. 30%, p = 0.374) and location-dependent (39% vs. 31%, p = 0.090) symptoms. However, the discrepant prevalence of atopic dermatitis was the only statistically significant difference between both groups. CONCLUSION: Despite slight differences between both patient groups, clinical data are not sufficient to distinguish between silent sensitization and clinically relevant allergic rhinitis to HDM. Therefore, nasal provocation testing remains the gold standard for assessing clinical relevance in patients sensitized to Dermatophagoides species.


Asunto(s)
Pyroglyphidae , Rinitis Alérgica , Alérgenos , Animales , Antígenos Dermatofagoides , Polvo , Humanos , Estudios Retrospectivos , Rinitis Alérgica/diagnóstico , Rinitis Alérgica/epidemiología , Pruebas Cutáneas , Calidad del Sueño
20.
Genomics ; 113(6): 4022-4027, 2021 11.
Artículo en Inglés | MEDLINE | ID: mdl-34648882

RESUMEN

Rhodotorula toruloides is an oleaginous yeast with high biotechnological potential. In order to understand the molecular physiology of lipid synthesis in R. toruloides and to advance metabolic engineering, a high-resolution genome is required. We constructed a genome draft of R. toruloides CBS 14, using a hybrid assembly approach, consisting of short and long reads generated by Illumina and Nanopore sequencing, respectively. The genome draft consists of 23 contigs and 3 scaffolds, with a N50 length of 1,529,952 bp, thus largely representing chromosomal organization. The total size of the genome is 20,534,857 bp and the overall GC content is 61.83%. Transcriptomic data from different growth conditions was used to aid species-specific gene annotation. We annotated 9464 genes and identified 11,691 transcripts. Furthermore, we demonstrated the presence of a potential plasmid, an extrachromosomal circular structure of about 11 kb with a copy number about three times as high as the other chromosomes.


Asunto(s)
Rhodotorula , Transcriptoma , Cromosomas , Anotación de Secuencia Molecular , Rhodotorula/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA