Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 118
Filtrar
1.
Nucleic Acids Res ; 52(6): 2865-2885, 2024 Apr 12.
Artículo en Inglés | MEDLINE | ID: mdl-38471806

RESUMEN

A comprehensive understanding of molecular changes during brain aging is essential to mitigate cognitive decline and delay neurodegenerative diseases. The interpretation of mRNA alterations during brain aging is influenced by the health and age of the animal cohorts studied. Here, we carefully consider these factors and provide an in-depth investigation of mRNA splicing and dynamics in the aging mouse brain, combining short- and long-read sequencing technologies with extensive bioinformatic analyses. Our findings encompass a spectrum of age-related changes, including differences in isoform usage, decreased mRNA dynamics and a module showing increased expression of neuronal genes. Notably, our results indicate a reduced abundance of mRNA isoforms leading to nonsense-mediated RNA decay and suggest a regulatory role for RNA-binding proteins, indicating that their regulation may be altered leading to the reshaping of the aged brain transcriptome. Collectively, our study highlights the importance of studying mRNA splicing events during brain aging.


Asunto(s)
Empalme Alternativo , Encéfalo , Empalme del ARN , Animales , Ratones , Encéfalo/metabolismo , Perfilación de la Expresión Génica/métodos , Empalme del ARN/genética , ARN Mensajero/genética , ARN Mensajero/metabolismo , Transcriptoma/genética
2.
Cell Rep ; 43(1): 113644, 2024 01 23.
Artículo en Inglés | MEDLINE | ID: mdl-38180837

RESUMEN

Extensive remodeling of the female mammary epithelium during development and pregnancy has been linked to cancer susceptibility. The faithful response of mammary epithelial cells (MECs) to hormone signaling is key to avoiding breast cancer development. Here, we show that lactogenic differentiation of murine MECs requires silencing of genes encoding ribosomal RNA (rRNA) by the antisense transcript PAPAS. Accordingly, knockdown of PAPAS derepresses rRNA genes, attenuates the response to lactogenic hormones, and induces malignant transformation. Restoring PAPAS levels in breast cancer cells reduces tumorigenicity and lung invasion and activates many interferon-regulated genes previously linked to metastasis suppression. Mechanistically, PAPAS transcription depends on R-loop formation at the 3' end of rRNA genes, which is repressed by RNase H1 and replication protein A (RPA) overexpression in breast cancer cells. Depletion of PAPAS and upregulation of RNase H1 and RPA in human breast cancer underpin the clinical relevance of our findings.


Asunto(s)
Neoplasias de la Mama , Glándulas Mamarias Animales , Embarazo , Femenino , Ratones , Animales , Humanos , Glándulas Mamarias Animales/metabolismo , Mama/metabolismo , Diferenciación Celular , Neoplasias de la Mama/genética , Neoplasias de la Mama/metabolismo , Transformación Celular Neoplásica/metabolismo , Células Epiteliales/metabolismo
3.
Viruses ; 15(10)2023 09 30.
Artículo en Inglés | MEDLINE | ID: mdl-37896809

RESUMEN

The 2023 International Virus Bioinformatics Meeting was held in Valencia, Spain, from 24-26 May 2023, attracting approximately 180 participants worldwide. The primary objective of the conference was to establish a dynamic scientific environment conducive to discussion, collaboration, and the generation of novel research ideas. As the first in-person event following the SARS-CoV-2 pandemic, the meeting facilitated highly interactive exchanges among attendees. It served as a pivotal gathering for gaining insights into the current status of virus bioinformatics research and engaging with leading researchers and emerging scientists. The event comprised eight invited talks, 19 contributed talks, and 74 poster presentations across eleven sessions spanning three days. Topics covered included machine learning, bacteriophages, virus discovery, virus classification, virus visualization, viral infection, viromics, molecular epidemiology, phylodynamic analysis, RNA viruses, viral sequence analysis, viral surveillance, and metagenomics. This report provides rewritten abstracts of the presentations, a summary of the key research findings, and highlights shared during the meeting.


Asunto(s)
Bacteriófagos , Virus ARN , Virosis , Virus , Humanos , Biología Computacional , Virus/genética
4.
J Fungi (Basel) ; 9(10)2023 Oct 14.
Artículo en Inglés | MEDLINE | ID: mdl-37888270

RESUMEN

Neutrophils are critical phagocytic cells in innate immunity, playing a significant role in defending against invasive fungal pathogens. This study aimed to explore the transcriptional activation of human neutrophils in response to different fungal pathogens, including Candida albicans and Aspergillus fumigatus, compared to the bacterial pathogen Escherichia coli. We identified distinct transcriptional profiles and stress-related pathways in neutrophils during fungal infections, highlighting their functional diversity and adaptability. The transcriptional response was largely redundant across all pathogens in immune-relevant categories and cytokine pathway activation. However, differences in the magnitude of differentially expressed genes (DEGs) were observed, with A. fumigatus inducing a lower transcriptional effect compared to C. albicans and E. coli. Notably, specific gene signatures associated with cell death were differentially regulated by fungal pathogens, potentially increasing neutrophil susceptibility to autophagy, pyroptosis, and neutrophil extracellular trap (NET) formation. These findings provide valuable insights into the complex immunological responses of neutrophils during fungal infections, offering new avenues for diagnostic and therapeutic strategies, particularly in the management of invasive fungal diseases.

5.
Viruses ; 15(9)2023 Aug 29.
Artículo en Inglés | MEDLINE | ID: mdl-37766241

RESUMEN

Viruses are abundant and diverse entities that have important roles in public health, ecology, and agriculture. The identification and surveillance of viruses rely on an understanding of their genome organization, sequences, and replication strategy. Despite technological advancements in sequencing methods, our current understanding of virus diversity remains incomplete, highlighting the need to explore undiscovered viruses. Virus databases play a crucial role in providing access to sequences, annotations and other metadata, and analysis tools for studying viruses. However, there has not been a comprehensive review of virus databases in the last five years. This study aimed to fill this gap by identifying 24 active virus databases and included an extensive evaluation of their content, functionality and compliance with the FAIR principles. In this study, we thoroughly assessed the search capabilities of five database catalogs, which serve as comprehensive repositories housing a diverse array of databases and offering essential metadata. Moreover, we conducted a comprehensive review of different types of errors, encompassing taxonomy, names, missing information, sequences, sequence orientation, and chimeric sequences, with the intention of empowering users to effectively tackle these challenges. We expect this review to aid users in selecting suitable virus databases and other resources, and to help databases in error management and improve their adherence to the FAIR principles. The databases listed here represent the current knowledge of viruses and will help aid users find databases of interest based on content, functionality, and scope. The use of virus databases is integral to gaining new insights into the biology, evolution, and transmission of viruses, and developing new strategies to manage virus outbreaks and preserve global health.

6.
J Virol ; 97(10): e0020523, 2023 10 31.
Artículo en Inglés | MEDLINE | ID: mdl-37728614

RESUMEN

IMPORTANCE: A common hypothesis holds that bats (order Chiroptera) are outstanding reservoirs for zoonotic viruses because of a special antiviral interferon (IFN) system. However, functional studies about key components of the bat IFN system are rare. RIG-I is a cellular sensor for viral RNA signatures that activates the antiviral signaling chain to induce IFN. We cloned and functionally characterized RIG-I genes from two species of the suborders Yangochiroptera and Yinpterochiroptera. The bat RIG-Is were conserved in their sequence and domain organization, and similar to human RIG-I in (i) mediating virus- and IFN-activated gene expression, (ii) antiviral signaling, (iii) temperature dependence, and (iv) recognition of RNA ligands. Moreover, RIG-I of Rousettus aegyptiacus (suborder Yinpterochiroptera) and of humans were found to recognize SARS-CoV-2 infection. Thus, members of both bat suborders encode RIG-Is that are comparable to their human counterpart. The ability of bats to harbor zoonotic viruses therefore seems due to other features.


Asunto(s)
Quirópteros , Receptores de Ácido Retinoico , SARS-CoV-2 , Animales , Humanos , Quirópteros/metabolismo , COVID-19 , Receptores Inmunológicos/química , Receptores Inmunológicos/genética , Receptores Inmunológicos/metabolismo , SARS-CoV-2/fisiología , Virus , Receptores de Ácido Retinoico/química , Receptores de Ácido Retinoico/genética , Receptores de Ácido Retinoico/metabolismo
7.
BMC Genomics ; 24(1): 548, 2023 Sep 16.
Artículo en Inglés | MEDLINE | ID: mdl-37715127

RESUMEN

Mycoplasmopsis (M.) bovis, the agent of mastitis, pneumonia, and arthritis in cattle, harbors a small genome of approximately 1 Mbp. Combining data from Illumina and Nanopore technologies, we sequenced and assembled the genomes of 35 European strains and isolate DL422_88 from Cuba. While the high proportion of repetitive structures in M. bovis genomes represent a particular challenge, implementation of our own pipeline Mycovista (available on GitHub www.github.com/sandraTriebel/mycovista ) in a hybrid approach enabled contiguous assembly of the genomes and, consequently, improved annotation rates considerably. To put our European strain panel in a global context, we analyzed the new genome sequences together with 175 genome assemblies from public databases. Construction of a phylogenetic tree based on core genes of these 219 strains revealed a clustering pattern according to geographical origin, with European isolates positioned on clades 4 and 5. Genomic data allowing assignment of strains to tissue specificity or certain disease manifestations could not be identified. Seven strains isolated from cattle with systemic circular condition (SCC), still a largely unknown manifestation of M. bovis disease, were located on both clades 4 and 5. Pairwise association analysis revealed 108 genomic elements associated with a particular clade of the phylogenetic tree. Further analyzing these hits, 25 genes are functionally annotated and could be linked to a M. bovis protein, e.g. various proteases and nucleases, as well as ten variable surface lipoproteins (Vsps) and other surface proteins. These clade-specific genes could serve as useful markers in epidemiological and clinical surveys.


Asunto(s)
Genómica , Mycoplasma bovis , Femenino , Animales , Bovinos , Filogenia , Análisis por Conglomerados , Bases de Datos Factuales , Endonucleasas , Mycoplasma bovis/genética
8.
PLoS Comput Biol ; 19(8): e1011422, 2023 08.
Artículo en Inglés | MEDLINE | ID: mdl-37639475

RESUMEN

The study of viral communities has revealed the enormous diversity and impact these biological entities have on various ecosystems. These observations have sparked widespread interest in developing computational strategies that support the comprehensive characterisation of viral communities based on sequencing data. Here we introduce VIRify, a new computational pipeline designed to provide a user-friendly and accurate functional and taxonomic characterisation of viral communities. VIRify identifies viral contigs and prophages from metagenomic assemblies and annotates them using a collection of viral profile hidden Markov models (HMMs). These include our manually-curated profile HMMs, which serve as specific taxonomic markers for a wide range of prokaryotic and eukaryotic viral taxa and are thus used to reliably classify viral contigs. We tested VIRify on assemblies from two microbial mock communities, a large metagenomics study, and a collection of publicly available viral genomic sequences from the human gut. The results showed that VIRify could identify sequences from both prokaryotic and eukaryotic viruses, and provided taxonomic classifications from the genus to the family rank with an average accuracy of 86.6%. In addition, VIRify allowed the detection and taxonomic classification of a range of prokaryotic and eukaryotic viruses present in 243 marine metagenomic assemblies. Finally, the use of VIRify led to a large expansion in the number of taxonomically classified human gut viral sequences and the improvement of outdated and shallow taxonomic classifications. Overall, we demonstrate that VIRify is a novel and powerful resource that offers an enhanced capability to detect a broad range of viral contigs and taxonomically classify them.


Asunto(s)
Eucariontes , Microbiota , Humanos , Células Eucariotas , Genoma Viral/genética , Metagenoma/genética
9.
Artículo en Inglés | MEDLINE | ID: mdl-37489568

RESUMEN

A novel actinomycetal strain, designated M600PL45_2T, was isolated from marine sediments obtained from Ingleses beach, Porto, on the Northern Coast of Portugal and was subjected to a polyphasic taxonomic characterisation study. The here described Gram-reaction-positive strain is characterised by the production of a brown pigment in both solid and liquid medium and forms typical helical hyphae that differentiate into smooth spores. The results of a phylogenetic analysis based on the 16S rRNA gene sequence indicated that M600PL45_2T has a high similarity to two members of the genus Streptomyces, Streptomyces bathyalis ASO4wetT (98.51 %) and Streptomyces daqingensis NEAU ZJC8T (98.44 %). The genome of M600PL45_2T has a size of 6 695 159 bp, a DNA G+C content of 70.71 mol% and 5538 coding sequences. M600PL45_2T grows at 15-37 °C and with a maximal growth rate between 25 °C and 30 °C. Growth at pH 6.0 to 9.0 with the optimal range between 6.0 and 7.5 was observed. M600PL45_2T showed a high salinity tolerance, growing with 0-10 % (w/v) NaCl, with best growth with 1-3% (w/v) NaCl. Major cellular fatty acids are iso-C15:0 (25.03 %), anteiso-C15:0 (17.70) and iso-C16:0 (26.90 %). The novel isolate was able to grow in media containing a variety of nitrogen and carbon sources. An antimicrobial activity screening indicated that an extract of M600PL45_2T has inhibitory activity against Staphylococcus aureus. On the basis of the polyphasic data, M600PL45_2T (= CECT 30365T = DSM 114036T) is introduced as the type strain of a novel species, that we named Streptomyces marispadix sp. nov.


Asunto(s)
Ácidos Grasos , Cloruro de Sodio , Composición de Base , Ácidos Grasos/química , Filogenia , ARN Ribosómico 16S/genética , Análisis de Secuencia de ADN , ADN Bacteriano/genética , Técnicas de Tipificación Bacteriana , Sedimentos Geológicos
10.
Front Vet Sci ; 10: 1178922, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37323838

RESUMEN

Bacteria of the genus Salmonella pose a major risk to livestock, the food economy, and public health. Salmonella infections are one of the leading causes of food poisoning. The identification of serovars of Salmonella achieved by their diverse surface antigens is essential to gain information on their epidemiological context. Traditionally, slide agglutination has been used for serotyping. In recent years, whole-genome sequencing (WGS) followed by in silico serotyping has been established as an alternative method for serotyping and the detection of genetic markers for Salmonella. Until now, WGS data generated with Illumina sequencing are used to validate in silico serotyping methods. Oxford Nanopore Technologies (ONT) opens the possibility to sequence ultra-long reads and has frequently been used for bacterial sequencing. In this study, ONT sequencing data of 28 Salmonella strains of different serovars with epidemiological relevance in humans, food, and animals were taken to investigate the performance of the in silico serotyping tools SISTR and SeqSero2 compared to traditional slide agglutination tests. Moreover, the detection of genetic markers for resistance against antimicrobial agents, virulence, and plasmids was studied by comparing WGS data based on ONT with WGS data based on Illumina. Based on the ONT data from flow cell version R9.4.1, in silico serotyping achieved an accuracy of 96.4 and 92% for the tools SISTR and SeqSero2, respectively. Highly similar sets of genetic markers comparing both sequencing technologies were identified. Taking the ongoing improvement of basecalling and flow cells into account, ONT data can be used for Salmonella in silico serotyping and genetic marker detection.

11.
BMC Genomics ; 24(1): 288, 2023 May 29.
Artículo en Inglés | MEDLINE | ID: mdl-37248517

RESUMEN

BACKGROUND: Chlamydia (C.) psittaci, the causative agent of avian chlamydiosis and human psittacosis, is a genetically heterogeneous species. Its broad host range includes parrots and many other birds, but occasionally also humans (via zoonotic transmission), ruminants, horses, swine and rodents. To assess whether there are genetic markers associated with host tropism we comparatively analyzed whole-genome sequences of 61 C. psittaci strains, 47 of which carrying a 7.6-kbp plasmid. RESULTS: Following clean-up, reassembly and polishing of poorly assembled genomes from public databases, phylogenetic analyses using C. psittaci whole-genome sequence alignment revealed four major clades within this species. Clade 1 represents the most recent lineage comprising 40/61 strains and contains 9/10 of the psittacine strains, including type strain 6BC, and 10/13 of human isolates. Strains from different non-psittacine hosts clustered in Clades 2- 4. We found that clade membership correlates with typing schemes based on SNP types, ompA genotypes, multilocus sequence types as well as plasticity zone (PZ) structure and host preference. Genome analysis also revealed that i) sequence variation in the major outer membrane porin MOMP can result in 3D structural changes of immunogenic domains, ii) past host change of Clade 3 and 4 strains could be associated with loss of MAC/perforin in the PZ, rather than the large cytotoxin, iii) the distinct phylogeny of atypical strains (Clades 3 and 4) is also reflected in their repertoire of inclusion proteins (Inc family) and polymorphic membrane proteins (Pmps). CONCLUSIONS: Our study identified a number of genomic features that can be correlated with the phylogeny and host preference of C. psittaci strains. Our data show that intra-species genomic divergence is associated with past host change and includes deletions in the plasticity zone, structural variations in immunogenic domains and distinct repertoires of virulence factors.


Asunto(s)
Chlamydia , Chlamydophila psittaci , Psitacosis , Animales , Humanos , Caballos , Porcinos , Chlamydophila psittaci/genética , Psitacosis/veterinaria , Filogenia , Chlamydia/genética , Aves , Genómica
12.
Nucleic Acids Res ; 51(12): 6479-6494, 2023 07 07.
Artículo en Inglés | MEDLINE | ID: mdl-37224537

RESUMEN

A fundamental step in the influenza A virus (IAV) replication cycle is the coordinated packaging of eight distinct genomic RNA segments (i.e. vRNAs) into a viral particle. Although this process is thought to be controlled by specific vRNA-vRNA interactions between the genome segments, few functional interactions have been validated. Recently, a large number of potentially functional vRNA-vRNA interactions have been detected in purified virions using the RNA interactome capture method SPLASH. However, their functional significance in coordinated genome packaging remains largely unclear. Here, we show by systematic mutational analysis that mutant A/SC35M (H7N7) viruses lacking several prominent SPLASH-identified vRNA-vRNA interactions involving the HA segment package the eight genome segments as efficiently as the wild-type virus. We therefore propose that the vRNA-vRNA interactions identified by SPLASH in IAV particles are not necessarily critical for the genome packaging process, leaving the underlying molecular mechanism elusive.


Asunto(s)
Subtipo H7N7 del Virus de la Influenza A , Empaquetamiento del Genoma Viral , Humanos , Genoma Viral , Subtipo H7N7 del Virus de la Influenza A/fisiología , Gripe Humana/virología , ARN Viral/metabolismo , Ensamble de Virus
13.
Front Endocrinol (Lausanne) ; 14: 1021640, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-36936174

RESUMEN

Placenta accreta spectrum (PAS) is one of the major causes of maternal morbidity and mortality worldwide with increasing incidence. PAS refers to a group of pathological conditions ranging from the abnormal attachment of the placenta to the uterus wall to its perforation and, in extreme cases, invasion into surrounding organs. Among them, placenta accreta is characterized by a direct adhesion of the villi to the myometrium without invasion and remains the most common diagnosis of PAS. Here, we identify the potential regulatory miRNA and target networks contributing to placenta accreta development. Using small RNA-Seq followed by RT-PCR confirmation, altered miRNA expression, including that of members of placenta-specific miRNA clusters (e.g., C19MC and C14MC), was identified in placenta accreta samples compared to normal placental tissues. In situ hybridization (ISH) revealed expression of altered miRNAs mostly in trophoblast but also in endothelial cells and this profile was similar among all evaluated degrees of PAS. Kyoto encyclopedia of genes and genomes (KEGG) analyses showed enriched pathways dysregulated in PAS associated with cell cycle regulation, inflammation, and invasion. mRNAs of genes associated with cell cycle and inflammation were downregulated in PAS. At the protein level, NF-κB was upregulated while PTEN was downregulated in placenta accreta tissue. The identified miRNAs and their targets are associated with signaling pathways relevant to controlling trophoblast function. Therefore, this study provides miRNA:mRNA associations that could be useful for understanding PAS onset and progression.


Asunto(s)
MicroARNs , Placenta Accreta , Embarazo , Humanos , Femenino , Placenta Accreta/genética , Placenta Accreta/metabolismo , Placenta Accreta/patología , MicroARNs/genética , MicroARNs/metabolismo , Células Endoteliales/metabolismo , Placenta/metabolismo , Miometrio
14.
bioRxiv ; 2023 Mar 17.
Artículo en Inglés | MEDLINE | ID: mdl-36993667

RESUMEN

Oxford Nanopore Technologies (ONT) allows direct sequencing of ribonucleic acids (RNA) and, in addition, detection of possible RNA modifications due to deviations from the expected ONT signal. The software available so far for this purpose can only detect a small number of modifications. Alternatively, two samples can be compared for different RNA modifications. We present Magnipore, a novel tool to search for significant signal shifts between samples of Oxford Nanopore data from similar or related species. Magnipore classifies them into mutations and potential modifications. We use Magnipore to compare SARS-CoV-2 samples. Included were representatives of the early 2020s Pango lineages (n=6), samples from Pango lineages B.1.1.7 (n=2, Alpha), B.1.617.2 (n=1, Delta), and B.1.529 (n=7, Omicron). Magnipore utilizes position-wise Gaussian distribution models and a comprehensible significance threshold to find differential signals. In the case of Alpha and Delta, Magnipore identifies 55 detected mutations and 15 sites that hint at differential modifications. We predicted potential virus-variant and variant-group-specific differential modifications. Magnipore contributes to advancing RNA modification analysis in the context of viruses and virus variants.

15.
BMC Genomics ; 24(1): 151, 2023 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-36973643

RESUMEN

BACKGROUND: Most plant-pathogenic Xanthomonas bacteria harbor transcription activator-like effector (TALE) genes, which function as transcriptional activators of host plant genes and support infection. The entire repertoire of up to 29 TALE genes of a Xanthomonas strain is also referred to as TALome. The DNA-binding domain of TALEs is comprised of highly conserved repeats and TALE genes often occur in gene clusters, which precludes the assembly of TALE-carrying Xanthomonas genomes based on standard sequencing approaches. RESULTS: Here, we report the successful assembly of the 5 Mbp genomes of five Xanthomonas strains from Oxford Nanopore Technologies (ONT) sequencing data. For one of these strains, Xanthomonas oryzae pv. oryzae (Xoo) PXO35, we illustrate why Illumina short reads and longer PacBio reads are insufficient to fully resolve the genome. While ONT reads are perfectly suited to yield highly contiguous genomes, they suffer from a specific error profile within homopolymers. To still yield complete and correct TALomes from ONT assemblies, we present a computational correction pipeline specifically tailored to TALE genes, which yields at least comparable accuracy as Illumina-based polishing. We further systematically assess the ONT-based pipeline for its multiplexing capacity and find that, combined with computational correction, the complete TALome of Xoo PXO35 could have been reconstructed from less than 20,000 ONT reads. CONCLUSIONS: Our results indicate that multiplexed ONT sequencing combined with a computational correction of TALE genes constitutes a highly capable tool for characterizing the TALomes of huge collections of Xanthomonas strains in the future.


Asunto(s)
Secuenciación de Nanoporos , Xanthomonas , Efectores Tipo Activadores de la Transcripción/genética , Xanthomonas/genética , Genoma
16.
Curr Top Microbiol Immunol ; 439: 305-339, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-36592250

RESUMEN

Coronaviruses have a broad host range and exhibit high zoonotic potential. In this chapter, we describe their genomic organization in terms of encoded proteins and provide an introduction to the peculiar discontinuous transcription mechanism. Further, we present evolutionary conserved genomic RNA secondary structure features, which are involved in the complex replication mechanism. With a focus on computational methods, we review the emergence of SARS-CoV-2 starting with the 2019 strains. In that context, we also discuss the debated hypothesis of whether SARS-CoV-2 was created in a laboratory. We focus on the molecular evolution and the epidemiological dynamics of this recently emerged pathogen and we explain how variants of concern are detected and characterised. COVID-19, the disease caused by SARS-CoV-2, can spread through different transmission routes and also depends on a number of risk factors. We describe how current computational models of viral epidemiology, or more specifically, phylodynamics, have facilitated and will continue to enable a better understanding of the epidemic dynamics of SARS-CoV-2.


Asunto(s)
COVID-19 , SARS-CoV-2 , Animales , SARS-CoV-2/genética , COVID-19/genética , Genoma Viral , Genómica , Estadios del Ciclo de Vida
17.
Proc Natl Acad Sci U S A ; 119(35): e2122636119, 2022 08 30.
Artículo en Inglés | MEDLINE | ID: mdl-36018838

RESUMEN

Taxonomic classification, that is, the assignment to biological clades with shared ancestry, is a common task in genetics, mainly based on a genome similarity search of large genome databases. The classification quality depends heavily on the database, since representative relatives must be present. Many genomic sequences cannot be classified at all or only with a high misclassification rate. Here we present BERTax, a deep neural network program based on natural language processing to precisely classify the superkingdom and phylum of DNA sequences taxonomically without the need for a known representative relative from a database. We show BERTax to be at least on par with the state-of-the-art approaches when taxonomically similar species are part of the training data. For novel organisms, however, BERTax clearly outperforms any existing approach. Finally, we show that BERTax can also be combined with database approaches to further increase the prediction quality in almost all cases. Since BERTax is not based on similar entries in databases, it allows precise taxonomic classification of a broader range of genomic sequences, thus increasing the overall information gain.


Asunto(s)
Código de Barras del ADN Taxonómico , ADN , Aprendizaje Profundo , Programas Informáticos , Algoritmos , Secuencia de Bases , ADN/clasificación , ADN/genética , Código de Barras del ADN Taxonómico/métodos , Genoma , Genómica
18.
Viruses ; 14(7)2022 07 12.
Artículo en Inglés | MEDLINE | ID: mdl-35891501

RESUMEN

Viruses are the cause of a considerable burden to human, animal and plant health, while on the other hand playing an important role in regulating entire ecosystems. The power of new sequencing technologies combined with new tools for processing "Big Data" offers unprecedented opportunities to answer fundamental questions in virology. Virologists have an urgent need for virus-specific bioinformatics tools. These developments have led to the formation of the European Virus Bioinformatics Center, a network of experts in virology and bioinformatics who are joining forces to enable extensive exchange and collaboration between these research areas. The EVBC strives to provide talented researchers with a supportive environment free of gender bias, but the gender gap in science, especially in math-intensive fields such as computer science, persists. To bring more talented women into research and keep them there, we need to highlight role models to spark their interest, and we need to ensure that female scientists are not kept at lower levels but are given the opportunity to lead the field. Here we showcase the work of the EVBC and highlight the achievements of some outstanding women experts in virology and viral bioinformatics.


Asunto(s)
Biología Computacional , Investigadores , Virus , Europa (Continente) , Femenino , Humanos , Investigadores/estadística & datos numéricos , Virus/genética
19.
Front Microbiol ; 13: 909692, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35814690

RESUMEN

During the last decades, antimicrobial resistance (AMR) has become a global public health concern. Nowadays multi-drug resistance is commonly observed in strains of Vibrio cholerae, the etiological agent of cholera. In order to limit the spread of pathogenic drug-resistant bacteria and to maintain treatment options the analysis of clinical samples and their AMR profiles are essential. Particularly, in low-resource settings a timely analysis of AMR profiles is often impaired due to lengthy culturing procedures for antibiotic susceptibility testing or lack of laboratory capacity. In this study, we explore the applicability of whole genome sequencing for the prediction of AMR profiles of V. cholerae. We developed the pipeline CholerAegon for the in silico prediction of AMR profiles of 82 V. cholerae genomes assembled from long and short sequencing reads. By correlating the predicted profiles with results from phenotypic antibiotic susceptibility testing we show that the prediction can replace in vitro susceptibility testing for five of seven antibiotics. Because of the relatively low costs, possibility for real-time data analyses, and portability, the Oxford Nanopore Technologies MinION sequencing platform-especially in light of an upcoming less error-prone technology for the platform-appears to be well suited for pathogen genomic analyses such as the one described here. Together with CholerAegon, it can leverage pathogen genomics to improve disease surveillance and to control further spread of antimicrobial resistance.

20.
BMC Plant Biol ; 22(1): 340, 2022 Jul 14.
Artículo en Inglés | MEDLINE | ID: mdl-35836106

RESUMEN

BACKGROUND: Fruits are the seed-bearing structures of flowering plants and are highly diverse in terms of morphology, texture and maturation. Dehiscent fruits split open upon maturation to discharge their seeds while indehiscent fruits are dispersed as a whole. Indehiscent fruits evolved from dehiscent fruits several times independently in the crucifer family (Brassicaceae). The fruits of Lepidium appelianum, for example, are indehiscent while the fruits of the closely related L. campestre are dehiscent. Here, we investigate the molecular and genetic mechanisms underlying the evolutionary transition from dehiscent to indehiscent fruits using these two Lepidium species as model system. RESULTS: We have sequenced the transcriptomes and small RNAs of floral buds, flowers and fruits of L. appelianum and L. campestre and analyzed differentially expressed genes (DEGs) and differently differentially expressed genes (DDEGs). DEGs are genes that show significantly different transcript levels in the same structures (buds, flowers and fruits) in different species, or in different structures in the same species. DDEGs are genes for which the change in expression level between two structures is significantly different in one species than in the other. Comparing the two species, the highest number of DEGs was found in flowers, followed by fruits and floral buds while the highest number of DDEGs was found in fruits versus flowers followed by flowers versus floral buds. Several gene ontology terms related to cell wall synthesis and degradation were overrepresented in different sets of DEGs highlighting the importance of these processes for fruit opening. Furthermore, the fruit valve identity genes FRUITFULL and YABBY3 were among the DEGs identified. Finally, the microRNA miR166 as well as the TCP transcription factors BRANCHED1 (BRC1) and TCP FAMILY TRANSCRIPTION FACTOR 4 (TCP4) were found to be DDEGs. CONCLUSIONS: Our study reveals differences in gene expression between dehiscent and indehiscent fruits and uncovers miR166, BRC1 and TCP4 as candidate genes for the evolutionary transition from dehiscent to indehiscent fruits in Lepidium.


Asunto(s)
Brassicaceae , Lepidium , Brassicaceae/genética , Brassicaceae/metabolismo , Flores/genética , Frutas/genética , Frutas/metabolismo , Regulación de la Expresión Génica de las Plantas , Lepidium/genética , Transcriptoma
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...