RESUMEN
COVID-19 pandemic caused by SARS-CoV-2 infection is a public health emergency. COVID-19 typically exhibits respiratory illness. Unexpectedly, emerging clinical reports indicate that neurological symptoms continue to rise, suggesting detrimental effects of SARS-CoV-2 on the central nervous system (CNS). Here, we show that a Düsseldorf isolate of SARS-CoV-2 enters 3D human brain organoids within 2 days of exposure. We identified that SARS-CoV-2 preferably targets neurons of brain organoids. Imaging neurons of organoids reveal that SARS-CoV-2 exposure is associated with altered distribution of Tau from axons to soma, hyperphosphorylation, and apparent neuronal death. Our studies, therefore, provide initial insights into the potential neurotoxic effect of SARS-CoV-2 and emphasize that brain organoids could model CNS pathologies of COVID-19.
Asunto(s)
Betacoronavirus/fisiología , Encéfalo/virología , Neuronas/virología , Animales , Muerte Celular , Chlorocebus aethiops , Humanos , Enfermedades del Sistema Nervioso/virología , Organoides , SARS-CoV-2 , Células Vero , Proteínas tau/metabolismoRESUMEN
BACKGROUND: Tracing of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) transmission chains is still a major challenge for public health authorities, when incidental contacts are not recalled or are not perceived as potential risk contacts. Viral sequencing can address key questions about SARS-CoV-2 evolution and may support reconstruction of viral transmission networks by integration of molecular epidemiology into classical contact tracing. METHODS: In collaboration with local public health authorities, we set up an integrated system of genomic surveillance in an urban setting, combining a) viral surveillance sequencing, b) genetically based identification of infection clusters in the population, c) integration of public health authority contact tracing data, and d) a user-friendly dashboard application as a central data analysis platform. RESULTS: Application of the integrated system from August to December 2020 enabled a characterization of viral population structure, analysis of 4 outbreaks at a maximum care hospital, and genetically based identification of 5 putative population infection clusters, all of which were confirmed by contact tracing. The system contributed to the development of improved hospital infection control and prevention measures and enabled the identification of previously unrecognized transmission chains, involving a martial arts gym and establishing a link between the hospital to the local population. CONCLUSIONS: Integrated systems of genomic surveillance could contribute to the monitoring and, potentially, improved management of SARS-CoV-2 transmission in the population.
Asunto(s)
COVID-19 , SARS-CoV-2 , COVID-19/epidemiología , Trazado de Contacto , Brotes de Enfermedades/prevención & control , Genómica , Humanos , SARS-CoV-2/genéticaRESUMEN
BackgroundTracking person-to-person SARS-CoV-2 transmission in the population is important to understand the epidemiology of community transmission and may contribute to the containment of SARS-CoV-2. Neither contact tracing nor genomic surveillance alone, however, are typically sufficient to achieve this objective.AimWe demonstrate the successful application of the integrated genomic surveillance (IGS) system of the German city of Düsseldorf for tracing SARS-CoV-2 transmission chains in the population as well as detecting and investigating travel-associated SARS-CoV-2 infection clusters.MethodsGenomic surveillance, phylogenetic analysis, and structured case interviews were integrated to elucidate two genetically defined clusters of SARS-CoV-2 isolates detected by IGS in Düsseldorf in July 2021.ResultsCluster 1 (n = 67 Düsseldorf cases) and Cluster 2 (n = 36) were detected in a surveillance dataset of 518 high-quality SARS-CoV-2 genomes from Düsseldorf (53% of total cases, sampled mid-June to July 2021). Cluster 1 could be traced back to a complex pattern of transmission in nightlife venues following a putative importation by a SARS-CoV-2-infected return traveller (IP) in late June; 28 SARS-CoV-2 cases could be epidemiologically directly linked to IP. Supported by viral genome data from Spain, Cluster 2 was shown to represent multiple independent introduction events of a viral strain circulating in Catalonia and other European countries, followed by diffuse community transmission in Düsseldorf.ConclusionIGS enabled high-resolution tracing of SARS-CoV-2 transmission in an internationally connected city during community transmission and provided infection chain-level evidence of the downstream propagation of travel-imported SARS-CoV-2 cases.
Asunto(s)
COVID-19 , Enfermedades Transmisibles Importadas , Humanos , SARS-CoV-2/genética , Viaje , Enfermedades Transmisibles Importadas/epidemiología , COVID-19/epidemiología , Filogenia , Trazado de Contacto , Alemania/epidemiología , GenómicaRESUMEN
We whole-genome sequenced 55 SARS-CoV-2 isolates from Germany to investigate SARS-CoV-2 outbreaks in 2020 in the Heinsberg district and Düsseldorf. While the genetic structure of the Heinsberg outbreak indicates a clonal origin, reflecting superspreading dynamics from mid-February during the carnival season, distinct viral strains were circulating in Düsseldorf in March, reflecting the city's international links. Limited detection of Heinsberg strains in the Düsseldorf area despite geographical proximity may reflect efficient containment and contact-tracing efforts.
Asunto(s)
Betacoronavirus/genética , Técnicas de Laboratorio Clínico/métodos , Infecciones por Coronavirus/diagnóstico , Genoma Viral/genética , Pandemias , Neumonía Viral/diagnóstico , Secuenciación Completa del Genoma/métodos , Betacoronavirus/aislamiento & purificación , Betacoronavirus/patogenicidad , COVID-19 , Prueba de COVID-19 , Infecciones por Coronavirus/epidemiología , Brotes de Enfermedades , Alemania/epidemiología , Humanos , Neumonía Viral/epidemiología , ADN Polimerasa Dirigida por ARN , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , SARS-CoV-2RESUMEN
RNA-based regulation has become a major research topic in molecular biology. The analysis of epigenetic and expression data is therefore incomplete if RNA-based regulation is not taken into account. Thus, it is increasingly important but not yet standard to combine RNA-centric data and analysis tools with other types of experimental data such as RNA-seq or ChIP-seq. Here, we present the RNA workbench, a comprehensive set of analysis tools and consolidated workflows that enable the researcher to combine these two worlds. Based on the Galaxy framework the workbench guarantees simple access, easy extension, flexible adaption to personal and security needs, and sophisticated analyses that are independent of command-line knowledge. Currently, it includes more than 50 bioinformatics tools that are dedicated to different research areas of RNA biology including RNA structure analysis, RNA alignment, RNA annotation, RNA-protein interaction, ribosome profiling, RNA-seq analysis and RNA target prediction. The workbench is developed and maintained by experts in RNA bioinformatics and the Galaxy framework. Together with the growing community evolving around this workbench, we are committed to keep the workbench up-to-date for future standards and needs, providing researchers with a reliable and robust framework for RNA data analysis. AVAILABILITY: The RNA workbench is available at https://github.com/bgruening/galaxy-rna-workbench.
Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , ARN/química , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Biología Computacional , Internet , Conformación de Ácido Nucleico , ARN/metabolismo , ARN no Traducido/química , Flujo de TrabajoRESUMEN
What does it take to convert a heap of sequencing data into a publishable result? First, common tools are employed to reduce primary data (sequencing reads) to a form suitable for further analyses (i.e., the list of variable sites). The subsequent exploratory stage is much more ad hoc and requires the development of custom scripts and pipelines, making it problematic for biomedical researchers. Here, we describe a hybrid platform combining common analysis pathways with the ability to explore data interactively. It aims to fully encompass and simplify the "raw data-to-publication" pathway and make it reproducible.
Asunto(s)
Investigación Biomédica/métodos , Investigación Biomédica/organización & administración , Biología Computacional , Secuenciación de Nucleótidos de Alto Rendimiento , Investigadores , Programas Informáticos , HumanosRESUMEN
CLIP-seq experiments are currently the most important means for determining the binding sites of RNA binding proteins on a genome-wide level. The computational analysis can be divided into three steps. In the first pre-processing stage, raw reads have to be trimmed and mapped to the genome. This step has to be specifically adapted for each CLIP-seq protocol. The next step is peak calling, which is required to remove unspecific signals and to determine bona fide protein binding sites on target RNAs. Here, both protocol-specific approaches as well as generic peak callers are available. Despite some peak callers being more widely used, each peak caller has its specific assets and drawbacks, and it might be advantageous to compare the results of several methods. Although peak calling is often the final step in many CLIP-seq publications, an important follow-up task is the determination of binding models from CLIP-seq data. This is central because CLIP-seq experiments are highly dependent on the transcriptional state of the cell in which the experiment was performed. Thus, relying solely on binding sites determined by CLIP-seq from different cells or conditions can lead to a high false negative rate. This shortcoming can, however, be circumvented by applying models that predict additional putative binding sites.
Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Inmunoprecipitación/métodos , Proteínas de Unión al ARN/genética , ARN/química , Análisis de Secuencia de ARN/estadística & datos numéricos , Programas Informáticos , Anticuerpos/química , Secuencia de Bases , Sitios de Unión , Línea Celular , Biblioteca de Genes , Humanos , Conformación de Ácido Nucleico , Unión Proteica , ARN/genética , ARN/metabolismo , Proteínas de Unión al ARN/metabolismo , Análisis de Secuencia de ARN/métodos , TranscriptomaRESUMEN
Glutamate is the most important excitatory neurotransmitter in the brain. The N-methyl-D-aspartate (NMDA) receptor is a glutamate-gated ionotropic cation channel that is composed of several subunits and modulated by a glycine binding site. Many forms of synaptic plasticity depend on the influx of calcium ions through NMDA receptors, and NMDA receptor dysfunction has been linked to a number of neuropsychiatric disorders, including schizophrenia. Whole-exome sequencing was performed in a family with a strong history of psychotic disorders over three generations. We used an iterative strategy to obtain condense and meaningful variants. In this highly affected family, we found a frameshift mutation (rs10666583) in the GRIN3B gene, which codes for the GluN3B subunit of the NMDA receptor in all family members with a psychotic disorder, but not in the healthy relatives. Matsuno et al., also reported this null variant as a risk factor for schizophrenia in 2015. In a broader sample of 22 patients with psychosis, the allele frequency of the rs10666583 mutation variant was increased compared to those of healthy population samples and unaffected relatives. Compared to the 1000 Genomes Project population, we found a significant increase of this variant with a large effect size among patients. The amino acid shift degrades the S1/S2 glycine binding domain of the dominant modulatory GluN3B subunit of the NMDA receptor, which subsequently affects the permeability of the channel pore to calcium ions. A decreased glycine affinity for the GluN3B subunit might cause impaired functional capability of the NMDA receptor and could be an important risk factor for the pathogenesis of psychotic disorders.
Asunto(s)
Exoma/genética , Predisposición Genética a la Enfermedad , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Mutación Missense/genética , Trastornos Psicóticos/genética , Receptores de N-Metil-D-Aspartato/genética , Esquizofrenia/genética , Adulto , Anciano , Estudios de Casos y Controles , Femenino , Humanos , Masculino , Persona de Mediana Edad , Linaje , Factores de Riesgo , Adulto JovenRESUMEN
BACKGROUND: Many functional RNA molecules fold into pseudoknot structures, which are often essential for the formation of an RNA's 3D structure. Currently the design of RNA molecules, which fold into a specific structure (known as RNA inverse folding) within biotechnological applications, is lacking the feature of incorporating pseudoknot structures into the design. Hairpin-(H)- and kissing hairpin-(K)-type pseudoknots cover a wide range of biologically functional pseudoknots and can be represented on a secondary structure level. RESULTS: The RNA inverse folding program antaRNA, which takes secondary structure, target GC-content and sequence constraints as input, is extended to provide solutions for such H- and K-type pseudoknotted secondary structure constraint. We demonstrate the easy and flexible interchangeability of modules within the antaRNA framework by incorporating pKiss as structure prediction tool capable of predicting the mentioned pseudoknot types. The performance of the approach is demonstrated on a subset of the Pseudobase ++ dataset. CONCLUSIONS: This new service is available via a standalone version and is also part of the Freiburg RNA Tools webservice. Furthermore, antaRNA is available in Galaxy and is part of the RNA-workbench Docker image.
Asunto(s)
Algoritmos , Pliegue del ARN , ARN/química , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Humanos , Modelos MolecularesRESUMEN
The SARS-CoV-2 genome occupies a unique place in infection biology - it is the most highly sequenced genome on earth (making up over 20% of public sequencing datasets) with fine scale information on sampling date and geography, and has been subject to unprecedented intense analysis. As a result, these phylogenetic data are an incredibly valuable resource for science and public health. However, the vast majority of the data was sequenced by tiling amplicons across the full genome, with amplicon schemes that changed over the pandemic as mutations in the viral genome interacted with primer binding sites. In combination with the disparate set of genome assembly workflows and lack of consistent quality control (QC) processes, the current genomes have many systematic errors that have evolved with the virus and amplicon schemes. These errors have significant impacts on the phylogeny, and therefore over the last few years, many thousands of hours of researchers time has been spent in "eyeballing" trees, looking for artefacts, and then patching the tree. Given the huge value of this dataset, we therefore set out to reprocess the complete set of public raw sequence data in a rigorous amplicon-aware manner, and build a cleaner phylogeny. Here we provide a global tree of 3,960,704 samples, built from a consistently assembled set of high quality consensus sequences from all available public data as of March 2023, viewable at https://viridian.taxonium.org. Each genome was constructed using a novel assembly tool called Viridian (https://github.com/iqbal-lab-org/viridian), developed specifically to process amplicon sequence data, eliminating artefactual errors and mask the genome at low quality positions. We provide simulation and empirical validation of the methodology, and quantify the improvement in the phylogeny. Phase 2 of our project will address the fact that the data in the public archives is heavily geographically biased towards the Global North. We therefore have contributed new raw data to ENA/SRA from many countries including Ghana, Thailand, Laos, Sri Lanka, India, Argentina and Singapore. We will incorporate these, along with all public raw data submitted between March 2023 and the current day, into an updated set of assemblies, and phylogeny. We hope the tree, consensus sequences and Viridian will be a valuable resource for researchers.
RESUMEN
Accurate and comprehensive immunogenetic reference panels are key to the successful implementation of population-scale immunogenomics. The 5Mbp Major Histocompatibility Complex (MHC) is the most polymorphic region of the human genome and associated with multiple immune-mediated diseases, transplant matching and therapy responses. Analysis of MHC genetic variation is severely complicated by complex patterns of sequence variation, linkage disequilibrium and a lack of fully resolved MHC reference haplotypes, increasing the risk of spurious findings on analyzing this medically important region. Integrating Illumina, ultra-long Nanopore, and PacBio HiFi sequencing as well as bespoke bioinformatics, we completed five of the alternative MHC reference haplotypes of the current (GRCh38/hg38) build of the human reference genome and added one other. The six assembled MHC haplotypes encompass the DR1 and DR4 haplotype structures in addition to the previously completed DR2 and DR3, as well as six distinct classes of the structurally variable C4 region. Analysis of the assembled haplotypes showed that MHC class II sequence structures, including repeat element positions, are generally conserved within the DR haplotype supergroups, and that sequence diversity peaks in three regions around HLA-A, HLA-B+C, and the HLA class II genes. Demonstrating the potential for improved short-read analysis, the number of proper read pairs recruited to the MHC was found to be increased by 0.06%-0.49% in a 1000 Genomes Project read remapping experiment with seven diverse samples. Furthermore, the assembled haplotypes can serve as references for the community and provide the basis of a structurally accurate genotyping graph of the complete MHC region.
Asunto(s)
Antígenos de Histocompatibilidad Clase II , Complejo Mayor de Histocompatibilidad , Humanos , Haplotipos , Alelos , Antígenos de Histocompatibilidad Clase II/genética , Complejo Mayor de Histocompatibilidad/genética , Antígenos HLA/genética , Antígenos HLA-C/genéticaRESUMEN
Typical genotyping workflows map reads to a reference genome before identifying genetic variants. Generating such alignments introduces reference biases and comes with substantial computational burden. Furthermore, short-read lengths limit the ability to characterize repetitive genomic regions, which are particularly challenging for fast k-mer-based genotypers. In the present study, we propose a new algorithm, PanGenie, that leverages a haplotype-resolved pangenome reference together with k-mer counts from short-read sequencing data to genotype a wide spectrum of genetic variation-a process we refer to as genome inference. Compared with mapping-based approaches, PanGenie is more than 4 times faster at 30-fold coverage and achieves better genotype concordances for almost all variant types and coverages tested. Improvements are especially pronounced for large insertions (≥50 bp) and variants in repetitive regions, enabling the inclusion of these classes of variants in genome-wide association studies. PanGenie efficiently leverages the increasing amount of haplotype-resolved assemblies to unravel the functional impact of previously inaccessible variants while being faster compared with alignment-based workflows.
Asunto(s)
Variación Genética , Genoma Humano , Genómica , Algoritmos , Genoma Humano/genética , Estudio de Asociación del Genoma Completo , Genómica/métodos , Genotipo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: RNA plays essential roles in all known forms of life. Clustering RNA sequences with common sequence and structure is an essential step towards studying RNA function. With the advent of high-throughput sequencing techniques, experimental and genomic data are expanding to complement the predictive methods. However, the existing methods do not effectively utilize and cope with the immense amount of data becoming available. RESULTS: Hundreds of thousands of non-coding RNAs have been detected; however, their annotation is lagging behind. Here we present GraphClust2, a comprehensive approach for scalable clustering of RNAs based on sequence and structural similarities. GraphClust2 bridges the gap between high-throughput sequencing and structural RNA analysis and provides an integrative solution by incorporating diverse experimental and genomic data in an accessible manner via the Galaxy framework. GraphClust2 can efficiently cluster and annotate large datasets of RNAs and supports structure-probing data. We demonstrate that the annotation performance of clustering functional RNAs can be considerably improved. Furthermore, an off-the-shelf procedure is introduced for identifying locally conserved structure candidates in long RNAs. We suggest the presence and the sparseness of phylogenetically conserved local structures for a collection of long non-coding RNAs. CONCLUSIONS: By clustering data from 2 cross-linking immunoprecipitation experiments, we demonstrate the benefits of GraphClust2 for motif discovery under the presence of biological and methodological biases. Finally, we uncover prominent targets of double-stranded RNA binding protein Roquin-1, such as BCOR's 3' untranslated region that contains multiple binding stem-loops that are evolutionary conserved.
Asunto(s)
ARN no Traducido/química , ARN no Traducido/genética , Análisis de Secuencia de ARN/métodos , Análisis por Conglomerados , Biología Computacional , Secuenciación de Nucleótidos de Alto Rendimiento , Anotación de Secuencia Molecular , Conformación de Ácido Nucleico , Programas InformáticosRESUMEN
The primary problem with the explosion of biomedical datasets is not the data, not computational resources, and not the required storage space, but the general lack of trained and skilled researchers to manipulate and analyze these data. Eliminating this problem requires development of comprehensive educational resources. Here we present a community-driven framework that enables modern, interactive teaching of data analytics in life sciences and facilitates the development of training materials. The key feature of our system is that it is not a static but a continuously improved collection of tutorials. By coupling tutorials with a web-based analysis framework, biomedical researchers can learn by performing computation themselves through a web browser without the need to install software or search for example datasets. Our ultimate goal is to expand the breadth of training materials to include fundamental statistical and data science topics and to precipitate a complete re-engineering of undergraduate and graduate curricula in life sciences. This project is accessible at https://training.galaxyproject.org.
Asunto(s)
Biología Computacional/educación , Biología Computacional/métodos , Investigadores/educación , Curriculum , Análisis de Datos , Educación a Distancia/métodos , Educación a Distancia/tendencias , Humanos , Programas InformáticosRESUMEN
In optogenetics, light-activated proteins are used to monitor and modulate cellular behaviour with light. Combining genetic targeting of distinct cellular populations with defined patterns of optical stimulation enables one to study specific cell classes in complex biological tissues. In the current study we attempted to investigate the functional relevance of heterocellular electrotonic coupling in cardiac tissue in situ. In order to do that, we used a Cre-Lox approach to express the light-gated cation channel Channelrhodopsin-2 (ChR2) specifically in either cardiac myocytes or non-myocytes. Despite high specificity when using the same Cre driver lines in a previous study in combination with a different optogenetic probe, we found patchy off-target ChR2 expression in cryo-sections and extended z-stack imaging through the ventricular wall of hearts cleared using CLARITY. Based on immunohistochemical analysis, single-cell electrophysiological recordings and whole-genome sequencing, we reason that non-specificity is caused on the Cre recombination level. Our study highlights the importance of careful design and validation of the Cre recombination targets for reliable cell class specific expression of optogenetic tools.
Asunto(s)
Miocitos Cardíacos/metabolismo , Optogenética/métodos , Animales , Channelrhodopsins/genética , Regulación de la Expresión Génica/genética , Regulación de la Expresión Génica/efectos de la radiación , Ratones , Ratones Endogámicos C57BL , Miocitos Cardíacos/citología , Miocitos Cardíacos/efectos de la radiaciónRESUMEN
Magnetochemistry recently emerged as a promising approach to control addressable spin arrays on surfaces. Here we report on the binding, spatial ordering, and magnetic properties of Fe on a highly regular Co-tetraphenylporphyrin (Co-TPP) template and highlight how the Fe controls the magnetism of the Co centers. As evidenced by scanning tunneling microscopy (STM) single Fe atoms attach to the saddle-shape conformers site-selectively in a unique coordination environment offered through a heptamer defined by the Co-N-C-C-C-N cyclic subunit. While the magnetic moment of Co is quenched for bare Co-TPP/Ag(111), the Fe presence revives it. Our X-ray magnetic circular dichroism (XMCD) experiments, complemented by density functional theory (DFT) calculations, evidence a ferromagnetic coupling between the Fe and the Co center concomitant with a complex charge redistribution involving the porphyrin ligand. Thus, we demonstrate an unusual metalloporphyrin coordination geometry that opens pathways to spatially order and engineer magnetic moments in surface-based nanostructures.
RESUMEN
Under natural conditions, plants are exposed to rapidly changing light intensities. To acclimate to such fluctuations, plants have evolved adaptive mechanisms that optimally exploit available light energy and simultaneously minimise damage of the photosynthetic apparatus through excess light. An important mechanism is the dissipation of excess excitation energy as heat which can be measured as nonphotochemical quenching of chlorophyll fluorescence (NPQ). In this paper, we present a highly simplified mathematical model that captures essential experimentally observed features of the short term adaptive quenching dynamics. We investigate the stationary and dynamic behaviour of the model and systematically analyse the dependence of characteristic system properties on key parameters such as rate constants and pool sizes. Comparing simulations with experimental data allows to derive conclusions about the validity of the simplifying assumptions and we further propose hypotheses regarding the role of the xanthophyll cycle in NPQ. We envisage that the presented theoretical description of the light reactions in conjunction with short term adaptive processes serves as a basis for the development of more detailed mechanistic models by which the molecular mechanisms of NPQ can be theoretically studied.