RESUMO
The Extracellular RNA Communication Consortium (ERCC) was launched to accelerate progress in the new field of extracellular RNA (exRNA) biology and to establish whether exRNAs and their carriers, including extracellular vesicles (EVs), can mediate intercellular communication and be utilized for clinical applications. Phase 1 of the ERCC focused on exRNA/EV biogenesis and function, discovery of exRNA biomarkers, development of exRNA/EV-based therapeutics, and construction of a robust set of reference exRNA profiles for a variety of biofluids. Here, we present progress by ERCC investigators in these areas, and we discuss collaborative projects directed at development of robust methods for EV/exRNA isolation and analysis and tools for sharing and computational analysis of exRNA profiling data.
Assuntos
Ácidos Nucleicos Livres/genética , Ácidos Nucleicos Livres/metabolismo , Vesículas Extracelulares/genética , Biomarcadores , Humanos , Bases de Conhecimento , MicroRNAs/genética , RNA/genéticaRESUMO
Assessing the real-world impact of biomedical research is notoriously difficult. Here, we present the framework for building a prospective science-centered information system from scratch that has been afforded by the Sidra Medical and Research Center in Qatar. This experiment is part of the global conversation on maximizing returns on research investment.
Assuntos
Pesquisa Biomédica/economia , Pesquisa Biomédica/organização & administração , Projeto Genoma Humano , Humanos , Sistemas de Informação , Bases de Conhecimento , Catar , Projetos de PesquisaRESUMO
Recent years have seen revived interest in computer-assisted organic synthesis1,2. The use of reaction- and neural-network algorithms that can plan multistep synthetic pathways have revolutionized this field1,3-7, including examples leading to advanced natural products6,7. Such methods typically operate on full, literature-derived 'substrate(s)-to-product' reaction rules and cannot be easily extended to the analysis of reaction mechanisms. Here we show that computers equipped with a comprehensive knowledge-base of mechanistic steps augmented by physical-organic chemistry rules, as well as quantum mechanical and kinetic calculations, can use a reaction-network approach to analyse the mechanisms of some of the most complex organic transformations: namely, cationic rearrangements. Such rearrangements are a cornerstone of organic chemistry textbooks and entail notable changes in the molecule's carbon skeleton8-12. The algorithm we describe and deploy at https://HopCat.allchemy.net/ generates, within minutes, networks of possible mechanistic steps, traces plausible step sequences and calculates expected product distributions. We validate this algorithm by three sets of experiments whose analysis would probably prove challenging even to highly trained chemists: (1) predicting the outcomes of tail-to-head terpene (THT) cyclizations in which substantially different outcomes are encoded in modular precursors differing in minute structural details; (2) comparing the outcome of THT cyclizations in solution or in a supramolecular capsule; and (3) analysing complex reaction mixtures. Our results support a vision in which computers no longer just manipulate known reaction types1-7 but will help rationalize and discover new, mechanistically complex transformations.
Assuntos
Algoritmos , Técnicas de Química Sintética , Ciclização , Redes Neurais de Computação , Terpenos , Cátions/química , Bases de Conhecimento , Terpenos/química , Técnicas de Química Sintética/métodos , Produtos Biológicos/síntese química , Produtos Biológicos/química , Reprodutibilidade dos Testes , SoluçõesRESUMO
Cyclic peptides offer a range of notable advantages, including potent antibacterial properties, high binding affinity and specificity to target molecules, and minimal toxicity, making them highly promising candidates for drug development. However, a comprehensive database that consolidates both synthetically derived and naturally occurring cyclic peptides is conspicuously absent. To address this void, we introduce CyclicPepedia (https://www.biosino.org/iMAC/cyclicpepedia/), a pioneering database that encompasses 8744 known cyclic peptides. This repository, structured as a composite knowledge network, offers a wealth of information encompassing various aspects of cyclic peptides, such as cyclic peptides' sources, categorizations, structural characteristics, pharmacokinetic profiles, physicochemical properties, patented drug applications, and a collection of crucial publications. Supported by a user-friendly knowledge retrieval system and calculation tools specifically designed for cyclic peptides, CyclicPepedia will be able to facilitate advancements in cyclic peptide drug development.
Assuntos
Bases de Conhecimento , Peptídeos Cíclicos , Peptídeos Cíclicos/química , Bases de Dados de ProteínasRESUMO
Training algorithms to computationally plan multistep organic syntheses has been a challenge for more than 50 years1-7. However, the field has progressed greatly since the development of early programs such as LHASA1,7, for which reaction choices at each step were made by human operators. Multiple software platforms6,8-14 are now capable of completely autonomous planning. But these programs 'think' only one step at a time and have so far been limited to relatively simple targets, the syntheses of which could arguably be designed by human chemists within minutes, without the help of a computer. Furthermore, no algorithm has yet been able to design plausible routes to complex natural products, for which much more far-sighted, multistep planning is necessary15,16 and closely related literature precedents cannot be relied on. Here we demonstrate that such computational synthesis planning is possible, provided that the program's knowledge of organic chemistry and data-based artificial intelligence routines are augmented with causal relationships17,18, allowing it to 'strategize' over multiple synthetic steps. Using a Turing-like test administered to synthesis experts, we show that the routes designed by such a program are largely indistinguishable from those designed by humans. We also successfully validated three computer-designed syntheses of natural products in the laboratory. Taken together, these results indicate that expert-level automated synthetic planning is feasible, pending continued improvements to the reaction knowledge base and further code optimization.
Assuntos
Inteligência Artificial , Produtos Biológicos/síntese química , Técnicas de Química Sintética/métodos , Química Orgânica/métodos , Software , Inteligência Artificial/normas , Automação/métodos , Automação/normas , Benzilisoquinolinas/síntese química , Benzilisoquinolinas/química , Técnicas de Química Sintética/normas , Química Orgânica/normas , Indanos/síntese química , Indanos/química , Alcaloides Indólicos/síntese química , Alcaloides Indólicos/química , Bases de Conhecimento , Lactonas/síntese química , Lactonas/química , Macrolídeos/síntese química , Macrolídeos/química , Reprodutibilidade dos Testes , Sesquiterpenos/síntese química , Sesquiterpenos/química , Software/normas , Tetra-Hidroisoquinolinas/síntese química , Tetra-Hidroisoquinolinas/químicaRESUMO
Emerging CRISPR-Cas9 technology permits synthetic lethality (SL) screening of large number of gene pairs from gene combination double knockout (CDKO) experiments. However, the poor integration and annotation of CDKO SL data in current SL databases limit their utility, and diverse methods of calculating SL scores prohibit their comparison. To overcome these shortcomings, we have developed SL knowledge base (SLKB) that incorporates data of 11 CDKO experiments in 22 cell lines, 16,059 SL gene pairs and 264,424 non-SL gene pairs. Additionally, within SLKB, we have implemented five SL calculation methods: median score with and without background control normalization (Median-B/NB), sgRNA-derived score (sgRNA-B/NB), Horlbeck score, GEMINI score and MAGeCK score. The five scores have demonstrated a mere 1.21% overlap among their top 10% SL gene pairs, reflecting high diversity. Users can browse SL networks and assess the impact of scoring methods using Venn diagrams. The SL network generated from all data in SLKB shows a greater likelihood of SL gene pair connectivity with other SL gene pairs than non-SL pairs. Comparison of SL networks between two cell lines demonstrated greater likelihood to share SL hub genes than SL gene pairs. SLKB website and pipeline can be freely accessed at https://slkb.osubmi.org and https://slkb.docs.osubmi.org/, respectively.
Assuntos
Bases de Conhecimento , Mutações Sintéticas Letais , Humanos , RNA Guia de Sistemas CRISPR-Cas , Uso da InternetRESUMO
Metabolite-associated cell communications play critical roles in maintaining the normal biological function of human through coordinating cells, organs and physiological systems. Though substantial information of MACCs has been continuously reported, no relevant database has become available so far. To address this gap, we here developed the first knowledgebase (MACC), to comprehensively describe human metabolite-associated cell communications through curation of experimental literatures. MACC currently contains: (a) 4206 carefully curated metabolite-associated cell communications pairs involving 244 human endogenous metabolites and reported biological effects in vivo and in vitro; (b) 226 comprehensive cell subtypes and 296 disease states, such as cancers, autoimmune diseases, and pathogenic infections; (c) 4508 metabolite-related enzymes and transporters, involving 542 pathways; (d) an interactive tool with user-friendly interface to visualize networks of multiple metabolite-cell interactions. (e) overall expression landscape of metabolite-associated gene sets derived from over 1500 single-cell expression profiles to infer metabolites variations across different cells in the sample. Also, MACC enables cross-links to well-known databases, such as HMDB, DrugBank, TTD and PubMed etc. In complement to ligand-receptor databases, MACC may give new perspectives of alternative communication between cells via metabolite secretion and adsorption, together with the resulting biological functions. MACC is publicly accessible at: http://macc.badd-cao.net/.
Assuntos
Comunicação Celular , Doença , Bases de Conhecimento , Metaboloma , HumanosRESUMO
First released in 2006, DrugBank (https://go.drugbank.com) has grown to become the 'gold standard' knowledge resource for drug, drug-target and related pharmaceutical information. DrugBank is widely used across many diverse biomedical research and clinical applications, and averages more than 30 million views/year. Since its last update in 2018, we have been actively enhancing the quantity and quality of the drug data in this knowledgebase. In this latest release (DrugBank 6.0), the number of FDA approved drugs has grown from 2646 to 4563 (a 72% increase), the number of investigational drugs has grown from 3394 to 6231 (a 38% increase), the number of drug-drug interactions increased from 365 984 to 1 413 413 (a 300% increase), and the number of drug-food interactions expanded from 1195 to 2475 (a 200% increase). In addition to this notable expansion in database size, we have added thousands of new, colorful, richly annotated pathways depicting drug mechanisms and drug metabolism. Likewise, existing datasets have been significantly improved and expanded, by adding more information on drug indications, drug-drug interactions, drug-food interactions and many other relevant data types for 11 891 drugs. We have also added experimental and predicted MS/MS spectra, 1D/2D-NMR spectra, CCS (collision cross section), RT (retention time) and RI (retention index) data for 9464 of DrugBank's 11 710 small molecule drugs. These and other improvements should make DrugBank 6.0 even more useful to a much wider research audience ranging from medicinal chemists to metabolomics specialists to pharmacologists.
Assuntos
Bases de Conhecimento , Metabolômica , Espectrometria de Massas em Tandem , Bases de Dados Factuais , Interações Alimento-DrogaRESUMO
In 2003, the Human Disease Ontology (DO, https://disease-ontology.org/) was established at Northwestern University. In the intervening 20 years, the DO has expanded to become a highly-utilized disease knowledge resource. Serving as the nomenclature and classification standard for human diseases, the DO provides a stable, etiology-based structure integrating mechanistic drivers of human disease. Over the past two decades the DO has grown from a collection of clinical vocabularies, into an expertly curated semantic resource of over 11300 common and rare diseases linking disease concepts through more than 37000 vocabulary cross mappings (v2023-08-08). Here, we introduce the recently launched DO Knowledgebase (DO-KB), which expands the DO's representation of the diseaseome and enhances the findability, accessibility, interoperability and reusability (FAIR) of disease data through a new SPARQL service and new Faceted Search Interface. The DO-KB is an integrated data system, built upon the DO's semantic disease knowledge backbone, with resources that expose and connect the DO's semantic knowledge with disease-related data across Open Linked Data resources. This update includes descriptions of efforts to assess the DO's global impact and improvements to data quality and content, with emphasis on changes in the last two years.
Assuntos
Ecossistema , Bases de Conhecimento , Humanos , Doenças Raras , Semântica , Fatores de TempoRESUMO
Single-cell proteomics (SCP) has emerged as a powerful tool for detecting cellular heterogeneity, offering unprecedented insights into biological mechanisms that are masked in bulk cell populations. With the rapid advancements in AI-based time trajectory analysis and cell subpopulation identification, there exists a pressing need for a database that not only provides SCP raw data but also explicitly describes experimental details and protein expression profiles. However, no such database has been available yet. In this study, a database, entitled 'SingPro', specializing in single-cell proteomics was thus developed. It was unique in (a) systematically providing the SCP raw data for both mass spectrometry-based and flow cytometry-based studies and (b) explicitly describing experimental detail for SCP study and expression profile of any studied protein. Anticipating a robust interest from the research community, this database is poised to become an invaluable repository for OMICs-based biomedical studies. Access to SingPro is unrestricted and does not mandate a login at: http://idrblab.org/singpro/.
Assuntos
Bases de Dados de Proteínas , Processamento de Proteína Pós-Traducional , Proteômica , Bases de Conhecimento , Espectrometria de Massas , Análise de Célula ÚnicaRESUMO
Allosteric regulation, induced by perturbations at an allosteric site topographically distinct from the orthosteric site, is one of the most direct and efficient ways to fine-tune macromolecular function. The Allosteric Database (ASD; accessible online at http://mdl.shsmu.edu.cn/ASD) has been systematically developed since 2009 to provide comprehensive information on allosteric regulation. In recent years, allostery has seen sustained growth and wide-ranging applications in life sciences, from basic research to new therapeutics development, while also elucidating emerging obstacles across allosteric research stages. To overcome these challenges and maintain high-quality data center services, novel features were curated in the ASD2023 update: (i) 66 589 potential allosteric sites, covering > 80% of the human proteome and constituting the human allosteric pocketome; (ii) 748 allosteric protein-protein interaction (PPI) modulators with clear mechanisms, aiding protein machine studies and PPI-targeted drug discovery; (iii) 'Allosteric Hit-to-Lead,' a pioneering dataset providing panoramic views from 87 well-defined allosteric hits to 6565 leads and (iv) 456 dualsteric modulators for exploring the simultaneous regulation of allosteric and orthosteric sites. Meanwhile, ASD2023 maintains a significant growth of foundational allosteric data. Based on these efforts, the allosteric knowledgebase is progressively evolving towards an integrated landscape, facilitating advancements in allosteric target identification, mechanistic exploration and drug discovery.
Assuntos
Sítio Alostérico , Bases de Conhecimento , Humanos , Regulação Alostérica , Descoberta de Drogas , Ligantes , Proteoma , Mapas de Interação de ProteínasRESUMO
Tumorigenic functions due to the formation of fusion genes have been targeted for cancer therapeutics (i.e. kinase inhibitors). However, many fusion proteins involved in various cellular processes have not been studied for targeted therapeutics. This is because the lack of complete fusion protein sequences and their whole 3D structures has made it challenging to develop new therapeutic strategies. To fill these critical gaps, we developed a computational pipeline and a resource of human fusion proteins named FusionPDB, available at https://compbio.uth.edu/FusionPDB. FusionPDB is organized into four levels: 43K fusion protein sequences (14.7K in-frame fusion genes, Level 1), over 2300 + 1267 fusion protein 3D structures (from 2300 recurrent and 266 manually curated in-frame fusion genes, Level 2), pLDDT score analysis for the 1267 fusion proteins from 266 manually curated fusion genes (Level 3), and virtual screening outcomes for 68 selected fusion proteins from 266 manually curated fusion genes (Level 4). FusionPDB is the only resource providing whole 3D structures of fusion proteins and comprehensive knowledge of human fusion proteins. It will be regularly updated until it covers all human fusion proteins in the future.
Assuntos
Bases de Dados de Proteínas , Humanos , Sequência de Aminoácidos , Bases de Conhecimento , Neoplasias/genética , Conformação ProteicaRESUMO
The Reactome Knowledgebase (https://reactome.org), an Elixir and GCBR core biological data resource, provides manually curated molecular details of a broad range of normal and disease-related biological processes. Processes are annotated as an ordered network of molecular transformations in a single consistent data model. Reactome thus functions both as a digital archive of manually curated human biological processes and as a tool for discovering functional relationships in data such as gene expression profiles or somatic mutation catalogs from tumor cells. Here we review progress towards annotation of the entire human proteome, targeted annotation of disease-causing genetic variants of proteins and of small-molecule drugs in a pathway context, and towards supporting explicit annotation of cell- and tissue-specific pathways. Finally, we briefly discuss issues involved in making Reactome more fully interoperable with other related resources such as the Gene Ontology and maintaining the resulting community resource network.
Assuntos
Bases de Conhecimento , Redes e Vias Metabólicas , Transdução de Sinais , Humanos , Redes e Vias Metabólicas/genética , Proteoma/genéticaRESUMO
The single-cell proteomics enables the direct quantification of protein abundance at the single-cell resolution, providing valuable insights into cellular phenotypes beyond what can be inferred from transcriptome analysis alone. However, insufficient large-scale integrated databases hinder researchers from accessing and exploring single-cell proteomics, impeding the advancement of this field. To fill this deficiency, we present a comprehensive database, namely Single-cell Proteomic DataBase (SPDB, https://scproteomicsdb.com/), for general single-cell proteomic data, including antibody-based or mass spectrometry-based single-cell proteomics. Equipped with standardized data process and a user-friendly web interface, SPDB provides unified data formats for convenient interaction with downstream analysis, and offers not only dataset-level but also protein-level data search and exploration capabilities. To enable detailed exhibition of single-cell proteomic data, SPDB also provides a module for visualizing data from the perspectives of cell metadata or protein features. The current version of SPDB encompasses 133 antibody-based single-cell proteomic datasets involving more than 300 million cells and over 800 marker/surface proteins, and 10 mass spectrometry-based single-cell proteomic datasets involving more than 4000 cells and over 7000 proteins. Overall, SPDB is envisioned to be explored as a useful resource that will facilitate the wider research communities by providing detailed insights into proteomics from the single-cell perspective.
Assuntos
Proteínas , Proteômica , Anticorpos , Bases de Conhecimento , Espectrometria de Massas , Humanos , Animais , Análise de Célula ÚnicaRESUMO
Human endogenous retroviruses (HERVs), as remnants of ancient exogenous retrovirus infected and integrated into germ cells, comprise â¼8% of the human genome. These HERVs have been implicated in numerous diseases, and extensive research has been conducted to uncover their specific roles. Despite these efforts, a comprehensive source of HERV-disease association still needs to be added. To address this gap, we introduce the HervD Atlas (https://ngdc.cncb.ac.cn/hervd/), an integrated knowledgebase of HERV-disease associations manually curated from all related published literature. In the current version, HervD Atlas collects 60 726 HERV-disease associations from 254 publications (out of 4692 screened literature), covering 21 790 HERVs (21 049 HERV-Terms and 741 HERV-Elements) belonging to six types, 149 diseases and 610 related/affected genes. Notably, an interactive knowledge graph that systematically integrates all the HERV-disease associations and corresponding affected genes into a comprehensive network provides a powerful tool to uncover and deduce the complex interplay between HERVs and diseases. The HervD Atlas also features a user-friendly web interface that allows efficient browsing, searching, and downloading of all association information, research metadata, and annotation information. Overall, the HervD Atlas is an essential resource for comprehensive, up-to-date knowledge on HERV-disease research, potentially facilitating the development of novel HERV-associated diagnostic and therapeutic strategies.
Assuntos
Retrovirus Endógenos , Bases de Conhecimento , Viroses , Humanos , Viroses/genética , Viroses/virologia , Atlas como Assunto , Uso da InternetRESUMO
The Nucleic Acid Knowledgebase (nakb.org) is a new data resource, updated weekly, for experimentally determined 3D structures containing DNA and/or RNA nucleic acid polymers and their biological assemblies. NAKB indexes nucleic acid-containing structures derived from all major structure determination methods (X-ray, NMR and EM), including all held by the Protein Data Bank (PDB). As the planned successor to the Nucleic Acid Database (NDB), NAKB's design preserves all functionality of the NDB and provides novel nucleic acid-centric content, including structural and functional annotations, as well as annotations from and links to external resources. A variety of custom interactive tools have been developed to enable rapid exploration and drill-down of NAKB's content.
Assuntos
Conformação de Ácido Nucleico , Ácidos Nucleicos , DNA/química , Bases de Conhecimento , Ácidos Nucleicos/genética , RNA/químicaRESUMO
Aging entails gradual functional decline influenced by interconnected factors. Multiple hallmarks proposed as common and conserved underlying denominators of aging on the molecular, cellular and systemic levels across multiple species. Thus, understanding the function of aging hallmarks and their relationships across species can facilitate the translation of anti-aging drug development from model organisms to humans. Here, we built AgeAnnoMO (https://relab.xidian.edu.cn/AgeAnnoMO/#/), a knowledgebase of multi-omics annotation for animal aging. AgeAnnoMO encompasses an extensive collection of 136 datasets from eight modalities, encompassing 8596 samples from 50 representative species, making it a comprehensive resource for aging and longevity research. AgeAnnoMO characterizes multiple aging regulators across species via multi-omics data, comprehensively annotating aging-related genes, proteins, metabolites, mitochondrial genes, microbiotas and age-specific TCR and BCR sequences tied to aging hallmarks for these species and tissues. AgeAnnoMO not only facilitates a deeper and more generalizable understanding of aging mechanisms, but also provides potential insights of the specificity across tissues and species in aging process, which is important to develop the effective anti-aging interventions for diverse populations. We anticipate that AgeAnnoMO will provide a valuable resource for comprehending and integrating the conserved driving hallmarks in aging biology and identifying the targetable biomarkers for aging research.
Assuntos
Envelhecimento , Bases de Conhecimento , Multiômica , Animais , Humanos , Envelhecimento/genética , Biomarcadores , Longevidade/genéticaRESUMO
Biomarkers play an important role in various area such as personalized medicine, drug development, clinical care, and molecule breeding. However, existing animals' biomarker resources predominantly focus on human diseases, leaving a significant gap in non-human animal disease understanding and breeding research. To address this limitation, we present BioKA (Biomarker Knowledgebase for Animals, https://ngdc.cncb.ac.cn/bioka), a curated and integrated knowledgebase encompassing multiple animal species, diseases/traits, and annotated resources. Currently, BioKA houses 16 296 biomarkers associated with 951 mapped diseases/traits across 31 species from 4747 references, including 11 925 gene/protein biomarkers, 1784 miRNA biomarkers, 1043 mutation biomarkers, 773 metabolic biomarkers, 357 circRNA biomarkers and 127 lncRNA biomarkers. Furthermore, BioKA integrates various annotations such as GOs, protein structures, protein-protein interaction networks, miRNA targets and so on, and constructs an interactive knowledge network of biomarkers including circRNA-miRNA-mRNA associations, lncRNA-miRNA associations and protein-protein associations, which is convenient for efficient data exploration. Moreover, BioKA provides detailed information on 308 breeds/strains of 13 species, and homologous annotations for 8784 biomarkers across 16 species, and offers three online application tools. The comprehensive knowledge provided by BioKA not only advances human disease research but also contributes to a deeper understanding of animal diseases and supports livestock breeding.
Assuntos
Biomarcadores , Bases de Conhecimento , Animais , MicroRNAs/genética , Proteínas , RNA Circular , RNA Longo não CodificanteRESUMO
Plant Reactome (https://plantreactome.gramene.org) is a freely accessible, comprehensive plant pathway knowledgebase. It provides curated reference pathways from rice (Oryza sativa) and gene-orthology-based pathway projections to 129 additional species, spanning single-cell photoautotrophs, non-vascular plants, and higher plants, thus encompassing a wide-ranging taxonomic diversity. Currently, Plant Reactome houses a collection of 339 reference pathways, covering metabolic and transport pathways, hormone signaling, genetic regulations of developmental processes, and intricate transcriptional networks that orchestrate a plant's response to abiotic and biotic stimuli. Beyond being a mere repository, Plant Reactome serves as a dynamic data discovery platform. Users can analyze and visualize omics data, such as gene expression, gene-gene interaction, proteome, and metabolome data, all within the rich context of plant pathways. Plant Reactome is dedicated to fostering data interoperability, upholding global data standards, and embracing the tenets of the Findable, Accessible, Interoperable and Re-usable (FAIR) data policy.
Assuntos
Bases de Conhecimento , Redes e Vias Metabólicas , Multiômica , Plantas , Redes e Vias Metabólicas/genética , Plantas/genética , Plantas/metabolismo , Transdução de Sinais/genética , Internet , Bases de Dados de ProteínasRESUMO
The Catalogue Of Somatic Mutations In Cancer (COSMIC), https://cancer.sanger.ac.uk/cosmic, is an expert-curated knowledgebase providing data on somatic variants in cancer, supported by a comprehensive suite of tools for interpreting genomic data, discerning the impact of somatic alterations on disease, and facilitating translational research. The catalogue is accessed and used by thousands of cancer researchers and clinicians daily, allowing them to quickly access information from an immense pool of data curated from over 29 thousand scientific publications and large studies. Within the last 4 years, COSMIC has substantially expanded its utility by adding new resources: the Mutational Signatures catalogue, the Cancer Mutation Census, and Actionability. To improve data accessibility and interoperability, somatic variants have received stable genomic identifiers that are associated with their genomic coordinates in GRCh37 and GRCh38, and new export files with reduced data redundancy have been made available for download.