RESUMO
The European Nucleotide Archive (ENA; https://www.ebi.ac.uk/ena) is maintained by the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI). The ENA is one of the three members of the International Nucleotide Sequence Database Collaboration (INSDC). It serves the bioinformatics community worldwide via the submission, processing, archiving and dissemination of sequence data. The ENA supports data types ranging from raw reads, through alignments and assemblies to functional annotation. The data is enriched with contextual information relating to samples and experimental configurations. In this article, we describe recent progress and improvements to ENA services. In particular, we focus upon three areas of work in 2023: FAIRness of ENA data, pandemic preparedness and foundational technology. For FAIRness, we have introduced minimal requirements for spatiotemporal annotation, created a metadata-based classification system, incorporated third party metadata curations with archived records, and developed a new rapid visualisation platform, the ENA Notebooks. For foundational enhancements, we have improved the INSDC data exchange and synchronisation pipelines, and invested in site reliability engineering for ENA infrastructure. In order to support genomic surveillance efforts, we have continued to provide ENA services in support of SARS-CoV-2 data mobilisation and have adapted these for broader pathogen surveillance efforts.
Assuntos
Genômica , Nucleotídeos , Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Internet , Reprodutibilidade dos Testes , Europa (Continente)RESUMO
The European Nucleotide Archive (ENA; https://www.ebi.ac.uk/ena), maintained by the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI), offers those producing data an open and supported platform for the management, archiving, publication, and dissemination of data; and to the scientific community as a whole, it offers a globally comprehensive data set through a host of data discovery and retrieval tools. Here, we describe recent updates to the ENA's submission and retrieval services as well as focused efforts to improve connectivity, reusability, and interoperability of ENA data and metadata.
Assuntos
Bases de Dados de Ácidos Nucleicos , Academias e Institutos , Biologia Computacional , Internet , Software , Conjuntos de Dados como AssuntoRESUMO
The European Nucleotide Archive (ENA, https://www.ebi.ac.uk/ena), maintained at the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI) provides freely accessible services, both for deposition of, and access to, open nucleotide sequencing data. Open scientific data are of paramount importance to the scientific community and contribute daily to the acceleration of scientific advance. Here, we outline the major updates to ENA's services and infrastructure that have been delivered over the past year.
Assuntos
Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Nucleotídeos/genética , Software , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Internet , Anotação de Sequência Molecular , Nucleotídeos/classificaçãoRESUMO
The European Nucleotide Archive (ENA; https://www.ebi.ac.uk/ena), provided by the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI), has for almost forty years continued in its mission to freely archive and present the world's public sequencing data for the benefit of the entire scientific community and for the acceleration of the global research effort. Here we highlight the major developments to ENA services and content in 2020, focussing in particular on the recently released updated ENA browser, modernisation of our release process and our data coordination collaborations with specific research communities.
Assuntos
Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos/tendências , Ácidos Nucleicos/genética , Nucleotídeos/genética , Bases de Dados de Ácidos Nucleicos/estatística & dados numéricos , Europa (Continente) , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Internet , Anotação de Sequência Molecular , Ácidos Nucleicos/química , Nucleotídeos/química , Análise de Sequência de DNA , Análise de Sequência de RNARESUMO
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic will be remembered as one of the defining events of the 21st century. The rapid global outbreak has had significant impacts on human society and is already responsible for millions of deaths. Understanding and tackling the impact of the virus has required a worldwide mobilisation and coordination of scientific research. The COVID-19 Data Portal (https://www.covid19dataportal.org/) was first released as part of the European COVID-19 Data Platform, on April 20th 2020 to facilitate rapid and open data sharing and analysis, to accelerate global SARS-CoV-2 and COVID-19 research. The COVID-19 Data Portal has fortnightly feature releases to continue to add new data types, search options, visualisations and improvements based on user feedback and research. The open datasets and intuitive suite of search, identification and download services, represent a truly FAIR (Findable, Accessible, Interoperable and Reusable) resource that enables researchers to easily identify and quickly obtain the key datasets needed for their COVID-19 research.
Assuntos
Pesquisa Biomédica , COVID-19 , Bases de Dados Factuais , Conjuntos de Dados como Assunto , Disseminação de Informação , Publicação de Acesso Aberto , SARS-CoV-2 , COVID-19/epidemiologia , COVID-19/genética , COVID-19/virologia , Bases de Dados Bibliográficas , Surtos de Doenças , Humanos , Pandemias , SARS-CoV-2/química , SARS-CoV-2/genética , SARS-CoV-2/metabolismo , SARS-CoV-2/ultraestrutura , Fatores de Tempo , Proteínas Virais/química , Proteínas Virais/genéticaRESUMO
The European Nucleotide Archive (ENA, https://www.ebi.ac.uk/ena) at the European Molecular Biology Laboratory's European Bioinformatics Institute provides open and freely available data deposition and access services across the spectrum of nucleotide sequence data types. Making the world's public sequencing datasets available to the scientific community, the ENA represents a globally comprehensive nucleotide sequence resource. Here, we outline ENA services and content in 2019 and provide an insight into selected key areas of development in this period.
Assuntos
Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Genômica , Biologia Computacional/métodos , Europa (Continente) , Genômica/métodos , Anotação de Sequência Molecular , Software , Interface Usuário-Computador , NavegadorRESUMO
The COVID-19 pandemic has seen large-scale pathogen genomic sequencing efforts, becoming part of the toolbox for surveillance and epidemic research. This resulted in an unprecedented level of data sharing to open repositories, which has actively supported the identification of SARS-CoV-2 structure, molecular interactions, mutations and variants, and facilitated vaccine development and drug reuse studies and design. The European COVID-19 Data Platform was launched to support this data sharing, and has resulted in the deposition of several million SARS-CoV-2 raw reads. In this paper we describe (1) open data sharing, (2) tools for submission, analysis, visualisation and data claiming (e.g. ORCiD), (3) the systematic analysis of these datasets, at scale via the SARS-CoV-2 Data Hubs as well as (4) lessons learnt. This paper describes a component of the Platform, the SARS-CoV-2 Data Hubs, which enable the extension and set up of infrastructure that we intend to use more widely in the future for pathogen surveillance and pandemic preparedness.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , Pandemias , COVID-19/epidemiologia , Genômica , Disseminação de InformaçãoRESUMO
Background: A consensus on the diagnostic utility of cerebrospinal fluid adenosine deaminase (ADA) for tuberculous meningitis (TBM) is lacking. Methods: Patients aged ≥12 years admitted with CNS infections were enrolled prospectively. ADA was measured with spectrophotometry. Results: We enrolled 251 TBM and 131 other CNS infections. The optimal cutoff of ADA was calculated at 5.5 U/l against microbiological reference standard with area under curve 0.743, sensitivity 80.7%, specificity 60.3%, positive likelihood ratio 2.03 and negative likelihood ratio 3.12. The widely used cutoff value 10 U/l had specificity 82% and sensitivity 50%. The discriminating power was higher for TBM versus viral meningoencephalitis than bacterial or cryptococcal meningitis. Conclusion: Cerebrospinal fluid ADA has a low-to-modest diagnostic utility.
The diagnosis of tuberculosis (TB) of the brain is mainly made by testing cerebrospinal fluid, a clear liquid that flows in and around the brain and spinal cord. Adenosine deaminase (ADA) is a protein whose production and activity are increased in many diseases, such as TB. ADA testing in cerebrospinal fluid is widely used for the diagnosis of brain TB. However, the experts have split opinions regarding its confirmatory role. This study explores ADA measurement in cerebrospinal fluid for differentiating TB from other brain infections. The report says that this simple and inexpensive test can be helpful, but it cannot make or refute the diagnosis of brain TB and should only be considered along with other tests.
Assuntos
Tuberculose Meníngea , Humanos , Adenosina Desaminase/líquido cefalorraquidiano , Hospitalização , Padrões de Referência , Estudos Retrospectivos , Sensibilidade e Especificidade , Tuberculose Meníngea/diagnóstico , Tuberculose Meníngea/líquido cefalorraquidianoRESUMO
The COVID-19 pandemic has exemplified the importance of interoperable and equitable data sharing for global surveillance and to support research. While many challenges could be overcome, at least in some countries, many hurdles within the organizational, scientific, technical and cultural realms still remain to be tackled to be prepared for future threats. We propose to (i) continue supporting global efforts that have proven to be efficient and trustworthy toward addressing challenges in pathogen molecular data sharing; (ii) establish a distributed network of Pathogen Data Platforms to (a) ensure high quality data, metadata standardization and data analysis, (b) perform data brokering on behalf of data providers both for research and surveillance, (c) foster capacity building and continuous improvements, also for pandemic preparedness; (iii) establish an International One Health Pathogens Portal, connecting pathogen data isolated from various sources (human, animal, food, environment), in a truly One Health approach and following FAIR principles. To address these challenging endeavors, we have started an ELIXIR Focus Group where we invite all interested experts to join in a concerted, expert-driven effort toward sustaining and ensuring high-quality data for global surveillance and research.
Assuntos
COVID-19 , Animais , Humanos , COVID-19/epidemiologia , Pandemias , Fortalecimento Institucional , Disseminação de InformaçãoRESUMO
OBJECTIVES: Despite the acute and life-threatening repercussions that tuberculosis (TB) may have on the burgeoning older population in endemic countries like India, the spectrum of geriatric TB emergencies is not adequately understood. METHODS: We performed a prospective observational study at the emergency department of an academic hospital in north India between January 2019 and June 2020, investigating the clinical and laboratory features and outcomes of active TB in older patients aged 60 years and above. RESULTS: Out of 71 geriatric TB emergencies, central nervous system disease predominated (n = 41, 57.7%), followed by pulmonary (n = 16, 22.5%), pleural TB (n = 8, 11.3%), and multisite involvement (n = 6, 8.4%). Nearly 71.8% were male, and 77.4% belonged to low socioeconomic status (lower-middle or lower class). Usual predisposing factors were tobacco smoking (38.0%), chronic alcohol use (27.0%), and diabetes mellitus (23.9%). Atypical features were more frequent with extrapulmonary TB. Only 28.2% were microbiologically confirmed cases, and rifampicin resistance was seen in only one case. The mortality rate was considerably high (24.0%), highest with pulmonary TB (37.0%). CONCLUSION: Older patients with TB emergencies have atypical presentations, diagnostic difficulties, and high mortality.
RESUMO
Data sharing enables research communities to exchange findings and build upon the knowledge that arises from their discoveries. Areas of public and animal health as well as food safety would benefit from rapid data sharing when it comes to emergencies. However, ethical, regulatory and institutional challenges, as well as lack of suitable platforms which provide an infrastructure for data sharing in structured formats, often lead to data not being shared or at most shared in form of supplementary materials in journal publications. Here, we describe an informatics platform that includes workflows for structured data storage, managing and pre-publication sharing of pathogen sequencing data and its analysis interpretations with relevant stakeholders.
Assuntos
Bases de Dados Factuais , Disseminação de Informação , Bactérias/classificação , Metagenômica , Filogenia , Interface Usuário-ComputadorRESUMO
BACKGROUND: Lipidapheresis techniques are increasingly used to treat drug-resistant hyperlipidemia but few efficacy studies under routine application are available. In this multicenter observational study we investigated direct adsorption of lipoproteins (DALI) and lipoprotein filtration (MONET) for the short and the long-term effects on lipid-lowering effects. METHODS: Data of 122 apheresis patients from 11 centers (DALI: n = 78, MONET: n = 44) were prospectively collected for a period of 2 years. Routine lipid measurements were evaluated (2154 DALI and 1297 MONET sessions). It was investigated whether the relative reduction of LDL-C during apheresis session achieves at least 60%. Also relative reduction of total cholesterol, HDL, triglyceride, and Lp(a) were analyzed. RESULTS: The relative reduction of LDL-C was at least 60%: DALI: 70.62%, 95% CI = [69.34; 71.90] and MONET: 64.12%, 95% CI = [60.79; 67.46]. Also triglycerides were reduced with both systems: DALI 38.63%, 95% CI = [33.95; 43.30] vs. MONET 57.68%, 95% CI = [51.91; 63.45]. Relative reductions of total cholesterol were in the range of 50% (DALI 95% CI = [46.49; 49.65] MONET 95% CI = [48.93; 55.26]) and of Lp(a) in the range of 65% (DALI 95% CI = [61.92; 65.83] MONET 95% CI = [63.71; 70.30]. HDL reduction was: DALI 15.01%, 95% CI = [13.22; 16.79] and MONET 22.59%, 95% CI = [19.33; 25.84]. For both devices treated patient plasma/blood volume and in case of DALI the use of the larger adsorber configurations (DALI 1000 and DALI 1250) were independent positive predictors of the relative reduction of LDL-C and of Lp(a). CONCLUSIONS: Both systems effectively improved lipid profile and reduced atherogenic lipids. The results point to the importance of the individualized application of these valuable therapies to achieve clinical targets.
Assuntos
Remoção de Componentes Sanguíneos/métodos , Hiperlipidemias/terapia , Lipídeos/sangue , Adsorção , Idoso , Biomarcadores/sangue , Remoção de Componentes Sanguíneos/efeitos adversos , HDL-Colesterol/sangue , LDL-Colesterol/sangue , Bases de Dados Factuais , Regulação para Baixo , Feminino , Filtração , Alemanha , Humanos , Hiperlipidemias/sangue , Hiperlipidemias/diagnóstico , Masculino , Pessoa de Meia-Idade , Estudos Prospectivos , Fatores de Tempo , Resultado do Tratamento , Triglicerídeos/sangueRESUMO
BACKGROUND: Lipidapheresis was introduced for intractable hyperlipidemia as a more selective therapy than plasma exchange aiming to enhance efficacy and limit side-effects. Although this therapy is regarded safe, multicenter data from routine application are limited. We investigated direct adsorption of lipoproteins (DALI) and lipofiltration (MONET) regarding the short and the long-term safety aspects. METHODS: This multicenter observational study prospectively evaluated 2154 DALI and 1297 MONET sessions of 122 patients during a period of 2 years. Safety parameters included clinical side-effects (adverse device effects, ADEs), technical complications, blood pressure and pulse rate. Also routinely performed laboratory parameters were documented. Analysis of laboratory parameters was not corrected for blood dilution. RESULTS: Overall 0.4% DALI and 0.5% MONET treatments were affected by ADE. Technical complications occurred in 2.1% and in 0.8% DALI and MONET sessions, respectively. The most frequent ADE was hypotension, and the majority of technical problems were related to vascular access. Both types of treatments led to a drop of thrombocytes in the range of 7-8%. Hematocrit and erythrocytes decreased only during the DALI treatments by about 6%. Leucocytes decreased during the DALI therapy (â¼15%), whereas they increased during the MONET application (â¼11%). MONET treatment was associated with a higher reduction of proteins (fibrinogen: 58% vs. 23%, albumin: 12% vs. 7%, CRP: 33% vs. 19% for MONET and DALI, respectively). Apart from severe thrombocytopenia in two DALI patients, changes of other parameters were typically transient. CONCLUSIONS: Under routine use the frequency of side-effects was low. Still, monitoring of blood count and proteins in chronic apheresis patients is recommended.