Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters

Database
Language
Affiliation country
Publication year range
1.
Microbiol Resour Announc ; 13(2): e0104523, 2024 Feb 15.
Article in English | MEDLINE | ID: mdl-38289057

ABSTRACT

The ATCC Genome Portal (AGP, https://genomes.atcc.org/) is a database of authenticated genomes for bacteria, fungi, protists, and viruses held in ATCC's biorepository. It now includes 3,938 assemblies (253% increase) produced under ISO 9000 by ATCC. Here, we present new features and content added to the AGP for the research community.

2.
mSphere ; 7(3): e0007722, 2022 06 29.
Article in English | MEDLINE | ID: mdl-35491842

ABSTRACT

The availability of public genomics data has become essential for modern life sciences research, yet the quality, traceability, and curation of these data have significant impacts on a broad range of microbial genomics research. While microbial genome databases such as NCBI's RefSeq database leverage the scalability of crowd sourcing for growth, genomics data provenance and authenticity of the source materials used to produce data are not strict requirements. Here, we describe the de novo assembly of 1,113 bacterial genome references produced from authenticated materials sourced from the American Type Culture Collection (ATCC), each with full genomics data provenance relating to bioinformatics methods, quality control, and passage history. Comparative genomics analysis of ATCC standard reference genomes (ASRGs) revealed significant issues with regard to NCBI's RefSeq bacterial genome assemblies related to completeness, mutations, structure, strain metadata, and gaps in traceability to the original biological source materials. Nearly half of RefSeq assemblies lack details on sample source information, sequencing technology, or bioinformatics methods. Deep curation of these records is not within the scope of NCBI's core mission in supporting open science, which aims to collect sequence records that are submitted by the public. Nonetheless, we propose that gaps in metadata accuracy and data provenance represent an "elephant in the room" for microbial genomics research. Effectively addressing these issues will require raising the level of accountability for data depositors and acknowledging the need for higher expectations of quality among the researchers whose research depends on accurate and attributable reference genome data. IMPORTANCE The traceability of microbial genomics data to authenticated physical biological materials is not a requirement for depositing these data into public genome databases. This creates significant risks for the reliability and data provenance of these important genomics research resources, the impact of which is not well understood. We sought to investigate this by carrying out a comparative genomics study of 1,113 ATCC standard reference genomes (ASRGs) produced by ATCC from authenticated and traceable materials using the latest sequencing technologies. We found widespread discrepancies in genome assembly quality, genetic variability, and the quality and completeness of the associated metadata among hundreds of reference genomes for ATCC strains found in NCBI's RefSeq database. We present a comparative analysis of de novo-assembled ASRGs, their respective metadata, and variant analysis using RefSeq genomes as a reference. Although assembly quality in RefSeq has generally improved over time, we found that significant quality issues remain, especially as related to genomic data and metadata provenance. Our work highlights the importance of data authentication and provenance for the microbial genomics community, and underscores the risks of ignoring this issue in the future.


Subject(s)
Databases, Genetic , Genomics , Genome, Bacterial , Genome, Microbial , Reproducibility of Results
3.
Viruses ; 13(12)2021 12 04.
Article in English | MEDLINE | ID: mdl-34960703

ABSTRACT

SARS-CoV-2 pathogenesis, vaccine, and therapeutic studies rely on the use of animals challenged with highly pathogenic virus stocks produced in cell cultures. Ideally, these virus stocks should be genetically and functionally similar to the original clinical isolate, retaining wild-type properties to be reliably used in animal model studies. It is well-established that SARS-CoV-2 isolates serially passaged on Vero cell lines accumulate mutations and deletions in the furin cleavage site; however, these can be eliminated when passaged on Calu-3 lung epithelial cell lines, as presented in this study. As numerous stocks of SARS-CoV-2 variants of concern are being grown in cell cultures with the intent for use in animal models, it is essential that propagation methods generate virus stocks that are pathogenic in vivo. Here, we found that the propagation of a B.1.351 SARS-CoV-2 stock on Calu-3 cells eliminated viruses that previously accumulated mutations in the furin cleavage site. Notably, there were alternative variants that accumulated at the same nucleotide positions in virus populations grown on Calu-3 cells at multiple independent facilities. When a Calu-3-derived B.1.351 virus stock was used to infect hamsters, the virus remained pathogenic and the Calu-3-specific variants persisted in the population. These results suggest that Calu-3-derived virus stocks are pathogenic but care should still be taken to evaluate virus stocks for newly arising mutations during propagation.


Subject(s)
SARS-CoV-2/growth & development , Serial Passage/methods , Spike Glycoprotein, Coronavirus/genetics , Animals , COVID-19/virology , Cell Line, Tumor , Chlorocebus aethiops , Cricetinae , Furin/metabolism , Humans , Mutation , SARS-CoV-2/genetics , SARS-CoV-2/pathogenicity , Vero Cells
4.
Microbiol Resour Announc ; 10(47): e0081821, 2021 Nov 24.
Article in English | MEDLINE | ID: mdl-34817215

ABSTRACT

Lack of data provenance negatively impacts scientific reproducibility and the reliability of genomic data. The ATCC Genome Portal (https://genomes.atcc.org) addresses this by providing data provenance information for microbial whole-genome assemblies originating from authenticated biological materials. To date, we have sequenced 1,579 complete genomes, including 466 type strains and 1,156 novel genomes.

5.
bioRxiv ; 2020 Jun 10.
Article in English | MEDLINE | ID: mdl-32577643

ABSTRACT

The function of the SARS-CoV-2 accessory protein p6, encoded by ORF6, is not fully known. Based upon its similarity to p6 from SARS-CoV, it may play a similar role, namely as an antagonist of type I interferon (IFN) signaling. Here we report the sequencing of a SARS-CoV-2 strain passaged six times after original isolation from a clinical patient in Hong Kong. The genome sequence shows a 27 nt in-frame deletion (Δ27,264-27,290) within ORF6, predicted to result in a 9 aa deletion ( ΔFKVSIWNLD ) from the central portion of p6. This deletion is predicted to result in a dramatic alteration in the three-dimensional structure of the resultant protein (p6 Δ22-30 ), possibly with significant functional implications. Analysis of the original clinical sample indicates that the deletion was not present, while sequencing of subsequent passages of the strain identifies the deletion as a majority variant. This suggests that the deletion originated ab initio during passaging and subsequently propagated into the majority, possibly due to the removal of selective pressure through the IFN-deficient Vero E6 cell line. The specific function of the SARS-CoV-2 p6 N-terminus, if any, has not yet been determined. However, this deletion is predicted to cause a shift from N-endo to N-ecto in the transmembrane localization of the SARS-CoV-2 p6 Δ22-30 N-terminus, possibly leading to the ablation of its native function.

SELECTION OF CITATIONS
SEARCH DETAIL