Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 127
Filtrar
1.
Front Microbiol ; 15: 1272972, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38440140

RESUMEN

Introduction: Whole Genome Sequencing (WGS) of the SARS-CoV-2 virus is crucial in the surveillance of the COVID-19 pandemic. Several primer schemes have been developed to sequence nearly all of the ~30,000 nucleotide SARS-CoV-2 genome, using a multiplex PCR approach to amplify cDNA copies of the viral genomic RNA. Midnight primers and ARTIC V4.1 primers are the most popular primer schemes that can amplify segments of SARS-CoV-2 (400 bp and 1200 bp, respectively) tiled across the viral RNA genome. Mutations within primer binding sites and primer-primer interactions can result in amplicon dropouts and coverage bias, yielding low-quality genomes with 'Ns' inserted in the missing amplicon regions, causing inaccurate lineage assignments, and making it challenging to monitor lineage-specific mutations in Variants of Concern (VoCs). Methods: In this study we used a set of seven long-range PCR primer pairs to sequence clinical isolates of SARS-CoV-2 on Oxford Nanopore sequencer. These long-range primers generate seven amplicons approximately 4500 bp that covered whole genome of SARS-CoV-2. One of these regions includes the full-length S-gene by using a set of flanking primers. We also evaluated the performance of these long-range primers with Midnight primers by sequencing 94 clinical isolates in a Nanopore flow cell. Results and discussion: Using a small set of long-range primers to sequence SARS-CoV-2 genomes reduces the possibility of amplicon dropout and coverage bias. The key finding of this study is that long range primers can be used in single-molecule sequencing of RNA viruses in surveillance of emerging variants. We also show that by designing primers flanking the S-gene, we can obtain reliable identification of SARS-CoV-2 variants.

2.
bioRxiv ; 2023 Jul 11.
Artículo en Inglés | MEDLINE | ID: mdl-37502853

RESUMEN

Whole Genome Sequencing (WGS) of the SARS-CoV-2 virus is crucial in the surveillance of the COVID-19 pandemic. Several primer schemes have been developed to sequence the ~30,000 nucleotide SARS-CoV-2 genome that use a multiplex PCR approach to amplify cDNA copies of the viral genomic RNA. Midnight primers and ARTIC V4.1 primers are the most popular primer schemes that can amplify segments of SARS-CoV-2 (400 bp and 1200 bp, respectively) tiled across the viral RNA genome. Mutations within primer binding sites and primer-primer interactions can result in amplicon dropouts and coverage bias, yielding low-quality genomes with 'Ns' inserted in the missing amplicon regions, causing inaccurate lineage assignments, and making it challenging to monitor lineage-specific mutations in Variants of Concern (VoCs). This study uses seven long-range PCR primers with an amplicon size of ~4500 bp to tile across the complete SARS-CoV-2 genome. One of these regions includes the full-length S-gene by using a set of flanking primers. Using a small set of long-range primers to sequence SARS-CoV-2 genomes reduces the possibility of amplicon dropout and coverage bias.

3.
Microbiol Resour Announc ; 12(3): e0000723, 2023 Mar 16.
Artículo en Inglés | MEDLINE | ID: mdl-36779742

RESUMEN

An eleven-year-old tested positive for SARS-CoV-2 Lambda variant. Sequencing was performed on the Oxford Nanopore and the Illumina NextSeq 500. Both platforms identified all 7 of the synonymous mutations in the sample, while all 28 nonsynonymous mutations were identified from Oxford Nanopore and 20 nonsynonymous mutations were identified from Illumina.

4.
J Appl Microbiol ; 134(1)2023 Jan 23.
Artículo en Inglés | MEDLINE | ID: mdl-36626787

RESUMEN

Omics research inevitably involves the collection and analysis of big data, which can only be handled by automated approaches. Here we point out that the analysis of big data in the field of genomics dictates certain requirements, such as specialized software, quality control of input data, and simplification for visualization of the results. The latter results in a loss of information, as is exemplified for phylogenetic trees. Clear communication of big data analyses can be enhanced by novel visualization strategies. The interpretation of findings is sometimes hampered when dedicated analytical tools are not fully understood by microbiologists, while the researchers performing these analyses may not have a full overview of the biology of the microbes under study. These issues are illustrated here, using SARS-Cov-2 and Salmonella enterica as zoonotic examples. Whereas in scientific communications jargon should be avoided or explained, nomenclature to group similar organisms and distinguish these from more distant relatives is not only essential, but also influences the interpretation of results. Unfortunately, changes in taxonomically accepted names are now so frequent that they hamper rather than assist research, as is illustrated with difficulties of microbiome studies. Nomenclature to group viral isolates, as is done for SARS-Cov2, is also not without difficulties. Some weaknesses in current omics research stem from poor quality of data or biased databases, and problems can be magnified by machine learning approaches. Moreover, the overall opus of scientific publications can now be considered "big data", as is illustrated by the avalanche of COVID-19-related publications. The peer-review model of scientific publishing is only barely coping with this novel situation, resulting in retractions and the publication of bogus works. The avalanche of scientific publications that originated from the current pandemic can obstruct literature searches, and this will unfortunately continue over time.


Asunto(s)
COVID-19 , Animales , Humanos , SARS-CoV-2/genética , Filogenia , ARN Viral , Genómica , Zoonosis
5.
J Appl Microbiol ; 133(6): 3690-3698, 2022 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-36074056

RESUMEN

AIMS: The current Monkeypox virus (MPX) outbreak is not only the largest known outbreak to date caused by a strain belonging to the West-African clade, but also results in remarkably different clinical and epidemiological features compared to previous outbreaks of this virus. Here, we consider the possibility that mutations in the viral genome may be responsible for its changed characteristics. METHODS AND RESULTS: Six genome sequences of isolates from the current outbreak were compared to five genomes of isolates from the 2017 outbreak in Nigeria and to two historic genomes, all belonging to the West-African clade. We report differences that are consistently present in the 2022 isolates but not in the others. Although some variation in repeat units was observed, only two were consistently found in the 2022 genomes only, and these were located in intergenic regions. A total of 55 single nucleotide polymorphisms were consistently present in the 2022 isolates compared to the 2017 isolates. Of these, 25 caused an amino acid substitution in a predicted protein. CONCLUSIONS: The nature of the substitution and the annotation of the affected protein identified potential candidates that might affect the virulence of the virus. These included the viral DNA helicase and transcription factors. SIGNIFICANCE: This bioinformatic analysis provides guidance for wet-lab research to identify changed properties of the MPX.


Asunto(s)
Brotes de Enfermedades , Monkeypox virus , Monkeypox virus/genética , Nigeria/epidemiología , Genoma Viral/genética , ADN Viral
6.
FEMS Microbiol Rev ; 46(3)2022 05 06.
Artículo en Inglés | MEDLINE | ID: mdl-35076068

RESUMEN

The genomic diversity of SARS-CoV-2 is the result of a relatively low level of spontaneous mutations introduced during viral replication. With millions of SARS-CoV-2 genome sequences now available, we can begin to assess the overall genetic repertoire of this virus. We find that during 2020, there was a global wave of one variant that went largely unnoticed, possibly because its members were divided over several sublineages (B.1.177 and sublineages B.1.177.XX). We collectively call this Janus, and it was eventually replaced by the Alpha (B.1.1.7) variant of concern (VoC), next replaced by Delta (B.1.617.2), which itself might soon be replaced by a fourth pandemic wave consisting of Omicron (B.1.1.529). We observe that splitting up and redefining variant lineages over time, as was the case with Janus and is now happening with Alpha, Delta and Omicron, is not helpful to describe the epidemic waves spreading globally. Only ∼5% of the 30 000 nucleotides of the SARS-CoV-2 genome are found to be variable. We conclude that a fourth wave of the pandemic with the Omicron variant might not be that different from other VoCs, and that we may already have the tools in hand to effectively deal with this new VoC.


Asunto(s)
COVID-19 , SARS-CoV-2 , COVID-19/epidemiología , Humanos , Mutación , Pandemias , SARS-CoV-2/genética , Glicoproteína de la Espiga del Coronavirus/genética
7.
Commun Biol ; 4(1): 117, 2021 01 26.
Artículo en Inglés | MEDLINE | ID: mdl-33500552

RESUMEN

In this study, more than one hundred thousand Escherichia coli and Shigella genomes were examined and classified. This is, to our knowledge, the largest E. coli genome dataset analyzed to date. A Mash-based analysis of a cleaned set of 10,667 E. coli genomes from GenBank revealed 14 distinct phylogroups. A representative genome or medoid identified for each phylogroup was used as a proxy to classify 95,525 unassembled genomes from the Sequence Read Archive (SRA). We find that most of the sequenced E. coli genomes belong to four phylogroups (A, C, B1 and E2(O157)). Authenticity of the 14 phylogroups is supported by several different lines of evidence: phylogroup-specific core genes, a phylogenetic tree constructed with 2613 single copy core genes, and differences in the rates of gene gain/loss/duplication. The methodology used in this work is able to reproduce known phylogroups, as well as to identify previously uncharacterized phylogroups in E. coli species.


Asunto(s)
Escherichia coli/clasificación , Escherichia coli/genética , Genoma Bacteriano , Biología Computacional/métodos , Proteínas de Escherichia coli/genética , Especiación Genética , Genómica/métodos , Filogenia , Análisis de Secuencia de ADN , Shigella/clasificación , Shigella/genética
8.
Nucleic Acids Res ; 49(2): e7, 2021 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-32710622

RESUMEN

Traditional epitranscriptomics relies on capturing a single RNA modification by antibody or chemical treatment, combined with short-read sequencing to identify its transcriptomic location. This approach is labor-intensive and may introduce experimental artifacts. Direct sequencing of native RNA using Oxford Nanopore Technologies (ONT) can allow for directly detecting the RNA base modifications, although these modifications might appear as sequencing errors. The percent Error of Specific Bases (%ESB) was higher for native RNA than unmodified RNA, which enabled the detection of ribonucleotide modification sites. Based on the %ESB differences, we developed a bioinformatic tool, epitranscriptional landscape inferring from glitches of ONT signals (ELIGOS), that is based on various types of synthetic modified RNA and applied to rRNA and mRNA. ELIGOS is able to accurately predict known classes of RNA methylation sites (AUC > 0.93) in rRNAs from Escherichiacoli, yeast, and human cells, using either unmodified in vitro transcription RNA or a background error model, which mimics the systematic error of direct RNA sequencing as the reference. The well-known DRACH/RRACH motif was localized and identified, consistent with previous studies, using differential analysis of ELIGOS to study the impact of RNA m6A methyltransferase by comparing wild type and knockouts in yeast and mouse cells. Lastly, the DRACH motif could also be identified in the mRNA of three human cell lines. The mRNA modification identified by ELIGOS is at the level of individual base resolution. In summary, we have developed a bioinformatic software package to uncover native RNA modifications.


Asunto(s)
Biología Computacional/métodos , Secuenciación de Nucleótidos de Alto Rendimiento , Procesamiento Postranscripcional del ARN , RNA-Seq , Error Científico Experimental , Programas Informáticos , Adenina/análogos & derivados , Adenina/análisis , Animales , Línea Celular , Escherichia coli/genética , Humanos , Meiosis , Metiltransferasas/deficiencia , Metiltransferasas/metabolismo , Ratones , Ratones Noqueados , Motivos de Nucleótidos , ARN Bacteriano/genética , ARN de Hongos/genética , ARN Mensajero/genética , ARN Ribosómico/genética , Curva ROC , Saccharomyces cerevisiae/genética , Análisis de Secuencia de ADN , Moldes Genéticos , Transcripción Genética
9.
Comput Struct Biotechnol J ; 18: 3890-3896, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-33335686

RESUMEN

Large-scale protein analysis has been used to characterize large numbers of proteins across numerous species. One of the applications is to use as a high-throughput screening method for pathogenicity of genomes. Unlike sequence homology methods, protein comparison at a functional level provides us with a unique opportunity to classify proteins, based on their functional structures without dealing with sequence complexity of distantly related species. Protein functions can be abstractly described by a set of protein functional domains, such as PfamA domains; a set of genomes can then be mapped to a matrix, with each row representing a genome, and the columns representing the presence or absence of a given functional domain. However, a powerful tool is needed to analyze the large sparse matrices generated by millions of genomes that will become available in the near future. The ProdMX is a tool with user-friendly utilities developed to facilitate high-throughput analysis of proteins with an ability to be included as an effective module in the high-throughput pipeline. The ProdMX employs a compressed sparse matrix algorithm to reduce computational resources and time used to perform the matrix manipulation during functional domain analysis. The ProdMX is a free and publicly available Python package which can be installed with popular package mangers such as PyPI and Conda, or with a standard installer from source code available on the ProdMX GitHub repository at https://github.com/visanuwan/prodmx.

10.
Microbiol Resour Announc ; 10(1)2020 Dec 17.
Artículo en Inglés | MEDLINE | ID: mdl-33334896

RESUMEN

Two coding-complete sequences of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) were obtained from samples from two patients in Arkansas, in the southeastern corner of the United States. The viral genome was obtained using the ARTIC Network protocol and Oxford Nanopore Technologies sequencing.

11.
Molecules ; 25(22)2020 Nov 12.
Artículo en Inglés | MEDLINE | ID: mdl-33198233

RESUMEN

The advancements of information technology and related processing techniques have created a fertile base for progress in many scientific fields and industries. In the fields of drug discovery and development, machine learning techniques have been used for the development of novel drug candidates. The methods for designing drug targets and novel drug discovery now routinely combine machine learning and deep learning algorithms to enhance the efficiency, efficacy, and quality of developed outputs. The generation and incorporation of big data, through technologies such as high-throughput screening and high through-put computational analysis of databases used for both lead and target discovery, has increased the reliability of the machine learning and deep learning incorporated techniques. The use of these virtual screening and encompassing online information has also been highlighted in developing lead synthesis pathways. In this review, machine learning and deep learning algorithms utilized in drug discovery and associated techniques will be discussed. The applications that produce promising results and methods will be reviewed.


Asunto(s)
Biología Computacional/métodos , Descubrimiento de Drogas/métodos , Aprendizaje Automático , Algoritmos , Teorema de Bayes , Bases de Datos Factuales , Aprendizaje Profundo , Humanos , Internet , Método de Montecarlo , Reproducibilidad de los Resultados , Programas Informáticos , Máquina de Vectores de Soporte
12.
Acta Neuropathol Commun ; 8(1): 87, 2020 06 20.
Artículo en Inglés | MEDLINE | ID: mdl-32563269

RESUMEN

Molecular biomarkers provide both diagnostic and prognostic results for patients with diffuse glioma, the most common primary brain tumor in adults. Here, we used a long-read nanopore-based sequencing technique to simultaneously assess IDH mutation status and MGMT methylation level in 4 human cell lines and 8 fresh human brain tumor biopsies. Currently, these biomarkers are assayed separately, and results can take days to weeks. We demonstrated the use of nanopore Cas9-targeted sequencing (nCATS) to identify IDH1 and IDH2 mutations within 36 h and compared this approach against currently used clinical methods. nCATS was also able to simultaneously provide high-resolution evaluation of MGMT methylation levels not only at the promoter region, as with currently used methods, but also at CpGs across the proximal promoter region, the entirety of exon 1, and a portion of intron 1. We compared the methylation levels of all CpGs to MGMT expression in all cell lines and tumors and observed a positive correlation between intron 1 methylation and MGMT expression. Finally, we identified single nucleotide variants in 3 target loci. This pilot study demonstrates the feasibility of using nCATS as a clinical tool for cancer precision medicine.


Asunto(s)
Neoplasias Encefálicas/diagnóstico , Proteína 9 Asociada a CRISPR/genética , Metilasas de Modificación del ADN/química , Enzimas Reparadoras del ADN/química , Glioma/diagnóstico , Isocitrato Deshidrogenasa/genética , Análisis de Secuencia de ARN/métodos , Proteínas Supresoras de Tumor/química , Adulto , Anciano , Oxidorreductasas de Alcohol , Biomarcadores de Tumor/genética , Biopsia , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patología , Línea Celular Tumoral , Femenino , Glioma/genética , Glioma/patología , Humanos , Masculino , Metilación , Persona de Mediana Edad , Proyectos Piloto , Adulto Joven
13.
Microbiol Resour Announc ; 9(24)2020 Jun 11.
Artículo en Inglés | MEDLINE | ID: mdl-32527771

RESUMEN

Here, we present a 16S rRNA gene amplicon sequence data set and profiles demonstrating the bacterial diversity of baby and adult elephants from four different geographical locations in Thailand. The dominant phyla among baby and adult elephants were Bacteroidetes, Firmicutes, Proteobacteria, Kiritimatiellaeota, Euryarchaeota, and Tenericutes.

14.
Microbiologyopen ; 9(2): e973, 2020 02.
Artículo en Inglés | MEDLINE | ID: mdl-31742930

RESUMEN

The intraspecies genomic diversity of the single-strand RNA (+) virus species hepatitis A virus (Hepatovirus), hepatitis C virus (Hepacivirus), and hepatitis E virus (Orthohepevirus) was compared. These viral species all can cause liver inflammation (hepatitis), but share no gene similarity. The codon usage of human hepatitis A virus (HAV) is suboptimal for replication in its host, a characteristic it shares with taxonomically related rodent, simian, and bat hepatitis A virus species. We found this codon usage to be strikingly similar to that of Triatoma virus that infects blood-sucking kissing bugs. The codon usage of that virus is well adapted to its insect host. The codon usage of HAV is also similar to other invertebrate viruses of various taxonomic families. An evolutionary ancestor of HAV and related virus species is hypothesized to be an insect virus that underwent a host jump to infect mammals. The similarity between HAV and invertebrate viruses goes beyond codon usage, as they also share amino acid composition characteristics, while not sharing direct sequence homology. In contrast, hepatitis C virus and hepatitis E virus are highly similar in codon usage preference, nucleotide composition, and amino acid composition, and share these characteristics with Human pegivirus A, West Nile virus, and Zika virus. We present evidence that these observations are only partly explained by differences in nucleotide composition of the complete viral codon regions. We consider the combination of nucleotide composition, amino acid composition, and codon usage preference suitable to provide information on possible evolutionary similarities between distant virus species that cannot be investigated by phylogeny.


Asunto(s)
Evolución Molecular , Genoma Viral , Genómica , Hepacivirus/genética , Virus de la Hepatitis A/genética , Virus de la Hepatitis E/genética , Codón , Genómica/métodos , Hepacivirus/clasificación , Virus de la Hepatitis A/clasificación , Virus de la Hepatitis E/clasificación , Humanos , Filogenia
15.
J Microbiol Methods ; 166: 105739, 2019 11.
Artículo en Inglés | MEDLINE | ID: mdl-31626891

RESUMEN

Gut metagenome profiling using the Oxford Nanopore Technologies (ONT) sequencer was assessed in a pilot-sized study of 10 subjects. The taxonomic abundance of gut microbiota derived from ONT was comparable with Illumina Technology (IT) for the high-abundance species. IT better detected low-abundance species through amplification, when material was limited.


Asunto(s)
Bacterias/clasificación , Microbioma Gastrointestinal/genética , Neoplasias de Cabeza y Cuello/epidemiología , Metagenoma/genética , Secuenciación de Nanoporos/métodos , Anciano , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Masculino , Persona de Mediana Edad , Proyectos Piloto , Análisis de Secuencia de ADN/métodos
16.
BMC Cancer ; 19(1): 827, 2019 Aug 22.
Artículo en Inglés | MEDLINE | ID: mdl-31438887

RESUMEN

BACKGROUND: SMARCB1-deficient sinonasal carcinoma (SDSC) is an aggressive subtype of head and neck cancers that has a poor prognosis despite multimodal therapy. We present a unique case with next generation sequencing data of a patient who had SDSC with perineural invasion to the trigeminal nerve that progressed to a brain metastasis and eventually leptomeningeal spread. CASE PRESENTATION: A 42 year old female presented with facial pain and had resection of a tumor along the V2 division of the trigeminal nerve on the right. She underwent adjuvant stereotactic radiation. She developed further neurological symptoms and imaging demonstrated the tumor had infiltrated into the cavernous sinus as well as intradurally. She had surgical resection for removal of her brain metastasis and decompression of the cavernous sinus. Following her second surgery, she had adjuvant radiation and chemotherapy. Several months later she had quadriparesis and imaging was consistent with leptomeningeal spread. She underwent palliative radiation and ultimately transitioned quickly to comfort care and expired. Overall survival from time of diagnosis was 13 months. Next generation sequencing was carried out on her primary tumor and brain metastasis. The brain metastatic tissue had an increased tumor mutational burden in comparison to the primary. CONCLUSIONS: This is the first report of SDSC with perineural invasion progressing to leptomeningeal carcinomatosis. Continued next generation sequencing of the primary and metastatic tissue by clinicians is encouraged toprovide further insights into metastatic progression of rare solid tumors.


Asunto(s)
Carcinoma/etiología , Carcinoma/patología , Neoplasias de los Senos Paranasales/etiología , Neoplasias de los Senos Paranasales/patología , Proteína SMARCB1/deficiencia , Adulto , Biomarcadores de Tumor , Carcinoma/diagnóstico por imagen , Progresión de la Enfermedad , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Inmunohistoquímica , Carcinomatosis Meníngea/diagnóstico , Carcinomatosis Meníngea/secundario , Metástasis de la Neoplasia , Estadificación de Neoplasias , Neoplasias de los Senos Paranasales/diagnóstico por imagen , Polimorfismo de Nucleótido Simple , Tomografía Computarizada por Rayos X
18.
Front Microbiol ; 10: 857, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31080442

RESUMEN

To achieve maximum transmission chain tracking in the current Ebola outbreak, whole genome sequencing (WGS) has been proposed to provide optimal information. However, WGS remains a costly and time-intensive procedure that is poorly suited for the large numbers of samples being generated, especially under severe time and work-environment constraints as in the present DRC outbreak. To better prepare for future outbreaks, where an apparent single outbreak may actually represent overlapping outbreaks caused by independent variants, and where rapid identification of emerging new transmission chains will be essential, a more practical method would be to amplify and sequence genomic areas that reveal the highest information to differentiate EBOV variants. We have identified four highly informative polymorphism PCR sequencing targets, suitable for rapid tracing of transmission chains and identification of new sources of Ebola outbreaks, an approach which will be far more practical in the field than WGS.

19.
Artículo en Inglés | MEDLINE | ID: mdl-30834378

RESUMEN

Escherichia coli ATCC 11775 is a strain that was identified in 1941 and is now considered a type strain for the species. We present here the complete genome sequence for E. coli ATCC 11775. The genome was sequenced using Oxford Nanopore Technologies products and had 4,903,501 and 131,333 nucleotides of sequence length of the individual chromosome and plasmid, respectively.

20.
Front Microbiol ; 10: 260, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-30858830

RESUMEN

Long-read nanopore sequencing by a MinION device offers the unique possibility to directly sequence native RNA. We combined an enzymatic poly-A tailing reaction with the native RNA sequencing to (i) sequence complex population of single-stranded (ss)RNA viruses in parallel, (ii) detect genome, subgenomic mRNA/mRNA simultaneously, (iii) detect a complex transcriptomic architecture without the need for assembly, (iv) enable real-time detection. Using this protocol, positive-ssRNA, negative-ssRNA, with/without a poly(A)-tail, segmented/non-segmented genomes were mixed and sequenced in parallel. Mapping of the generated sequences on the reference genomes showed 100% length recovery with up to 97% identity. This work provides a proof of principle and the validity of this strategy, opening up a wide range of applications to study RNA viruses.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...