Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
Más filtros

Tipo del documento
Intervalo de año de publicación
1.
Nucleic Acids Res ; 52(3): e15, 2024 Feb 09.
Artículo en Inglés | MEDLINE | ID: mdl-38084888

RESUMEN

Whole genome sequencing has increasingly become the essential method for studying the genetic mechanisms of antimicrobial resistance and for surveillance of drug-resistant bacterial pathogens. The majority of bacterial genomes sequenced to date have been sequenced with Illumina sequencing technology, owing to its high-throughput, excellent sequence accuracy, and low cost. However, because of the short-read nature of the technology, these assemblies are fragmented into large numbers of contigs, hindering the obtaining of full information of the genome. We develop Pasa, a graph-based algorithm that utilizes the pangenome graph and the assembly graph information to improve scaffolding quality. By leveraging the population information of the bacteria species, Pasa is able to utilize the linkage information of the gene families of the species to resolve the contig graph of the assembly. We show that our method outperforms the current state of the arts in terms of accuracy, and at the same time, is computationally efficient to be applied to a large number of existing draft assemblies.


Asunto(s)
Algoritmos , Bacterias , Genoma Bacteriano , Bacterias/clasificación , Bacterias/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos
2.
BMC Bioinformatics ; 25(1): 193, 2024 May 16.
Artículo en Inglés | MEDLINE | ID: mdl-38755527

RESUMEN

We have developed AMRViz, a toolkit for analyzing, visualizing, and managing bacterial genomics samples. The toolkit is bundled with the current best practice analysis pipeline allowing researchers to perform comprehensive analysis of a collection of samples directly from raw sequencing data with a single command line. The analysis results in a report showing the genome structure, genome annotations, antibiotic resistance and virulence profile for each sample. The pan-genome of all samples of the collection is analyzed to identify core- and accessory-genes. Phylogenies of the whole genome as well as all gene clusters are also generated. The toolkit provides a web-based visualization dashboard allowing researchers to interactively examine various aspects of the analysis results. Availability: AMRViz is implemented in Python and NodeJS, and is publicly available under open source MIT license at https://github.com/amromics/amrviz .


Asunto(s)
Genoma Bacteriano , Genómica , Programas Informáticos , Genómica/métodos , Farmacorresistencia Bacteriana/genética , Filogenia , Bacterias/genética , Bacterias/efectos de los fármacos , Antibacterianos/farmacología
3.
BMC Genomics ; 25(1): 709, 2024 Jul 22.
Artículo en Inglés | MEDLINE | ID: mdl-39039439

RESUMEN

Whole genome analysis for microbial genomics is critical to studying and monitoring antimicrobial resistance strains. The exponential growth of microbial sequencing data necessitates a fast and scalable computational pipeline to generate the desired outputs in a timely and cost-effective manner. Recent methods have been implemented to integrate individual genomes into large collections of specific bacterial populations and are widely employed for systematic genomic surveillance. However, they do not scale well when the population expands and turnaround time remains the main issue for this type of analysis. Here, we introduce AMRomics, an optimized microbial genomics pipeline that can work efficiently with big datasets. We use different bacterial data collections to compare AMRomics against competitive tools and show that our pipeline can generate similar results of interest but with better performance. The software is open source and is publicly available at https://github.com/amromics/amromics under an MIT license.


Asunto(s)
Genoma Bacteriano , Genómica , Programas Informáticos , Flujo de Trabajo , Genómica/métodos , Biología Computacional/métodos , Bacterias/genética , Genoma Microbiano , Farmacorresistencia Bacteriana/genética
4.
BMC Cancer ; 22(1): 85, 2022 Jan 20.
Artículo en Inglés | MEDLINE | ID: mdl-35057759

RESUMEN

BACKGROUND: Circulating cell-free DNA (cfDNA) in the plasma of cancer patients contains cell-free tumour DNA (ctDNA) derived from tumour cells and it has been widely recognized as a non-invasive source of tumour DNA for diagnosis and prognosis of cancer. Molecular profiling of ctDNA is often performed using targeted sequencing or low-coverage whole genome sequencing (WGS) to identify tumour specific somatic mutations or somatic copy number aberrations (sCNAs). However, these approaches cannot efficiently detect all tumour-derived genomic changes in ctDNA. METHODS: We performed WGS analysis of cfDNA from 4 breast cancer patients and 2 patients with benign tumours. We sequenced matched germline DNA for all 6 patients and tumour samples from the breast cancer patients. All samples were sequenced on Illumina HiSeqXTen sequencing platform and achieved approximately 30x, 60x and 100x coverage on germline, tumour and plasma DNA samples, respectively. RESULTS: The mutational burden of the plasma samples (1.44 somatic mutations/Mb of genome) was higher than the matched tumour samples. However, 90% of high confidence somatic cfDNA variants were not detected in matched tumour samples and were found to comprise two background plasma mutational signatures. In contrast, cfDNA from the di-nucleosome fraction (300 bp-350 bp) had much higher proportion (30%) of variants shared with tumour. Despite high coverage sequencing we were unable to detect sCNAs in plasma samples. CONCLUSIONS: Deep sequencing analysis of plasma samples revealed higher fraction of unique somatic mutations in plasma samples, which were not detected in matched tumour samples. Sequencing of di-nucleosome bound cfDNA fragments may increase recovery of tumour mutations from plasma.


Asunto(s)
Neoplasias de la Mama/genética , ADN Tumoral Circulante/sangre , Análisis Mutacional de ADN/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Secuenciación Completa del Genoma/métodos , Adulto , Biomarcadores de Tumor/genética , Neoplasias de la Mama/sangre , Femenino , Humanos , Mutación , Pronóstico
5.
PLoS Comput Biol ; 17(1): e1008586, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33471816

RESUMEN

A streaming assembly pipeline utilising real-time Oxford Nanopore Technology (ONT) sequencing data is important for saving sequencing resources and reducing time-to-result. A previous approach implemented in npScarf provided an efficient streaming algorithm for hybrid assembly but was relatively prone to mis-assemblies compared to other graph-based methods. Here we present npGraph, a streaming hybrid assembly tool using the assembly graph instead of the separated pre-assembly contigs. It is able to produce more complete genome assembly by resolving the path finding problem on the assembly graph using long reads as the traversing guide. Application to synthetic and real data from bacterial isolate genomes show improved accuracy while still maintaining a low computational cost. npGraph also provides a graphical user interface (GUI) which provides a real-time visualisation of the progress of assembly. The tool and source code is available at https://github.com/hsnguyen/assembly.


Asunto(s)
Nanoporos , Alineación de Secuencia/métodos , Análisis de Secuencia de ADN/métodos , Algoritmos , Biología Computacional , ADN Bacteriano/análisis , ADN Bacteriano/genética , Genoma Bacteriano/genética , Nanotecnología , Programas Informáticos , Interfaz Usuario-Computador
6.
J Antimicrob Chemother ; 74(3): 582-593, 2019 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-30445429

RESUMEN

BACKGROUND: Polymyxin B and E (colistin) have been pivotal in the treatment of XDR Gram-negative bacterial infections; however, resistance has emerged. A structurally related lipopeptide, octapeptin C4, has shown significant potency against XDR bacteria, including polymyxin-resistant strains, but its mode of action remains undefined. OBJECTIVES: We sought to compare and contrast the acquisition of resistance in an XDR Klebsiella pneumoniae (ST258) clinical isolate in vitro with all three lipopeptides to potentially unveil variations in their mode of action. METHODS: The isolate was exposed to increasing concentrations of polymyxins and octapeptin C4 over 20 days. Day 20 strains underwent WGS, complementation assays, antimicrobial susceptibility testing and lipid A analysis. RESULTS: Twenty days of exposure to the polymyxins resulted in a 1000-fold increase in the MIC, whereas for octapeptin C4 a 4-fold increase was observed. There was no cross-resistance observed between the polymyxin- and octapeptin-resistant strains. Sequencing of polymyxin-resistant isolates revealed mutations in previously known resistance-associated genes, including crrB, mgrB, pmrB, phoPQ and yciM, along with novel mutations in qseC. Octapeptin C4-resistant isolates had mutations in mlaDF and pqiB, genes related to phospholipid transport. These genetic variations were reflected in distinct phenotypic changes to lipid A. Polymyxin-resistant isolates increased 4-amino-4-deoxyarabinose fortification of lipid A phosphate groups, whereas the lipid A of octapeptin C4-resistant strains harboured a higher abundance of hydroxymyristate and palmitoylate. CONCLUSIONS: Octapeptin C4 has a distinct mode of action compared with the polymyxins, highlighting its potential as a future therapeutic agent to combat the increasing threat of XDR bacteria.


Asunto(s)
Antibacterianos/farmacología , Colistina/farmacología , Farmacorresistencia Bacteriana Múltiple , Klebsiella pneumoniae/efectos de los fármacos , Lipopéptidos/farmacología , Péptidos Cíclicos/farmacología , Polimixina B/farmacología , Humanos , Infecciones por Klebsiella/microbiología , Klebsiella pneumoniae/aislamiento & purificación , Pruebas de Sensibilidad Microbiana , Mutación , Secuenciación Completa del Genoma
7.
Bioinformatics ; 34(5): 873-874, 2018 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-29092025

RESUMEN

Motivation: Targeted sequencing using capture probes has become increasingly popular in clinical applications due to its scalability and cost-effectiveness. The approach also allows for higher sequencing coverage of the targeted regions resulting in better analysis statistical power. However, because of the dynamics of the hybridization process, it is difficult to evaluate the efficiency of the probe design prior to the experiments which are time consuming and costly. Results: We developed CapSim, a software package for simulation of targeted sequencing. Given a genome sequence and a set of probes, CapSim simulates the fragmentation, the dynamics of probe hybridization and the sequencing of the captured fragments on Illumina and PacBio sequencing platforms. The simulated data can be used for evaluating the performance of the analysis pipeline, as well as the efficiency of the probe design. Parameters of the various stages in the sequencing process can also be evaluated in order to optimize the experiments. Availability and implementation: CapSim is publicly available under BSD license at https://github.com/Devika1/capsim. Contact: l.coin@imb.uq.edu.au. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Genómica/métodos , Programas Informáticos
8.
BMC Bioinformatics ; 19(1): 267, 2018 07 16.
Artículo en Inglés | MEDLINE | ID: mdl-30012093

RESUMEN

BACKGROUND: Tandem repeats comprise significant proportion of the human genome including coding and regulatory regions. They are highly prone to repeat number variation and nucleotide mutation due to their repetitive and unstable nature, making them a major source of genomic variation between individuals. Despite recent advances in high throughput sequencing, analysis of tandem repeats in the context of complex diseases is still hindered by technical limitations. We report a novel targeted sequencing approach, which allows simultaneous analysis of hundreds of repeats. We developed a Bayesian algorithm, namely - GtTR - which combines information from a reference long-read dataset with a short read counting approach to genotype tandem repeats at population scale. PCR sizing analysis was used for validation. RESULTS: We used a PacBio long-read sequenced sample to generate a reference tandem repeat genotype dataset with on average 13% absolute deviation from PCR sizing results. Using this reference dataset GtTR generated estimates of VNTR copy number with accuracy within 95% high posterior density (HPD) intervals of 68 and 83% for capture sequence data and 200X WGS data respectively, improving to 87 and 94% with use of a PCR reference. We show that the genotype resolution increases as a function of depth, such that the median 95% HPD interval lies within 25, 14, 12 and 8% of the its midpoint copy number value for 30X, 200X WGS, 395X and 800X capture sequence data respectively. We validated nine targets by PCR sizing analysis and genotype estimates from sequencing results correlated well with PCR results. CONCLUSIONS: The novel genotyping approach described here presents a new cost-effective method to explore previously unrecognized class of repeat variation in GWAS studies of complex diseases at the population level. Further improvements in accuracy can be obtained by improving accuracy of the reference dataset.


Asunto(s)
Algoritmos , Dosificación de Gen , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Secuencias Repetidas en Tándem/genética , Alelos , Secuencia de Bases , Teorema de Bayes , Simulación por Computador , Genoma Humano , Genotipo , Humanos , Repeticiones de Minisatélite/genética , Secuenciación Completa del Genoma
9.
BMC Bioinformatics ; 19(1): 261, 2018 07 13.
Artículo en Inglés | MEDLINE | ID: mdl-30001702

RESUMEN

BACKGROUND: Detection of genomic inversions remains challenging. Many existing methods primarily target inzversions with a non repetitive breakpoint, leaving inverted repeat (IR) mediated non-allelic homologous recombination (NAHR) inversions largely unexplored. RESULT: We present npInv, a novel tool specifically for detecting and genotyping NAHR inversion using long read sub-alignment of long read sequencing data. We benchmark npInv with other tools in both simulation and real data. We use npInv to generate a whole-genome inversion map for NA12878 consisting of 30 NAHR inversions (of which 15 are novel), including all previously known NAHR mediated inversions in NA12878 with flanking IR less than 7kb. Our genotyping accuracy on this dataset was 94%. We used PCR to confirm the presence of two of these novel inversions. We show that there is a near linear relationship between the length of flanking IR and the minimum inversion size, without inverted repeats. CONCLUSION: The application of npInv shows high accuracy in both simulation and real data. The results give deeper insight into understanding inversion.


Asunto(s)
Inversión Cromosómica/genética , Genotipo , Humanos
10.
Bioinformatics ; 33(24): 3988-3990, 2017 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-28961965

RESUMEN

MOTIVATION: The recent introduction of a barcoding protocol for Oxford Nanopore sequencing has increased the versatility of the technology. Several bioinformatics tools have been developed to demultiplex barcoded reads, but none of them supports streaming analysis. This limits the use of multiplexed sequencing in real-time applications, which is one of the main advantages of the technology. RESULTS: We introduced npBarcode, an open source and cross-platform tool for barcode demultiplexing in streaming fashion that can be used to pipe data to further real-time analyses. The tool also provides a friendly graphical user interface by integrating the module into npReader, making possible to monitor the progress concurrently when the sequencing is still in progress. We show that our algorithm achieves accuracies at least as good as competing tools. AVAILABILITY AND IMPLEMENTATION: npBarcode is bundled in Japsa-a Java tools kit for genome analysis, and is freely available at https://github.com/mdcao/japsa. CONTACT: s.nguyen@uq.edu.au or l.coin@imb.uq.edu.au. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Procesamiento Automatizado de Datos , Nanoporos , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Algoritmos , Reproducibilidad de los Resultados
11.
Mol Biol Evol ; 33(5): 1349-57, 2016 05.
Artículo en Inglés | MEDLINE | ID: mdl-26912811

RESUMEN

Methods for measuring genetic distances in phylogenetics are known to be sensitive to the evolutionary model assumed. However, there is a lack of established methodology to accommodate the trade-off between incorporating sufficient biological reality and avoiding model overfitting. In addition, as traditional methods measure distances based on the observed number of substitutions, their tend to underestimate distances between diverged sequences due to backward and parallel substitutions. Various techniques were proposed to correct this, but they lack the robustness against sequences that are distantly related and of unequal base frequencies. In this article, we present a novel genetic distance estimate based on information theory that overcomes the above two hurdles. Instead of examining the observed number of substitutions, this method estimates genetic distances using Shannon's mutual information. This naturally provides an effective framework for balancing model complexity and goodness of fit. Our distance estimate is shown to be approximately linear to elapsed time and hence is less sensitive to the divergence of sequence data and compositional biased sequences. Using extensive simulation data, we show that our method 1) consistently reconstructs more accurate phylogeny topologies than existing methods, 2) is robust in extreme conditions such as diverged phylogenies, unequal base frequencies data, and heterogeneous mutation patterns, and 3) scales well with large phylogenies.


Asunto(s)
Evolución Biológica , Modelos Genéticos , Análisis de Secuencia/métodos , Algoritmos , Composición de Base , Simulación por Computador , Evolución Molecular , Variación Genética , Teoría de la Información , Filogenia
12.
Brief Bioinform ; 16(2): 193-204, 2015 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-24504770

RESUMEN

Short tandem repeats are highly polymorphic and associated with a wide range of phenotypic variation, some of which cause neurodegenerative disease in humans. With advances in high-throughput sequencing technologies, there are novel opportunities to study genetic variation. While available sequencing technologies and bioinformatics tools provide options for mining high-throughput sequencing data, their suitability for analysis of repeat variation is an open question, with tools for quantifying variability in repetitive sequence still in their infancy. We present here a comprehensive survey and empirical evaluation of current sequencing technologies and bioinformatics tools in all stages of an analysis pipeline. While there is not one optimal pipeline to suit all circumstances, we find that the choice of alignment and repeat genotyping tools greatly impacts the accuracy and efficiency by which short tandem repeat variation can be detected. We further note that to detect variation relevant to many repeat diseases, it is essential to choose technologies that offer either long read-lengths or paired-end sequencing, coupled with specific genotyping tools.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/estadística & datos numéricos , Repeticiones de Microsatélite , Biología Computacional/métodos , Variación Genética , Humanos , Alineación de Secuencia/estadística & datos numéricos
13.
Bioinformatics ; 32(5): 764-6, 2016 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-26556383

RESUMEN

MOTIVATION: The recently released Oxford Nanopore MinION sequencing platform presents many innovative features opening up potential for a range of applications not previously possible. Among these features, the ability to sequence in real-time provides a unique opportunity for many time-critical applications. While many software packages have been developed to analyze its data, there is still a lack of toolkits that support the streaming and real-time analysis of MinION sequencing data. RESULTS: We developed npReader, an open-source software package to facilitate real-time analysis of MinION sequencing data. npReader can simultaneously extract sequence reads and stream them to downstream analysis pipelines while the samples are being sequenced on the MinION device. It provides a command line interface for easy integration into a bioinformatics work flow, as well as a graphical user interface which concurrently displays the statistics of the run. It also provides an application programming interface for development of streaming algorithms in order to fully utilize the extent of nanopore sequencing potential. AVAILABILITY AND IMPLEMENTATION: npReader is written in Java and is freely available at https://github.com/mdcao/npReader CONTACT: m.cao1@uq.edu.au or l.coin@imb.uq.edu.au.


Asunto(s)
Programas Informáticos , Algoritmos , Biología Computacional , Nanoporos , Análisis de Secuencia de ADN
14.
Nucleic Acids Res ; 42(3): e16, 2014 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-24353318

RESUMEN

The advances of high-throughput sequencing offer an unprecedented opportunity to study genetic variation. This is challenged by the difficulty of resolving variant calls in repetitive DNA regions. We present a Bayesian method to estimate repeat-length variation from paired-end sequence read data. The method makes variant calls based on deviations in sequence fragment sizes, allowing the analysis of repeats at lengths of relevance to a range of phenotypes. We demonstrate the method's ability to detect and quantify changes in repeat lengths from short read genomic sequence data across genotypes. We use the method to estimate repeat variation among 12 strains of Arabidopsis thaliana and demonstrate experimentally that our method compares favourably against existing methods. Using this method, we have identified all repeats across the genome, which are likely to be polymorphic. In addition, our predicted polymorphic repeats also included the only known repeat expansion in A. thaliana, suggesting an ability to discover potential unstable repeats.


Asunto(s)
Variación Genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Secuencias Repetidas en Tándem , Arabidopsis/genética , Teorema de Bayes , Programas Informáticos
15.
Genome Biol ; 25(1): 209, 2024 Aug 06.
Artículo en Inglés | MEDLINE | ID: mdl-39107817

RESUMEN

Pangenome inference is an indispensable step in bacterial genomics, yet its scalability poses a challenge due to the rapid growth of genomic collections. This paper presents PanTA, a software package designed for constructing pangenomes of large bacterial datasets, showing unprecedented efficiency levels multiple times higher than existing tools. PanTA introduces a novel mechanism to construct the pangenome progressively without rebuilding the accumulated collection from scratch. The progressive mode is shown to consume orders of magnitude less computational resources than existing solutions in managing growing datasets. The software is open source and is publicly available at https://github.com/amromics/panta and at 10.6084/m9.figshare.23724705 .


Asunto(s)
Genoma Bacteriano , Programas Informáticos , Genómica/métodos , Bacterias/genética , Filogenia
16.
iScience ; 27(9): 110623, 2024 Sep 20.
Artículo en Inglés | MEDLINE | ID: mdl-39228791

RESUMEN

Machine learning has the potential to be a powerful tool in the fight against antimicrobial resistance (AMR), a critical global health issue. Machine learning can identify resistance mechanisms from DNA sequence data without prior knowledge. The first step in building a machine learning model is a feature extraction from sequencing data. Traditional methods like single nucleotide polymorphism (SNP) calling and k-mer counting yield numerous, often redundant features, complicating prediction and analysis. In this paper, we propose PanKA, a method using the pangenome to extract a concise set of relevant features for predicting AMR. PanKA not only enables fast model training and prediction but also improves accuracy. Applied to the Escherichia coli and Klebsiella pneumoniae bacterial species, our model is more accurate than conventional and state-of-the-art methods in predicting AMR.

17.
BMC Genomics ; 14: 76, 2013 Feb 02.
Artículo en Inglés | MEDLINE | ID: mdl-23374135

RESUMEN

BACKGROUND: Among repetitive genomic sequence, the class of tri-nucleotide repeats has received much attention due to their association with human diseases. Tri-nucleotide repeat diseases are caused by excessive sequence length variability; diseases such as Huntington's disease and Fragile X syndrome are tied to an increase in the number of repeat units in a tract. Motivated by the recent discovery of a tri-nucleotide repeat associated genetic defect in Arabidopsis thaliana, this study takes a cross-species approach to investigating these repeat tracts, with the goal of using commonalities between species to identify potential disease-related properties. RESULTS: We find that statistical enrichment in regulatory function associations for coding region repeats - previously observed in human - is consistent across multiple organisms. By distinguishing between homo-amino acid tracts that are encoded by tri-nucleotide repeats, and those encoded by varying codons, we show that amino acid repeats - not tri-nucleotide repeats - fully explain these regulatory associations. Using this same separation between repeat- and non-repeat-encoded homo-amino acid tracts, we show that poly-glutamine tracts are disproportionately encoded by tri-nucleotide repeats, and those tracts that are encoded by tri-nucleotide repeats are also significantly longer; these results are consistent across multiple species. CONCLUSION: These findings establish similarities in tri-nucleotide repeats across species at the level of protein functionality and protein sequence. The tendency of tri-nucleotide repeats to encode longer poly-glutamine tracts indicates a link with the poly-glutamine repeat diseases. The cross-species nature of this tendency suggests that unknown repeat diseases are yet to be uncovered in other species. Future discoveries of new non-human repeat associated defects may provide the breadth of information needed to unravel the mechanisms that underpin this class of human disease.


Asunto(s)
Péptidos/genética , Animales , Arabidopsis/genética , Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Frecuencia de los Genes , Genes , Inestabilidad Genómica , Humanos , Ratones , Sistemas de Lectura Abierta , Fenotipo , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Saccharomyces cerevisiae/genética , Repeticiones de Trinucleótidos
18.
Nat Med ; 28(8): 1619-1629, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35970920

RESUMEN

Checkpoint inhibitor (CPI) therapies provide limited benefit to patients with tumors of low immune reactivity. T cell-inducing vaccines hold promise to exert long-lasting disease control in combination with CPI therapy. Safety, tolerability and recommended phase 2 dose (RP2D) of an individualized, heterologous chimpanzee adenovirus (ChAd68) and self-amplifying mRNA (samRNA)-based neoantigen vaccine in combination with nivolumab and ipilimumab were assessed as primary endpoints in an ongoing phase 1/2 study in patients with advanced metastatic solid tumors (NCT03639714). The individualized vaccine regimen was safe and well tolerated, with no dose-limiting toxicities. Treatment-related adverse events (TRAEs) >10% included pyrexia, fatigue, musculoskeletal and injection site pain and diarrhea. Serious TRAEs included one count each of pyrexia, duodenitis, increased transaminases and hyperthyroidism. The RP2D was 1012 viral particles (VP) ChAd68 and 30 µg samRNA. Secondary endpoints included immunogenicity, feasibility of manufacturing and overall survival (OS). Vaccine manufacturing was feasible, with vaccination inducing long-lasting neoantigen-specific CD8 T cell responses. Several patients with microsatellite-stable colorectal cancer (MSS-CRC) had improved OS. Exploratory biomarker analyses showed decreased circulating tumor DNA (ctDNA) in patients with prolonged OS. Although small study size limits statistical and translational analyses, the increased OS observed in MSS-CRC warrants further exploration in larger randomized studies.


Asunto(s)
Neoplasias Colorrectales , Pan troglodytes , Adenoviridae/genética , Animales , Neoplasias Colorrectales/tratamiento farmacológico , Fiebre , Humanos , ARN Mensajero/uso terapéutico
19.
Adv Exp Med Biol ; 696: 657-66, 2011.
Artículo en Inglés | MEDLINE | ID: mdl-21431607

RESUMEN

A biological compression model, expert model, is presented which is superior to existing compression algorithms in both compression performance and speed. The model is able to compress whole eukaryotic genomes. Most importantly, the model provides a framework for knowledge discovery from biological data. It can be used for repeat element discovery, sequence alignment and phylogenetic analysis. We demonstrate that the model can handle statistically biased sequences and distantly related sequences where conventional knowledge discovery tools often fail.


Asunto(s)
Algoritmos , Compresión de Datos/estadística & datos numéricos , Biología Computacional , Sistemas Especialistas , Genoma Humano , Genómica/estadística & datos numéricos , Humanos , Teoría de la Información , Bases del Conocimiento , Modelos Genéticos , Modelos Estadísticos , Filogenia , Secuencias Repetitivas de Ácidos Nucleicos , Alineación de Secuencia/estadística & datos numéricos
20.
BMC Bioinformatics ; 11: 599, 2010 Dec 16.
Artículo en Inglés | MEDLINE | ID: mdl-21159205

RESUMEN

BACKGROUND: Traditional genome alignment methods consider sequence alignment as a variation of the string edit distance problem, and perform alignment by matching characters of the two sequences. They are often computationally expensive and unable to deal with low information regions. Furthermore, they lack a well-principled objective function to measure the performance of sets of parameters. Since genomic sequences carry genetic information, this article proposes that the information content of each nucleotide in a position should be considered in sequence alignment. An information-theoretic approach for pairwise genome local alignment, namely XMAligner, is presented. Instead of comparing sequences at the character level, XMAligner considers a pair of nucleotides from two sequences to be related if their mutual information in context is significant. The information content of nucleotides in sequences is measured by a lossless compression technique. RESULTS: Experiments on both simulated data and real data show that XMAligner is superior to conventional methods especially on distantly related sequences and statistically biased data. XMAligner can align sequences of eukaryote genome size with only a modest hardware requirement. Importantly, the method has an objective function which can obviate the need to choose parameter values for high quality alignment. The alignment results from XMAligner can be integrated into a visualisation tool for viewing purpose. CONCLUSIONS: The information-theoretic approach for sequence alignment is shown to overcome the mentioned problems of conventional character matching alignment methods. The article shows that, as genomic sequences are meant to carry information, considering the information content of nucleotides is helpful for genomic sequence alignment. AVAILABILITY: Downloadable binaries, documentation and data can be found at ftp://ftp.infotech.monash.edu.au/software/DNAcompress-XM/XMAligner/.


Asunto(s)
Algoritmos , Compresión de Datos , Alineación de Secuencia/métodos , Secuencia de Bases , Genómica/métodos , Modelos Teóricos , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA