Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 31
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Malar J ; 17(1): 299, 2018 Aug 17.
Artigo em Inglês | MEDLINE | ID: mdl-30119664

RESUMO

BACKGROUND: Much of the extensive research regarding transmission of malaria is underpinned by mathematical modelling. Compartmental models, which focus on interactions and transitions between population strata, have been a mainstay of such modelling for more than a century. However, modellers are increasingly adopting agent-based approaches, which model hosts, vectors and/or their interactions on an individual level. One reason for the increasing popularity of such models is their potential to provide enhanced realism by allowing system-level behaviours to emerge as a consequence of accumulated individual-level interactions, as occurs in real populations. METHODS: A systematic review of 90 articles published between 1998 and May 2018 was performed, characterizing agent-based models (ABMs) relevant to malaria transmission. The review provides an overview of approaches used to date, determines the advantages of these approaches, and proposes ideas for progressing the field. RESULTS: The rationale for ABM use over other modelling approaches centres around three points: the need to accurately represent increased stochasticity in low-transmission settings; the benefits of high-resolution spatial simulations; and heterogeneities in drug and vaccine efficacies due to individual patient characteristics. The success of these approaches provides avenues for further exploration of agent-based techniques for modelling malaria transmission. Potential extensions include varying elimination strategies across spatial landscapes, extending the size of spatial models, incorporating human movement dynamics, and developing increasingly comprehensive parameter estimation and optimization techniques. CONCLUSION: Collectively, the literature covers an extensive array of topics, including the full spectrum of transmission and intervention regimes. Bringing these elements together under a common framework may enhance knowledge of, and guide policies towards, malaria elimination. However, because of the diversity of available models, endorsing a standardized approach to ABM implementation may not be possible. Instead it is recommended that model frameworks be contextually appropriate and sufficiently described. One key recommendation is to develop enhanced parameter estimation and optimization techniques. Extensions of current techniques will provide the robust results required to enhance current elimination efforts.


Assuntos
Transmissão de Doença Infecciosa , Interações Hospedeiro-Parasita , Malária/transmissão , Modelos Estatísticos , Mosquitos Vetores/fisiologia , Animais , Humanos
2.
BMC Genomics ; 18(1): 259, 2017 03 27.
Artigo em Inglês | MEDLINE | ID: mdl-28347272

RESUMO

BACKGROUND: Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. RESULTS: We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. CONCLUSIONS: This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.


Assuntos
Genoma , RNA não Traduzido/metabolismo , Animais , Teorema de Bayes , Sítios de Ligação , Sequência Conservada , Humanos , Íntrons , Camundongos , Desenvolvimento Muscular/genética , Conformação de Ácido Nucleico , RNA não Traduzido/química , RNA não Traduzido/genética , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Interface Usuário-Computador , Peixe-Zebra/genética
3.
Proc Natl Acad Sci U S A ; 110(33): 13428-33, 2013 Aug 13.
Artigo em Inglês | MEDLINE | ID: mdl-23878210

RESUMO

Eradication of an invasive species can provide significant environmental, economic, and social benefits, but eradication programs often fail. Constant and careful monitoring improves the chance of success, but an invasion may seem to be in decline even when it is expanding in abundance or spatial extent. Determining whether an invasion is in decline is a challenging inference problem for two reasons. First, it is typically infeasible to regularly survey the entire infested region owing to high cost. Second, surveillance methods are imperfect and fail to detect some individuals. These two factors also make it difficult to determine why an eradication program is failing. Agent-based methods enable inferences to be made about the locations of undiscovered individuals over time to identify trends in invader abundance and spatial extent. We develop an agent-based Bayesian method and apply it to Australia's largest eradication program: the campaign to eradicate the red imported fire ant (Solenopsis invicta) from Brisbane. The invasion was deemed to be almost eradicated in 2004 but our analyses indicate that its geographic range continued to expand despite a sharp decline in number of nests. We also show that eradication would probably have been achieved with a relatively small increase in the area searched and treated. Our results demonstrate the importance of inferring temporal and spatial trends in ongoing invasions. The method can handle incomplete observations and takes into account the effects of human intervention. It has the potential to transform eradication practices.


Assuntos
Formigas/fisiologia , Conservação dos Recursos Naturais/métodos , Monitoramento Ambiental/métodos , Controle de Insetos/métodos , Espécies Introduzidas/estatística & dados numéricos , Modelos Biológicos , Animais , Teorema de Bayes , Dinâmica Populacional , Queensland
4.
Cladistics ; 31(4): 438-440, 2015 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-34772263

RESUMO

A recent article published in Cladistics is critical of a number of heuristic methods for phylogenetic inference based on parsimony scores. One of my papers is among those criticized, and I would appreciate the opportunity to make a public response. The specific criticism is that I have re-invented an algorithm for economizing parsimony calculations on trees that differ by a subtree pruning and regrafting (SPR) rearrangement. This criticism is justified, and I apologize for incorrectly claiming originality for my presentation of this algorithm. However, I would like to clarify the intent of my paper, if I can do so without detracting from the sincerity of my apology. My paper is not about that algorithm, nor even primarily about parsimony. Rather, it is about a novel strategy for Markov chain Monte Carlo (MCMC) sampling in a state space consisting of trees. The sampler involves drawing from conditional distributions over sets of trees: a Gibbs-like strategy that had not previously been used to sample tree-space. I would like to see this technique incorporated into MCMC samplers for phylogenetics, as it may have advantages over commonly used Metropolis-like strategies. I have recently used it to sample phylogenies of a biological invasion, and I am finding many applications for it in agent-based Bayesian ecological modelling. It is thus my contention that my 2005 paper retains substantial value.

5.
BMC Bioinformatics ; 13: 179, 2012 Jul 27.
Artigo em Inglês | MEDLINE | ID: mdl-22838505

RESUMO

BACKGROUND: Many problems in bioinformatics involve classification based on features such as sequence, structure or morphology. Given multiple classifiers, two crucial questions arise: how does their performance compare, and how can they best be combined to produce a better classifier? A classifier can be evaluated in terms of sensitivity and specificity using benchmark, or gold standard, data, that is, data for which the true classification is known. However, a gold standard is not always available. Here we demonstrate that a Bayesian model for comparing medical diagnostics without a gold standard can be successfully applied in the bioinformatics domain, to genomic scale data sets. We present a new implementation, which unlike previous implementations is applicable to any number of classifiers. We apply this model, for the first time, to the problem of finding the globally optimal logical combination of classifiers. RESULTS: We compared three classifiers of protein subcellular localisation, and evaluated our estimates of sensitivity and specificity against estimates obtained using a gold standard. The method overestimated sensitivity and specificity with only a small discrepancy, and correctly ranked the classifiers. Diagnostic tests for swine flu were then compared on a small data set. Lastly, classifiers for a genome-wide association study of macular degeneration with 541094 SNPs were analysed. In all cases, run times were feasible, and results precise. The optimal logical combination of classifiers was also determined for all three data sets. Code and data are available from http://bioinformatics.monash.edu.au/downloads/. CONCLUSIONS: The examples demonstrate the methods are suitable for both small and large data sets, applicable to the wide range of bioinformatics classification problems, and robust to dependence between classifiers. In all three test cases, the globally optimal logical combination of the classifiers was found to be their union, according to three out of four ranking criteria. We propose as a general rule of thumb that the union of classifiers will be close to optimal.


Assuntos
Biologia Computacional/métodos , Algoritmos , Teorema de Bayes , Classificação/métodos , Estudo de Associação Genômica Ampla , Humanos , Degeneração Macular/genética , Polimorfismo de Nucleotídeo Único , Proteínas/análise , Sensibilidade e Especificidade
6.
Bioinformatics ; 27(5): 604-10, 2011 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-21208984

RESUMO

MOTIVATION: The analysis of multiple sequence alignments is allowing researchers to glean valuable insights into evolution, as well as identify genomic regions that may be functional, or discover novel classes of functional elements. Understanding the distribution of conservation levels that constitutes the evolutionary landscape is crucial to distinguishing functional regions from non-functional. Recent evidence suggests that a binary classification of evolutionary rates is inappropriate for this purpose and finds only highly conserved functional elements. Given that the distribution of evolutionary rates is multi-modal, determining the number of modes is of paramount concern. Through simulation, we evaluate the performance of a number of information criterion approaches derived from MCMC simulations in determining the dimension of a model. RESULTS: We utilize a deviance information criterion (DIC) approximation that is more robust than the approximations from other information criteria, and show our information criteria approximations do not produce superfluous modes when estimating conservation distributions under a variety of circumstances. We analyse the distribution of conservation for a multiple alignment comprising four primate species and mouse, and repeat this on two additional multiple alignments of similar species. We find evidence of six distinct classes of evolutionary rates that appear to be robust to the species used. AVAILABILITY: Source code and data are available at http://dl.dropbox.com/u/477240/changept.zip.


Assuntos
Evolução Molecular , Modelos Estatísticos , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Algoritmos , Animais , Teorema de Bayes , Biologia Computacional/métodos , Simulação por Computador , DNA/análise , Genômica/métodos , Camundongos , Primatas
7.
Mol Biol Evol ; 27(4): 942-53, 2010 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-19955480

RESUMO

The proportion of functional sequence in the human genome is currently a subject of debate. The most widely accepted figure is that approximately 5% is under purifying selection. In Drosophila, estimates are an order of magnitude higher, though this corresponds to a similar quantity of sequence. These estimates depend on the difference between the distribution of genomewide evolutionary rates and that observed in a subset of sequences presumed to be neutrally evolving. Motivated by the widening gap between these estimates and experimental evidence of genome function, especially in mammals, we developed a sensitive technique for evaluating such distributions and found that they are much more complex than previously apparent. We found strong evidence for at least nine well-resolved evolutionary rate classes in an alignment of four Drosophila species and at least seven classes in an alignment of four mammals, including human. We also identified at least three rate classes in human ancestral repeats. By positing that the largest of these ancestral repeat classes is neutrally evolving, we estimate that the proportion of nonneutrally evolving sequence is 30% of human ancestral repeats and 45% of the aligned portion of the genome. However, we also question whether any of the classes represent neutrally evolving sequences and argue that a plausible alternative is that they reflect variable structure-function constraints operating throughout the genomes of complex organisms.


Assuntos
Drosophila/genética , Mamíferos/genética , Animais , Sequência Conservada , Evolução Molecular , Genoma Humano , Humanos , Recombinação Genética , Alinhamento de Sequência
8.
R Soc Open Sci ; 7(3): 191315, 2020 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-32269786

RESUMO

The behaviour of many processes in science and engineering can be accurately described by dynamical system models consisting of a set of ordinary differential equations (ODEs). Often these models have several unknown parameters that are difficult to estimate from experimental data, in which case Bayesian inference can be a useful tool. In principle, exact Bayesian inference using Markov chain Monte Carlo (MCMC) techniques is possible; however, in practice, such methods may suffer from slow convergence and poor mixing. To address this problem, several approaches based on approximate Bayesian computation (ABC) have been introduced, including Markov chain Monte Carlo ABC (MCMC ABC) and sequential Monte Carlo ABC (SMC ABC). While the system of ODEs describes the underlying process that generates the data, the observed measurements invariably include errors. In this paper, we argue that several popular ABC approaches fail to adequately model these errors because the acceptance probability depends on the choice of the discrepancy function and the tolerance without any consideration of the error term. We observe that the so-called posterior distributions derived from such methods do not accurately reflect the epistemic uncertainties in parameter values. Moreover, we demonstrate that these methods provide minimal computational advantages over exact Bayesian methods when applied to two ODE epidemiological models with simulated data and one with real data concerning malaria transmission in Afghanistan.

9.
Epidemics ; 32: 100393, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32674025

RESUMO

Modern data and computational resources, coupled with algorithmic and theoretical advances to exploit these, allow disease dynamic models to be parameterised with increasing detail and accuracy. While this enhances models' usefulness in prediction and policy, major challenges remain. In particular, lack of identifiability of a model's parameters may limit the usefulness of the model. While lack of parameter identifiability may be resolved through incorporation into an inference procedure of prior knowledge, formulating such knowledge is often difficult. Furthermore, there are practical challenges associated with acquiring data of sufficient quantity and quality. Here, we discuss recent progress on these issues.


Assuntos
Doenças Transmissíveis/epidemiologia , Política de Saúde , Modelos Teóricos , Saúde Pública/estatística & dados numéricos , Teorema de Bayes , Humanos , Modelos Biológicos
10.
Genet Epidemiol ; 32(6): 513-9, 2008 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-18357613

RESUMO

The probabilities that two individuals share 0, 1, or 2 alleles identical by descent (IBD) at a given genotyped marker locus are quantities of fundamental importance for disease gene and quantitative trait mapping and in family-based tests of association. Until recently, genotyped markers were sufficiently sparse that founder haplotypes could be modelled as having been drawn from a population in linkage equilibrium for the purpose of estimating IBD probabilities. However, with the advent of high-throughput single nucleotide polymorphism genotyping assays, this is no longer a reasonable assumption. Indeed, the imminent arrival of individual sequencing will enable high-density single nucleotide polymorphism genotyping on a scale for which current algorithms are not equipped. In this paper, we present a simple new model in which founder haplotypes are modelled as a Markov chain. Another important innovation is that genotyping errors are explicitly incorporated into the model. We compare results obtained using the new model to those obtained using the popular genetic linkage analysis package Merlin, with and without using the cluster model of linkage disequilibrium that is incorporated into that program. We find that the new model results in accuracy approaching that of Merlin with haplotype blocks, but achieves this with orders of magnitude faster run times. Moreover, the new algorithm scales linearly with number of markers, irrespective of density, whereas Merlin scales supralinearly. We also confirm a previous finding that ignoring linkage disequilibrium in founder haplotypes can cause errors in the calculation of IBD probabilities.


Assuntos
Modelos Genéticos , Modelos Estatísticos , Polimorfismo de Nucleotídeo Único , Algoritmos , Alelos , Análise por Conglomerados , Simulação por Computador , Marcadores Genéticos , Haplótipos , Humanos , Desequilíbrio de Ligação , Cadeias de Markov , Característica Quantitativa Herdável
11.
Hum Genet ; 125(5-6): 591-604, 2009 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-19296132

RESUMO

Migraine is a painful disorder for which the etiology remains obscure. Diagnosis is largely based on International Headache Society criteria. However, no feature occurs in all patients who meet these criteria, and no single symptom is required for diagnosis. Consequently, this definition may not accurately reflect the phenotypic heterogeneity or genetic basis of the disorder. Such phenotypic uncertainty is typical for complex genetic disorders and has encouraged interest in multivariate statistical methods for classifying disease phenotypes. We applied three popular statistical phenotyping methods-latent class analysis, grade of membership and grade of membership "fuzzy" clustering (Fanny)-to migraine symptom data, and compared heritability and genome-wide linkage results obtained using each approach. Our results demonstrate that different methodologies produce different clustering structures and non-negligible differences in subsequent analyses. We therefore urge caution in the use of any single approach and suggest that multiple phenotyping methods be used.


Assuntos
Doenças em Gêmeos/genética , Ligação Genética , Padrões de Herança , Transtornos de Enxaqueca/genética , Modelos Estatísticos , Análise por Conglomerados , Feminino , Humanos , Masculino , Fenótipo , Gêmeos/genética
12.
Hum Genet ; 126(2): 277-88, 2009 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-19390863

RESUMO

Definition of disease phenotype is a necessary preliminary to research into genetic causes of a complex disease. Clinical diagnosis of migraine is currently based on diagnostic criteria developed by the International Headache Society. Previously, we examined the natural clustering of these diagnostic symptoms using latent class analysis (LCA) and found that a four-class model was preferred. However, the classes can be ordered such that all symptoms progressively intensify, suggesting that a single continuous variable representing disease severity may provide a better model. Here, we compare two models: item response theory and LCA, each constructed within a Bayesian context. A deviance information criterion is used to assess model fit. We phenotyped our population sample using these models, estimated heritability and conducted genome-wide linkage analysis using Merlin-qtl. LCA with four classes was again preferred. After transformation, phenotypic trait values derived from both models are highly correlated (correlation = 0.99) and consequently results from subsequent genetic analyses were similar. Heritability was estimated at 0.37, while multipoint linkage analysis produced genome-wide significant linkage to chromosome 7q31-q33 and suggestive linkage to chromosomes 1 and 2. We argue that such continuous measures are a powerful tool for identifying genes contributing to migraine susceptibility.


Assuntos
Transtornos de Enxaqueca/diagnóstico , Transtornos de Enxaqueca/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Teorema de Bayes , Análise por Conglomerados , Doenças em Gêmeos , Feminino , Ligação Genética , Predisposição Genética para Doença , Humanos , Escore Lod , Masculino , Pessoa de Meia-Idade , Transtornos de Enxaqueca/fisiopatologia , Fenótipo
13.
Sci Rep ; 9(1): 8938, 2019 06 20.
Artigo em Inglês | MEDLINE | ID: mdl-31222114

RESUMO

Accurate delimitation of the geographic range of a species is important for control of biological invasions, conservation of threatened species, and understanding species range dynamics under environmental change. However, estimating range boundaries is challenging because monitoring methods are imperfect, the area that might contain individuals is often incompletely surveyed, and species may have patchy distributions. In these circumstances, large areas can be surveyed without finding individuals despite occupancy extending beyond surveyed areas, resulting in underestimation of range limits. We developed a delimitation method that can be applied with imperfect survey data and patchy distributions. The approach is to construct polygons indicative of the geographic range of a species. Each polygon is associated with a specific probability such that each interior point of the polygon has at least that posterior probability of being interior to the true boundary according to a Bayesian model. The method uses the posterior distribution of latent quantities derived from an agent-based Bayesian model and calculates the posterior distribution of the range as a derived quantity from Markov chain Monte Carlo samples. An application of this method described here informed the Australian campaign to eradicate red imported fire ants (Solenopsis invicta).

14.
J Comput Biol ; 15(4): 407-30, 2008 May.
Artigo em Inglês | MEDLINE | ID: mdl-18435570

RESUMO

Evolutionary conservation is an important indicator of function and a major component of bioinformatic methods to identify non-protein-coding genes. We present a new Bayesian method for segmenting pairwise alignments of eukaryotic genomes while simultaneously classifying segments into slowly and rapidly evolving fractions. We also describe an information criterion similar to the Akaike Information Criterion (AIC) for determining the number of classes. Working with pairwise alignments enables detection of differences in conservation patterns among closely related species. We analyzed three whole-genome and three partial-genome pairwise alignments among eight Drosophila species. Three distinct classes of conservation level were detected. Sequences comprising the most slowly evolving component were consistent across a range of species pairs, and constituted approximately 62-66% of the D. melanogaster genome. Almost all (>90%) of the aligned protein-coding sequence is in this fraction, suggesting much of it (comprising the majority of the Drosophila genome, including approximately 56% of non-protein-coding sequences) is functional. The size and content of the most rapidly evolving component was species dependent, and varied from 1.6% to 4.8%. This fraction is also enriched for protein-coding sequence (while containing significant amounts of non-protein-coding sequence), suggesting it is under positive selection. We also classified segments according to conservation and GC content simultaneously. This analysis identified numerous sub-classes of those identified on the basis of conservation alone, but was nevertheless consistent with that classification. Software, data, and results available at www.maths.qut.edu.au/-keithj/. Genomic segments comprising the conservation classes available in BED format.


Assuntos
Drosophila melanogaster/genética , Evolução Molecular , Alinhamento de Sequência , Análise de Sequência de DNA , Animais , Genoma , Humanos , Dados de Sequência Molecular
15.
Methods Mol Biol ; 452: 207-29, 2008.
Artigo em Inglês | MEDLINE | ID: mdl-18566767

RESUMO

Whole-genome comparisons among mammalian and other eukaryotic organisms have revealed that they contain large quantities of conserved non-protein-coding sequence. Although some of the functions of this non-coding DNA have been identified, there remains a large quantity of conserved genomic sequence that is of no known function. Moreover, the task of delineating the conserved sequences is non-trivial, particularly when some sequences are conserved in only a small number of lineages. Sequence segmentation is a statistical technique for identifying putative functional elements in genomes based on atypical sequence characteristics, such as conservation levels relative to other genomes, GC content, SNP frequency, and potentially many others. The publicly available program changept and associated programs use Bayesian multiple change-point analysis to delineate classes of genomic segments with similar characteristics, potentially representing new classes of non-coding RNAs (contact web site: http://silmaril.math.sci.qut.edu.au/~keith/) .


Assuntos
Composição de Bases , Genoma , Polimorfismo de Nucleotídeo Único , RNA não Traduzido/genética , Análise de Sequência/métodos , Software , Animais , Teorema de Bayes , Mamíferos
16.
PLoS One ; 13(12): e0208927, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30596668

RESUMO

Time series segmentation aims to identify segment boundary points in a time series, and to determine the dynamical properties corresponding to each segment. To segment time series data, this article presents a Bayesian change-point model in which the data within segments follows an autoregressive moving average (ARMA) model. A prior distribution is defined for the number of change-points, their positions, segment means and error terms. To quantify uncertainty about the location of change-points, the resulting posterior probability distributions are sampled using the Generalized Gibbs sampler Markov chain Monte Carlo technique. This methodology is illustrated by applying it to simulated data and to real data known as the well-log time series data. This well-log data records the measurements of nuclear magnetic response of underground rocks during the drilling of a well. Our approach has high sensitivity, and detects a larger number of change-points than have been identified by comparable methods in the existing literature.


Assuntos
Teorema de Bayes , Espectroscopia de Ressonância Magnética/estatística & dados numéricos , Modelos Estatísticos , Humanos , Cadeias de Markov , Método de Monte Carlo
17.
Methods Mol Biol ; 1525: 293-312, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-27896726

RESUMO

Many biological sequences have a segmental structure that can provide valuable clues to their content, structure, and function. The program changept is a tool for investigating the segmental structure of a sequence, and can also be applied to multiple sequences in parallel to identify a common segmental structure, thus providing a method for integrating multiple data types to identify functional elements in genomes. In the previous edition of this book, a command line interface for changept is described. Here we present a graphical user interface for this package, called changeptGUI. This interface also includes tools for pre- and post-processing of data and results to facilitate investigation of the number and characteristics of segment classes.


Assuntos
Biologia Computacional/métodos , Genoma/genética , Software , Interface Usuário-Computador
18.
PLoS One ; 12(3): e0173331, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28288164

RESUMO

Small molecule inhibitors, such as lapatinib, are effective against breast cancer in clinical trials, but tumor cells ultimately acquire resistance to the drug. Maintaining sensitization to drug action is essential for durable growth inhibition. Recently, adaptive reprogramming of signaling circuitry has been identified as a major cause of acquired resistance. We developed a computational framework using a Bayesian statistical approach to model signal rewiring in acquired resistance. We used the p1-model to infer potential aberrant gene-pairs with differential posterior probabilities of appearing in resistant-vs-parental networks. Results were obtained using matched gene expression profiles under resistant and parental conditions. Using two lapatinib-treated ErbB2-positive breast cancer cell-lines: SKBR3 and BT474, our method identified similar dysregulated signaling pathways including EGFR-related pathways as well as other receptor-related pathways, many of which were reported previously as compensatory pathways of EGFR-inhibition via signaling cross-talk. A manual literature survey provided strong evidence that aberrant signaling activities in dysregulated pathways are closely related to acquired resistance in EGFR tyrosine kinase inhibitors. Our approach predicted literature-supported dysregulated pathways complementary to both node-centric (SPIA, DAVID, and GATHER) and edge-centric (ESEA and PAGI) methods. Moreover, by proposing a novel pattern of aberrant signaling called V-structures, we observed that genes were dysregulated in resistant-vs-sensitive conditions when they were involved in the switch of dependencies from targeted to bypass signaling events. A literature survey of some important V-structures suggested they play a role in breast cancer metastasis and/or acquired resistance to EGFR-TKIs, where the mRNA changes of TGFBR2, LEF1 and TP53 in resistant-vs-sensitive conditions were related to the dependency switch from targeted to bypass signaling links. Our results suggest many signaling pathway structures are compromised in acquired resistance, and V-structures of aberrant signaling within/among those pathways may provide further insights into the bypass mechanism of targeted inhibition.


Assuntos
Antineoplásicos/uso terapêutico , Teorema de Bayes , Neoplasias da Mama/tratamento farmacológico , Resistencia a Medicamentos Antineoplásicos/genética , Regulação Neoplásica da Expressão Gênica , Neoplasias da Mama/genética , Feminino , Humanos , Probabilidade
19.
J Comput Biol ; 13(7): 1369-83, 2006 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-17037964

RESUMO

Eukaryotic genomes display segmental patterns of variation in various properties, including GC content and degree of evolutionary conservation. DNA segmentation algorithms are aimed at identifying statistically significant boundaries between such segments. Such algorithms may provide a means of discovering new classes of functional elements in eukaryotic genomes. This paper presents a model and an algorithm for Bayesian DNA segmentation and considers the feasibility of using it to segment whole eukaryotic genomes. The algorithm is tested on a range of simulated and real DNA sequences, and the following conclusions are drawn. Firstly, the algorithm correctly identifies non-segmented sequence, and can thus be used to reject the null hypothesis of uniformity in the property of interest. Secondly, estimates of the number and locations of change-points produced by the algorithm are robust to variations in algorithm parameters and initial starting conditions and correspond to real features in the data. Thirdly, the algorithm is successfully used to segment human chromosome 1 according to GC content, thus demonstrating the feasibility of Bayesian segmentation of eukaryotic genomes. The software described in this paper is available from the author's website (www.uq.edu.au/ approximately uqjkeith/) or upon request to the author.


Assuntos
Algoritmos , Biologia Computacional/métodos , Células Eucarióticas/metabolismo , Genoma/genética , Sequência de Bases , Complexo Principal de Histocompatibilidade/genética , Modelos Genéticos
20.
Nucleic Acids Res ; 32(3): e35, 2004 Feb 18.
Artigo em Inglês | MEDLINE | ID: mdl-14973330

RESUMO

Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs.


Assuntos
Mutagênese/genética , Análise de Sequência de DNA/métodos , Sequência Rica em At/genética , Algoritmos , Animais , Sequência de Bases , DNA Mitocondrial/genética , DNA de Protozoário/química , DNA de Protozoário/genética , Dictyostelium/genética , Sequência Rica em GC/genética , Humanos , Dados de Sequência Molecular , RNA de Transferência de Treonina/genética , Homologia de Sequência do Ácido Nucleico
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA