Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 42
Filtrar
1.
Nucleic Acids Res ; 51(1): e6, 2023 01 11.
Artigo em Inglês | MEDLINE | ID: mdl-36395816

RESUMO

With more and more data being collected, modern network representations exploit the complementary nature of different data sources as well as similarities across patients. We here introduce the Variation of information fused Layers of Networks algorithm (ViLoN), a novel network-based approach for the integration of multiple molecular profiles. As a key innovation, it directly incorporates prior functional knowledge (KEGG, GO). In the constructed network of patients, patients are represented by networks of pathways, comprising genes that are linked by common functions and joint regulation in the disease. Patient stratification remains a key challenge both in the clinic and for research on disease mechanisms and treatments. We thus validated ViLoN for patient stratification on multiple data type combinations (gene expression, methylation, copy number), showing substantial improvements and consistently competitive performance for all. Notably, the incorporation of prior functional knowledge was critical for good results in the smaller cohorts (rectum adenocarcinoma: 90, esophageal carcinoma: 180), where alternative methods failed.


Assuntos
Algoritmos , Neoplasias Esofágicas , Humanos , Neoplasias Esofágicas/tratamento farmacológico , Neoplasias Esofágicas/genética , Neoplasias Esofágicas/patologia , Redes Reguladoras de Genes , Estudos de Coortes
2.
PLoS Comput Biol ; 19(6): e1011188, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37327238

RESUMO

In clinical neuroscience, epileptic seizures have been associated with the sudden emergence of coupled activity across the brain. The resulting functional networks-in which edges indicate strong enough coupling between brain regions-are consistent with the notion of percolation, which is a phenomenon in complex networks corresponding to the sudden emergence of a giant connected component. Traditionally, work has concentrated on noise-free percolation with a monotonic process of network growth, but real-world networks are more complex. We develop a class of random graph hidden Markov models (RG-HMMs) for characterizing percolation regimes in noisy, dynamically evolving networks in the presence of edge birth and edge death. This class is used to understand the type of phase transitions undergone in a seizure, and in particular, distinguishing between different percolation regimes in epileptic seizures. We develop a hypothesis testing framework for inferring putative percolation mechanisms. As a necessary precursor, we present an EM algorithm for estimating parameters from a sequence of noisy networks only observed at a longitudinal subsampling of time points. Our results suggest that different types of percolation can occur in human seizures. The type inferred may suggest tailored treatment strategies and provide new insights into the fundamental science of epilepsy.


Assuntos
Epilepsia , Convulsões , Humanos , Encéfalo , Transição de Fase , Algoritmos
3.
J Phys Chem A ; 128(30): 6116-6123, 2024 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-39008894

RESUMO

We demonstrate the use of gradient-boosted ensemble models that accurately predict emission wavelengths in benzobis[1,2-d:4,5-d']oxazole (BBO) based fluorescent emitters. We have curated a database of 50 molecules from previously published data by the Jeffries-EL group using density functional theory (DFT) computed ground and excited state features. We consider two machine learning (ML) models based on (i) whole cruciform molecules and (ii) their constituent fragment molecules. Both ML models provide accurate predictions with root-mean-square errors between 30 and 36 nm, competitive with state-of-the-art deep learning models trained on orders of magnitude more molecules, and this accuracy holds even when tested on four new BBO emitters unseen by the models. We also provide an interpretable feature importance analysis and discuss the relevant relationships between DFT and changes in predicted emission wavelength.

4.
Clin Infect Dis ; 76(3): e400-e408, 2023 02 08.
Artigo em Inglês | MEDLINE | ID: mdl-35616119

RESUMO

BACKGROUND: The Omicron variant of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is highly transmissible in vaccinated and unvaccinated populations. The dynamics that govern its establishment and propensity toward fixation (reaching 100% frequency in the SARS-CoV-2 population) in communities remain unknown. Here, we describe the dynamics of Omicron at 3 institutions of higher education (IHEs) in the greater Boston area. METHODS: We use diagnostic and variant-specifying molecular assays and epidemiological analytical approaches to describe the rapid dominance of Omicron following its introduction into 3 IHEs with asymptomatic surveillance programs. RESULTS: We show that the establishment of Omicron at IHEs precedes that of the state and region and that the time to fixation is shorter at IHEs (9.5-12.5 days) than in the state (14.8 days) or region. We show that the trajectory of Omicron fixation among university employees resembles that of students, with a 2- to 3-day delay. Finally, we compare cycle threshold values in Omicron vs Delta variant cases on college campuses and identify lower viral loads among college affiliates who harbor Omicron infections. CONCLUSIONS: We document the rapid takeover of the Omicron variant at IHEs, reaching near-fixation within the span of 9.5-12.5 days despite lower viral loads, on average, than the previously dominant Delta variant. These findings highlight the transmissibility of Omicron, its propensity to rapidly dominate small populations, and the ability of robust asymptomatic surveillance programs to offer early insights into the dynamics of pathogen arrival and spread.


Assuntos
COVID-19 , Humanos , COVID-19/epidemiologia , SARS-CoV-2/genética , Universidades , Boston
5.
PLoS Comput Biol ; 18(9): e1010434, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-36048890

RESUMO

The reproductive number is an important metric that has been widely used to quantify the infectiousness of communicable diseases. The time-varying instantaneous reproductive number is useful for monitoring the real-time dynamics of a disease to inform policy making for disease control. Local estimation of this metric, for instance at a county or city level, allows for more targeted interventions to curb transmission. However, simultaneous estimation of local reproductive numbers must account for potential sources of heterogeneity in these time-varying quantities-a key element of which is human mobility. We develop a statistical method that incorporates human mobility between multiple regions for estimating region-specific instantaneous reproductive numbers. The model also can account for exogenous cases imported from outside of the regions of interest. We propose two approaches to estimate the reproductive numbers, with mobility data used to adjust incidence in the first approach and to inform a formal priori distribution in the second (Bayesian) approach. Through a simulation study, we show that region-specific reproductive numbers can be well estimated if human mobility is reasonably well approximated by available data. We use this approach to estimate the instantaneous reproductive numbers of COVID-19 for 14 counties in Massachusetts using CDC case report data and the human mobility data collected by SafeGraph. We found that, accounting for mobility, our method produces estimates of reproductive numbers that are distinct across counties. In contrast, independent estimation of county-level reproductive numbers tends to produce similar values, as trends in county case-counts for the state are fairly concordant. These approaches can also be used to estimate any heterogeneity in transmission, for instance, age-dependent instantaneous reproductive number estimates. As people are more mobile and interact frequently in ways that permit transmission, it is important to account for this in the estimation of the reproductive number.


Assuntos
COVID-19 , Doenças Transmissíveis , Teorema de Bayes , COVID-19/epidemiologia , Humanos , Reprodução , SARS-CoV-2
6.
Am J Public Health ; 112(2): 277-283, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-35080960

RESUMO

Objectives. To develop an approach to project quarantine needs during an outbreak, particularly for communally housed individuals who interact with outside individuals. Methods. We developed a method that uses basic surveillance data to do short-term projections of future quarantine needs. The development of this method was rigorous, but it is conceptually simple and easy to implement and allows one to anticipate potential superspreading events. We demonstrate how this method can be used with data from the fall 2020 semester of a large urban university in Boston, Massachusetts, that provided quarantine housing for students living on campus in response to the COVID-19 pandemic. Our approach accounted for potentially infectious interactions between individuals living in university housing and those who did not. Results. Our approach was able to accurately project 10-day-ahead quarantine utilization for on-campus students in a large urban university. Our projections were most accurate when we anticipated weekend superspreading events around holidays. Conclusions. We provide an easy-to-use software tool to project quarantine utilization for institutions that can account for mixing with outside populations. This software tool has potential application for universities, corrections facilities, and the military. (Am J Public Health. 2022;112(2):277-283. https://doi.org/10.2105/AJPH.2021.306573).


Assuntos
Previsões/métodos , Quarentena/tendências , Software , Boston/epidemiologia , Necessidades e Demandas de Serviços de Saúde/tendências , Habitação/tendências , Humanos , Universidades
7.
PLoS Comput Biol ; 17(1): e1008545, 2021 01.
Artigo em Inglês | MEDLINE | ID: mdl-33503024

RESUMO

We investigate the source detection problem in epidemiology, which is one of the most important issues for control of epidemics. Mathematically, we reformulate the problem as one of identifying the relevant component in a multivariate Gaussian mixture model. Focusing on the study of cholera and diseases with similar modes of transmission, we calibrate the parameters of our mixture model using human mobility networks within a stochastic, spatially explicit epidemiological model for waterborne disease. Furthermore, we adopt a Bayesian perspective, so that prior information on source location can be incorporated (e.g., reflecting the impact of local conditions). Posterior-based inference is performed, which permits estimates in the form of either individual locations or regions. Importantly, our estimator only requires first-arrival times of the epidemic by putative observers, typically located only at a small proportion of nodes. The proposed method is demonstrated within the context of the 2000-2002 cholera outbreak in the KwaZulu-Natal province of South Africa.


Assuntos
Transmissão de Doença Infecciosa , Epidemias , Modelos Estatísticos , Vigilância em Saúde Pública/métodos , Teorema de Bayes , Cólera/epidemiologia , Cólera/prevenção & controle , Cólera/transmissão , Biologia Computacional , Busca de Comunicante , Surtos de Doenças/prevenção & controle , Surtos de Doenças/estatística & dados numéricos , Transmissão de Doença Infecciosa/prevenção & controle , Transmissão de Doença Infecciosa/estatística & dados numéricos , Epidemias/prevenção & controle , Epidemias/estatística & dados numéricos , Humanos , Dinâmica Populacional , África do Sul , Viagem
8.
Philos Trans A Math Phys Eng Sci ; 380(2233): 20210303, 2022 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-35965456

RESUMO

A valuable metric in understanding local infectious disease dynamics is the local time-varying reproduction number, i.e. the expected number of secondary local cases caused by each infected individual. Accurate estimation of this quantity requires distinguishing cases arising from local transmission from those imported from elsewhere. Realistically, we can expect identification of cases as local or imported to be imperfect. We study the propagation of such errors in estimation of the local time-varying reproduction number. In addition, we propose a Bayesian framework for estimation of the true local time-varying reproduction number when identification errors exist. And we illustrate the practical performance of our estimator through simulation studies and with outbreaks of COVID-19 in Hong Kong and Victoria, Australia. This article is part of the theme issue 'Technical challenges of modelling real-life epidemics and examples of overcoming these'.


Assuntos
COVID-19 , Doenças Transmissíveis , Teorema de Bayes , COVID-19/epidemiologia , Doenças Transmissíveis/epidemiologia , Surtos de Doenças , Humanos , Reprodução
9.
Genet Epidemiol ; 44(4): 352-367, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32100372

RESUMO

We propose a novel variant set test for rare-variant association studies, which leverages multiple single-nucleotide variant (SNV) annotations. Our approach optimizes a convex combination of different sequence kernel association test (SKAT) statistics, where each statistic is constructed from a different annotation and combination weights are optimized through a multiple kernel learning algorithm. The combination test statistic is evaluated empirically through data splitting. In simulations, we find our method preserves type I error at α=2.5×10-6 and has greater power than SKAT(-O) when SNV weights are not misspecified and sample sizes are large ( N≥5,000 ). We utilize our method in the Framingham Heart Study (FHS) to identify SNV sets associated with fasting glucose. While we are unable to detect any genome-wide significant associations between fasting glucose and 4-kb windows of rare variants ( p<10-7 ) in 6,419 FHS participants, our method identifies suggestive associations between fasting glucose and rare variants near ROCK2 ( p=2.1×10-5 ) and within CPLX1 ( p=5.3×10-5 ). These two genes were previously reported to be involved in obesity-mediated insulin resistance and glucose-induced insulin secretion by pancreatic beta-cells, respectively. These findings will need to be replicated in other cohorts and validated by functional genomic studies.


Assuntos
Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Proteínas Adaptadoras de Transporte Vesicular/genética , Algoritmos , Glicemia/análise , Estudo de Associação Genômica Ampla , Humanos , Resistência à Insulina , Células Secretoras de Insulina/citologia , Células Secretoras de Insulina/metabolismo , Estudos Longitudinais , Modelos Estatísticos , Proteínas do Tecido Nervoso/genética , Obesidade/genética , Obesidade/patologia , Quinases Associadas a rho/genética
10.
Biometrics ; 74(4): 1351-1361, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-29772079

RESUMO

Cellular mechanism-of-action is of fundamental concern in many biological studies. It is of particular interest for identifying the cause of disease and learning the way in which treatments act against disease. However, pinpointing such mechanisms is difficult, due to the fact that small perturbations to the cell can have wide-ranging downstream effects. Given a snapshot of cellular activity, it can be challenging to tell where a disturbance originated. The presence of an ever-greater variety of high-throughput biological data offers an opportunity to examine cellular behavior from multiple angles, but also presents the statistical challenge of how to effectively analyze data from multiple sources. In this setting, we propose a method for mechanism-of-action inference by extending network filtering to multi-attribute data. We first estimate a joint Gaussian graphical model across multiple data types using penalized regression and filter for network effects. We then apply a set of likelihood ratio tests to identify the most likely site of the original perturbation. In addition, we propose a conditional testing procedure to allow for detection of multiple perturbations. We demonstrate this methodology on paired gene expression and methylation data from The Cancer Genome Atlas (TCGA).


Assuntos
Biometria/métodos , Simulação por Computador/estatística & dados numéricos , Biologia de Sistemas/métodos , Fenômenos Fisiológicos Celulares , Biologia Computacional/métodos , Metilação de DNA , Interpretação Estatística de Dados , Perfilação da Expressão Gênica , Humanos , Neoplasias/genética , Análise de Regressão
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA