Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
BMC Infect Dis ; 17(1): 521, 2017 07 26.
Artigo em Inglês | MEDLINE | ID: mdl-28747154

RESUMO

BACKGROUND: Mixing patterns of human populations play a crucial role in shaping the spreading paths of infectious diseases. The diffusion of mobile and wearable devices able to record close proximity interactions represents a great opportunity for gathering detailed data on social interactions and mixing patterns in human populations. The aim of this study is to investigate how social interactions are affected by the onset of symptomatic conditions and to what extent the heterogeneity in human behavior can reflect a different risk of infection. METHODS: We study the relation between individuals' social behavior and the onset of different symptoms, by making use of data collected in 2009 among students sharing a dormitory in a North America university campus. The dataset combines Bluetooth proximity records between study participants with self-reported daily records on their health state. Specifically, we investigate whether individuals' social activity significantly changes during different symptomatic conditions, including those defining Influenza-like illness, and highlight to what extent possible heterogeneities in social behaviors among individuals with similar age and daily routines may be responsible for a different risk of infection for influenza. RESULTS: Our results suggest that symptoms associated with Influenza-like illness can be responsible of a reduction of about 40% in the average duration of contacts and of 30% in the daily time spent in social interactions, possibly driven by the onset of fever. However, differences in the number of daily contacts were found to be not statistically significant. In addition, we found that individuals who experienced clinical influenza during the study period were characterized by a significantly higher social activity. In particular, both the number of person-to-person contacts and the time spent in social interactions emerged as significant risk factors for influenza infection. CONCLUSIONS: Our findings highlight that Influenza-like illness can remarkably reduce the social activity of individuals and strengthen the idea that the heterogeneity in social habits among individuals can significantly contribute in shaping differences among the individuals' risk of infection.


Assuntos
Influenza Humana/etiologia , Relações Interpessoais , Comportamento Social , Adulto , Febre/etiologia , Febre/psicologia , Habitação , Humanos , Influenza Humana/epidemiologia , Influenza Humana/psicologia , Influenza Humana/transmissão , Fatores de Risco , Autorrelato , Estudantes , Universidades
2.
BMC Bioinformatics ; 17(Suppl 16): 447, 2016 Dec 13.
Artigo em Inglês | MEDLINE | ID: mdl-28105912

RESUMO

BACKGROUND: Functional genomic and epigenomic research relies fundamentally on sequencing based methods like ChIP-seq for the detection of DNA-protein interactions. These techniques return large, high dimensional data sets with visually complex structures, such as multi-modal peaks extended over large genomic regions. Current tools for visualisation and data exploration represent and leverage these complex features only to a limited extent. RESULTS: We present DGW, an open source software package for simultaneous alignment and clustering of multiple epigenomic marks. DGW uses Dynamic Time Warping to adaptively rescale and align genomic distances which allows to group regions of interest with similar shapes, thereby capturing the structure of epigenomic marks. We demonstrate the effectiveness of the approach in a simulation study and on a real epigenomic data set from the ENCODE project. CONCLUSIONS: Our results show that DGW automatically recognises and aligns important genomic features such as transcription start sites and splicing sites from histone marks. DGW is available as an open source Python package.


Assuntos
Simulação por Computador , Epigenômica/métodos , Genoma Humano , Código das Histonas , Software , Imunoprecipitação da Cromatina , Análise por Conglomerados , DNA/metabolismo , Proteínas de Ligação a DNA/metabolismo , Epigênese Genética , Humanos , Leucemia/genética
3.
Bioinformatics ; 29(3): 407-8, 2013 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-23242262

RESUMO

UNLABELLED: We introduce a novel implementation in ANSI C of the MINE family of algorithms for computing maximal information-based measures of dependence between two variables in large datasets, with the aim of a low memory footprint and ease of integration within bioinformatics pipelines. We provide the libraries minerva (with the R interface) and minepy for Python, MATLAB, Octave and C++. The C solution reduces the large memory requirement of the original Java implementation, has good upscaling properties and offers a native parallelization for the R interface. Low memory requirements are demonstrated on the MINE benchmarks as well as on large ( = 1340) microarray and Illumina GAII RNA-seq transcriptomics datasets. AVAILABILITY AND IMPLEMENTATION: Source code and binaries are freely available for download under GPL3 licence at http://minepy.sourceforge.net for minepy and through the CRAN repository http://cran.r-project.org for the R package minerva. All software is multiplatform (MS Windows, Linux and OSX).


Assuntos
Software , Algoritmos , Biologia Computacional , Mineração de Dados , Perfilação da Expressão Gênica , Metagenoma
4.
Front Pharmacol ; 14: 1272091, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38239195

RESUMO

Introduction: Understanding drug exposure at disease target sites is pivotal to profiling new drug candidates in terms of tolerability and efficacy. Such quantification is particularly tedious for anti-tuberculosis (TB) compounds as the heterogeneous pulmonary microenvironment due to the infection may alter lung permeability and affect drug disposition. Murine models have been a longstanding support in TB research so far and are here used as human surrogates to unveil the distribution of several anti-TB compounds at the site-of-action via a novel and centralized PBPK design framework. Methods: As an intermediate approach between data-driven pharmacokinetic (PK) models and whole-body physiologically based (PB) PK models, we propose a parsimonious framework for PK investigation (minimal PBPK approach) that retains key physiological processes involved in TB disease, while reducing computational costs and prior knowledge requirements. By lumping together pulmonary TB-unessential organs, our minimal PBPK model counts 9 equations compared to the 36 of published full models, accelerating the simulation more than 3-folds in Matlab 2022b. Results: The model has been successfully tested and validated against 11 anti-TB compounds-rifampicin, rifapentine, pyrazinamide, ethambutol, isoniazid, moxifloxacin, delamanid, pretomanid, bedaquiline, OPC-167832, GSK2556286 - showing robust predictability power in recapitulating PK dynamics in mice. Structural inspections on the proposed design have ensured global identifiability and listed free fraction in plasma and blood-to-plasma ratio as top sensitive parameters for PK metrics. The platform-oriented implementation allows fast comparison of the compounds in terms of exposure and target attainment. Discrepancies in plasma and lung levels for the latest BPaMZ and HPMZ regimens have been analyzed in terms of their impact on preclinical experiment design and on PK/PD indices. Conclusion: The framework we developed requires limited drug- and species-specific information to reconstruct accurate PK dynamics, delivering a unified viewpoint on anti-TB drug distribution at the site-of-action and a flexible fit-for-purpose tool to accelerate model-informed drug design pipelines and facilitate translation into the clinic.

5.
Cancers (Basel) ; 13(14)2021 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-34298641

RESUMO

High-throughput technologies make it possible to produce a large amount of data representing different biological layers, examples of which are genomics, proteomics, metabolomics and transcriptomics. Omics data have been individually investigated to understand the molecular bases of various diseases, but this may not be sufficient to fully capture the molecular mechanisms and the multilayer regulatory processes underlying complex diseases, especially cancer. To overcome this problem, several multi-omics integration methods have been introduced but a commonly agreed standard of analysis is still lacking. In this paper, we present MOUSSE, a novel normalization-free pipeline for unsupervised multi-omics integration. The main innovations are the use of rank-based subject-specific signatures and the use of such signatures to derive subject similarity networks. A separate similarity network was derived for each omics, and the resulting networks were then carefully merged in a way that considered their informative content. We applied it to analyze survival in ten different types of cancer. We produced a meaningful clusterization of the subjects and obtained a higher average classification score than ten state-of-the-art algorithms tested on the same data. As further validation, we extracted from the subject-specific signatures a list of relevant features used for the clusterization and investigated their biological role in survival. We were able to verify that, according to the literature, these features are highly involved in cancer progression and differential survival.

6.
Methods Mol Biol ; 1883: 323-346, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30547407

RESUMO

Reconstructing a gene regulatory network from one or more sets of omics measurements has been a major task of computational biology in the last 20 years. Despite an overwhelming number of algorithms proposed to solve the network inference problem either in the general scenario or in an ad-hoc tailored situation, assessing the stability of reconstruction is still an uncharted territory and exploratory studies mainly tackled theoretical aspects. We introduce here empirical stability, which is induced by variability of reconstruction as a function of data subsampling. By evaluating differences between networks that are inferred using different subsets of the same data we obtain quantitative indicators of the robustness of the algorithm, of the noise level affecting the data, and, overall, of the reliability of the reconstructed graph. We show that empirical stability can be used whenever no ground truth is available to compute a direct measure of the similarity between the inferred structure and the true network. The main ingredient here is a suite of indicators, called NetSI, providing statistics of distances between graphs generated by a given algorithm fed with different data subsets, where the chosen metric is the Hamming-Ipsen-Mikhailov (HIM) distance evaluating dissimilarity of graph topologies with shared nodes. Operatively, the NetSI family is demonstrated here on synthetic and high-throughput datasets, inferring graphs at different resolution levels (topology, direction, weight), showing how the stability indicators can be effectively used for the quantitative comparison of the stability of different reconstruction algorithms.


Assuntos
Algoritmos , Biologia Computacional/métodos , Redes Reguladoras de Genes , Modelos Genéticos , Biologia Computacional/instrumentação , Conjuntos de Dados como Assunto , Perfilação da Expressão Gênica/instrumentação , Perfilação da Expressão Gênica/métodos , Genoma Humano/genética , Genômica/instrumentação , Genômica/métodos , Humanos , Proteômica/instrumentação , Proteômica/métodos
7.
PLoS One ; 11(3): e0152648, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27031641

RESUMO

When modeling coexpression networks from high-throughput time course data, Pearson Correlation Coefficient (PCC) is one of the most effective and popular similarity functions. However, its reliability is limited since it cannot capture non-linear interactions and time shifts. Here we propose to overcome these two issues by employing a novel similarity function, Dynamic Time Warping Maximal Information Coefficient (DTW-MIC), combining a measure taking care of functional interactions of signals (MIC) and a measure identifying time lag (DTW). By using the Hamming-Ipsen-Mikhailov (HIM) metric to quantify network differences, the effectiveness of the DTW-MIC approach is demonstrated on a set of four synthetic and one transcriptomic datasets, also in comparison to TimeDelay ARACNE and Transfer Entropy.


Assuntos
Biologia Computacional/métodos , Algoritmos , Escherichia coli/genética , Escherichia coli/metabolismo , Redes Reguladoras de Genes , Humanos , Redes e Vias Metabólicas , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Linfócitos T/metabolismo
8.
PLoS One ; 9(2): e89815, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24587057

RESUMO

The number of available algorithms to infer a biological network from a dataset of high-throughput measurements is overwhelming and keeps growing. However, evaluating their performance is unfeasible unless a 'gold standard' is available to measure how close the reconstructed network is to the ground truth. One measure of this is the stability of these predictions to data resampling approaches. We introduce NetSI, a family of Network Stability Indicators, to assess quantitatively the stability of a reconstructed network in terms of inference variability due to data subsampling. In order to evaluate network stability, the main NetSI methods use a global/local network metric in combination with a resampling (bootstrap or cross-validation) procedure. In addition, we provide two normalized variability scores over data resampling to measure edge weight stability and node degree stability, and then introduce a stability ranking for edges and nodes. A complete implementation of the NetSI indicators, including the Hamming-Ipsen-Mikhailov (HIM) network distance adopted in this paper is available with the R package nettools. We demonstrate the use of the NetSI family by measuring network stability on four datasets against alternative network reconstruction methods. First, the effect of sample size on stability of inferred networks is studied in a gold standard framework on yeast-like data from the Gene Net Weaver simulator. We also consider the impact of varying modularity on a set of structurally different networks (50 nodes, from 2 to 10 modules), and then of complex feature covariance structure, showing the different behaviours of standard reconstruction methods based on Pearson correlation, Maximum Information Coefficient (MIC) and False Discovery Rate (FDR) strategy. Finally, we demonstrate a strong combined effect of different reconstruction methods and phenotype subgroups on a hepatocellular carcinoma miRNA microarray dataset (240 subjects), and we validate the analysis on a second dataset (166 subjects) with good reproducibility.


Assuntos
Modelos Biológicos , Redes Neurais de Computação , Algoritmos , Carcinoma Hepatocelular/genética , Redes Reguladoras de Genes , Humanos , Neoplasias Hepáticas/genética , MicroRNAs/genética , Leveduras/fisiologia
9.
Nat Biotechnol ; 32(9): 926-32, 2014 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-25150839

RESUMO

The concordance of RNA-sequencing (RNA-seq) with microarrays for genome-wide analysis of differential gene expression has not been rigorously assessed using a range of chemical treatment conditions. Here we use a comprehensive study design to generate Illumina RNA-seq and Affymetrix microarray data from the same liver samples of rats exposed in triplicate to varying degrees of perturbation by 27 chemicals representing multiple modes of action (MOAs). The cross-platform concordance in terms of differentially expressed genes (DEGs) or enriched pathways is linearly correlated with treatment effect size (R(2)0.8). Furthermore, the concordance is also affected by transcript abundance and biological complexity of the MOA. RNA-seq outperforms microarray (93% versus 75%) in DEG verification as assessed by quantitative PCR, with the gain mainly due to its improved accuracy for low-abundance transcripts. Nonetheless, classifiers to predict MOAs perform similarly when developed using data from either platform. Therefore, the endpoint studied and its biological complexity, transcript abundance and the genomic application are important factors in transcriptomic research and for clinical and regulatory decision making.


Assuntos
Análise de Sequência com Séries de Oligonucleotídeos , RNA Mensageiro/genética , Análise de Sequência de RNA , Animais , Ratos
10.
PLoS One ; 7(5): e36540, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22615778

RESUMO

The outcome of a functional genomics pipeline is usually a partial list of genomic features, ranked by their relevance in modelling biological phenotype in terms of a classification or regression model. Due to resampling protocols or to a meta-analysis comparison, it is often the case that sets of alternative feature lists (possibly of different lengths) are obtained, instead of just one list. Here we introduce a method, based on permutations, for studying the variability between lists ("list stability") in the case of lists of unequal length. We provide algorithms evaluating stability for lists embedded in the full feature set or just limited to the features occurring in the partial lists. The method is demonstrated by finding and comparing gene profiles on a large prostate cancer dataset, consisting of two cohorts of patients from different countries, for a total of 455 samples.


Assuntos
Biologia Computacional , Matemática , Algoritmos
11.
PLoS One ; 6(12): e28646, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-22216103

RESUMO

RegnANN is a novel method for reverse engineering gene networks based on an ensemble of multilayer perceptrons. The algorithm builds a regressor for each gene in the network, estimating its neighborhood independently. The overall network is obtained by joining all the neighborhoods. RegnANN makes no assumptions about the nature of the relationships between the variables, potentially capturing high-order and non linear dependencies between expression patterns. The evaluation focuses on synthetic data mimicking plausible submodules of larger networks and on biological data consisting of submodules of Escherichia coli. We consider Barabasi and Erdös-Rényi topologies together with two methods for data generation. We verify the effect of factors such as network size and amount of data to the accuracy of the inference algorithm. The accuracy scores obtained with RegnANN is methodically compared with the performance of three reference algorithms: ARACNE, CLR and KELLER. Our evaluation indicates that RegnANN compares favorably with the inference methods tested. The robustness of RegnANN, its ability to discover second order correlations and the agreement between results obtained with this new methods on both synthetic and biological data are promising and they stimulate its application to a wider range of problems.


Assuntos
Redes Reguladoras de Genes , Redes Neurais de Computação , Escherichia coli/fisiologia
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa