Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Netw Sci (Camb Univ Press) ; 10(2): 131-145, 2022 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-36217370

RESUMEN

Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes or proteins, using a network of gene coexpression data that includes functional annotations. Signed distance correlation has proved useful for the construction of unweighted gene coexpression networks. However, transforming correlation values into unweighted networks may lead to a loss of important biological information related to the intensity of the correlation. Here we introduce a principled method to construct weighted gene coexpression networks using signed distance correlation. These networks contain weighted edges only between those pairs of genes whose correlation value is higher than a given threshold. We analyse data from different organisms and find that networks generated with our method based on signed distance correlation are more stable and capture more biological information compared to networks obtained from Pearson correlation. Moreover, we show that signed distance correlation networks capture more biological information than unweighted networks based on the same metric. While we use biological data sets to illustrate the method, the approach is general and can be used to construct networks in other domains. Code and data are available on https://github.com/javier-pardodiaz/sdcorGCN.

2.
Phys Rev E ; 105(5-2): 059902, 2022 May.
Artículo en Inglés | MEDLINE | ID: mdl-35706326

RESUMEN

This corrects the article DOI: 10.1103/PhysRevE.100.062304.

3.
J Comput Biol ; 29(7): 752-768, 2022 07.
Artículo en Inglés | MEDLINE | ID: mdl-35588362

RESUMEN

Nitrogen uptake in legumes is facilitated by bacteria such as Rhizobium leguminosarum. For this bacterium, gene expression data are available, but functional gene annotation is less well developed than for other model organisms. More annotations could lead to a better understanding of the pathways for growth, plant colonization, and nitrogen fixation in R. leguminosarum. In this study, we present a pipeline that combines novel scores from gene coexpression network analysis in a principled way to identify the genes that are associated with certain growth conditions or highly coexpressed with a predefined set of genes of interest. This association may lead to putative functional annotation or to a prioritized list of genes for further study.


Asunto(s)
Rhizobium leguminosarum , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Fijación del Nitrógeno/genética , Rhizobium leguminosarum/genética , Rhizobium leguminosarum/metabolismo
4.
Bioinformatics ; 2021 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-33523234

RESUMEN

MOTIVATION: Even within well studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lack of trustworthy functional annotations can impede the validation of such networks. Hence, there is a need for a principled method to construct gene coexpression networks that capture biological information and are structurally stable even in the absence of functional information. RESULTS: We introduce the concept of signed distance correlation as a measure of dependency between two variables, and apply it to generate gene coexpression networks. Distance correlation offers a more intuitive approach to network construction than commonly used methods such as Pearson correlation and mutual information. We propose a framework to generate self-consistent networks using signed distance correlation purely from gene expression data, with no additional information. We analyse data from three different organisms to illustrate how networks generated with our method are more stable and capture more biological information compared to networks obtained from Pearson correlation or mutual information. SUPPLEMENTARY INFORMATION: Supplementary Information and code are available at Bioinformatics and https://github.com/javier-pardodiaz/sdcorGCN online.

5.
J R Soc Interface ; 18(174): 20200898, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33468022

RESUMEN

The potential impact of automation on the labour market is a topic that has generated significant interest and concern amongst scholars, policymakers and the broader public. A number of studies have estimated occupation-specific risk profiles by examining how suitable associated skills and tasks are for automation. However, little work has sought to take a more holistic view on the process of labour reallocation and how employment prospects are impacted as displaced workers transition into new jobs. In this article, we develop a data-driven model to analyse how workers move through an empirically derived occupational mobility network in response to automation scenarios. At a macro level, our model reproduces the Beveridge curve, a key stylized fact in the labour market. At a micro level, our model provides occupation-specific estimates of changes in short and long-term unemployment corresponding to specific automation shocks. We find that the network structure plays an important role in determining unemployment levels, with occupations in particular areas of the network having few job transition opportunities. In an automation scenario where low wage occupations are more likely to be automated than high wage occupations, the network effects are also more likely to increase the long-term unemployment of low-wage occupations.


Asunto(s)
Empleo , Ocupaciones , Automatización , Humanos , Desempleo
6.
J R Soc Interface ; 16(151): 20180661, 2019 02 28.
Artículo en Inglés | MEDLINE | ID: mdl-30958184

RESUMEN

We introduce a tensor-based clustering method to extract sparse, low-dimensional structure from high-dimensional, multi-indexed datasets. This framework is designed to enable detection of clusters of data in the presence of structural requirements which we encode as algebraic constraints in a linear program. Our clustering method is general and can be tailored to a variety of applications in science and industry. We illustrate our method on a collection of experiments measuring the response of genetically diverse breast cancer cell lines to an array of ligands. Each experiment consists of a cell line-ligand combination, and contains time-course measurements of the early signalling kinases MAPK and AKT at two different ligand dose levels. By imposing appropriate structural constraints and respecting the multi-indexed structure of the data, the analysis of clusters can be optimized for biological interpretation and therapeutic understanding. We then perform a systematic, large-scale exploration of mechanistic models of MAPK-AKT crosstalk for each cluster. This analysis allows us to quantify the heterogeneity of breast cancer cell subtypes, and leads to hypotheses about the signalling mechanisms that mediate the response of the cell lines to ligands.


Asunto(s)
Algoritmos , Neoplasias de la Mama/metabolismo , Sistema de Señalización de MAP Quinasas , Modelos Biológicos , Neoplasias de la Mama/patología , Análisis por Conglomerados , Femenino , Humanos , Quinasas de Proteína Quinasa Activadas por Mitógenos/metabolismo , Proteínas Proto-Oncogénicas c-akt/metabolismo
7.
Phys Rev E ; 100(6-1): 062304, 2019 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-31962461

RESUMEN

The analysis and characterization of human mobility using population-level mobility models is important for numerous applications, ranging from the estimation of commuter flows in cities to modeling trade flows between countries. However, almost all of these applications have focused on large spatial scales, which typically range between intracity scales and intercountry scales. In this paper, we investigate population-level human mobility models on a much smaller spatial scale by using them to estimate customer mobility flow between supermarket zones. We use anonymized, ordered customer-basket data to infer empirical mobility flow in supermarkets, and we apply variants of the gravity and intervening-opportunities models to fit this mobility flow and estimate the flow on unseen data. We find that a doubly-constrained gravity model and an extended radiation model (which is a type of intervening-opportunities model) can successfully estimate 65%-70% of the flow inside supermarkets. Using a gravity model as a case study, we then investigate how to reduce congestion in supermarkets using mobility models. We model each supermarket zone as a queue, and we use a gravity model to identify store layouts with low congestion, which we measure either by the maximum number of visits to a zone or by the total mean queue size. We then use a simulated-annealing algorithm to find store layouts with lower congestion than a supermarket's original layout. In these optimized store layouts, we find that popular zones are often in the perimeter of a store. Our research gives insight both into how customers move in supermarkets and into how retailers can arrange stores to reduce congestion. It also provides a case study of human mobility on small spatial scales.

8.
NPJ Syst Biol Appl ; 4: 32, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-30131869

RESUMEN

Cells adapt their metabolic fluxes in response to changes in the environment. We present a framework for the systematic construction of flux-based graphs derived from organism-wide metabolic networks. Our graphs encode the directionality of metabolic flows via edges that represent the flow of metabolites from source to target reactions. The methodology can be applied in the absence of a specific biological context by modelling fluxes probabilistically, or can be tailored to different environmental conditions by incorporating flux distributions computed through constraint-based approaches such as Flux Balance Analysis. We illustrate our approach on the central carbon metabolism of Escherichia coli and on a metabolic model of human hepatocytes. The flux-dependent graphs under various environmental conditions and genetic perturbations exhibit systemic changes in their topological and community structure, which capture the re-routing of metabolic flows and the varying importance of specific reactions and pathways. By integrating constraint-based models and tools from network science, our framework allows the study of context-specific metabolic responses at a system level beyond standard pathway descriptions.

9.
R Soc Open Sci ; 4(7): 170154, 2017 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-28791141

RESUMEN

We examine the relationship between social structure and sentiment through the analysis of a large collection of tweets about the Irish Marriage Referendum of 2015. We obtain the sentiment of every tweet with the hashtags #marref and #marriageref that was posted in the days leading to the referendum, and construct networks to aggregate sentiment and use it to study the interactions among users. Our analysis shows that the sentiment of outgoing mention tweets is correlated with the sentiment of incoming mentions, and there are significantly more connections between users with similar sentiment scores than among users with opposite scores in the mention and follower networks. We combine the community structure of the follower and mention networks with the activity level of the users and sentiment scores to find groups that support voting 'yes' or 'no' in the referendum. There were numerous conversations between users on opposing sides of the debate in the absence of follower connections, which suggests that there were efforts by some users to establish dialogue and debate across ideological divisions. Our analysis shows that social structure can be integrated successfully with sentiment to analyse and understand the disposition of social media users around controversial or polarizing issues. These results have potential applications in the integration of data and metadata to study opinion dynamics, public opinion modelling and polling.

10.
Digit Health ; 3: 2055207616688841, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-29942579

RESUMEN

Social media are being increasingly used for health promotion, yet the landscape of users, messages and interactions in such fora is poorly understood. Studies of social media and diabetes have focused mostly on patients, or public agencies addressing it, but have not looked broadly at all of the participants or the diversity of content they contribute. We study Twitter conversations about diabetes through the systematic analysis of 2.5 million tweets collected over 8 months and the interactions between their authors. We address three questions. (1) What themes arise in these tweets? (2) Who are the most influential users? (3) Which type of users contribute to which themes? We answer these questions using a mixed-methods approach, integrating techniques from anthropology, network science and information retrieval such as thematic coding, temporal network analysis and community and topic detection. Diabetes-related tweets fall within broad thematic groups: health information, news, social interaction and commercial. At the same time, humorous messages and references to popular culture appear consistently, more than any other type of tweet. We classify authors according to their temporal 'hub' and 'authority' scores. Whereas the hub landscape is diffuse and fluid over time, top authorities are highly persistent across time and comprise bloggers, advocacy groups and NGOs related to diabetes, as well as for-profit entities without specific diabetes expertise. Top authorities fall into seven interest communities as derived from their Twitter follower network. Our findings have implications for public health professionals and policy makers who seek to use social media as an engagement tool and to inform policy design.

11.
J R Soc Interface ; 13(121)2016 08.
Artículo en Inglés | MEDLINE | ID: mdl-27581482

RESUMEN

Cellular signal transduction usually involves activation cascades, the sequential activation of a series of proteins following the reception of an input signal. Here, we study the classic model of weakly activated cascades and obtain analytical solutions for a variety of inputs. We show that in the special but important case of optimal gain cascades (i.e. when the deactivation rates are identical) the downstream output of the cascade can be represented exactly as a lumped nonlinear module containing an incomplete gamma function with real parameters that depend on the rates and length of the cascade, as well as parameters of the input signal. The expressions obtained can be applied to the non-identical case when the deactivation rates are random to capture the variability in the cascade outputs. We also show that cascades can be rearranged so that blocks with similar rates can be lumped and represented through our nonlinear modules. Our results can be used both to represent cascades in computational models of differential equations and to fit data efficiently, by reducing the number of equations and parameters involved. In particular, the length of the cascade appears as a real-valued parameter and can thus be fitted in the same manner as Hill coefficients. Finally, we show how the obtained nonlinear modules can be used instead of delay differential equations to model delays in signal transduction.


Asunto(s)
Modelos Biológicos , Transducción de Señal/fisiología , Animales , Humanos
12.
PLoS Comput Biol ; 12(8): e1005055, 2016 08.
Artículo en Inglés | MEDLINE | ID: mdl-27494178

RESUMEN

We exploit flow propagation on the directed neuronal network of the nematode C. elegans to reveal dynamically relevant features of its connectome. We find flow-based groupings of neurons at different levels of granularity, which we relate to functional and anatomical constituents of its nervous system. A systematic in silico evaluation of the full set of single and double neuron ablations is used to identify deletions that induce the most severe disruptions of the multi-resolution flow structure. Such ablations are linked to functionally relevant neurons, and suggest potential candidates for further in vivo investigation. In addition, we use the directional patterns of incoming and outgoing network flows at all scales to identify flow profiles for the neurons in the connectome, without pre-imposing a priori categories. The four flow roles identified are linked to signal propagation motivated by biological input-response scenarios.


Asunto(s)
Caenorhabditis elegans/fisiología , Conectoma , Modelos Neurológicos , Red Nerviosa/fisiología , Animales , Biología Computacional
13.
J R Soc Interface ; 11(101): 20140940, 2014 Dec 06.
Artículo en Inglés | MEDLINE | ID: mdl-25297320

RESUMEN

Directionality is a crucial ingredient in many complex networks in which information, energy or influence are transmitted. In such directed networks, analysing flows (and not only the strength of connections) is crucial to reveal important features of the network that might go undetected if the orientation of connections is ignored. We showcase here a flow-based approach for community detection through the study of the network of the most influential Twitter users during the 2011 riots in England. Firstly, we use directed Markov Stability to extract descriptions of the network at different levels of coarseness in terms of interest communities, i.e. groups of nodes within which flows of information are contained and reinforced. Such interest communities reveal user groupings according to location, profession, employer and topic. The study of flows also allows us to generate an interest distance, which affords a personalized view of the attention in the network as viewed from the vantage point of any given user. Secondly, we analyse the profiles of incoming and outgoing long-range flows with a combined approach of role-based similarity and the novel relaxed minimum spanning tree algorithm to reveal that the users in the network can be classified into five roles. These flow roles go beyond the standard leader/follower dichotomy and differ from classifications based on regular/structural equivalence. We then show that the interest communities fall into distinct informational organigrams characterized by a different mix of user roles reflecting the quality of dialogue within them. Our generic framework can be used to provide insight into how flows are generated, distributed, preserved and consumed in directed networks.


Asunto(s)
Internet , Modelos Teóricos , Tumultos , Apoyo Social , Femenino , Humanos , Masculino , Reino Unido
14.
BMC Syst Biol ; 6: 146, 2012 Nov 25.
Artículo en Inglés | MEDLINE | ID: mdl-23176679

RESUMEN

BACKGROUND: Stomata are tiny pores in plant leaves that regulate gas and water exchange between the plant and its environment. Abscisic acid and ethylene are two well-known elicitors of stomatal closure when acting independently. However, when stomata are presented with a combination of both signals, they fail to close. RESULTS: Toshed light on this unexplained behaviour, we have collected time course measurements of stomatal aperture and hydrogen peroxide production in Arabidopsis thaliana guard cells treated with abscisic acid, ethylene, and a combination of both. Our experiments show that stomatal closure is linked to sustained high levels of hydrogen peroxide in guard cells. When treated with a combined dose of abscisic acid and ethylene, guard cells exhibit increased antioxidant activity that reduces hydrogen peroxide levels and precludes closure. We construct a simplified model of stomatal closure derived from known biochemical pathways that captures the experimentally observed behaviour. CONCLUSIONS: Our experiments and modelling results suggest a distinct role for two antioxidant mechanisms during stomatal closure: a slower, delayed response activated by a single stimulus (abscisic acid 'or' ethylene) and another more rapid 'and' mechanism that is only activated when both stimuli are present. Our model indicates that the presence of this rapid 'and' mechanism in the antioxidant response is key to explain the lack of closure under a combined stimulus.


Asunto(s)
Ácido Abscísico/metabolismo , Arabidopsis/citología , Arabidopsis/fisiología , Etilenos/metabolismo , Modelos Biológicos , Hojas de la Planta/anatomía & histología , Estrés Fisiológico , Arabidopsis/anatomía & histología , Arabidopsis/metabolismo , Hojas de la Planta/citología , Hojas de la Planta/metabolismo , Hojas de la Planta/fisiología , Especies Reactivas de Oxígeno/metabolismo , Transducción de Señal , Factores de Tiempo
15.
Phys Biol ; 9(3): 036001, 2012 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-22551942

RESUMEN

The mitogen-activated protein kinase (MAPK) family of proteins is involved in regulating cellular fates such as proliferation, differentiation and apoptosis. In particular, the dynamics of the Erk/Mek system, which has become the canonical example for MAPK signaling systems, have attracted considerable attention. Erk is encoded by two genes, Erk1 and Erk2, that until recently had been considered equivalent as they differ only subtly at the sequence level. However, these proteins exhibit radically different trafficking between cytoplasm and nucleus and this fact may have functional implications. Here we use spatially resolved data on Erk1/2 to develop and analyze spatio-temporal models of these cascades, and we discuss how sensitivity analysis can be used to discriminate between mechanisms. Our models elucidate some of the factors governing the interplay between signaling processes and the Erk1/2 localization in different cellular compartments, including competition between Erk1 and Erk2. Our approach is applicable to a wide range of signaling systems, such as activation cascades, where translocation of molecules occurs. Our study provides a first model of Erk1 and Erk2 activation and their nuclear shuttling dynamics, revealing a role in the regulation of the efficiency of nuclear signaling.


Asunto(s)
Núcleo Celular/metabolismo , Proteína Quinasa 1 Activada por Mitógenos/metabolismo , Proteína Quinasa 3 Activada por Mitógenos/metabolismo , Transporte Activo de Núcleo Celular , Animales , Activación Enzimática , Células HeLa , Humanos , Sistema de Señalización de MAP Quinasas , Ratones , Proteína Quinasa 1 Activada por Mitógenos/análisis , Proteína Quinasa 3 Activada por Mitógenos/análisis , Modelos Biológicos , Células 3T3 NIH
16.
J R Soc Interface ; 9(73): 1925-33, 2012 Aug 07.
Artículo en Inglés | MEDLINE | ID: mdl-22262815

RESUMEN

Estimating parameters from data is a key stage of the modelling process, particularly in biological systems where many parameters need to be estimated from sparse and noisy datasets. Over the years, a variety of heuristics have been proposed to solve this complex optimization problem, with good results in some cases yet with limitations in the biological setting. In this work, we develop an algorithm for model parameter fitting that combines ideas from evolutionary algorithms, sequential Monte Carlo and direct search optimization. Our method performs well even when the order of magnitude and/or the range of the parameters is unknown. The method refines iteratively a sequence of parameter distributions through local optimization combined with partial resampling from a historical prior defined over the support of all previous iterations. We exemplify our method with biological models using both simulated and real experimental data and estimate the parameters efficiently even in the absence of a priori knowledge about the parameters.


Asunto(s)
Algoritmos , Modelos Biológicos , Método de Montecarlo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...