Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
1.
Brief Bioinform ; 14(3): 279-92, 2013 May.
Artículo en Inglés | MEDLINE | ID: mdl-22772837

RESUMEN

The need to analyze high-dimension biological data is driving the development of new data mining methods. Biclustering algorithms have been successfully applied to gene expression data to discover local patterns, in which a subset of genes exhibit similar expression levels over a subset of conditions. However, it is not clear which algorithms are best suited for this task. Many algorithms have been published in the past decade, most of which have been compared only to a small number of algorithms. Surveys and comparisons exist in the literature, but because of the large number and variety of biclustering algorithms, they are quickly outdated. In this article we partially address this problem of evaluating the strengths and weaknesses of existing biclustering methods. We used the BiBench package to compare 12 algorithms, many of which were recently published or have not been extensively studied. The algorithms were tested on a suite of synthetic data sets to measure their performance on data with varying conditions, such as different bicluster models, varying noise, varying numbers of biclusters and overlapping biclusters. The algorithms were also tested on eight large gene expression data sets obtained from the Gene Expression Omnibus. Gene Ontology enrichment analysis was performed on the resulting biclusters, and the best enrichment terms are reported. Our analyses show that the biclustering method and its parameters should be selected based on the desired model, whether that model allows overlapping biclusters, and its robustness to noise. In addition, we observe that the biclustering algorithms capable of finding more than one model are more successful at capturing biologically relevant clusters.


Asunto(s)
Algoritmos , Expresión Génica , Animales , Análisis por Conglomerados , Análisis Factorial , Humanos , Modelos Teóricos , Análisis de Secuencia por Matrices de Oligonucleótidos
2.
Nat Commun ; 12(1): 726, 2021 02 09.
Artículo en Inglés | MEDLINE | ID: mdl-33563980

RESUMEN

Human mobility is a primary driver of infectious disease spread. However, existing data is limited in availability, coverage, granularity, and timeliness. Data-driven forecasts of disease dynamics are crucial for decision-making by health officials and private citizens alike. In this work, we focus on a machine-learned anonymized mobility map (hereon referred to as AMM) aggregated over hundreds of millions of smartphones and evaluate its utility in forecasting epidemics. We factor AMM into a metapopulation model to retrospectively forecast influenza in the USA and Australia. We show that the AMM model performs on-par with those based on commuter surveys, which are sparsely available and expensive. We also compare it with gravity and radiation based models of mobility, and find that the radiation model's performance is quite similar to AMM and commuter flows. Additionally, we demonstrate our model's ability to predict disease spread even across state boundaries. Our work contributes towards developing timely infectious disease forecasting at a global scale using human mobility datasets expanding their applications in the area of infectious disease epidemiology.


Asunto(s)
Predicción/métodos , Gripe Humana/epidemiología , Aprendizaje Automático , Australia/epidemiología , Humanos , Gripe Humana/prevención & control , Gripe Humana/transmisión , Modelos Teóricos , Ciudad de Nueva York/epidemiología , Dinámica Poblacional , Reproducibilidad de los Resultados , Teléfono Inteligente
3.
Nat Commun ; 10(1): 4817, 2019 10 23.
Artículo en Inglés | MEDLINE | ID: mdl-31645563

RESUMEN

The recent trend of rapid urbanization makes it imperative to understand urban characteristics such as infrastructure, population distribution, jobs, and services that play a key role in urban livability and sustainability. A healthy debate exists on what constitutes optimal structure regarding livability in cities, interpolating, for instance, between mono- and poly-centric organization. Here anonymous and aggregated flows generated from three hundred million users, opted-in to Location History, are used to extract global Intra-urban trips. We develop a metric that allows us to classify cities and to establish a connection between mobility organization and key urban indicators. We demonstrate that cities with strong hierarchical mobility structure display an extensive use of public transport, higher levels of walkability, lower pollutant emissions per capita and better health indicators. Our framework outperforms previous metrics, is highly scalable and can be deployed with little cost, even in areas without resources for traditional data collection.

5.
Methods Mol Biol ; 1375: 55-74, 2016.
Artículo en Inglés | MEDLINE | ID: mdl-26626937

RESUMEN

Rapid development and increasing popularity of gene expression microarrays have resulted in a number of studies on the discovery of co-regulated genes. One important way of discovering such co-regulations is the query-based search since gene co-expressions may indicate a shared role in a biological process. Although there exist promising query-driven search methods adapting clustering, they fail to capture many genes that function in the same biological pathway because microarray datasets are fraught with spurious samples or samples of diverse origin, or the pathways might be regulated under only a subset of samples. On the other hand, a class of clustering algorithms known as biclustering algorithms which simultaneously cluster both the items and their features are useful while analyzing gene expression data, or any data in which items are related in only a subset of their samples. This means that genes need not be related in all samples to be clustered together. Because many genes only interact under specific circumstances, biclustering may recover the relationships that traditional clustering algorithms can easily miss. In this chapter, we briefly summarize the literature using biclustering for querying co-regulated genes. Then we present a novel biclustering approach and evaluate its performance by a thorough experimental analysis.


Asunto(s)
Análisis por Conglomerados , Biología Computacional/métodos , Perfilación de la Expresión Génica/métodos , Algoritmos , Bases de Datos Genéticas , Regulación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Genes BRCA1 , Genes BRCA2 , Genes p53 , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA