RESUMO
The impact of micro-level people's activities on urban macro-level indicators is a complex question that has been the subject of much interest among researchers and policymakers. Transportation preferences, consumption habits, communication patterns and other individual-level activities can significantly impact large-scale urban characteristics, such as the potential for innovation generation of the city. Conversely, large-scale urban characteristics can also constrain and determine the activities of their inhabitants. Therefore, understanding the interdependence and mutual reinforcement between micro- and macro-level factors is critical to defining effective public policies. The increasing availability of digital data sources, such as social media and mobile phones, has opened up new opportunities for the quantitative study of this interdependency. This paper aims to detect meaningful city clusters on the basis of a detailed analysis of the spatiotemporal activity patterns for each city. The study is carried out on a worldwide city dataset of spatiotemporal activity patterns obtained from geotagged social media data. Clustering features are obtained from unsupervised topic analyses of activity patterns. Our study compares state-of-the-art clustering models, selecting the model achieving a 2.7% greater Silhouette Score than the next-best model. Three well-separated city clusters are identified. Additionally, the study of the distribution of the City Innovation Index over these three city clusters shows discrimination of low performing from high performing cities relative to innovation. Low performing cities are identified in one well-separated cluster. Therefore, it is possible to correlate micro-scale individual-level activities to large-scale urban characteristics.
Assuntos
Meios de Transporte , Humanos , Cidades , Análise por Conglomerados , Fatores de TempoRESUMO
Non-alcoholic fatty liver disease (NAFLD) affects 25% of the population worldwide, and its prevalence is anticipated to increase globally. While most NAFLD patients are asymptomatic, NAFLD may progress to fibrosis, cirrhosis, cardiovascular disease, and diabetes. Research reports, with daunting results, show the challenge that NAFLD's burden causes to global population health. The current process for identifying fibrosis risk levels is inefficient, expensive, does not cover all potential populations, and does not identify the risk in time. Instead of invasive liver biopsies, we implemented a non-invasive fibrosis assessment process calculated from clinical data (accessed via EMRs/EHRs). We stratified patients' risks for fibrosis from 2007 to 2017 by modeling the risk in 5579 individuals. The process involved time-series machine learning models (Hidden Markov Models and Group-Based Trajectory Models) profiled fibrosis risk by modeling patients' latent medical status resulted in three groups. The high-risk group had abnormal lab test values and a higher prevalence of chronic conditions. This study can help overcome the inefficient, traditional process of detecting fibrosis via biopsies (that are also medically unfeasible due to their invasive nature, the medical resources involved, and costs) at early stages. Thus longitudinal risk assessment may be used to make population-specific medical recommendations targeting early detection of high risk patients, to avoid the development of fibrosis disease and its complications as well as decrease healthcare costs.
Assuntos
Hepatopatia Gordurosa não Alcoólica , Humanos , Fígado , Cirrose Hepática , Aprendizado de Máquina , Hepatopatia Gordurosa não Alcoólica/complicações , Hepatopatia Gordurosa não Alcoólica/diagnóstico , Hepatopatia Gordurosa não Alcoólica/epidemiologia , Medição de Risco , Fatores de TempoRESUMO
Automatic modulation classification (AMC) plays a fundamental role in common communication systems. Existing clustering models typically handle fewer modulation types with lower classification accuracies and more computational resources. This paper proposes a hierarchical self-organizing map (SOM) based on a feature space composed of high-order cumulants (HOC) and amplitude moment features. This SOM with two stacked layers can identify intrinsic differences among samples in the feature space without the need to set thresholds. This model can roughly cluster the multiple amplitude-shift keying (MASK), multiple phase-shift keying (MPSK), and multiple quadrature amplitude keying (MQAM) samples in the root layer and then finely distinguish the samples with different orders in the leaf layers. We creatively implement a discrete transformation method based on modified activation functions. This method causes MQAM samples to cluster in the leaf layer with more distinct boundaries between clusters and higher classification accuracies. The simulation results demonstrate the superior performance of the proposed hierarchical SOM on AMC problems when compared with other clustering models. Our proposed method can manage more categories of modulation signals and obtain higher classification accuracies while using fewer computational resources.
Assuntos
Algoritmos , Análise por Conglomerados , Simulação por ComputadorRESUMO
Current theories hold that brain function is highly related to long-range physical connections through axonal bundles, namely extrinsic connectivity. However, obtaining a groupwise cortical parcellation based on extrinsic connectivity remains challenging. Current parcellation methods are computationally expensive; need tuning of several parameters or rely on ad-hoc constraints. Furthermore, none of these methods present a model for the cortical extrinsic connectivity of the cortex. To tackle these problems, we propose a parsimonious model for the extrinsic connectivity and an efficient parceling technique based on clustering of tractograms. Our technique allows the creation of single subject and groupwise parcellations of the whole cortex. The parcellations obtained with our technique are in agreement with structural and functional parcellations in the literature. In particular, the motor and sensory cortex are subdivided in agreement with the human homunculus of Penfield. We illustrate this by comparing our resulting parcels with the motor strip mapping included in the Human Connectome Project data.
Assuntos
Córtex Cerebral/diagnóstico por imagem , Imagem de Tensor de Difusão/métodos , Modelos Teóricos , Neuroimagem/métodos , Adulto , Feminino , Humanos , MasculinoRESUMO
The vast potential sequence diversity of TCRs and their ligands has presented an historic barrier to computational prediction of TCR epitope specificity, a holy grail of quantitative immunology. One common approach is to cluster sequences together, on the assumption that similar receptors bind similar epitopes. Here, we provide the first independent evaluation of widely used clustering algorithms for TCR specificity inference, observing some variability in predictive performance between models, and marked differences in scalability. Despite these differences, we find that different algorithms produce clusters with high degrees of similarity for receptors recognising the same epitope. Our analysis strengthens the case for use of clustering models to identify signals of common specificity from large repertoires, whilst highlighting scope for improvement of complex models over simple comparators.