Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 47
Filtrar
1.
Entropy (Basel) ; 25(3)2023 Mar 10.
Artigo em Inglês | MEDLINE | ID: mdl-36981376

RESUMO

We introduce a new non-black-box method of extracting multiple areas in a high-dimensional big data space where data points that satisfy specific conditions are highly concentrated. First, we extract one-dimensional areas where the data that satisfy specific conditions are mostly gathered by using the Bayesian method. Second, we construct higher-dimensional areas where the densities of focused data points are higher than the simple combination of the results for one dimension, and then we verify the results through data validation. Third, we apply this method to estimate the set of significant factors shared in successful firms with growth rates in sales at the top 1% level using 156-dimensional data of corporate financial reports for 12 years containing about 320,000 firms. We also categorize high-growth firms into 15 groups of different sets of factors.

2.
Entropy (Basel) ; 24(2)2022 Jan 29.
Artigo em Inglês | MEDLINE | ID: mdl-35205509

RESUMO

Traders who instantly react to changes in the financial market and place orders in milliseconds are called high-frequency traders (HFTs). HFTs have recently become more prevalent and attracting attention in the study of market microstructures. In this study, we used data to track the order history of individual HFTs in the USD/JPY forex market to reveal how individual HFTs interact with the order book and what strategies they use to place their limit orders. Specifically, we introduced an 8-dimensional multivariate Hawkes process that included the excitations due to the occurrence of limit orders, cancel orders, and executions in the order book change, and performed maximum likelihood estimations of the limit order processes for 134 HFTs. As a result, we found that the limit order generation processes of 104 of the 134 HFTs were modeled by a multivariate Hawkes process. In this analysis of the EBS market, the HFTs whose strategies were modeled by the Hawkes process were categorized into three groups according to their excitation mechanisms: (1) those excited by executions; (2) those that were excited by the occurrences or cancellations of limit orders; and (3) those that were excited by their own orders.

3.
Entropy (Basel) ; 23(2)2021 Jan 29.
Artigo em Inglês | MEDLINE | ID: mdl-33573072

RESUMO

Although the sizes of business firms have been a subject of intensive research, the definition of a "size" of a firm remains unclear. In this study, we empirically characterize in detail the scaling relations between size measures of business firms, analyzing them based on allometric scaling. Using a large dataset of Japanese firms that tracked approximately one million firms annually for two decades (1994-2015), we examined up to the trivariate relations between corporate size measures: annual sales, capital stock, total assets, and numbers of employees and trading partners. The data were examined using a multivariate generalization of a previously proposed method for analyzing bivariate scalings. We found that relations between measures other than the capital stock are marked by allometric scaling relations. Power-law exponents for scalings and distributions of multiple firm size measures were mostly robust throughout the years but had fluctuations that appeared to correlate with national economic conditions. We established theoretical relations between the exponents. We expect these results to allow direct estimation of the effects of using alternative size measures of business firms in regression analyses, to facilitate the modeling of firms, and to enhance the current theoretical understanding of complex systems.

4.
Entropy (Basel) ; 23(2)2021 Feb 19.
Artigo em Inglês | MEDLINE | ID: mdl-33669835

RESUMO

The Sigma-Pi structure investigated in this work consists of the sum of products of an increasing number of identically distributed random variables. It appears in stochastic processes with random coefficients and also in models of growth of entities such as business firms and cities. We study the Sigma-Pi structure with Bernoulli random variables and find that its probability distribution is always bounded from below by a power-law function regardless of whether the random variables are mutually independent or duplicated. In particular, we investigate the case in which the asymptotic probability distribution has always upper and lower power-law bounds with the same tail-index, which depends on the parameters of the distribution of the random variables. We illustrate the Sigma-Pi structure in the context of a simple growth model with successively born entities growing according to a stochastic proportional growth law, taking both Bernoulli, confirming the theoretical results, and half-normal random variables, for which the numerical results can be rationalized using insights from the Bernoulli case. We analyze the interdependence among entities represented by the product terms within the Sigma-Pi structure, the possible presence of memory in growth factors, and the contribution of each product term to the whole Sigma-Pi structure. We highlight the influence of the degree of interdependence among entities in the number of terms that effectively contribute to the total sum of sizes, reaching the limiting case of a single term dominating extreme values of the Sigma-Pi structure when all entities grow independently.

5.
Entropy (Basel) ; 22(3)2020 Feb 26.
Artigo em Inglês | MEDLINE | ID: mdl-33286041

RESUMO

From the viewpoint of statistical physics, ecosystems in the real world are very attractive targets of research as examples of far-from thermal equilibrium systems where various kinds of components are coming in and out continuously while keeping the whole systems quasi-stationary. As a fortunate example of a fully-observable ecosystem, we analyzed the comprehensive data of convenience stores where approximately 5% of the commodity species is replaced by new ones daily. The share of stores for each species fluctuates significantly; however, the entire distribution of shares is fairly stationary and follows the log-uniform distribution, that is, the power law distribution with exponent 0. We introduce an empirical time evolution model of shares and firstly deduce that the key mechanism of realizing this stationary distribution is random multiplicative diffusion in finite size spaces. Our model based on the general stochastic process is expected to be applicable to various dynamic systems, especially complex systems with highly nonlinear interactions.

6.
Entropy (Basel) ; 22(2)2020 Feb 12.
Artigo em Inglês | MEDLINE | ID: mdl-33285984

RESUMO

Complexity and information theory are two very valuable but distinct fields of research, yet sharing the same roots. Here, we develop a complexity framework inspired by the allometric scaling laws of living biological systems in order to evaluate the structural features of networks. This is done by aligning the fundamental building blocks of information theory (entropy and mutual information) with the core concepts in network science such as the preferential attachment and degree correlations. In doing so, we are able to articulate the meaning and significance of mutual information as a comparative analysis tool for network activity. When adapting and applying the framework to the specific context of the business ecosystem of Japanese firms, we are able to highlight the key structural differences and efficiency levels of the economic activities within each prefecture in Japan. Moreover, we propose a method to quantify the distance of an economic system to its efficient free market configuration by distinguishing and quantifying two particular types of mutual information, total and structural.

7.
Phys Rev Lett ; 120(13): 138301, 2018 Mar 30.
Artigo em Inglês | MEDLINE | ID: mdl-29694225

RESUMO

A microscopic model is established for financial Brownian motion from the direct observation of the dynamics of high-frequency traders (HFTs) in a foreign exchange market. Furthermore, a theoretical framework parallel to molecular kinetic theory is developed for the systematic description of the financial market from microscopic dynamics of HFTs. We report first on a microscopic empirical law of traders' trend-following behavior by tracking the trajectories of all individuals, which quantifies the collective motion of HFTs but has not been captured in conventional order-book models. We next introduce the corresponding microscopic model of HFTs and present its theoretical solution paralleling molecular kinetic theory: Boltzmann-like and Langevin-like equations are derived from the microscopic dynamics via the Bogoliubov-Born-Green-Kirkwood-Yvon hierarchy. Our model is the first microscopic model that has been directly validated through data analysis of the microscopic dynamics, exhibiting quantitative agreements with mesoscopic and macroscopic empirical results.

8.
Entropy (Basel) ; 20(11)2018 Nov 06.
Artigo em Inglês | MEDLINE | ID: mdl-33266576

RESUMO

We introduce a systematic method to estimate an economic indicator from the Japanese government by analyzing big Japanese blog data. Explanatory variables are monthly word frequencies. We adopt 1352 words in the section of economics and industry of the Nikkei thesaurus for each candidate word to illustrate the economic index. From this large volume of words, our method automatically selects the words which have strong correlation with the economic indicator and resolves some difficulties in statistics such as the spurious correlation and overfitting. As a result, our model reasonably illustrates the real economy index. The announcement of an economic index from government usually has a time lag, while our proposed method can be real time.

9.
Entropy (Basel) ; 20(7)2018 Jul 07.
Artigo em Inglês | MEDLINE | ID: mdl-33265601

RESUMO

We use the definition of statistical symmetry as the invariance of a probability distribution under a given transformation and apply the concept to the underlying probability distribution of stochastic processes. To measure the degree of statistical asymmetry, we take the Kullback-Leibler divergence of a given probability distribution with respect to the corresponding transformed one and study it for the Gaussian autoregressive process using transformations on the temporal correlations' structure. We then illustrate the employment of this notion as a time series analysis tool by measuring local statistical asymmetries of foreign exchange market price data for three transformations that capture distinct autocorrelation behaviors of the series-independence, non-negative correlations and Markovianity-obtaining a characterization of price movements in terms of each statistical symmetry.

10.
Phys Rev Lett ; 112(9): 098703, 2014 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-24655287

RESUMO

We introduce a novel description of the dynamics of the order book of financial markets as that of an effective colloidal Brownian particle embedded in fluid particles. The analysis of comprehensive market data enables us to identify all motions of the fluid particles. Correlations between the motions of the Brownian particle and its surrounding fluid particles reflect specific layering interactions; in the inner layer the correlation is strong and with short memory, while in the outer layer it is weaker and with long memory. By interpreting and estimating the contribution from the outer layer as a drag resistance, we demonstrate the validity of the fluctuation-dissipation relation in this nonmaterial Brownian motion process.

11.
Sci Rep ; 14(1): 13380, 2024 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-38862614

RESUMO

Human mobility in an urban area is complicated; the origins, destinations, and transportation modes of each person differ. The quantitative description of urban human mobility has recently attracted the attention of researchers, and it highly related to urban science problems. Herein, combined with physics inspiration, we introduce a revised electric circuit model (RECM) in which moving people are regarded as charged particles and analogical concepts of electromagnetism such as human conductivity and human potential enable us to capture the characteristics of urban human mobility. We introduce the unit system, ensure the uniqueness of the calculation result, and reduce the computation cost of the algorithm to 1/10,000 compared with the original ECM, making the model more universal and easier to use. We compared features including human conductivity and potential between different major cities in Japan to show our improvement of the universality and the application range of the model. Furthermore, based on inspiration of physics, we propose a route generation model (RGM) to simulate a human flow pattern that automatically determines suitable routes between a given origin and destination as a source and sink, respectively. These discoveries are expected to lead to new approaches to the solution of urban science problems.

12.
Sci Rep ; 14(1): 10487, 2024 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-38714817

RESUMO

The convenience store industry in Japan holds immense significance, making a thorough comprehension of customer purchase behaviour invaluable for companies aiming to gain insights into their customer base. In this paper, we propose a novel application of a Markov network model to simulate purchases guided by stopping probabilities calculated from real data. Each node in the Markov network represents different product categories available for purchase. Additionally, we introduce the concept of a "driving force," quantifying the influence of purchasing product A on the likelihood of purchasing product B, compared to random purchasing. For instance, our analysis reveals that the inclusion of nutrient bars in a purchase set leads to, on average, a 13% reduction in tobacco purchases compared to random patterns. To validate our approach, we compare the simulated macro-level purchase behaviours with real point of Sale (POS) data obtained from a prominent convenience store giant, 7-Eleven. The dataset is comprised of roughly 54 million receipts, in which we focus on the product categories existing in this dataset rather than individual products. Our model successfully replicates the purchase size distribution for 99.9% of all purchases and the purchase counts across various product categories, demonstrating its efficacy in capturing broad purchase patterns.

13.
Sci Rep ; 14(1): 4628, 2024 Feb 26.
Artigo em Inglês | MEDLINE | ID: mdl-38409204

RESUMO

We develop a single two-layered model framework that captures and replicates both the statistical properties of the network as well as those of the intrinsic quantities of the agents. Our model framework consists of two distinct yet connected elements that were previously only studied in isolation, namely methods related to temporal network structures and those associated with money transport flows. Within this context, the network structure emerges from the first layer and its topological structure is transferred to the second layer associated with the money transactions. In this manner, we can explain how the micro-level dynamics of the agents within the network lead to the exogenous manifestation of the aggregated system statistical data en-wrapping the very same agents within the system. This is done by capturing the essential dynamics of collective motion in complex networks that enable the simultaneous emergence of tent-shaped distributions in growth rates within the agents, together with the emergence of scaling properties within the network in the study. We can validate the model framework and dynamics by applying these to the context of the real-world inter-firm trading network of firms in Japan and comparing the results of the statistical distributions at both network and agent levels in a temporal manner. In particular, we compare our results to the fundamental quantities supporting the seven empirical laws observed in data: the degree distribution, the mean degree growth rate over time, the age distribution of the firms, the preferential attachment, the sales distribution in steady states, their growth rates, their scaling relations generated by the model. We find these results to be nearly identical to the real-world data. The framework has the potential to be transformed into a forecasting tool to support decision-makers on financial and prudential policies.

14.
Sci Rep ; 13(1): 22298, 2023 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-38102222

RESUMO

We analyze the time series of hashtag numbers of social media data. We observe that the usage distribution of hashtags is characterized by a fat-tailed distribution with a size-dependent power law exponent and we find that there is a clear dependency between the growth rate distributions of hashtags and size of hashtags usage. We propose a generalized random multiplicative process model with a theory that explains the size dependency of the fat-tailed distribution. Numerical simulations show that our model reproduces these size-dependent properties nicely. We expect that our model is useful for understanding the mechanism of fat-tailed distributions in various fields of science and technology.

15.
Phys Rev Lett ; 108(16): 168701, 2012 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-22680760

RESUMO

We propose a new type of stochastic network evolution model based on annihilation, creation, and coagulation of nodes, together with the preferential attachment rule. The system reaches a unique quasistatistically steady state in which the distribution of links follows a power law, lifetime of nodes follows an exponential distribution, and the mean number of links grows exponentially with time. The master equation of the model is solved analytically by applying Smoluchowski's coagulation equation for aerosols. The results indicate that coagulation of nodes in complex networks and mean field analysis of aerosols are similar in both the growth dynamics with irreversible processes and in the steady state statistics. We confirm that the basic properties of the model are consistent with the empirical results of a business transaction network having about 1×10(6) firms.

16.
Phys Rev E ; 106(6-1): 064304, 2022 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-36671187

RESUMO

We propose a data-driven stochastic method that allows the simulation of a complex system's long-term evolution. Given a large amount of historical data on trajectories in a multi-dimensional phase space, our method simulates the time evolution of a system based on a random selection of partial trajectories in the data without detailed knowledge of the system dynamics. We apply this method to a large data set of time evolution of approximately one million business firms for a quarter century. Accordingly, from simulations starting from arbitrary initial conditions, we obtain a stationary distribution in three-dimensional log-size phase space, which satisfies the allometric scaling laws of three variables. Furthermore, universal distributions of fluctuation around the scaling relations are consistent with the empirical data.


Assuntos
Simulação por Computador , Processos Estocásticos
17.
Sci Rep ; 12(1): 17888, 2022 10 25.
Artigo em Inglês | MEDLINE | ID: mdl-36284166

RESUMO

During the COVID-19 pandemic, governments faced difficulties in implementing mobility restriction measures, as no clear quantitative relationship between human mobility and infection spread in large cities is known. We developed a model that enables quantitative estimations of the infection risk for individual places and activities by using smartphone GPS data for the Tokyo metropolitan area. The effective reproduction number is directly calculated from the number of infectious social contacts defined by the square of the population density at each location. The difference in the infection rate of daily activities is considered, where the 'stay-out' activity, staying at someplace neither home nor workplace, is more than 28 times larger than other activities. Also, the contribution to the infection strongly depends on location. We imply that the effective reproduction number is sufficiently suppressed if the highest-risk locations or activities are restricted. We also discuss the effects of the Delta variant and vaccination.


Assuntos
COVID-19 , Humanos , COVID-19/epidemiologia , Número Básico de Reprodução , SARS-CoV-2 , Pandemias
18.
Sci Rep ; 12(1): 9918, 2022 06 15.
Artigo em Inglês | MEDLINE | ID: mdl-35705582

RESUMO

Owing to the big data the extension of physical laws on nonmaterial has seen numerous successes, and human mobility is one of the scientific frontier topics. Recent GPS technology has made it possible to trace detailed trajectories of millions of people, macroscopic approaches such as the gravity law for human flow between cities and microscopic approaches of individual origin-destination distributions are attracting much attention. However, we need a more general basic model with wide applicability to realize traffic forecasting and urban planning of metropolis fully utilizing the GPS data. Here, based on a novel idea of treating moving people as charged particles, we introduce a method to map macroscopic human flows into currents on an imaginary electric circuit defined over a metropolitan area. Conductance is found to be nearly proportional to the maximum current in each location and synchronized human flows in the morning and evening are well described by the temporal changes of electric potential. Surprisingly, the famous fluctuation-dissipation theorem holds, namely, the variances of currents are proportional to the conductivities akin to an ordinary material.


Assuntos
Urbanização , Cidades , Sistemas de Informação Geográfica , Humanos
19.
Sci Rep ; 12(1): 3120, 2022 02 24.
Artigo em Inglês | MEDLINE | ID: mdl-35210492

RESUMO

Bursts and collective emotion have been widely studied in social physics field where researchers use mathematical models to understand human social dynamics. However, few researches recognize and separately analyze the internal and external influence on burst behaviors. To bridge this gap, we introduce a non-parametric approach to classify an interevent time series into five scenarios: random arrival, endogenous burst, endogenous non-burst, exogenous burst and exogenous non-burst. In order to process large-scale social media data, we first segment the interevent time series into sections by detecting change points. Then we use the rule-based algorithm to classify the time series based on its distribution. To validate our model, we analyze 27.2 million COVID-19 related comments collected from Chinese social media between January to October 2020. We adopt the emotion category called Profile of Mood States which consists of six emotions: Anger, Depression, Fatigue, Vigor, Tension and Confusion. This enables us to compare the burst features of different collective emotions during the COVID-19 period. The burst detection and classification approach introduced in this paper can also be applied to analyzing other complex systems, including but not limited to social media, financial market and signal processing.


Assuntos
COVID-19
20.
Appl Netw Sci ; 6(1): 75, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34660884

RESUMO

To prevent the spread of the COVID-19 pandemic, governments in various countries have severely restricted the movement of people. The large amount of detailed human location data obtained from mobile phone users is useful for understanding the change of flow patterns of people under the effect of pandemic. In this paper, we observe the synchronized human flow during the COVID-19 pandemic using Global Positioning System data of about 1 million people obtained from mobile phone users. We apply the drainage basin analysis method which we introduced earlier for characterization of macroscopic human flow patterns to observe the effect of the spreading pandemic. Before the pandemic the afternoon basin size distribution has been approximated by an exponential distribution, however, the distribution of Tokyo and Sapporo, which were most affected by the first wave of COVID-19, deviated significantly from the exponential distribution. On the other hand, during the morning rush hour, the scaling law holds universally, i.e., in all cities, even though the number of moving people in the basin has decreased significantly. The fact that these scaling laws, which are closely related to the three-dimensionality structure of the city and the fractal structure of the transportation network, have not changed indicates that the macroscopic human flow features are determined mainly by the means of transport and the basic structure of cities which are invariant of the pandemic.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA