Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 51
Filter
1.
An Acad Bras Cienc ; 96(2): e20230126, 2024.
Article in English | MEDLINE | ID: mdl-38808875

ABSTRACT

A statistical analysis of maximum temperature from twelve weather stations in parts of Guinea is provided. Using maximum likelihood estimation, maximum temperature data was fitted by the Generalized Extreme Value distribution. Data from all of the twelve stations were adequately fit by the Generalized Extreme Value distribution. Return level estimates are provided. Significant trends in maximum temperature were found for four of the stations. The four stations exhibited significant positive trends at the 5% significance level.


Subject(s)
Models, Statistical , Temperature , Guinea , Likelihood Functions
2.
PLoS One ; 19(3): e0299164, 2024.
Article in English | MEDLINE | ID: mdl-38478502

ABSTRACT

In the dynamic landscape of financial markets, accurate forecasting of stock indices remains a pivotal yet challenging task, essential for investors and policymakers alike. This study is motivated by the need to enhance the precision of predicting the Shanghai Composite Index's opening price spread, a critical measure reflecting market volatility and investor sentiment. Traditional time series models like ARIMA have shown limitations in capturing the complex, nonlinear patterns inherent in stock price movements, prompting the exploration of advanced methodologies. The aim of this research is to bridge the gap in forecasting accuracy by developing a hybrid model that integrates the strengths of ARIMA with deep learning techniques, specifically Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks. This novel approach leverages the ARIMA model's proficiency in linear trend analysis and the deep learning models' capability in modeling nonlinear dependencies, aiming to provide a comprehensive tool for market prediction. Utilizing a comprehensive dataset covering the period from December 20, 1990, to June 2, 2023, the study develops and assesses the efficacy of ARIMA, LSTM, GRU, ARIMA-LSTM, and ARIMA-GRU models in forecasting the Shanghai Composite Index's opening price spread. The evaluation of these models is based on key statistical metrics, including Mean Squared Error (MSE) and Mean Absolute Error (MAE), to gauge their predictive accuracy. The findings indicate that the hybrid models, ARIMA-LSTM and ARIMA-GRU, perform better in forecasting the opening price spread of the Shanghai Composite Index than their standalone counterparts. This outcome suggests that combining traditional statistical methods with advanced deep learning algorithms can enhance stock market prediction. The research contributes to the field by providing evidence of the potential benefits of integrating different modeling approaches for financial forecasting, offering insights that could inform investment strategies and financial decision-making.


Subject(s)
Algorithms , Benchmarking , China , Investments , Memory, Long-Term , Forecasting
3.
Afr J Reprod Health ; 27(2): 57-66, 2023 Feb.
Article in English | MEDLINE | ID: mdl-37584940

ABSTRACT

Maternal mortality is a critical measure for quality of health system in any country and hence many countries have made concerted efforts to check its occurrence. Various stakeholders involved in the management of health system in Ghana have been tasked to ensure women do not die whilst giving birth. This study was conducted on a sample of 1,052 women selected from all the ten administrative regions of Ghana in which 188 maternal deaths occurred. Bayesian logistic modeling was used. Age at death, marital status, age, season, region, place of death, place of residence, religion and ethnicity emerged as the most significant determinants of maternal mortality in Ghana. It was realized that high numbers of maternal deaths were recorded in the least developed regions in the northern region. It is therefore important for stakeholders to devise a road map of getting health workers to accept postings to the rural areas and also provide well resourced health facilities to stem this menace.


Subject(s)
Maternal Death , Maternal Mortality , Humans , Pregnancy , Female , Ghana/epidemiology , Bayes Theorem , Marital Status
4.
Sci Rep ; 13(1): 11443, 2023 07 15.
Article in English | MEDLINE | ID: mdl-37454188

ABSTRACT

Tuberculosis, an airborne disease, is the deadliest human infectious disease caused by one single agent. The African region is among the most affected and most burdensome area in terms of tuberculosis cases. In this paper, we modeled the number of new cases of tuberculosis for 2000-2021 by integer time series. For each African country, we fitted twenty different models and selected the model that best fitted the data. The twenty models were mostly based on the number of new cases following either the Poisson or negative binomial distribution with the rate parameter allowed to vary linearly or quadratically with respect to year. The best fitted models were used to give predictions for 2022-2031.


Subject(s)
Tuberculosis , Humans , Time Factors , Tuberculosis/epidemiology , Africa/epidemiology , Models, Statistical
5.
Sci Rep ; 13(1): 10814, 2023 07 04.
Article in English | MEDLINE | ID: mdl-37402872

ABSTRACT

We provide an extreme value analysis of daily new cases of COVID-19. We use data from Benin, Burkina Faso, Cabo Verde, Cote d'Ivoire, Gambia, Ghana, Guinea, Guinea-Bissau, Liberia, Mali, Mauritania, Niger, Nigeria, Senegal, Sierra Leone and Togo, covering a period of 37 months. Extreme values were defined as monthly maximums of daily new cases. The generalized extreme value distribution was fitted to them with two of its three parameters allowed to vary linearly or quadratically with respect to month number. Ten of the sixteen countries were found to exhibit significant downward trends in monthly maximums. The adequacy of fits was assessed by probability plots and the Kolmogorov-Smirnov test. The fitted models were used to derive quantiles of the monthly maximum of new cases as well as their limits when the month number is taken to infinity.


Subject(s)
COVID-19 , Humans , COVID-19/epidemiology , Africa, Western/epidemiology , Burkina Faso , Senegal , Niger , Mali
6.
PLoS One ; 18(6): e0287011, 2023.
Article in English | MEDLINE | ID: mdl-37310978

ABSTRACT

We carry out a time series analysis on the yearly crop yield data in six east African countries (Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda) using the autoregressive integrated moving average (ARIMA) model. We describe the upper tail of the yearly crop yield data in those countries using the power law, lognormal, Fréchet and stretched exponential distributions. The forecast of the fitted ARIMA models suggests that the majority of the crops in different countries will experience neither an increase nor a decrease in yield from 2019 to 2028. A few exceptional cases correspond to significant increase in the yield of sorghum and coffee in Burundi and Rwanda, respectively, and significant decrease in the yield of beans in Burundi, Kenya and Rwanda. Based on Vuong's similarity test p-value, we find that the power law distribution captured the upper tails of yield distribution better than other distributions with just one exceptional case in Uganda, suggesting that these crops have the tendency for producing high yield. We find that only sugar cane in Somalia and sweet potato in Tanzania have the potential of producing extremely high yield. We describe the yield behaviour of these two crops as black swan, where the "rich getting richer" or the "preferential attachment" could be the underlying generating process. Other crops in Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda can only produce high but not extremely high yields. Various climate adaptation/smart strategies (use of short-duration pigeon pea varieties, use of cassava mosaic disease resistant cassava varieties, use of improved maize varieties, intensive manuring with a combination of green and poultry manure, early planting, etc) that could be adapted to increase yields in east Africa are suggested. The paper could be useful for future agricultural planning and rates calibration in crop risk insurance.


Subject(s)
Crops, Agricultural , Humans , Burundi , Edible Grain , Rwanda , Tanzania , Time Factors , Crop Production , Africa, Eastern
7.
PLoS One ; 18(5): e0285183, 2023.
Article in English | MEDLINE | ID: mdl-37146020

ABSTRACT

Although many data sets are discrete and heavy tailed (for example, number of claims and claim amounts if recorded as rounded values), not many discrete heavy tailed distributions are available in the literature. In this paper, we discuss thirteen known discrete heavy tailed distributions, propose nine new discrete heavy tailed distributions and give expressions for their probability mass functions, cumulative distribution functions, hazard rate functions, reversed hazard rate functions, means, variances, moment generating functions, entropies and quantile functions. Tail behaviour and a measure of asymmetry are used to compare the known and new discrete heavy tailed distributions. The better fits of the discrete heavy tailed distributions over their continuous counterparts as assessed by probability plots are illustrated using three data sets. Finally, a simulated study is performed to assess the finite sample performance of the maximum likelihood estimators used in the data application section.


Subject(s)
Likelihood Functions , Statistical Distributions
8.
Heliyon ; 9(4): e14771, 2023 Apr.
Article in English | MEDLINE | ID: mdl-37035386

ABSTRACT

Objective: Zimbabwe is one of the poorest countries in the world, just emerging from a broken health care system. The objective is to figure out variables affecting hospital charge for Zimbabwe. Material and methods: The variables used are sex, smoking status, number of children, region, age and body mass index. The first six of these are factors and the remaining are covariates. A mixture model was fitted to describe the dependence of hospital charge on these variables. Results: A mixture model with five components each having a reversed Gumbel distribution was found to give an adequate fit. Both the covariates and all but one of the factors were found to be significant. Estimates of value at risk of hospital charge are given for all combinations of the factors. Conclusions: The results suggest that the hospital charge could be higher for females, higher for smokers, higher if the patient had more children and higher if the patient is older. Further, estimates of value at risk given suggest, for example, that a 90 year old female not smoking and having no children and an average body mass index will have a hospital charge less than Z$49851 with probability 0.999.

9.
Entropy (Basel) ; 25(3)2023 Mar 20.
Article in English | MEDLINE | ID: mdl-36981422

ABSTRACT

Entropies are useful measures of variation. However, explicit expressions for entropies available in the literature are limited. In this paper, we provide a comprehensive collection of explicit expressions for four of the most common entropies for over sixty continuous univariate distributions. Most of the derived expressions are new. The explicit expressions involve known special functions.

10.
J Appl Stat ; 49(16): 4097-4121, 2022.
Article in English | MEDLINE | ID: mdl-36353294

ABSTRACT

Process capability indices (PCIs) are most effective devices/techniques used in industries for determining the quality of products and performance of manufacturing processes. In this article, we consider the PCI Cpc which is based on the proportion of conformance and is applicable to normally as well as non-normally and continuous as well as discrete distributed processes. In order to estimate the PCI Cpc when the process follows exponentiated exponential distribution, we have used five classical methods of estimation. The performances of these classical estimators are compared with respect to their biases and mean squared errors (MSEs) of the index Cpc through simulation study. Also, the confidence intervals for the index Cpc are constructed using five bootstrap confidence interval (BCIs) methods. Monte Carlo simulation study has been carried out to compare the performances of these five BCIs in terms of their average width and coverage probabilities. Besides, net sensitivity (NS) analysis for the given PCI Cpc is considered. We use two data sets related to electronic and food industries and two failure time data sets to illustrate the performance of the proposed methods of estimation and BCIs. Additionally, we have developed PCI Cpc using aforementioned methods for generalized Rayleigh distribution.

11.
Comput Econ ; : 1-34, 2022 Nov 03.
Article in English | MEDLINE | ID: mdl-36345292

ABSTRACT

Research on the exchange rate volatility and dynamic conditional correlation of African currencies/financial markets interdependence appears to be limited. In this paper, we employ GARCH models to characterize the exchange rate volatility of eight major African currencies. The variation of interdependence with respect to time is described using the DCC-GARCH model. From the results of the DCC, remarkable variations in correlations through time across these countries are observed with the correlations varying from low to moderate, suggesting that African economies are generally governed by certain economic factors and are vastly regulated. These regulations, including exchange rate misalignment led to sluggish and negative growth in most of the African countries. For instance, persistent misalignment can cause high levels of inflation, for example, undervaluation. Overvaluation can lead to trade imbalances and they can in turn create macroeconomic instability and balance of payment problems. Given these results, we suggest that policy makers should revamp and adopt state resilience so as to reduce the negative effect of exchange rate misalignment on economic growth.

12.
Sankhya Ser A ; : 1-28, 2022 Sep 09.
Article in English | MEDLINE | ID: mdl-36105539

ABSTRACT

The mathematical modeling of the coronavirus disease-19 (COVID-19) pandemic has been attempted by a large number of researchers from the very beginning of cases worldwide. The purpose of this research work is to find and classify the modelling of COVID-19 data by determining the optimal statistical modelling to evaluate the regular count of new COVID-19 fatalities, thus requiring discrete distributions. Some discrete models are checked and reviewed, such as Binomial, Poisson, Hypergeometric, discrete negative binomial, beta-binomial, Skellam, beta negative binomial, Burr, discrete Lindley, discrete alpha power inverse Lomax, discrete generalized exponential, discrete Marshall-Olkin Generalized exponential, discrete Gompertz-G-exponential, discrete Weibull, discrete inverse Weibull, exponentiated discrete Weibull, discrete Rayleigh, and new discrete Lindley. The probability mass function and the hazard rate function are addressed. Discrete models are discussed based on the maximum likelihood estimates for the parameters. A numerical analysis uses the regular count of new casualties in the countries of Angola,Ethiopia, French Guiana, El Salvador, Estonia, and Greece. The empirical findings are interpreted in-depth.

13.
Sci Rep ; 12(1): 7698, 2022 05 11.
Article in English | MEDLINE | ID: mdl-35546167

ABSTRACT

The first statistical analysis of maximum rainfall in Zimbabwe is provided. The data are from 103 stations spread across the different climatic regions of Zimbabwe. More than 90% of the stations had at least 50 years of data. The generalized extreme value distribution was fitted to maximum rainfall by the method of maximum likelihood. Probability plots, quantile plots and Kolmogorov-Smirnov tests showed that the generalized extreme value distribution provided an adequate fit for all stations. The vast majority of stations do not exhibit significant trends in rainfall. Twelve of the stations exhibit negative trends and three of the stations exhibit positive trends in rainfall. Estimates of return levels are given for 2, 5, 10, 20, 50 and 100 years.


Subject(s)
Models, Statistical , Rain , Probability , Zimbabwe
14.
Healthc Anal (N Y) ; 2: 100086, 2022 Nov.
Article in English | MEDLINE | ID: mdl-37520619

ABSTRACT

The COVID-19 series is obviously one of the most volatile time series with lots of spikes and oscillations. The conventional integer-valued auto-regressive time series models (INAR) may be limited to account for such features in COVID-19 series such as severe over-dispersion, excess of zeros, periodicity, harmonic shapes and oscillations. This paper proposes alternative formulations of the classical INAR process by considering the class of high-ordered INAR models with harmonic innovation distributions. Interestingly, the paper further explores the bivariate extension of these high-ordered INARs. South Africa and Mauritius' COVID-19 series are re-scrutinized under the optic of these new INAR processes. Some simulation experiments are also executed to validate the new models and their estimation procedures.

15.
BMC Res Notes ; 14(1): 331, 2021 Aug 27.
Article in English | MEDLINE | ID: mdl-34452631

ABSTRACT

OBJECTIVE: In Africa, most countries continue to battle COVID-19 with cases of newly infected still being recorded. In this note, we investigate how socioeconomic and demographic factors affected individuals awareness on the methods for controlling/preventing the spread of COVID-19 in some parts of Africa at the onset of the pandemic. RESULTS: Based on regression modelling, we find that having full awareness does not depend on religious affiliation. Men, urban dwelling, holding bachelors or higher degrees, operating multiple social media accounts or being employed are associated with having full awareness of the recommended practices for the prevention and control of COVID-19 at the early stage of the pandemic. No occupation, business or older people are associated with not having full awareness.


Subject(s)
COVID-19 , Social Media , Africa , Aged , Demography , Humans , Male , Pandemics , SARS-CoV-2 , Socioeconomic Factors
16.
Sci Rep ; 11(1): 12309, 2021 06 10.
Article in English | MEDLINE | ID: mdl-34112895

ABSTRACT

We provide a survival analysis of cancer patients in Zimbabwe. Our results show that young cancer patients have lower but not significant hazard rate compared to old cancer patients. Male cancer patients have lower but not significant hazard rate compared to female cancer patients. Race and marital status are significant risk factors for cancer patients in Zimbabwe.


Subject(s)
Cancer Survivors , Neoplasms/epidemiology , Socioeconomic Factors , Adult , Aged , Female , Humans , Kaplan-Meier Estimate , Male , Marital Status , Middle Aged , Neoplasms/pathology , Risk Factors , Zimbabwe/epidemiology
17.
Physica A ; 563: 125460, 2021 Feb 01.
Article in English | MEDLINE | ID: mdl-33162665

ABSTRACT

At the end of 2019, the current novel coronavirus emerged as a severe acute respiratory disease that has now become a worldwide pandemic. Future generations will look back on this difficult period and see how our society as a whole united and rose to this challenge. Many reports have suggested that this new virus is becoming comparable to the Spanish flu pandemic of 1918. We provide a statistical study on the modelling and analysis of the daily incidence of COVID-19 in eighteen countries around the world. In particular, we investigate whether it is possible to fit count regression models to the number of daily new cases of COVID-19 in various countries and make short term predictions of these numbers. The results suggest that the biggest advantage of these methods is that they are simplistic and straightforward allowing us to obtain preliminary results and an overall picture of the trends in the daily confirmed cases of COVID-19 around the world. The best fitting count regression model for modelling the number of new daily COVID-19 cases of all countries analysed was shown to be a negative binomial distribution with log link function. Whilst the results cannot solely be used to determine and influence policy decisions, they provide an alternative to more specialised epidemiological models and can help to support or contradict results obtained from other analysis.

18.
PLoS One ; 15(10): e0239652, 2020.
Article in English | MEDLINE | ID: mdl-33006975

ABSTRACT

In this paper, we propose six Student's t based compound distributions where the scale parameter is randomized using functional forms of the half normal, Fréchet, Lomax, Burr III, inverse gamma and generalized gamma distributions. For each of the proposed distribution, we give expressions for the probability density function, cumulative distribution function, moments and characteristic function. GARCH models with innovations taken to follow the compound distributions are fitted to the data using the method of maximum likelihood. For the sample data considered, we see that all but two of the proposed distributions perform better than two popular distributions. Finally, we perform a simulation study to examine the accuracy of the best performing model.


Subject(s)
Financial Management/statistics & numerical data , Models, Economic , Computer Simulation , Humans , Investments/statistics & numerical data , Likelihood Functions , Models, Statistical , Statistical Distributions
19.
J Appl Stat ; 47(5): 947-949, 2020.
Article in English | MEDLINE | ID: mdl-35707328

ABSTRACT

Mazucheli et al. [On the one parameter unit-Lindley distribution and its associated regression model for proportion data. J. Appl. Stat. 46 (2019), pp. 700-714] introduced the unit Lindley distribution. The expressions given for the moments and incomplete moments are either not correct or not in closed form. We derive closed form expressions moments and incomplete moments. The expressions are elementary except for the exponential integral.

20.
PLoS One ; 14(8): e0221487, 2019.
Article in English | MEDLINE | ID: mdl-31450236

ABSTRACT

Several lifetime distributions have played an important role to fit survival data. However, for some of these models, the computation of maximum likelihood estimators is quite difficult due to presence of flat regions in the search space, among other factors. Several well-known derivative-based optimization tools are unsuitable for obtaining such estimates. To circumvent this problem, we introduce the AdequacyModel computational library version 2.0.0 for the R statistical environment with two major contributions: a general optimization technique based on the Particle Swarm Optimization (PSO) method (with a minor modification of the original algorithm) and a set of statistical measures for assessment of the adequacy of the fitted model. This library is very useful for researchers in probability and statistics and has been cited in various papers in these areas. It serves as the basis for the Newdistns library (version 2.1) published in an impact journal in the area of computational statistics, see https://CRAN.R-project.org/package=Newdistns. It is also the basis of the Wrapped library (version 2.0), see https://CRAN.R-project.org/package=Wrapped. A third package making use of the AdequacyModel library can be found in https://CRAN.R-project.org/package=sglg. In addition, the proposed library has proved to be very useful for maximizing log-likelihood functions with complex search regions. The library provides a greater control of the optimization process by introducing a stop criterion based on a minimum number of iterations and the variance of a given proportion of optimal values. We emphasize that the new library can be used not only in statistics but in physics and mathematics as proved in several examples throughout the paper.


Subject(s)
Probability , Software , Algorithms , Computer Simulation , Monte Carlo Method
SELECTION OF CITATIONS
SEARCH DETAIL
...