Pesquisa | Portal Regional da BVS

Unsupervised Liu-type shrinkage estimators for mixture of regression models.

Ghanem, Elsayed; Hatefi, Armin; Usefi, Hamid.

Stat Methods Med Res ; : 9622802241259175, 2024 Aug 28.

Artigo em Inglês | MEDLINE | ID: mdl-39193788

RESUMO

The mixture of probabilistic regression models is one of the most common techniques to incorporate the information of covariates into learning of the population heterogeneity. Despite its flexibility, unreliable estimates can occur due to multicollinearity among covariates. In this paper, we develop Liu-type shrinkage methods through an unsupervised learning approach to estimate the model coefficients in the presence of multicollinearity. We evaluate the performance of our proposed methods via classification and stochastic versions of the expectation-maximization algorithm. We show using numerical simulations that the proposed methods outperform their Ridge and maximum likelihood counterparts. Finally, we apply our methods to analyze the bone mineral data of women aged 50 and older.

Bayesian mixture modelling with ranked set samples.

Alvandi, Amirhossein; Omidvar, Sedigheh; Hatefi, Armin; Jafari Jozani, Mohammad; Ozturk, Omer; Nematollahi, Nader.

Stat Med ; 43(19): 3723-3741, 2024 Aug 30.

Artigo em Inglês | MEDLINE | ID: mdl-38890118

RESUMO

We consider the Bayesian estimation of the parameters of a finite mixture model from independent order statistics arising from imperfect ranked set sampling designs. As a cost-effective method, ranked set sampling enables us to incorporate easily attainable characteristics, as ranking information, into data collection and Bayesian estimation. To handle the special structure of the ranked set samples, we develop a Bayesian estimation approach exploiting the Expectation-Maximization (EM) algorithm in estimating the ranking parameters and Metropolis within Gibbs Sampling to estimate the parameters of the underlying mixture model. Our findings show that the proposed RSS-based Bayesian estimation method outperforms the commonly used Bayesian counterpart using simple random sampling. The developed method is finally applied to estimate the bone disorder status of women aged 50 and older.

Assuntos

Algoritmos , Teorema de Bayes , Modelos Estatísticos , Humanos , Feminino , Pessoa de Meia-Idade , Idoso , Simulação por Computador , Método de Monte Carlo , Funções Verossimilhança , Cadeias de Markov

Efficient estimators with categorical ranked set samples: estimation procedures for osteoporosis.

Hatefi, Armin; Alvandi, Amirhossein.

J Appl Stat ; 49(4): 803-818, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35707814

RESUMO

Ranked set sampling (RSS) design as a cost-effective sampling is a powerful tool in situations where measuring the variable of interest is costly and time-consuming; however, ranking information about sampling units can be obtained easily through inexpensive and easy to measure characteristics at little or no cost. In this paper, we study RSS data for analysis of an ordinal population. First, we compare the problem of non-representative extreme samples under RSS and commonly-used simple random sampling. Using RSS data with tie information, we propose non-parametric and maximum likelihood estimators for population parameters. Through extensive numerical studies, we investigate the effect of various factors including ranking ability, tie generating mechanisms, the number of categories and population setting on the performance of the estimators. Finally, we apply the proposed methods to the bone disorder data to estimate the proportions of patients with osteopenia and osteoporosis status.

Estimation of ordinal population with multi-observer ranked set samples using ties information.

Alvandi, Amirhossein; Hatefi, Armin.

Stat Methods Med Res ; 30(8): 1960-1975, 2021 08.

Artigo em Inglês | MEDLINE | ID: mdl-34218747

RESUMO

In many surveys, we often deal with situations where measuring the study variable is expensive; however, there are easy-to-measure characteristics which can be used as ranking information to obtain more representative samples from the population. Ranked set sampling is successfully employed in these cases as an alternative to commonly used simple random sampling. When the data is ordinal categorical, it is common to apply the ordinal logistic regression approach to ranked set sampling data for the estimation of parameters. This technique first depends on the information of training data. Besides, one is not capable of using the ranking information in the estimation process. In this paper, we propose a ranked set sampling scheme in which ranking information from multiple sources can be combined and incorporated efficiently into both data collection and estimation. The ranked set sampling data is used for non-parametric and maximum likelihood estimation of ordinal categorical population. Through extensive simulation studies, the performance of estimators is evaluated. The methods are finally applied to analyze bone disorder data and obesity data.

Assuntos

Obesidade , Simulação por Computador , Humanos , Modelos Logísticos

An improved procedure for estimation of malignant breast cancer prevalence using partially rank ordered set samples with multiple concomitants.

Hatefi, Armin; Jafari Jozani, Mohammad.

Stat Methods Med Res ; 26(6): 2552-2566, 2017 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-26311819

RESUMO

Rank-based sampling designs are widely used in situations where measuring the variable of interest is costly but a small number of sampling units (set) can be easily ranked prior to taking the final measurements on them and this can be done at little cost. When the variable of interest is binary, a common approach for ranking the sampling units is to estimate the probabilities of success through a logistic regression model. However, this requires training samples for model fitting. Also, in this approach once a sampling unit has been measured, the extra rank information obtained in the ranking process is not used further in the estimation process. To address these issues, in this paper, we propose to use the partially rank-ordered set sampling design with multiple concomitants. In this approach, instead of fitting a logistic regression model, a soft ranking technique is employed to obtain a vector of weights for each measured unit that represents the probability or the degree of belief associated with its rank among a small set of sampling units. We construct an estimator which combines the rank information and the observed partially rank-ordered set measurements themselves. The proposed methodology is applied to a breast cancer study to estimate the proportion of patients with malignant (cancerous) breast tumours in a given population. Through extensive numerical studies, the performance of the estimator is evaluated under various concomitants with different ranking potentials (i.e. good, intermediate and bad) and tie structures among the ranks. We show that the precision of the partially rank-ordered set estimator is better than its counterparts under simple random sampling and ranked set sampling designs and, hence, the sample size required to achieve a desired precision is reduced.

Assuntos

Bioestatística/métodos , Neoplasias da Mama/epidemiologia , Neoplasias da Mama/diagnóstico , Neoplasias da Mama/patologia , Bases de Dados Factuais/estatística & dados numéricos , Feminino , Humanos , Modelos Logísticos , Modelos Estatísticos , Prevalência , Tamanho da Amostra , Estudos de Amostragem , Estatísticas não Paramétricas

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA