Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 56
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Demography ; 58(2): 773-784, 2021 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-33834231

RESUMO

We revisit a novel causal model published in Demography by Hicks et al. (2018), designed to assess whether exposure to neighborhood disadvantage over time affects children's reading and math skills. Here, we provide corrected and new results. Reconsideration of the model in the original article raised concerns about bias due to exposure-induced confounding (i.e., past exposures directly affecting future exposures) and true state dependence (i.e., past exposures affecting confounders of future exposures). Through simulation, we show that our originally proposed propensity function approach displays modest bias due to exposure-induced confounding but no bias from true state dependence. We suggest a correction based on residualized values and show that this new approach corrects for the observed bias. We contrast this revised method with other causal modeling approaches using simulation. Finally, we reproduce the substantive models from Hicks et al. (2018) using the new residuals-based adjustment procedure. With the correction, our findings are essentially identical to those reported originally. We end with some conclusions regarding approaches to causal modeling.


Assuntos
Características da Vizinhança , Características de Residência , Viés , Criança , Humanos , Matemática , Projetos de Pesquisa
2.
Demography ; 55(1): 1-31, 2018 02.
Artigo em Inglês | MEDLINE | ID: mdl-29192386

RESUMO

Prior research has suggested that children living in a disadvantaged neighborhood have lower achievement test scores, but these studies typically have not estimated causal effects that account for neighborhood choice. Recent studies used propensity score methods to account for the endogeneity of neighborhood exposures, comparing disadvantaged and nondisadvantaged neighborhoods. We develop an alternative propensity function approach in which cumulative neighborhood effects are modeled as a continuous treatment variable. This approach offers several advantages. We use our approach to examine the cumulative effects of neighborhood disadvantage on reading and math test scores in Los Angeles. Our substantive results indicate that recency of exposure to disadvantaged neighborhoods may be more important than average exposure for children's test scores. We conclude that studies of child development should consider both average cumulative neighborhood exposure and the timing of this exposure.


Assuntos
Sucesso Acadêmico , Áreas de Pobreza , Características de Residência/estatística & dados numéricos , Meio Social , Adolescente , Criança , Desenvolvimento Infantil , Pré-Escolar , Feminino , Humanos , Estudos Longitudinais , Los Angeles , Masculino , Matemática , Leitura , Fatores Sexuais , Isolamento Social , Fatores Socioeconômicos , Estresse Psicológico/epidemiologia , Fatores de Tempo , Violência/estatística & dados numéricos
3.
Epidemiology ; 26(6): 846-52, 2015 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-26258908

RESUMO

BACKGROUND: Respondent-driven sampling is used worldwide to estimate the population prevalence of characteristics, such as HIV/AIDS and associated risk factors in hard-to-reach populations. Estimating the total size of these populations is of great interest to national and international organizations; however, reliable measures of population size often do not exist. METHODS: Successive sampling-population size estimation (SS-PSE) along with network size imputation allows population size estimates to be made without relying on separate studies or additional data (as in network scale-up, multiplier, and capture-recapture methods), which may be biased. RESULTS: Ten population size estimates were calculated for people who inject drugs, female sex workers, men who have sex with other men, and migrants from sub-Saharan Africa in six different cities in Morocco. SS-PSE estimates fell within or very close to the likely values provided by experts and the estimates from previous studies using other methods. CONCLUSIONS: SS-PSE is an effective method for estimating the size of hard-to-reach populations that leverages important information within respondent-driven sampling studies. The addition of a network size imputation method helps to smooth network sizes allowing for more accurate results. However, caution should be used particularly when there is reason to believe that clustered subgroups may exist within the population of interest or when the sample size is small in relation to the population.


Assuntos
Bissexualidade/estatística & dados numéricos , Usuários de Drogas/estatística & dados numéricos , Infecções por HIV/epidemiologia , Homossexualidade Masculina/estatística & dados numéricos , Densidade Demográfica , Profissionais do Sexo/estatística & dados numéricos , Abuso de Substâncias por Via Intravenosa/epidemiologia , Adulto , Feminino , Humanos , Masculino , Marrocos/epidemiologia , Prevalência , Tamanho da Amostra , Estudos de Amostragem , Comportamento Sexual , Inquéritos e Questionários
4.
Biometrics ; 71(1): 258-266, 2015 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-25585794

RESUMO

The study of hard-to-reach populations presents significant challenges. Typically, a sampling frame is not available, and population members are difficult to identify or recruit from broader sampling frames. This is especially true of populations at high risk for HIV/AIDS. Respondent-driven sampling (RDS) is often used in such settings with the primary goal of estimating the prevalence of infection. In such populations, the number of people at risk for infection and the number of people infected are of fundamental importance. This article presents a case-study of the estimation of the size of the hard-to-reach population based on data collected through RDS. We study two populations of female sex workers and men-who-have-sex-with-men in El Salvador. The approach is Bayesian and we consider different forms of prior information, including using the UNAIDS population size guidelines for this region. We show that the method is able to quantify the amount of information on population size available in RDS samples. As separate validation, we compare our results to those estimated by extrapolating from a capture-recapture study of El Salvadorian cities. The results of our case-study are largely comparable to those of the capture-recapture study when they differ from the UNAIDS guidelines. Our method is widely applicable to data from RDS studies and we provide a software package to facilitate this.


Assuntos
Interpretação Estatística de Dados , Infecções por HIV/epidemiologia , Homossexualidade Masculina/estatística & dados numéricos , Modelos Estatísticos , Medição de Risco/métodos , População Urbana/estatística & dados numéricos , Simulação por Computador , El Salvador/epidemiologia , Métodos Epidemiológicos , Humanos , Masculino , Prevalência , Reprodutibilidade dos Testes , Tamanho da Amostra , Sensibilidade e Especificidade
5.
J Urban Health ; 92(6): 1052-64, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-26392276

RESUMO

African-American men who have sex with men (AA MSM) have been disproportionately infected with and affected by HIV and other STIs in San Francisco and the USA. The true scope and scale of the HIV epidemic in this population has not been quantified, in part because the size of this population remains unknown. We used the successive sampling population size estimation (SS-PSE) method, a new Bayesian approach to population size estimation that incorporates network size data routinely collected in respondent-driven sampling (RDS) studies, to estimate the number of AA MSM in San Francisco. This method was applied to data from a 2009 RDS study of AA MSM. An estimate from a separate study of local AA MSM was used to model the prior distribution of the population size. Two-hundred and fifty-six AA MSM were included in the RDS survey. The estimated population size was 4917 (95% CI 1267-28,771), using a flat prior estimated 1882 (95% CI 919-2463) as a lower acceptable bound, and a large prior estimated 6762 (95% CI 1994-13,863) as an acceptable upper bound. Point estimates from the SS-PSE were consistent with estimates from multiplier methods using external data. The SS-PSE method is easily integrated into RDS studies and therefore provides a simple and appealing tool to rapidly produce estimates of the size of key populations otherwise difficult to reach and enumerate.


Assuntos
Negro ou Afro-Americano/estatística & dados numéricos , Infecções por HIV/epidemiologia , Homossexualidade Masculina/estatística & dados numéricos , Comportamento Sexual/estatística & dados numéricos , Adolescente , Adulto , Fatores Etários , Idoso , Teorema de Bayes , Humanos , Masculino , Pessoa de Meia-Idade , Estudos de Amostragem , São Francisco/epidemiologia , Fatores Socioeconômicos , Adulto Jovem
6.
Influenza Other Respir Viruses ; 18(4): e13277, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38544454

RESUMO

BACKGROUND: Following the first locally transmitted case in Sukhbaatar soum, Selenge Province, we aimed to investigate the ultimate scale of the epidemic in the scenario of uninterrupted transmission. METHODS: This was a prospective case study following the locally modified WHO FFX cases generic protocol. A rapid response team collected data from November 14 to 29, 2020. We created a stochastic process to draw many transmission chains from this greater distribution to better understand and make inferences regarding the outbreak under investigation. RESULTS: The majority of the cases involved household transmissions (35, 52.2%), work transmissions (20, 29.9%), index (5, 7.5%), same apartment transmissions (2, 3.0%), school transmissions (2, 3.0%), and random contacts between individuals transmissions (1, 1.5%). The posterior means of the basic reproduction number of both the asymptomatic cases R 0 Asy $$ {R}_0^{Asy} $$ and the presymptomatic cases R 0 Pre $$ {R}_0^{Pre} $$ (1.35 [95% CrI 0.88-1.86] and 1.29 [95% CrI 0.67-2.10], respectively) were lower than that of the symptomatic cases (2.00 [95% Crl 1.38-2.76]). CONCLUSION: Our study highlights the heterogeneity of COVID-19 transmission across different symptom statuses and underscores the importance of early identification and isolation of symptomatic cases in disease control. Our approach, which combines detailed contact tracing data with advanced statistical methods, can be applied to other infectious diseases, facilitating a more nuanced understanding of disease transmission dynamics.


Assuntos
COVID-19 , Humanos , COVID-19/epidemiologia , SARS-CoV-2 , Mongólia , Busca de Comunicante , Surtos de Doenças/prevenção & controle
7.
J Stat Softw ; 52(2): i02, 2013 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-24307887

RESUMO

Exponential-family random graph models (ERGMs) represent a powerful and flexible class of models for the statistical analysis of networks. statnet is a suite of software packages that implement these models. This paper details how the capabilities for ERGM modeling can be expanded and customized by programming additional network statistics that may be included in ERGMs. We describe a template R package called ergm.userterms that can be modified for this purpose. It is designed to make this process as straightforward as possible. We also explain some of the internal workings of statnet that will help users develop their own network analysis capabilities.

8.
J Comput Graph Stat ; 32(2): 388-401, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37608920

RESUMO

Exponential-family Random Graph Models (ERGMs) have long been at the forefront of the analysis of relational data. The exponential-family form allows complex network dependencies to be represented. Models in this class are interpretable, flexible and have a strong theoretical foundation. The availability of powerful user-friendly open-source software allows broad accessibility and use. However, ERGMs sometimes suffer from a serious condition known as near-degeneracy, in which the model exhibits unrealistic probabilistic behavior or a severe lack-of-fit to real network data. Recently, Fellows and Handcock (2017) proposed a new model class, the Tapered ERGM, which circumvents the issue of near-degeneracy while maintaining the desirable features of ERGMs. However, the question of how to determine the proper amount of tapering needed for any model was heretofore left unanswered. This paper develops a new methodology for how to determine the necessary level of tapering and as such provides a new approach to inference for the Tapered ERGM class. Noting that a Tapered ERGM can always be made non-degenerate, we offer data-driven approaches for determining the amount of tapering necessary. The mean-value parameter estimates are unaffected by tapering, and we show that the natural parameter estimates are numerically weakly varying by the level of tapering. We then apply the Tapered ERGM to two published networks to demonstrate its effectiveness in cases where typical ERGMs fail and present the case for Tapered ERGMs replacing ERGMs entirely.

9.
Metron ; 81(1): 21-35, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37284420

RESUMO

Networked populations consist of inhomogeneous individuals connected via relational ties. The individuals typically vary in multivariate attributes. In some cases primary interest focuses on individual attributes and in others the understanding of the social structure of the ties. In many circumstances both are of interest, as is their relationship. In this paper we consider this last, most general, case. We model the joint distribution of social ties and individual attributes when the population is only partially observed. Of central interest is when the population is surveyed using a network sampling design. A second situation is when data about a subset of the ties and/or the individual attributes is unintentionally missing. Exponential-family random network models (ERNM)s are capable of specifying a joint statistical representation of both the ties of a network and individual attributes. This class of models allow the nodal attributes to be modeled as stochastic processes, expanding the range and realism of exponential-family approaches to network modeling. In this paper we develop a theory of inference for ERNMs when only part of the network is observed, as well as specific methodology for partially observed networks, including non-ignorable mechanisms for network-based sampling designs. In particular, we consider data collected via contact tracing, of considerable importance to infectious disease epidemiology and public health.

10.
J R Stat Soc Ser A Stat Soc ; 186(4): 682-706, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-38145242

RESUMO

Many demographic problems require models for partnership formation. We consider a model for matchings within a bipartite population where individuals have utility for people based on observed and unobserved characteristics. It represents both the availability of potential partners of different types and the preferences of individuals for such people. We develop an estimator for the preference parameters based on sample survey data on partnerships and population composition. We conduct simulation studies based on the Survey of Income and Program Participation showing that the estimator recovers preference parameters that are invariant under different population availabilities and has the correct confidence coverage.

11.
PLoS One ; 18(8): e0287368, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37594936

RESUMO

PURPOSE: Digital methods to augment traditional contact tracing approaches were developed and deployed globally during the COVID-19 pandemic. These "Exposure Notification (EN)" systems present new opportunities to support public health interventions. To date, there have been attempts to model the impact of such systems, yet no reports have explored the value of real-time system data for predictive epidemiological modeling. METHODS: We investigated the potential to short-term forecast COVID-19 caseloads using data from California's implementation of the Google Apple Exposure Notification (GAEN) platform, branded as CA Notify. CA Notify is a digital public health intervention leveraging resident's smartphones for anonymous EN. We extended a published statistical model that uses prior case counts to investigate the possibility of predicting short-term future case counts and then added EN activity to test for improved forecast performance. Additional predictive value was assessed by comparing the pandemic forecasting models with and without EN activity to the actual reported caseloads from 1-7 days in the future. RESULTS: Observation of time series presents noticeable evidence for temporal association of system activity and caseloads. Incorporating earlier ENs in our model improved prediction of the caseload counts. Using Bayesian inference, we found nonzero influence of EN terms with probability one. Furthermore, we found a reduction in both the mean absolute percentage error and the mean squared prediction error, the latter of at least 5% and up to 32% when using ENs over the model without. CONCLUSIONS: This preliminary investigation suggests smartphone based ENs can significantly improve the accuracy of short-term forecasting. These predictive models can be readily deployed as local early warning systems to triage resources and interventions.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , COVID-19/epidemiologia , Teorema de Bayes , Notificação de Doenças , Pandemias
12.
J R Stat Soc Ser A Stat Soc ; 185(2): 566-587, 2022 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-35756390

RESUMO

Exponential-family Random Graph models (ERGM) are widely used in social network analysis when modelling data on the relations between actors. ERGMs are typically interpreted as a snapshot of a network at a given point in time or in a final state. The recently proposed Latent Order Logistic model (LOLOG) directly allows for a latent network formation process. We assess the real-world performance of these models when applied to typical networks modelled by researchers. Specifically, we model data from an ensemble of articles in the journal Social Networks with published ERGM fits, and compare the ERGM fit to a comparable LOLOG fit. We demonstrate that the LOLOG models are, in general, in qualitative agreement with the ERGM models, and provide at least as good a model fit. In addition they are typically faster and easier to fit to data, without the tendency for degeneracy that plagues ERGMs. Our results support the general use of LOLOG models in circumstances where ERGMs are considered.

13.
Artigo em Inglês | MEDLINE | ID: mdl-36276174

RESUMO

Problem: Quantifying mortality from coronavirus disease (COVID-19) is difficult, especially in countries with limited resources. Comparing mortality data between countries is also challenging, owing to differences in methods for reporting mortality. Context: Tracking all-cause mortality (ACM) and comparing it with expected ACM from pre-pandemic data can provide an estimate of the overall burden of mortality related to the COVID-19 pandemic and support public health decision-making. This study validated an ACM calculator to estimate excess mortality during the COVID-19 pandemic. Action: The ACM calculator was developed as a tool for computing expected ACM and excess mortality at national and subnational levels. It was developed using R statistical software, was based on a previously described model that used non-parametric negative binomial regression and was piloted in several countries. Goodness-of-fit was validated by forecasting 2019 mortality from 2015-2018 data. Outcome: Three key lessons were identified from piloting the tool: using the calculator to compare reported provisional ACM with expected ACM can avoid potential false conclusions from comparing with historical averages alone; using disaggregated data at the subnational level can detect excess mortality by avoiding dilution of total numbers at the national level; and interpretation of results should consider system-related performance indicators. Discussion: Timely tracking of ACM to estimate excess mortality is important for the response to COVID-19. The calculator can provide countries with a way to analyse and visualize ACM and excess mortality at national and subnational levels.


Assuntos
COVID-19 , Pandemias , Humanos , SARS-CoV-2
14.
Stat Methodol ; 8(4): 319-339, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21691424

RESUMO

Exponential-family random graph models (ERGMs) provide a principled way to model and simulate features common in human social networks, such as propensities for homophily and friend-of-a-friend triad closure. We show that, without adjustment, ERGMs preserve density as network size increases. Density invariance is often not appropriate for social networks. We suggest a simple modification based on an offset which instead preserves the mean degree and accommodates changes in network composition asymptotically. We demonstrate that this approach allows ERGMs to be applied to the important situation of egocentrically sampled data. We analyze data from the National Health and Social Life Survey (NHSLS).

15.
J Surv Stat Methodol ; 9(1): 94-120, 2021 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-33521154

RESUMO

Respondent-driven sampling (RDS) is commonly used to study hard-to-reach populations since traditional methods are unable to efficiently survey members due to the typically highly stigmatized nature of the population. The number of people in these populations is of primary global health and demographic interest and is usually hard to estimate. However, due to the nature of RDS, current methods of population size estimation are insufficient. We introduce a new method of estimating population size that uses concepts from capture-recapture methods while modeling RDS as a successive sampling process. We assess its statistical validity using information from the CDC's National HIV Behavioral Surveillance system in 2009 and 2012.

16.
Demogr Res ; 23(6): 117-152, 2010 Jul 20.
Artigo em Inglês | MEDLINE | ID: mdl-21113421

RESUMO

In this paper we explore patterns of economic transfers between adults within household and family networks in a village in Malawi's Rumphi district, using data from the 2006 round of the Malawi Longitudinal Study of Families and Health. We fit Exponential-family Random Graph Models (ERGMs) to assess individual, relational, and higher-order network effects. The network effects of cyclic giving, reciprocity, and in-degree and out-degree distribution suggest a network with a tendency away from the formation of hierarchies or "hubs." Effects of age, sex, working status, education, health status, and kinship relation are also considered.

17.
Soc Networks ; 31(1): 52-62, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-23170041

RESUMO

The statistical modeling of social network data is difficult due to the complex dependence structure of the tie variables. Statistical exponential families of distributions provide a flexible way to model such dependence. They enable the statistical characteristics of the network to be encapsulated within an exponential family random graph (ERG) model. For a long time, however, likelihood-based estimation was only feasible for ERG models assuming dyad independence. For more realistic and complex models inference has been based on the pseudo-likelihood. Recent advances in computational methods have made likelihood-based inference practical, and comparison of the different estimators possible.In this paper, we present methodology to enable estimators of ERG model parameters to be compared. We use this methodology to compare the bias, standard errors, coverage rates and efficiency of maximum likelihood and maximum pseudo-likelihood estimators. We also propose an improved pseudo-likelihood estimation method aimed at reducing bias. The comparison is performed using simulated social network data based on two versions of an empirically realistic network model, the first representing Lazega's law firm data and the second a modified version with increased transitivity. The framework considers estimation of both the natural and the mean-value parameters.The results clearly show the superiority of the likelihood-based estimators over those based on pseudo-likelihood, with the bias-reduced pseudo-likelihood out-performing the general pseudo-likelihood. The use of the mean value parameterization provides insight into the differences between the estimators and when these differences will matter in practice.

18.
Soc Networks ; 31(3): 204-213, 2009 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-20191087

RESUMO

Social network data often involve transitivity, homophily on observed attributes, clustering, and heterogeneity of actor degrees. We propose a latent cluster random effects model to represent all of these features, and we describe a Bayesian estimation method for it. The model is applicable to both binary and non-binary network data. We illustrate the model using two real datasets. We also apply it to two simulated network datasets with the same, highly skewed, degree distribution, but very different network behavior: one unstructured and the other with transitivity and clustering. Models based on degree distributions, such as scale-free, preferential attachment and power-law models, cannot distinguish between these very different situations, but our model does.

19.
Comput Math Organ Theory ; 15(4): 294-302, 2009 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-26612976

RESUMO

Networks are being increasingly used to represent relational data. As the patterns of relations tends to be complex, many probabilistic models have been proposed to capture the structural properties of the process that generated the networks. Two features of network phenomena not captured by the simplest models is the variation in the number of relations individual entities have and the clustering of their relations. In this paper we present a statistical model within the curved exponential family class that can represent both arbitrary degree distributions and an average clustering coefficient. We present two tunable parameterizations of the model and give their interpretation. We also present a Markov Chain Monte Carlo (MCMC) algorithm that can be used to generate networks from this model.

20.
J Stat Softw ; 242008 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-28804272

RESUMO

latentnet is a package to fit and evaluate statistical latent position and cluster models for networks. Hoff, Raftery, and Handcock (2002) suggested an approach to modeling networks based on positing the existence of an latent space of characteristics of the actors. Relationships form as a function of distances between these characteristics as well as functions of observed dyadic level covariates. In latentnet social distances are represented in a Euclidean space. It also includes a variant of the extension of the latent position model to allow for clustering of the positions developed in Handcock, Raftery, and Tantrum (2007). The package implements Bayesian inference for the models based on an Markov chain Monte Carlo algorithm. It can also compute maximum likelihood estimates for the latent position model and a two-stage maximum likelihood method for the latent position cluster model. For latent position cluster models, the package provides a Bayesian way of assessing how many groups there are, and thus whether or not there is any clustering (since if the preferred number of groups is 1, there is little evidence for clustering). It also estimates which cluster each actor belongs to. These estimates are probabilistic, and provide the probability of each actor belonging to each cluster. It computes four types of point estimates for the coefficients and positions: maximum likelihood estimate, posterior mean, posterior mode and the estimator which minimizes Kullback-Leibler divergence from the posterior. You can assess the goodness-of-fit of the model via posterior predictive checks. It has a function to simulate networks from a latent position or latent position cluster model.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA