ABSTRACT
Female sex workers (FSW) are affected by individual, network, and structural risks, making them vulnerable to poor health and well-being. HIV prevention strategies and local community-based programs can rely on estimates of the number of FSW to plan and implement differentiated HIV prevention and treatment services. However, there are limited systematic assessments of the number of FSW in countries across sub-Saharan Africa to facilitate the identification of prevention and treatment gaps. Here we provide estimated population sizes of FSW and the corresponding uncertainties for almost all sub-national areas in sub-Saharan Africa. We first performed a literature review of FSW size estimates and then developed a Bayesian hierarchical model to synthesize these size estimates, resolving competing size estimates in the same area and producing estimates in areas without any data. We estimated that there are 2.5 million (95% uncertainty interval 1.9 to 3.1) FSW aged 15 to 49 in sub-Saharan Africa. This represents a proportion as percent of all women of childbearing age of 1.1% (95% uncertainty interval 0.8 to 1.3%). The analyses further revealed substantial differences between the proportions of FSW among adult females at the sub-national level and studied the relationship between these heterogeneities and many predictors. Ultimately, achieving the vision of no new HIV infections by 2030 necessitates dramatic improvements in our delivery of evidence-based services for sex workers across sub-Saharan Africa.
Subject(s)
Acquired Immunodeficiency Syndrome , HIV Infections , Sex Workers , Adult , Humans , Female , HIV Infections/epidemiology , HIV Infections/prevention & control , Bayes Theorem , Africa South of the Sahara/epidemiologyABSTRACT
Dynamic models have been successfully used in producing estimates of HIV epidemics at the national level due to their epidemiological nature and their ability to estimate prevalence, incidence, and mortality rates simultaneously. Recently, HIV interventions and policies have required more information at sub-national levels to support local planning, decision-making and resource allocation. Unfortunately, many areas lack sufficient data for deriving stable and reliable results, and this is a critical technical barrier to more stratified estimates. One solution is to borrow information from other areas within the same country. However, directly assuming hierarchical structures within the HIV dynamic models is complicated and computationally time-consuming. In this article, we propose a simple and innovative way to incorporate hierarchical information into the dynamical systems by using auxiliary data. The proposed method efficiently uses information from multiple areas within each country without increasing the computational burden. As a result, the new model improves predictive ability and uncertainty assessment.
Subject(s)
Epidemics , HIV Infections , Models, Statistical , Humans , HIV Infections/epidemiology , Epidemics/statistics & numerical data , PrevalenceABSTRACT
Often of primary interest in the analysis of multivariate data are the copula parameters describing the dependence among the variables, rather than the univariate marginal distributions. Since the ranks of a multivariate dataset are invariant to changes in the univariate marginal distributions, rank-based estimators are natural candidates for semiparametric copula estimation. Asymptotic information bounds for such estimators can be obtained from an asymptotic analysis of the rank likelihood, i.e. the probability of the multivariate ranks. In this article, we obtain limiting normal distributions of the rank likelihood for Gaussian copula models. Our results cover models with structured correlation matrices, such as exchangeable or circular correlation models, as well as unstructured correlation matrices. For all Gaussian copula models, the limiting distribution of the rank likelihood ratio is shown to be equal to that of a parametric likelihood ratio for an appropriately chosen multivariate normal model. This implies that the semiparametric information bounds for rank-based estimators are the same as the information bounds for estimators based on the full data, and that the multivariate normal distributions are least favorable.
ABSTRACT
We consider a class of network models, in which the connection probability depends on ultrahigh-dimensional nodal covariates (homophily) and node-specific popularity (degree heterogeneity). A Bayesian method is proposed to select nodal features in both dense and sparse networks under a mild assumption on popularity parameters. The proposed approach is implemented via Gibbs sampling. To alleviate the computational burden for large sparse networks, we further develop a working model in which parameters are updated based on a dense sub-graph at each step. Model selection consistency is established for both models, in the sense that the probability of the true model being selected converges to one asymptotically, even when the dimension grows with the network size at an exponential rate. The performance of the proposed models and estimation procedures are illustrated through Monte Carlo studies and three real world examples.
ABSTRACT
OBJECTIVES: Given the plausible mechanisms and the lacking of empirical evidence, the study aims to investigate how gestational sleep behaviors and the development of sleep disorders, such as restless legs syndrome, influence ultrasonographic measures of fetal growth. METHODS: The study included 2457 pregnant women from the NICHD Fetal Growth Studies - Singletons (2009-2013), who were recruited between 8-13 gestational weeks and followed up to five times during pregnancy. Women were categorized into six groups based on their total sleep hours and napping frequency. The trajectory of estimated fetal weight from 10-40weeks was derived from three ultrasonographic measures. Linear mixed effect models were applied to model the estimated fetal weight in relation to self-reported sleep-napping behaviors and restless legs syndrome status, adjusting for age, race and ethnicity, education, parity, prepregnancy body mass index category, infant sex, and prepregnancy sleep-napping behavior. RESULTS: From enrollment to near delivery, pregnant women's total sleep duration and nap frequency declined and restless legs syndrome symptoms frequency increased generally. No significant differences in estimated fetal weight were observed by sleep-napping group or by restless legs syndrome status. Results remained similar in sensitivity analyses and stratified analyses by women's prepregnancy body mass index category (normal vs. overweight/obese) or by infant sex. CONCLUSIONS: Our data indicate that there is no association between sleep during pregnancy-assessed as total sleep duration and napping frequency, nor restless legs syndrome symptoms-and fetal growth from weeks 10 to 40 in healthy pregnant women.
Subject(s)
Fetal Development , Restless Legs Syndrome , Sleep , Ultrasonography, Prenatal , Humans , Female , Pregnancy , Adult , Sleep/physiology , Time Factors , Fetal Development/physiology , Pregnancy Complications , Young Adult , Sleep DurationABSTRACT
The increase of ocean noise documented in the North Pacific has sparked concern on whether the observed increases are a global or regional phenomenon. This work provides evidence of low frequency sound increases in the Indian Ocean. A decade (2002-2012) of recordings made off the island of Diego Garcia, UK in the Indian Ocean was parsed into time series according to frequency band and sound level. Quarterly sound level comparisons between the first and last years were also performed. The combination of time series and temporal comparison analyses over multiple measurement parameters produced results beyond those obtainable from a single parameter analysis. The ocean sound floor has increased over the past decade in the Indian Ocean. Increases were most prominent in recordings made south of Diego Garcia in the 85-105 Hz band. The highest sound level trends differed between the two sides of the island; the highest sound levels decreased in the north and increased in the south. Rate, direction, and magnitude of changes among the multiple parameters supported interpretation of source functions driving the trends. The observed sound floor increases are consistent with concurrent increases in shipping, wind speed, wave height, and blue whale abundance in the Indian Ocean.
Subject(s)
Acoustics , Environmental Monitoring/methods , Noise , Water , Animals , Balaenoptera/physiology , Geologic Sediments , Indian Ocean , Linear Models , Noise, Transportation , Population Density , Ships , Signal Processing, Computer-Assisted , Sound Spectrography , Time Factors , Vocalization, Animal , Water Movements , WindABSTRACT
Aggregated relational data (ARD), formed from "How many X's do you know?" questions, is a powerful tool for learning important network characteristics with incomplete network data. Compared to traditional survey methods, ARD is attractive as it does not require a sample from the target population and does not ask respondents to self-reveal their own status. This is helpful for studying hard-to-reach populations like female sex workers who may be hesitant to reveal their status. From December 2008 to February 2009, the Kiev International Institute of Sociology (KIIS) collected ARD from 10,866 respondents to estimate the size of HIV-related groups in Ukraine. To analyze this data, we propose a new ARD model which incorporates respondent and group covariates in a regression framework and includes a bias term that is correlated between groups. We also introduce a new scaling procedure utilizing the correlation structure to further reduce biases. The resulting size estimates of those most-at-risk of HIV infection can improve the HIV response efficiency in Ukraine. Additionally, the proposed model allows us to better understand two network features without the full network data: 1. What characteristics affect who respondents know, and 2. How is knowing someone from one group related to knowing people from other groups. These features can allow researchers to better recruit marginalized individuals into the prevention and treatment programs. Our proposed model and several existing NSUM models are implemented in the networkscaleup R package.
ABSTRACT
Motivated by a study of United Nations voting behaviors, we introduce a regression model for a series of networks that are correlated over time. Our model is a dynamic extension of the additive and multiplicative effects network model (AMEN) of Hoff (2021). In addition to incorporating a temporal structure, the model accommodates two types of missing data thus allows the size of the network to vary over time. We demonstrate via simulations the necessity of various components of the model. We apply the model to the United Nations General Assembly voting data from 1983 to 2014 (Voeten, 2013) to answer interesting research questions regarding international voting behaviors. In addition to finding important factors that could explain the voting behaviors, the model-estimated additive effects, multiplicative effects, and their movements reveal meaningful foreign policy positions and alliances of various countries.
ABSTRACT
PURPOSE: Early warning in the travel origins is crucial to prevent disease spreading. When travel origins have delays in reporting disease outbreaks, the exported cases could be used to estimate the epidemic. METHODS: We developed a Bayesian model to jointly estimate the epidemic prevalence and detection delay using the exported cases and their arrival and detection dates. We used simulation studies to discuss potential biases generated by the exported cases. We proposed a hypothesis testing framework to determine the epidemic severity. RESULTS: We applied the method to the early phase of the COVID-19 epidemic of Wuhan, United States, Italy, and Iran and found that the indicators estimated from the exported cases were consistent with the domestic data under certain scenarios. The exported cases could generate various biases if not modeled properly. We presented the required number of exported cases for determining different severity levels of the outbreak. CONCLUSIONS: The exported case data is a good addition to the domestic data but also has its drawbacks. Utilizing the diagnosis resources from all countries, we advocate that countries work collaboratively to strengthen the global infectious disease surveillance system.
Subject(s)
COVID-19 , Communicable Diseases , Epidemics , Humans , COVID-19/epidemiology , Bayes Theorem , Disease Outbreaks , Communicable Diseases/epidemiology , China/epidemiologyABSTRACT
To combat the HIV/AIDS pandemic effectively, targeted interventions among certain key populations play a critical role. Examples of such key populations include sex workers, people who inject drugs, and men who have sex with men. While having accurate estimates for the size of these key populations is important, any attempt to directly contact or count members of these populations is difficult. As a result, indirect methods are used to produce size estimates. Multiple approaches for estimating the size of such populations have been suggested but often give conflicting results. It is, therefore, necessary to have a principled way to combine and reconcile these estimates. To this end, we present a Bayesian hierarchical model for estimating the size of key populations that combines multiple estimates from different sources of information. The proposed model makes use of multiple years of data and explicitly models the systematic error in the data sources used. We use the model to estimate the size of people who inject drugs in Ukraine. We evaluate the appropriateness of the model and compare the contribution of each data source to the final estimates.
ABSTRACT
The stochastic block model (SBM) and degree-corrected block model (DCBM) are network models often selected as the fundamental setting in which to analyze the theoretical properties of community detection methods. We consider the problem of spectral clustering of SBM and DCBM networks under a local form of edge differential privacy. Using a randomized response privacy mechanism called the edge-flip mechanism, we develop theoretical guarantees for differentially private community detection, demonstrating conditions under which this strong privacy guarantee can be upheld while achieving spectral clustering convergence rates that match the known rates without privacy. We prove the strongest theoretical results are achievable for dense networks (those with node degree linear in the number of nodes), while weak consistency is achievable under mild sparsity (node degree greater than n). We empirically demonstrate our results on a number of network examples.
ABSTRACT
Certain subpopulations like female sex workers (FSW), men who have sex with men (MSM), and people who inject drugs (PWID) often have higher prevalence of HIV/AIDS and are difficult to map directly due to stigma, discrimination, and criminalization. Fine-scale mapping of those populations contributes to the progress towards reducing the inequalities and ending the AIDS epidemic. In 2016 and 2017, the PLACE surveys were conducted at 3,290 venues in 20 out of the total 28 districts in Malawi to estimate the FSW sizes. These venues represent a presence-only data set where, instead of knowing both where people live and do not live (presence-absence data), only information about visited locations is available. In this study, we develop a Bayesian model for presence-only data and utilize the PLACE data to estimate the FSW size and uncertainty interval at a 1.5 × 1.5-km resolution for all of Malawi. The estimates can also be aggregated to any desirable level (city/district/region) for implementing targeted HIV prevention and treatment programs in FSW communities, which have been successful in lowering the incidence of HIV and other sexually transmitted infections.
ABSTRACT
The objective of this study was to determine if there is evidence for a causative link between sex under the influence of drugs or alcohol and risky sex for men in substance abuse treatment. Men in treatment participating in a multisite HIV prevention protocol who reported on baseline, 3, or 6 months computerized assessments the details of their most recent sexual events, and who reported having sexual events under the influence and not under the influence, and who reported most recent events that did and did not include condom use served as participants (n = 37). Safe sex was not significantly more likely to happen when participants were under the influence of drugs or alcohol during their most recent sexual event (48.3%) than when they were not under the influence (49%, p = .82). In this high-risk in treatment sample, a causative link between sex under the influence of drugs or alcohol and sexual risk behavior was not supported.
Subject(s)
Risk-Taking , Sexual Behavior/drug effects , Substance-Related Disorders/psychology , Unsafe Sex/drug effects , Adult , Female , Humans , Male , Substance Abuse Treatment Centers/statistics & numerical dataABSTRACT
Estimating the size of hard-to-reach populations is an important problem for many fields. The Network Scale-up Method (NSUM) is a relatively new approach to estimate the size of these hard-to-reach populations by asking respondents the question, "How many X's do you know," where X is the population of interest (e.g. "How many female sex workers do you know?"). The answers to these questions form Aggregated Relational Data (ARD). The NSUM has been used to estimate the size of a variety of subpopulations, including female sex workers, drug users, and even children who have been hospitalized for choking. Within the Network Scale-up methodology, there are a multitude of estimators for the size of the hidden population, including direct estimators, maximum likelihood estimators, and Bayesian estimators. In this article, we first provide an in-depth analysis of ARD properties and the techniques to collect the data. Then, we comprehensively review different estimation methods in terms of the assumptions behind each model, the relationships between the estimators, and the practical considerations of implementing the methods. We apply many of the models discussed in the review to one canonical data set and compare their performance and unique features, presented in the supplementary materials. Finally, we provide a summary of the dominant methods and an extensive list of the applications, and discuss the open problems and potential research directions in this area.
ABSTRACT
OBJECTIVE: Artifacts limit the application of proton resonance frequency (PRF) thermometry for on-site, individualized heating evaluations of implantable medical devices such as deep brain stimulation (DBS) for use in magnetic resonance imaging (MRI). Its properties are unclear and the research on how to choose an unaffected measurement region is insufficient. METHODS: The properties of PRF signals around the metallic DBS electrode were investigated through simulations and phantom experiments considering electromagnetic interferences from material susceptibility and the radio frequency (RF) interactions. A threshold method on phase difference ΔÏ was used to define a measurement area to estimate heating at the electrode surface. Its performance was compared to that of the Bayesian magnitude method and probe measurements. RESULTS: The B0 magnetic field inhomogeneity due to the electrode susceptibility was the main influencing factor on PRF compared to the RF artifact. ΔÏ around the electrode followed normal distribution but was distorted. Underestimation occurred at places with high temperature rises. The noise was increased and could be well estimated from magnitude images using a modified NEMA method. The ΔÏ-threshold method based on this knowledge outperformed the Bayesian magnitude method by more than 42% in estimation error of the electrode heating. CONCLUSION: The findings favor the use of PRF with the proposed approach as a reliable method for electrode heating estimation. SIGNIFICANCE: This study clarified the influence of device artifacts and could improve the performance of PRF thermometry for individualized heating assessments of patients with implants under MRI.
Subject(s)
Artifacts , Thermometry , Bayes Theorem , Heating , Humans , Magnetic Resonance Imaging , Phantoms, Imaging , Prostheses and Implants , ProtonsABSTRACT
PURPOSE: Human immunodeficiency virus (HIV) risks are heterogeneous in nature even in generalized epidemics. However, data are often missing for those at highest risk of HIV, including female sex workers. Statistical models may be used to address data gaps where direct, empiric estimates do not exist. METHODS: We proposed a new size estimation method that combines multiple data sources (the Malawi Biological and Behavioral Surveillance Survey, the Priorities for Local AIDS Control Efforts study, and the Malawi Demographic Household Survey). We used factor analysis to extract information from auxiliary variables and constructed a linear mixed effects model for predicting population size for all districts of Malawi. RESULTS: On average, the predicted proportion of female sex workers among women of reproductive age across all districts was about 0.58%. The estimated proportions seemed reasonable in comparing with a recent study Priorities for Local AIDS Control Efforts II (PLACE II). Compared with using a single data source, we observed increased precision and better geographic coverage. CONCLUSIONS: We illustrate how size estimates from different data sources may be combined for prediction. Applying this approach to other subpopulations in Malawi and to countries where size estimate data are lacking can ultimately inform national modeling processes and estimate the distribution of risks and priorities for HIV prevention and treatment programs.
Subject(s)
Sex Workers , Factor Analysis, Statistical , Female , HIV Infections/epidemiology , HIV Infections/prevention & control , Humans , Malawi/epidemiology , Sex Workers/statistics & numerical dataABSTRACT
Mitigating soil nitrous oxide (N2O) emissions is essential for staying below a 2 °C warming threshold. However, accurate assessments of mitigation potential are limited by uncertainty and variability in direct emission factors (EFs). To assess where and why EFs differ, we created high-resolution maps of crop-specific EFs based on 1,507 georeferenced field observations. Here, using a data-driven approach, we show that EFs vary by two orders of magnitude over space. At global and regional scales, such variation is primarily driven by climatic and edaphic factors rather than the well-recognized management practices. Combining spatially explicit EFs with N surplus information, we conclude that global mitigation potential without compromising crop production is 30% (95% confidence interval, 17-53%) of direct soil emissions of N2O, equivalent to the entire direct soil emissions of China and the United States combined. Two-thirds (65%) of the mitigation potential could be achieved on one-fifth of the global harvested area, mainly located in humid subtropical climates and across gleysols and acrisols. These findings highlight the value of a targeted policy approach on global hotspots that could deliver large N2O mitigation as well as environmental and food co-benefits.
ABSTRACT
When using multiple data sources in an analysis, it is important to understand the influence of each data source on the analysis and the consistency of the data sources with each other and the model. We suggest the use of a retrospective value of information framework in order to address such concerns. Value of information methods can be computationally difficult. We illustrate the use of computational methods that allow these methods to be applied even in relatively complicated settings. In illustrating the proposed methods, we focus on an application in estimating the size of hard to reach populations. Specifically, we consider estimating the number of injection drug users in Ukraine by combining all available data sources spanning over half a decade and numerous sub-national areas in the Ukraine. This application is of interest to public health researchers as this hard to reach population that plays a large role in the spread of HIV. We apply a Bayesian hierarchical model and evaluate the contribution of each data source in terms of absolute influence, expected influence, and level of surprise. Finally we apply value of information methods to inform suggestions on future data collection.
ABSTRACT
BACKGROUND: Travel is a potent force in the emergence of disease. We discussed how the traveler case reports could aid in a timely detection of a disease outbreak. METHODS: Using the traveler data, we estimated a few indicators of the epidemic that affected decision making and policy, including the exponential growth rate, the doubling time, and the probability of severe cases exceeding the hospital capacity, in the initial phase of the COVID-19 epidemic in multiple countries. We imputed the arrival dates when they were missing. We compared the estimates from the traveler data to the ones from domestic data. We quantitatively evaluated the influence of each case report and knowing the arrival date on the estimation. FINDINGS: We estimated the travel origin's daily exponential growth rate and examined the date from which the growth rate was consistently above 0.1 (equivalent to doubling time < 7 days). We found those dates were very close to the dates that critical decisions were made such as city lock-downs and national emergency announcement. Using only the traveler data, if the assumed epidemic start date was relatively accurate and the traveler sample was representative of the general population, the growth rate estimated from the traveler data was consistent with the domestic data. We also discussed situations that the traveler data could lead to biased estimates. From the data influence study, we found more recent travel cases had a larger influence on each day's estimate, and the influence of each case report got smaller as more cases became available. We provided the minimum number of exported cases needed to determine whether the local epidemic growth rate was above a certain level, and developed a user-friendly Shiny App to accommodate various scenarios.
ABSTRACT
Trabecular bone structure has been used to investigate the relationship between skeletal form and locomotor behavior on the premise that trabecular bone remodels in response to loading during an animal's lifetime. The aim of this study is to characterize human distal femoral trabecular bone structure in comparison to three non-human primate taxa and relate the patterns of trabecular structural variation in the distal femur to knee posture during habitual locomotor behavior. A whole-epiphysis approach was applied using microCT scans of the distal femora of extant Homo sapiens, Pan troglodytes, Pongo pygmaeus, and Papio spp. (N = 48). Bone volume fraction (BV/TV) was quantified in the epiphysis and analyzed with both whole-condyle and a novel sector analysis. The results indicate high trabecular bone structural variation within and between species. The sector analysis reveals the most distinctive patterns in the stereotypically loaded human knee, with a pattern of high BV/TV distally. In general, Pan, Pongo, and Papio show evidence of flexed knee postures, typical of their locomotor behaviors, with regions of high BV/TV posteriorly within the condyles. The pairwise comparisons confirm the unique pattern in Homo and reveal a shared high BV/TV region in the patellar groove of both Homo and Papio. The distinct pattern found in Homo relative to the other primate taxa suggests a plastic response to unique loading patterns during bipedal locomotion. Results may facilitate resolving the antiquity of habitual bipedality in the hominin fossil record. This analysis also presents new approaches for statistical analysis of whole-epiphysis trabecular bone structure. Anat Rec, 2018. © 2018 American Association for Anatomy.