Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
JAMA ; 331(20): 1765-1767, 2024 05 28.
Artigo em Inglês | MEDLINE | ID: mdl-38691367

RESUMO

This study compares the race and ethnicity of reproductive-age females between states that implemented restrictive abortion policies after the Dobbs v Jackson Women's Health Organization decision and states that did not.


Assuntos
Etnicidade , Feminino , Humanos , Estados Unidos , Gravidez , Adulto , Aborto Induzido/legislação & jurisprudência , Aborto Induzido/estatística & dados numéricos , Grupos Raciais , Adolescente , Adulto Jovem , Aborto Legal/legislação & jurisprudência , Governo Estadual
2.
Sci Rep ; 14(1): 9180, 2024 04 22.
Artigo em Inglês | MEDLINE | ID: mdl-38649687

RESUMO

Individual-level assessment of health and well-being permits analysis of community well-being and health risk evaluations across several dimensions of health. It also enables comparison and rankings of reported health and well-being for large geographical areas such as states, metropolitan areas, and counties. However, there is large variation in reported well-being within such large spatial units underscoring the importance of analyzing well-being at more granular levels, such as ZIP codes. In this paper, we address this problem by modeling well-being data to generate ZIP code tabulation area (ZCTA)-level rankings through spatially informed statistical modeling. We build regression models for individual-level overall well-being index and scores from five subscales (Physical, Financial, Social, Community, Purpose) using individual-level demographic characteristics as predictors while including a ZCTA-level spatial effect. The ZCTA neighborhood information is incorporated by using a graph Laplacian matrix; this enables estimation of the effect of a ZCTA on well-being using individual-level data from that ZCTA as well as by borrowing information from neighboring ZCTAs. We deploy our model on well-being data for the U.S. states of Massachusetts and Georgia. We find that our model can capture the effects of demographic features while also offering spatial effect estimates for all ZCTAs, including ones with no observations, under certain conditions. These spatial effect estimates provide community health and well-being rankings of ZCTAs, and our method can be deployed more generally to model other outcomes that are spatially dependent as well as data from other states or groups of states.


Assuntos
Características de Residência , Humanos , Masculino , Feminino , Características da Vizinhança , Adulto , Pessoa de Meia-Idade , Nível de Saúde , Modelos Estatísticos , Idoso
3.
J Subst Use Addict Treat ; 157: 209186, 2024 02.
Artigo em Inglês | MEDLINE | ID: mdl-37866438

RESUMO

INTRODUCTION: Social determinants of health (SDoH), such as socioeconomic status, education level, and food insecurity, are believed to influence the opioid crisis. While global SDoH indices such as the CDC's Social Vulnerability Index (SVI) and Area Deprivation Index (ADI) combine the explanatory power of multiple social factors for understanding health outcomes, they may be less applicable to the specific challenges of opioid misuse and associated outcomes. This study develops a novel index tailored to opioid misuse outcomes, tests the efficacy of this index in predicting drug overdose deaths across contexts, and compares the explanatory power of this index to other SDoH indices. METHODS: Focusing on four HEALing Communities Study (HCS) states (Kentucky, Massachusetts, New York and Ohio; encompassing 4269 ZIP codes), we identified multilevel SDoH potentially associated with opioid misuse and aggregated publicly available data for each measure. We then leveraged a random forest model to develop a composite measure that predicts age-adjusted drug overdose mortality rates based on SDoH. We used this composite measure to understand HCS and non-HCS communities in terms of overdose risk across areas of varying racial composition. Finally, we compared variance in drug overdose deaths explained by this index to variance explained by the SVI and ADI. RESULTS: Our composite measure included 28 SDoH measures and explained approximately 89 % percent of variance in age-adjusted drug overdose mortality across HCS states. Health care measures, including emergency department visits and primary care provider availability, were top predictors within the index. Index accuracy was robust within and outside of HCS communities and states. This measure identified high levels of overdose mortality risk in segregated communities. CONCLUSIONS: Existing SDoH indices fail to explain much variation in area-level overdose mortality rates. Having tailored composite indices can help us to identify places in which residents are at highest risk based on their composite contexts. A comprehensive index can also help to develop effective community interventions for programs such as HCS by considering the context in which people live.


Assuntos
Overdose de Drogas , Transtornos Relacionados ao Uso de Opioides , Humanos , Determinantes Sociais da Saúde , Fatores Sociais , Massachusetts/epidemiologia
4.
Addict Behav ; 148: 107868, 2024 01.
Artigo em Inglês | MEDLINE | ID: mdl-37774527

RESUMO

Prepulse inhibition (PPI) is a measure of sensorimotor filtering thought to shield the processing of initial weaker auditory stimuli from interruption by a later startle response. Previous studies have shown smoking withdrawal to have a negative impact on sensorimotor filtering, particularly in individuals with psychopathology. Because tobacco use may alleviate sensory and sensorimotor filtering deficits, we examined whether smoking withdrawal-induced changes in PPI were associated with maintenance of smoking abstinence in trauma-exposed individuals with and without PTSD who were attempting to quit smoking. Thirty-eight individuals (n = 24 with current or past PTSD; 14 trauma-exposed healthy controls) made an acute biochemically-verified smoking cessation attempt supported by 8 days of contingency management (CM) and cognitive behavioral therapy (CBT) for smoking. Participants completed a PPI task at the pre-quit baseline, 2 days post-quit, and 5 days post-quit. Post-quit changes in PPI were compared between those who remained abstinent for the first 8-days of the quit attempt and those who lapsed back to smoking. PPI changes induced by biochemically-verified smoking abstinence were associated with maintenance of abstinence across the 8-day CM/CBT-supported quit attempt. As compared to those who maintained tobacco abstinence, participants who lapsed to smoking had significantly lower PPI at 2 and 5 days post-quit relative to baseline. Thus, among trauma-exposed individuals, decreases in PPI during acute smoking cessation supported by CM/CBT are associated with lapse back to smoking. Interventions that improve PPI during early smoking abstinence may facilitate smoking cessation among such individuals who are at high risk for chronic, refractory tobacco use.


Assuntos
Abandono do Hábito de Fumar , Tabagismo , Humanos , Fumar/terapia , Fumar/psicologia , Fumar Tabaco , Abandono do Hábito de Fumar/psicologia , Tabagismo/psicologia , Produtos do Tabaco
5.
PLoS One ; 18(9): e0291118, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37682911

RESUMO

This study measures associations between COVID-19 deaths and sociodemographic factors (wealth, insurance coverage, urban residence, age, state population) for states in Nigeria across two waves of the COVID-19 pandemic: February 27th 2020 to October 24th 2020 and October 25th 2020 to July 25th 2021. Data sources include 2018 Nigeria Demographic and Health Survey and Nigeria Centre for Disease Control (NCDC) COVID-19 daily reports. It uses negative binomial models to model deaths, and stratifies results by respondent gender. It finds that overall mortality rates were concentrated within three states: Lagos, Edo and Federal Capital Territory (FCT) Abuja. Urban residence and insurance coverage are positively associated with differences in deaths for the full sample. The former, however, is significant only during the early stages of the pandemic. Associative differences in gender-stratified models suggest that wealth was a stronger protective factor for men and insurance a stronger protective factor for women. Associative strength between sociodemographic measures and deaths varies by gender and pandemic wave, suggesting that the pandemic impacted men and women in unique ways, and that the effectiveness of interventions should be evaluated for specific waves or periods.


Assuntos
COVID-19 , Cobertura do Seguro , Fatores Sociodemográficos , População Urbana , COVID-19/mortalidade , Humanos , Nigéria/epidemiologia , Fatores Etários , Masculino , Feminino
6.
PLOS Glob Public Health ; 3(7): e0000878, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37490461

RESUMO

Female genital mutilation/cutting (FGM/C) describes several procedures that involve injury to the vulva or vagina for nontherapeutic reasons. Though at least 200 million women and girls living in 30 countries have undergone FGM/C, there is a paucity of studies focused on public perception of FGM/C. We used machine learning methods to characterize discussion of FGM/C on Twitter in English from 2015 to 2020. Twitter has emerged in recent years as a source for seeking and sharing health information and misinformation. We extracted text metadata from user profiles to characterize the individuals and locations involved in conversations about FGM/C. We extracted major discussion themes from posts using correlated topic modeling. Finally, we extracted features from posts and applied random forest models to predict user engagement. The volume of tweets addressing FGM/C remained fairly stable across years. Conversation was mostly concentrated among the United States and United Kingdom through 2017, but shifted to Nigeria and Kenya in 2020. Some of the discussion topics associated with FGM/C across years included Islam, International Day of Zero Tolerance, current news stories, education, activism, male circumcision, human rights, and feminism. Tweet length and follower count were consistently strong predictors of engagement. Our findings suggest that (1) discussion about FGM/C has not evolved significantly over time, (2) the majority of the conversation about FGM/C on English-speaking Twitter is advocating for an end to the practice, (3) supporters of Donald Trump make up a substantial voice in the conversation about FGM/C, and (4) understanding the nuances in how people across cultures refer to and discuss FGM/C could be important for the design of public health communication and intervention.

7.
J Expo Sci Environ Epidemiol ; 33(2): 237-243, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-35145207

RESUMO

BACKGROUND/OBJECTIVE: Lack of access to resources such as medical facilities and grocery stores is related to poor health outcomes and inequities, particularly in an environmental justice framework. There can be substantial differences in quantifying "access" to such resources, depending on the geospatial method used to generate distance estimates. METHODS: We compared three methods for calculating distance to the nearest grocery store to illustrate differential access at the census block-group level in the Atlanta metropolitan area, including: Euclidean distance estimation, service areas incorporating roadways and other factors, and cost distance for every point on the map. RESULTS: We found notable differences in access across the three estimation techniques, implying a high potential for exposure misclassification by estimation method. There was a lack of nuanced exposure in the highest- and lowest-access areas using the Euclidean distance method. We found an Intraclass Correlation Coefficient (ICC) of 0.69 (0.65, 0.73), indicating moderate agreement between estimation methods. SIGNIFICANCE: As compared with Euclidean distance, service areas and cost distance may represent a more meaningful characterization of "access" to resources. Each method has tradeoffs in computational resources required versus potential improvement in exposure classification. Careful consideration of the method used for determining "access" will reduce subsequent misclassifications.


Assuntos
Disparidades nos Níveis de Saúde , Características da Vizinhança , Determinantes Sociais da Saúde , Humanos , Censos , Georgia , Geografia Médica
8.
Patterns (N Y) ; 3(8): 100547, 2022 Aug 12.
Artigo em Inglês | MEDLINE | ID: mdl-35721836

RESUMO

In this study, we measured the association between county characteristics and changes in healthy-food, fast-food, and alcohol tweets during the COVID-19 pandemic in the United States. Our analytic dataset consisted of 1,282,316 geotagged tweets that referenced food consumption posted before (63.2%) and during (36.8%) the pandemic and included all US states. We found the share of healthy-food tweets increased by 20.5% during the pandemic compared with pre-pandemic, while fast-food and alcohol tweets decreased by 9.4% and 11.4%, respectively. We also observed that time spent at home and more grocery stores per capita were associated with increased odds of healthy-food tweets and decreased odds of fast-food tweets. More liquor stores per capita was associated with increased odds of alcohol tweets. Our results highlight the potential impact of the pandemic on nutrition and alcohol consumption and the association between the built environment and health behaviors.

10.
BMC Public Health ; 22(1): 747, 2022 04 14.
Artigo em Inglês | MEDLINE | ID: mdl-35421958

RESUMO

BACKGROUND: There is a need to evaluate how the choice of time interval contributes to the lack of consistency of SDoH variables that appear as important to COVID-19 disease burden within an analysis for both case counts and death counts. METHODS: This study identified SDoH variables associated with U.S county-level COVID-19 cumulative case and death incidence for six different periods: the first 30, 60, 90, 120, 150, and 180 days since each county had COVID-19 one case per 10,000 residents. The set of SDoH variables were in the following domains: resource deprivation, access to care/health resources, population characteristics, traveling behavior, vulnerable populations, and health status. A generalized variance inflation factor (GVIF) analysis was used to identify variables with high multicollinearity. For each dependent variable, a separate model was built for each of the time periods. We used a mixed-effect generalized linear modeling of counts normalized per 100,000 population using negative binomial regression. We performed a Kolmogorov-Smirnov goodness of fit test, an outlier test, and a dispersion test for each model. Sensitivity analysis included altering the county start date to the day each county reached 10 COVID-19 cases per 10,000. RESULTS: Ninety-seven percent (3059/3140) of the counties were represented in the final analysis. Six features proved important for both the main and sensitivity analysis: adults-with-college-degree, days-sheltering-in-place-at-start, prior-seven-day-median-time-home, percent-black, percent-foreign-born, over-65-years-of-age, black-white-segregation, and days-since-pandemic-start. These variables belonged to the following categories: COVID-19 related, vulnerable populations, and population characteristics. Our diagnostic results show that across our outcomes, the models of the shorter time periods (30 days, 60 days, and 900 days) have a better fit. CONCLUSION: Our findings demonstrate that the set of SDoH features that are significant for COVID-19 outcomes varies based on the time from the start date of the pandemic and when COVID-19 was present in a county. These results could assist researchers with variable selection and inform decision makers when creating public health policy.


Assuntos
COVID-19 , Segregação Social , Adulto , COVID-19/epidemiologia , Humanos , Políticas , SARS-CoV-2 , Determinantes Sociais da Saúde , Estados Unidos/epidemiologia
11.
BMC Res Notes ; 15(1): 64, 2022 Feb 17.
Artigo em Inglês | MEDLINE | ID: mdl-35177096

RESUMO

OBJECTIVE: Electronic health records (EHR) hold promise for conducting large-scale analyses linking individual characteristics to health outcomes. However, these data often contain a large number of missing values at both the patient and visit level due to variation in data collection across facilities, providers, and clinical need. This study proposes a stepwise framework for imputing missing values within a visit-level EHR dataset that combines informative missingness and conditional imputation in a scalable manner that may be parallelized for efficiency. RESULTS: For this study we use a subset of data from AMPATH representing information from 530,812 clinic visits from 16,316 Human Immunodeficiency Virus (HIV) positive women across Western Kenya who have given birth. We apply this process to a set of 84 clinical, social and economic variables and are able to impute values for 84.6% of variables with missing data with an average reduction in missing data of approximately 35.6%. We validate the use of this imputed dataset by predicting National Hospital Insurance Fund (NHIF) enrollment with 94.8% accuracy.


Assuntos
Registros Eletrônicos de Saúde , Coleta de Dados , Feminino , Humanos , Quênia
12.
J Med Internet Res ; 22(12): e24425, 2020 12 15.
Artigo em Inglês | MEDLINE | ID: mdl-33264102

RESUMO

BACKGROUND: The epidemic of misinformation about COVID-19 transmission, prevention, and treatment has been going on since the start of the pandemic. However, data on the exposure and impact of misinformation is not readily available. OBJECTIVE: We aim to characterize and compare the start, peak, and doubling time of COVID-19 misinformation topics across 8 countries using an exponential growth model usually employed to study infectious disease epidemics. METHODS: COVID-19 misinformation topics were selected from the World Health Organization Mythbusters website. Data representing exposure was obtained from the Google Trends application programming interface for 8 English-speaking countries. Exponential growth models were used in modeling trends for each country. RESULTS: Searches for "coronavirus AND 5G" started at different times but peaked in the same week for 6 countries. Searches for 5G also had the shortest doubling time across all misinformation topics, with the shortest time in Nigeria and South Africa (approximately 4-5 days). Searches for "coronavirus AND ginger" started at the same time (the week of January 19, 2020) for several countries, but peaks were incongruent, and searches did not always grow exponentially after the initial week. Searches for "coronavirus AND sun" had different start times across countries but peaked at the same time for multiple countries. CONCLUSIONS: Patterns in the start, peak, and doubling time for "coronavirus AND 5G" were different from the other misinformation topics and were mostly consistent across countries assessed, which might be attributable to a lack of public understanding of 5G technology. Understanding the spread of misinformation, similarities and differences across different contexts can help in the development of appropriate interventions for limiting its impact similar to how we address infectious disease epidemics. Furthermore, the rapid proliferation of misinformation that discourages adherence to public health interventions could be predictive of future increases in disease cases.


Assuntos
COVID-19/epidemiologia , Comunicação , COVID-19/virologia , Humanos , Pandemias , SARS-CoV-2/isolamento & purificação
13.
Int J Health Policy Manag ; 9(7): 269-273, 2020 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-32613798

RESUMO

The editorial materials in top medical and public health journals are opportunities for experts to offer thoughts that might influence the trajectory of the field. To date, while some studies have examined gender bias in the publication of editorial materials in medical journals, none have studied public health journals. In this perspective, we studied the gender ratio of the editorial materials published in the top health and medical sciences journals between 2008 and early 2018 to test whether gender bias exists. We studied a total of 59 top journals in health and medical sciences. Overall, while there is a trend of increasing proportion of female first authors, there is still a greater proportion of male than female first authors. The average male-to-female first author ratio during the study period across all journals was 2.08. Ensuring equal access and exposure through journal editorials is a critical step, albeit only one step of a longer journey, towards gender balance in health and medical sciences research. Editors of top journals have a key role to play in pushing the fields towards more balanced gender equality, and we strongly urge editors to rethink the strategies for inviting authors for editorial materials.


Assuntos
Medicina , Publicações Periódicas como Assunto , Autoria , Feminino , Humanos , Masculino , Sexismo
14.
Paediatr Perinat Epidemiol ; 34(5): 544-552, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-31912544

RESUMO

BACKGROUND: Experiences typically considered private, such as, miscarriages and preterm births are being discussed publicly on social media and Internet discussion websites. These data can provide timely illustrations of how individuals discuss miscarriages and preterm births, as well as insights into the wellbeing of women who have experienced a miscarriage. OBJECTIVES: To characterise how users discuss the topic of miscarriage and preterm births on Twitter, analyse trends and drivers, and describe the perceived emotional state of women who have experienced a miscarriage. METHODS: We obtained 291 443 Twitter postings on miscarriages and preterm births from January 2017 through December 2018. Latent Dirichlet Allocation (LDA) was used to identify major topics of discussion. We applied time series decomposition methods to assess temporal trends and identify major drivers of discussion. Furthermore, four coders labelled the emotional content of 7282 personal miscarriage disclosure tweets into the following non-mutually exclusive categories: grief/sadness/depression, anger, relief, isolation, annoyance, and neutral. RESULTS: Topics in our data fell into eight groups: celebrity disclosures, Michelle Obama's disclosure, politics, healthcare, preterm births, loss and anxiety, flu vaccine and ectopic pregnancies. Political discussions around miscarriages were largely due to a misunderstanding between abortions and miscarriages. Grief and annoyance were the most commonly expressed emotions within the miscarriage self-disclosures; 50.6% (95% confidence interval [CI] 49.1, 52.2) and 16.2% (95% CI 15.2, 17.3). Postings increased with celebrity disclosures, pharmacists' refusal of prescribed medications and outrage over the high rate of preterm births in the United States. Miscarriage disclosures by celebrities also led to disclosures by women who had similar experiences. CONCLUSIONS: This study suggests that increase in discussions of miscarriage on social media are associated with several factors, including celebrity disclosures. Additionally, there is a misunderstanding of the potential physical, emotional and psychological impacts on individuals who lose a pregnancy due to a miscarriage.


Assuntos
Aborto Espontâneo , Nascimento Prematuro , Mídias Sociais , Emoções , Pessoas Famosas , Feminino , Pesar , Custos de Cuidados de Saúde , Humanos , Gravidez , Autorrevelação , Saúde da Mulher/legislação & jurisprudência
15.
BMJ Open Sport Exerc Med ; 5(1): e000567, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31423323

RESUMO

OBJECTIVES: We examined the use of data from social media for surveillance of physical activity prevalence in the USA. METHODS: We obtained data from the social media site Twitter from April 2015 to March 2016. The data consisted of 1 382 284 geotagged physical activity tweets from 481 146 users (55.7% men and 44.3% women) in more than 2900 counties. We applied machine learning and statistical modelling to demonstrate sex and regional variations in preferred exercises, and assessed the association between reports of physical activity on Twitter and population-level inactivity prevalence from the US Centers for Disease Control and Prevention. RESULTS: The association between physical inactivity tweet patterns and physical activity prevalence varied by sex and region. Walking was the most popular physical activity for both men and women across all regions (15.94% (95% CI 15.85% to 16.02%) and 18.74% (95% CI 18.64% to 18.88%) of tweets, respectively). Men and women mentioned performing gym-based activities at approximately the same rates (4.68% (95% CI 4.63% to 4.72%) and 4.13% (95% CI 4.08% to 4.18%) of tweets, respectively). CrossFit was most popular among men (14.91% (95% CI 14.52% to 15.31%)) among gym-based tweets, whereas yoga was most popular among women (26.66% (95% CI 26.03% to 27.19%)). Men mentioned engaging in higher intensity activities than women. Overall, counties with higher physical activity tweets also had lower leisure-time physical inactivity prevalence for both sexes. CONCLUSIONS: The regional-specific and sex-specific activity patterns captured on Twitter may allow public health officials to identify changes in health behaviours at small geographical scales and to design interventions best suited for specific populations.

16.
Palgrave Commun ; 5(1)2019.
Artigo em Inglês | MEDLINE | ID: mdl-32661492

RESUMO

Obesity is a global epidemic affecting millions. Implementation of interventions to curb obesity rates requires timely surveillance. In this study, we estimated sex-specific obesity prevalence using social media, search queries, demographics and built environment variables. We collected 3,817,125 and 1,382,284 geolocated tweets on food and exercise respectively, from Twitter's streaming API from April 2015 to March 2016. We also obtained searches related to physical activity and diet from Google Search Trends for the same time period. Next, we inferred the gender of Twitter users using machine learning methods and applied mixed-effects state-level linear regression models to estimate obesity prevalence. We observed differences in discussions of physical activity and foods, with males reporting higher intensity physical activities and lower caloric foods across 40 and 48 states, respectively. Additionally, counties with the highest percentage of exercise and food tweets had lower male and female obesity prevalence. Lastly, our models separately captured overall male and female spatial trends in obesity prevalence. The average correlation between actual and estimated obesity prevalence was 0.789 (95% CI, 0.785, 0.786) and 0.830 (95% CI, 0.830, 0.831) for males and females, respectively. Social media can provide timely community-level data on health information seeking and changes in behaviors, sentiments and norms. Social media data can also be combined with other data types such as, demographics, built environment variables, diet and physical activity indicators from other digital sources (e.g., mobile applications and wearables) to monitor health behaviors at different geographic scales, and to supplement delayed estimates from traditional surveillance systems.

17.
Demography ; 55(5): 1979-1999, 2018 10.
Artigo em Inglês | MEDLINE | ID: mdl-30276667

RESUMO

The digital traces that we leave online are increasingly fruitful sources of data for social scientists, including those interested in demographic research. The collection and use of digital data also presents numerous statistical, computational, and ethical challenges, motivating the development of new research approaches to address these burgeoning issues. In this article, we argue that researchers with formal training in demography-those who have a history of developing innovative approaches to using challenging data-are well positioned to contribute to this area of work. We discuss the benefits and challenges of using digital trace data for social and demographic research, and we review examples of current demographic literature that creatively use digital trace data to study processes related to fertility, mortality, and migration. Focusing on Facebook data for advertisers-a novel "digital census" that has largely been untapped by demographers-we provide illustrative and empirical examples of how demographic researchers can manage issues such as bias and representation when using digital trace data. We conclude by offering our perspective on the road ahead regarding demography and its role in the data revolution.


Assuntos
Big Data , Coleta de Dados/métodos , Demografia/métodos , Pesquisa , Mídias Sociais/estatística & dados numéricos , Viés , Coeficiente de Natalidade/tendências , Coleta de Dados/ética , Demografia/ética , Ética em Pesquisa , Humanos , Mortalidade/tendências , Privacidade , Grupos Raciais/estatística & dados numéricos , Mídias Sociais/ética
18.
Sociol Methods Res ; 46(3): 390-421, 2017 08.
Artigo em Inglês | MEDLINE | ID: mdl-29033471

RESUMO

Despite recent and growing interest in using Twitter to examine human behavior and attitudes, there is still significant room for growth regarding the ability to leverage Twitter data for social science research. In particular, gleaning demographic information about Twitter users-a key component of much social science research-remains a challenge. This article develops an accurate and reliable data processing approach for social science researchers interested in using Twitter data to examine behaviors and attitudes, as well as the demographic characteristics of the populations expressing or engaging in them. Using information gathered from Twitter users who state an intention to not vote in the 2012 presidential election, we describe and evaluate a method for processing data to retrieve demographic information reported by users that is not encoded as text (e.g., details of images) and evaluate the reliability of these techniques. We end by assessing the challenges of this data collection strategy and discussing how large-scale social media data may benefit demographic researchers.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA