Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 192
Filtrar
1.
J Med Internet Res ; 26: e47923, 2024 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-38488839

RESUMO

BACKGROUND: Patient health data collected from a variety of nontraditional resources, commonly referred to as real-world data, can be a key information source for health and social science research. Social media platforms, such as Twitter (Twitter, Inc), offer vast amounts of real-world data. An important aspect of incorporating social media data in scientific research is identifying the demographic characteristics of the users who posted those data. Age and gender are considered key demographics for assessing the representativeness of the sample and enable researchers to study subgroups and disparities effectively. However, deciphering the age and gender of social media users poses challenges. OBJECTIVE: This scoping review aims to summarize the existing literature on the prediction of the age and gender of Twitter users and provide an overview of the methods used. METHODS: We searched 15 electronic databases and carried out reference checking to identify relevant studies that met our inclusion criteria: studies that predicted the age or gender of Twitter users using computational methods. The screening process was performed independently by 2 researchers to ensure the accuracy and reliability of the included studies. RESULTS: Of the initial 684 studies retrieved, 74 (10.8%) studies met our inclusion criteria. Among these 74 studies, 42 (57%) focused on predicting gender, 8 (11%) focused on predicting age, and 24 (32%) predicted a combination of both age and gender. Gender prediction was predominantly approached as a binary classification task, with the reported performance of the methods ranging from 0.58 to 0.96 F1-score or 0.51 to 0.97 accuracy. Age prediction approaches varied in terms of classification groups, with a higher range of reported performance, ranging from 0.31 to 0.94 F1-score or 0.43 to 0.86 accuracy. The heterogeneous nature of the studies and the reporting of dissimilar performance metrics made it challenging to quantitatively synthesize results and draw definitive conclusions. CONCLUSIONS: Our review found that although automated methods for predicting the age and gender of Twitter users have evolved to incorporate techniques such as deep neural networks, a significant proportion of the attempts rely on traditional machine learning methods, suggesting that there is potential to improve the performance of these tasks by using more advanced methods. Gender prediction has generally achieved a higher reported performance than age prediction. However, the lack of standardized reporting of performance metrics or standard annotated corpora to evaluate the methods used hinders any meaningful comparison of the approaches. Potential biases stemming from the collection and labeling of data used in the studies was identified as a problem, emphasizing the need for careful consideration and mitigation of biases in future studies. This scoping review provides valuable insights into the methods used for predicting the age and gender of Twitter users, along with the challenges and considerations associated with these methods.


Assuntos
Mídias Sociais , Humanos , Adulto Jovem , Adulto , Reprodutibilidade dos Testes , Redes Neurais de Computação , Aprendizado de Máquina
2.
J Med Internet Res ; 26: e50652, 2024 Mar 25.
Artigo em Inglês | MEDLINE | ID: mdl-38526542

RESUMO

We manually annotated 9734 tweets that were posted by users who reported their pregnancy on Twitter, and used them to train, evaluate, and deploy deep neural network classifiers (F1-score=0.93) to detect tweets that report having a child with attention-deficit/hyperactivity disorder (678 users), autism spectrum disorders (1744 users), delayed speech (902 users), or asthma (1255 users), demonstrating the potential of Twitter as a complementary resource for assessing associations between pregnancy exposures and childhood health outcomes on a large scale.


Assuntos
Asma , Transtorno do Espectro Autista , Mídias Sociais , Criança , Feminino , Gravidez , Humanos , Asma/epidemiologia , Redes Neurais de Computação
3.
J Am Med Inform Assoc ; 31(4): 991-996, 2024 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-38218723

RESUMO

OBJECTIVE: The aim of the Social Media Mining for Health Applications (#SMM4H) shared tasks is to take a community-driven approach to address the natural language processing and machine learning challenges inherent to utilizing social media data for health informatics. In this paper, we present the annotated corpora, a technical summary of participants' systems, and the performance results. METHODS: The eighth iteration of the #SMM4H shared tasks was hosted at the AMIA 2023 Annual Symposium and consisted of 5 tasks that represented various social media platforms (Twitter and Reddit), languages (English and Spanish), methods (binary classification, multi-class classification, extraction, and normalization), and topics (COVID-19, therapies, social anxiety disorder, and adverse drug events). RESULTS: In total, 29 teams registered, representing 17 countries. In general, the top-performing systems used deep neural network architectures based on pre-trained transformer models. In particular, the top-performing systems for the classification tasks were based on single models that were pre-trained on social media corpora. CONCLUSION: To facilitate future work, the datasets-a total of 61 353 posts-will remain available by request, and the CodaLab sites will remain active for a post-evaluation phase.


Assuntos
Mídias Sociais , Humanos , Mineração de Dados/métodos , Aprendizado de Máquina , Processamento de Linguagem Natural , Redes Neurais de Computação
4.
medRxiv ; 2023 Nov 21.
Artigo em Inglês | MEDLINE | ID: mdl-38045356

RESUMO

Background: Preterm birth, defined as birth at <37 weeks of gestation, is the leading cause of neonatal death globally and, together with low birthweight, the second leading cause of infant mortality in the United States. There is mounting evidence that COVID-19 infection during pregnancy is associated with an increased risk of preterm birth; however, data remain limited by trimester of infection. The ability to study COVID-19 infection during the earlier stages of pregnancy has been limited by available sources of data. The objective of this study was to use self-reports in large-scale, longitudinal social media data to assess the association between trimester of COVID-19 infection and preterm birth. Methods: In this retrospective cohort study, we used natural language processing and machine learning, followed by manual validation, to identify pregnant Twitter users and to search their longitudinal collection of publicly available tweets for reports of COVID-19 infection during pregnancy and, subsequently, a preterm birth or term birth (i.e., a gestational age ≥37 weeks) outcome. Among the users who reported their pregnancy on Twitter, we also identified a 1:1 age-matched control group, consisting of users with a due date prior to January 1, 2020-that is, without COVID-19 infection during pregnancy. We calculated the odds ratios (ORs) with 95% confidence intervals (CIs) to compare the overall rates of preterm birth for pregnancies with and without COVID-19 infection and by timing of infection: first trimester (weeks 1-13), second trimester (weeks 1427), or third trimester (weeks 28-36). Results: Through August 2022, we identified 298 Twitter users who reported COVID-19 infection during pregnancy, a preterm birth or term birth outcome, and maternal age: 94 (31.5%) with first-trimester infection, 110 (36.9%) second-trimester infection, and 95 (31.9%) third-trimester infection. In total, 26 (8.8%) of these 298 users reported preterm birth: 8 (8.5%) were infected during the first trimester, 7 (6.4%) were infected during the second trimester, and 12 (12.6%) were infected during the third trimester. In the 1:1 age-matched control group, 13 (4.4%) of the 298 users reported preterm birth. Overall, the risk of preterm birth was significantly higher for pregnancies with COVID-19 infection compared to those without (OR 2.1, 95% CI 1.06-4.16). In particular, the risk of preterm birth was significantly higher for pregnancies with COVID-19 infection during the third trimester (OR 3.17, CI 1.39-7.21). Conclusion: The results of our study suggest that COVID-19 infection particularly during the third trimester is associated with an increased risk of preterm birth.

5.
medRxiv ; 2023 Nov 08.
Artigo em Inglês | MEDLINE | ID: mdl-37986776

RESUMO

The aim of the Social Media Mining for Health Applications (#SMM4H) shared tasks is to take a community-driven approach to address the natural language processing and machine learning challenges inherent to utilizing social media data for health informatics. The eighth iteration of the #SMM4H shared tasks was hosted at the AMIA 2023 Annual Symposium and consisted of five tasks that represented various social media platforms (Twitter and Reddit), languages (English and Spanish), methods (binary classification, multi-class classification, extraction, and normalization), and topics (COVID-19, therapies, social anxiety disorder, and adverse drug events). In total, 29 teams registered, representing 18 countries. In this paper, we present the annotated corpora, a technical summary of the systems, and the performance results. In general, the top-performing systems used deep neural network architectures based on pre-trained transformer models. In particular, the top-performing systems for the classification tasks were based on single models that were pre-trained on social media corpora. To facilitate future work, the datasets-a total of 61,353 posts-will remain available by request, and the CodaLab sites will remain active for a post-evaluation phase.

8.
Morphologie ; 107(356): 12-21, 2023 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-35184941

RESUMO

PURPOSE: In this study, the purpose was to uncover the views of medical students about online anatomy education adopted during the COVID-19 pandemic period. It was also aimed to determine whether medical school students found online education suitable for anatomy lectures and which materials they desired to use during teaching anatomy practice lectures in this process. METHODS: A survey form that was prepared with the Google Survey application was administered to the Medical Faculty Term 1 and 2 students who received anatomy courses at Istanbul Yeni Yüzyil University in the spring semester of the 2019-2020 academic year. RESULTS: A total of 180 students, 53.89% of whom were 1st graders and 46.11% 2nd graders participated in the study, and 43.89% of the students stated that they found online education suitable for anatomy theoretical courses, and 12.78% for anatomy practice courses. Also, 43.75% of Term 1 and 41.77% of Term 2 students stated that the pandemic negatively affected the teaching of anatomy theoretical courses. It was found that students considered that anatomy practice courses were more affected by the pandemic before and during the pandemic (P<0.001). CONCLUSIONS: This study uncovered that the pandemic process negatively affected anatomy education and students made more use of face-to-face education. We believe that the results obtained in the study will shed light on the views of anatomists on the teaching of anatomy in the online education process.


Assuntos
Anatomia , COVID-19 , Estudantes de Medicina , Humanos , Pandemias , Docentes de Medicina , COVID-19/epidemiologia
9.
Anim Welf ; 32: e65, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38487439

RESUMO

Over the last several decades an alternative to current methods of stunning cattle has been developed. This system, DTS: Diathermic Syncope®, has been suggested to the Jewish and Muslim communities as a means to achieve pre-cut stunning in conformity with both religious and EU regulations without a need to resort to a derogation that permits an exemption from the EU requirement to pre-stun all animals undergoing slaughter. The developer's contention is that the system induces fainting, and thus should be acceptable to all groups, including the kosher (Jewish) and Halal (Muslim) consumer. A review of the system based on publications and reports from the developer itself suggests that in reality the system selectively heats the brain, leading to an epileptic-type seizure with tonic-clonic phases and unconsciousness lasting several minutes. It does not induce a (benign) faint, and use of the system might cause structural brain damage. Thus, this system is unlikely to be acceptable under Jewish religious law and its animal welfare value can be questioned.

10.
JMIR Aging ; 5(3): e39547, 2022 Sep 16.
Artigo em Inglês | MEDLINE | ID: mdl-36112408

RESUMO

BACKGROUND: More than 6 million people in the United States have Alzheimer disease and related dementias, receiving help from more than 11 million family or other informal caregivers. A range of traditional interventions has been developed to support family caregivers; however, most of them have not been implemented in practice and remain largely inaccessible. While recent studies have shown that family caregivers of people with dementia use Twitter to discuss their experiences, methods have not been developed to enable the use of Twitter for interventions. OBJECTIVE: The objective of this study is to develop an annotated data set and benchmark classification models for automatically identifying a cohort of Twitter users who have a family member with dementia. METHODS: Between May 4 and May 20, 2021, we collected 10,733 tweets, posted by 8846 users, that mention a dementia-related keyword, a linguistic marker that potentially indicates a diagnosis, and a select familial relationship. Three annotators annotated 1 random tweet per user to distinguish those that indicate having a family member with dementia from those that do not. Interannotator agreement was 0.82 (Fleiss kappa). We used the annotated tweets to train and evaluate support vector machine and deep neural network classifiers. To assess the scalability of our approach, we then deployed automatic classification on unlabeled tweets that were continuously collected between May 4, 2021, and March 9, 2022. RESULTS: A deep neural network classifier based on a BERT (bidirectional encoder representations from transformers) model pretrained on tweets achieved the highest F1-score of 0.962 (precision=0.946 and recall=0.979) for the class of tweets indicating that the user has a family member with dementia. The classifier detected 128,838 tweets that indicate having a family member with dementia, posted by 74,290 users between May 4, 2021, and March 9, 2022-that is, approximately 7500 users per month. CONCLUSIONS: Our annotated data set can be used to automatically identify Twitter users who have a family member with dementia, enabling the use of Twitter on a large scale to not only explore family caregivers' experiences but also directly target interventions at these users.

11.
JMIR Form Res ; 6(6): e36771, 2022 Jun 30.
Artigo em Inglês | MEDLINE | ID: mdl-35771614

RESUMO

BACKGROUND: Despite the fact that medication is taken during more than 90% of pregnancies, the fetal risk for most medications is unknown, and the majority of medications have no data regarding safety in pregnancy. OBJECTIVE: Using ß-blockers as a proof-of-concept, the primary objective of this study was to assess the utility of Twitter data for a cohort study design-in particular, whether we could identify (1) Twitter users who have posted tweets reporting that they took medication during pregnancy and (2) their associated pregnancy outcomes. METHODS: We searched for mentions of ß-blockers in 2.75 billion tweets posted by 415,690 users who announced their pregnancy on Twitter. We manually reviewed the matching tweets to first determine if the user actually took the ß-blocker mentioned in the tweet. Then, to help determine if the ß-blocker was taken during pregnancy, we used the time stamp of the tweet reporting intake and drew upon an automated natural language processing (NLP) tool that estimates the date of the user's prenatal time period. For users who posted tweets indicating that they took or may have taken the ß-blocker during pregnancy, we drew upon additional NLP tools to help identify tweets that report their pregnancy outcomes. Adverse pregnancy outcomes included miscarriage, stillbirth, birth defects, preterm birth (<37 weeks gestation), low birth weight (<5 pounds and 8 ounces at delivery), and neonatal intensive care unit (NICU) admission. Normal pregnancy outcomes included gestational age ≥37 weeks and birth weight ≥5 pounds and 8 ounces. RESULTS: We retrieved 5114 tweets, posted by 2339 users, that mention a ß-blocker, and manually identified 2332 (45.6%) tweets, posted by 1195 (51.1%) of the users, that self-report taking the ß-blocker. We were able to estimate the date of the prenatal time period for 356 pregnancies among 334 (27.9%) of these 1195 users. Among these 356 pregnancies, we identified 257 (72.2%) during which the ß-blocker was or may have been taken. We manually verified an adverse pregnancy outcome-preterm birth, NICU admission, low birth weight, birth defects, or miscarriage-for 38 (14.8%) of these 257 pregnancies. We manually verified a gestational age ≥37 weeks for 198 (90.4%) and a birth weight ≥5 pounds and 8 ounces for 50 (22.8%) of the 219 pregnancies for which we did not identify an adverse pregnancy outcome. CONCLUSIONS: Our ability to detect pregnancy outcomes for Twitter users who posted tweets reporting that they took or may have taken a ß-blocker during pregnancy suggests that Twitter can be a complementary resource for cohort studies of drug safety in pregnancy.

12.
Digit Health ; 8: 20552076221097508, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35574580

RESUMO

Objective: Given the uncertainty about the trends and extent of the rapidly evolving COVID-19 outbreak, and the lack of extensive testing in the United Kingdom, our understanding of COVID-19 transmission is limited. We proposed to use Twitter to identify personal reports of COVID-19 to assess whether this data can help inform as a source of data to help us understand and model the transmission and trajectory of COVID-19. Methods: We used natural language processing and machine learning framework. We collected tweets (excluding retweets) from the Twitter Streaming API that indicate that the user or a member of the user's household had been exposed to COVID-19. The tweets were required to be geo-tagged or have profile location metadata in the UK. Results: We identified a high level of agreement between personal reports from Twitter and lab-confirmed cases by geographical region in the UK. Temporal analysis indicated that personal reports from Twitter appear up to 2 weeks before UK government lab-confirmed cases are recorded. Conclusions: Analysis of tweets may indicate trends in COVID-19 in the UK and provide signals of geographical locations where resources may need to be targeted or where regional policies may need to be put in place to further limit the spread of COVID-19. It may also help inform policy makers of the restrictions in lockdown that are most effective or ineffective.

13.
JMIR Public Health Surveill ; 8(4): e32405, 2022 04 25.
Artigo em Inglês | MEDLINE | ID: mdl-35468092

RESUMO

BACKGROUND: Pre-exposure prophylaxis (PrEP) is highly effective at preventing the acquisition of HIV. There is a substantial gap, however, between the number of people in the United States who have indications for PrEP and the number of them who are prescribed PrEP. Although Twitter content has been analyzed as a source of PrEP-related data (eg, barriers), methods have not been developed to enable the use of Twitter as a platform for implementing PrEP-related interventions. OBJECTIVE: Men who have sex with men (MSM) are the population most affected by HIV in the United States. Therefore, the objectives of this study were to (1) develop an automated natural language processing (NLP) pipeline for identifying men in the United States who have reported on Twitter that they are gay, bisexual, or MSM and (2) assess the extent to which they demographically represent MSM in the United States with new HIV diagnoses. METHODS: Between September 2020 and January 2021, we used the Twitter Streaming Application Programming Interface (API) to collect more than 3 million tweets containing keywords that men may include in posts reporting that they are gay, bisexual, or MSM. We deployed handwritten, high-precision regular expressions-designed to filter out noise and identify actual self-reports-on the tweets and their user profile metadata. We identified 10,043 unique users geolocated in the United States and drew upon a validated NLP tool to automatically identify their ages. RESULTS: By manually distinguishing true- and false-positive self-reports in the tweets or profiles of 1000 (10%) of the 10,043 users identified by our automated pipeline, we established that our pipeline has a precision of 0.85. Among the 8756 users for which a US state-level geolocation was detected, 5096 (58.2%) were in the 10 states with the highest numbers of new HIV diagnoses. Among the 6240 users for which a county-level geolocation was detected, 4252 (68.1%) were in counties or states considered priority jurisdictions by the Ending the HIV Epidemic initiative. Furthermore, the age distribution of the users reflected that of MSM in the United States with new HIV diagnoses. CONCLUSIONS: Our automated NLP pipeline can be used to identify MSM in the United States who may be at risk of acquiring HIV, laying the groundwork for using Twitter on a large scale to directly target PrEP-related interventions at this population.


Assuntos
Infecções por HIV , Minorias Sexuais e de Gênero , Mídias Sociais , Infecções por HIV/epidemiologia , Infecções por HIV/prevenção & controle , Homossexualidade Masculina , Humanos , Masculino , Processamento de Linguagem Natural , Estados Unidos/epidemiologia
14.
PLoS One ; 17(1): e0262087, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35077484

RESUMO

Advancing the utility of social media data for research applications requires methods for automatically detecting demographic information about social media study populations, including users' age. The objective of this study was to develop and evaluate a method that automatically identifies the exact age of users based on self-reports in their tweets. Our end-to-end automatic natural language processing (NLP) pipeline, ReportAGE, includes query patterns to retrieve tweets that potentially mention an age, a classifier to distinguish retrieved tweets that self-report the user's exact age ("age" tweets) and those that do not ("no age" tweets), and rule-based extraction to identify the age. To develop and evaluate ReportAGE, we manually annotated 11,000 tweets that matched the query patterns. Based on 1000 tweets that were annotated by all five annotators, inter-annotator agreement (Fleiss' kappa) was 0.80 for distinguishing "age" and "no age" tweets, and 0.95 for identifying the exact age among the "age" tweets on which the annotators agreed. A deep neural network classifier, based on a RoBERTa-Large pretrained transformer model, achieved the highest F1-score of 0.914 (precision = 0.905, recall = 0.942) for the "age" class. When the age extraction was evaluated using the classifier's predictions, it achieved an F1-score of 0.855 (precision = 0.805, recall = 0.914) for the "age" class. When it was evaluated directly on the held-out test set, it achieved an F1-score of 0.931 (precision = 0.873, recall = 0.998) for the "age" class. We deployed ReportAGE on a collection of more than 1.2 billion tweets, posted by 245,927 users, and predicted ages for 132,637 (54%) of them. Scaling the detection of exact age to this large number of users can advance the utility of social media data for research applications that do not align with the predefined age groupings of extant binary or multi-class classification approaches.


Assuntos
Coleta de Dados/métodos , Adolescente , Adulto , Humanos , Processamento de Linguagem Natural , Redes Neurais de Computação , Autorrelato , Mídias Sociais , Adulto Jovem
15.
JMIR Form Res ; 6(1): e33792, 2022 Jan 06.
Artigo em Inglês | MEDLINE | ID: mdl-34870607

RESUMO

BACKGROUND: COVID-19 during pregnancy is associated with an increased risk of maternal death, intensive care unit admission, and preterm birth; however, many people who are pregnant refuse to receive COVID-19 vaccination because of a lack of safety data. OBJECTIVE: The objective of this preliminary study was to assess whether Twitter data could be used to identify a cohort for epidemiologic studies of COVID-19 vaccination in pregnancy. Specifically, we examined whether it is possible to identify users who have reported (1) that they received COVID-19 vaccination during pregnancy or the periconception period, and (2) their pregnancy outcomes. METHODS: We developed regular expressions to search for reports of COVID-19 vaccination in a large collection of tweets posted through the beginning of July 2021 by users who have announced their pregnancy on Twitter. To help determine if users were vaccinated during pregnancy, we drew upon a natural language processing (NLP) tool that estimates the timeframe of the prenatal period. For users who posted tweets with a timestamp indicating they were vaccinated during pregnancy, we drew upon additional NLP tools to help identify tweets that reported their pregnancy outcomes. RESULTS: We manually verified the content of tweets detected automatically, identifying 150 users who reported on Twitter that they received at least one dose of COVID-19 vaccination during pregnancy or the periconception period. We manually verified at least one reported outcome for 45 of the 60 (75%) completed pregnancies. CONCLUSIONS: Given the limited availability of data on COVID-19 vaccine safety in pregnancy, Twitter can be a complementary resource for potentially increasing the acceptance of COVID-19 vaccination in pregnant populations. The results of this preliminary study justify the development of scalable methods to identify a larger cohort for epidemiologic studies.

16.
Preprint em Inglês | medRxiv | ID: ppmedrxiv-21263653

RESUMO

BackgroundCoronavirus Disease 2019 (Covid-19) during pregnancy is associated with an increased risk of maternal death, intensive care unit (ICU) admission, and preterm birth; however, many people who are pregnant refuse to receive Covid-19 vaccination because of a lack of safety data. ObjectiveThe objective of this preliminary study was to assess whether we could identify (1) users who have reported on Twitter that they received Covid-19 vaccination during pregnancy or the periconception period, and (2) reports of their pregnancy outcomes. MethodsWe searched for reports of Covid-19 vaccination in a large collection of tweets posted by users who have announced their pregnancy on Twitter. To help determine if users were vaccinated during pregnancy, we drew upon a natural language processing (NLP) tool that estimates the timeframe of the prenatal period. For users who posted tweets with a timestamp indicating they were vaccinated during pregnancy, we drew upon additional NLP tools to help identify tweets that report their pregnancy outcomes. ResultsUpon manually verifying the content of tweets detected automatically, we identified 150 users who reported on Twitter that they received at least one dose of Covid-19 vaccination during pregnancy or the periconception period. Among the 60 completed pregnancies, we manually verified at least one reported outcome for 45 (75%) of them. ConclusionsGiven the limited availability of data on Covid-19 vaccine safety in pregnancy, Twitter can be a complementary resource for potentially increasing the acceptance of Covid-19 vaccination in pregnant populations. Directions for future work include developing machine learning algorithms to detect a larger number of users for observational studies.

17.
J Med Internet Res ; 23(1): e25314, 2021 01 22.
Artigo em Inglês | MEDLINE | ID: mdl-33449904

RESUMO

BACKGROUND: In the United States, the rapidly evolving COVID-19 outbreak, the shortage of available testing, and the delay of test results present challenges for actively monitoring its spread based on testing alone. OBJECTIVE: The objective of this study was to develop, evaluate, and deploy an automatic natural language processing pipeline to collect user-generated Twitter data as a complementary resource for identifying potential cases of COVID-19 in the United States that are not based on testing and, thus, may not have been reported to the Centers for Disease Control and Prevention. METHODS: Beginning January 23, 2020, we collected English tweets from the Twitter Streaming application programming interface that mention keywords related to COVID-19. We applied handwritten regular expressions to identify tweets indicating that the user potentially has been exposed to COVID-19. We automatically filtered out "reported speech" (eg, quotations, news headlines) from the tweets that matched the regular expressions, and two annotators annotated a random sample of 8976 tweets that are geo-tagged or have profile location metadata, distinguishing tweets that self-report potential cases of COVID-19 from those that do not. We used the annotated tweets to train and evaluate deep neural network classifiers based on bidirectional encoder representations from transformers (BERT). Finally, we deployed the automatic pipeline on more than 85 million unlabeled tweets that were continuously collected between March 1 and August 21, 2020. RESULTS: Interannotator agreement, based on dual annotations for 3644 (41%) of the 8976 tweets, was 0.77 (Cohen κ). A deep neural network classifier, based on a BERT model that was pretrained on tweets related to COVID-19, achieved an F1-score of 0.76 (precision=0.76, recall=0.76) for detecting tweets that self-report potential cases of COVID-19. Upon deploying our automatic pipeline, we identified 13,714 tweets that self-report potential cases of COVID-19 and have US state-level geolocations. CONCLUSIONS: We have made the 13,714 tweets identified in this study, along with each tweet's time stamp and US state-level geolocation, publicly available to download. This data set presents the opportunity for future work to assess the utility of Twitter data as a complementary resource for tracking the spread of COVID-19.


Assuntos
COVID-19/epidemiologia , COVID-19/transmissão , Conjuntos de Dados como Assunto , Processamento de Linguagem Natural , Mídias Sociais/estatística & dados numéricos , COVID-19/diagnóstico , Surtos de Doenças/estatística & dados numéricos , Humanos , Estudos Longitudinais , SARS-CoV-2 , Autorrelato , Fala , Estados Unidos/epidemiologia
18.
Hepatol Int ; 15(1): 191-201, 2021 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-32949377

RESUMO

BACKGROUND: Primary sclerosing cholangitis (PSC) is a chronic, progressive liver disease known for its frequent concurrence with inflammatory bowel disease. PSC can progress to cirrhosis, end-stage liver disease, hepatobiliary cancer, and/or colorectal cancer. The etiopathogenesis of PSC remains poorly understood, and, as such, pharmacotherapy has yet to be definitively established. Little is known about the salivary microbiome in PSC and PSC-IBD. This study aimed to evaluate the oral microbiome of patients with PSC, with association to these patient's fecal microbial composition. METHODS: Saliva, fecal samples and Food Frequency Questionnaires were collected from 35 PSC patients with or without concomitant inflammatory bowel disease and 30 age- and BMI-matched healthy volunteers. 16S rRNA gene sequencing was performed using Illumina MiSeq platform. RESULTS: The salivary microbial signature of PSC was significantly altered as compared to healthy controls, independent of concomitant IBD, and was comprised of 19 significantly altered species, of which, eight species were consistently overrepresented in both fecal and saliva of patients with PSC, including Veillonella, Scardovia and Streptococcus. CONCLUSIONS: PSC is characterized by microbial dysbiosis in the gut and the salivary microbiome, independently from IBD. The PSC dysbiotic signature includes a reduction in autochthonous bacteria and an increased relative abundance of pathogenic bacteria, including an invasion of oral bacteria to the gut. PSC is a strong modulator of the microbial profile, in the gut and the oral microbiome. These results may lead to the development of biomarkers for screening and early diagnosis or the development of personalized medicine in PSC.


Assuntos
Colangite Esclerosante , Microbioma Gastrointestinal , Disbiose , Humanos , Doenças Inflamatórias Intestinais , RNA Ribossômico 16S/genética
19.
Data Brief ; 32: 106249, 2020 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-32944604

RESUMO

Despite the prevalence in the United States of miscarriage [1], stillbirth [2], and infant mortality associated with preterm birth and low birthweight [3], their causes remain largely unknown [4], [5], [6]. To advance the use of social media data as a complementary resource for epidemiology of adverse pregnancy outcomes, we present a data set of 6487 tweets that mention miscarriage, stillbirth, preterm birth or premature labor, low birthweight, neonatal intensive care, or fetal/infant loss in general. These tweets are a subset of 22,912 tweets retrieved by applying hand-written regular expressions to a database containing more than 400 million public tweets posted by more than 100,000 women who have announced their pregnancy on Twitter [7]. Two professional annotators labeled the 6487 tweets in a binary fashion, distinguishing those potentially reporting that the user has personally experienced the outcome ("outcome" tweets) from those that merely mention the outcome ("non-outcome" tweets). Inter-annotator agreement was κ = 0.90 (Cohen's kappa). The tweets annotated as "outcome" include 1318 women reporting miscarriage, 94 stillbirth, 591 preterm birth or premature labor, 171 low birthweight, 453 neonatal intensive care, and 356 fetal/infant loss in general. These "outcome" tweets can be used to explore patient experiences and perceptions of adverse pregnancy outcomes, and can direct researchers to the users' broader timelines-tweets posted by a user over time-for observational studies. Our past work demonstrates the analysis of timelines for selecting a study population [8] and conducting a case-control study [9] of users reporting that their child has a birth defect. For larger-scale studies, the full annotated corpus can be used to train supervised machine learning algorithms to automatically identify additional users reporting adverse pregnancy outcomes on Twitter. We used the annotated corpus to train feature-engineered and deep learning-based classifiers presented in "A natural language processing pipeline to advance the use of Twitter data for digital epidemiology of adverse pregnancy outcomes" [10].

20.
Conscious Cogn ; 85: 103020, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32932098

RESUMO

A long-standing controversy in social attention debates whether gaze-of-another induces reflexive shifts of one's own attention. In attempting to resolve this controversy, we utilized a novel Stroop task, the PAT Stroop, in which pro- and anti-saccade (PAT) responses are made to competing gaze and peripheral stimuli. The first experiment demonstrated a "Stroop effect" for peripheral stimuli, i.e. peripheral distractors interfered with gaze triggers, but gaze distractors did not interfere with peripheral triggers. These results were replicated in the second experiment, which also negated the possibility that the mere display and practice of the "clean PAT" influenced the results. Thus, the use a new PAT Stroop task demonstrated reflexive supremacy of peripheral stimuli over gaze stimuli. This novel variant of the Stroop task demonstrated similar characteristics to the classic color naming Stroop - i.e. an asymmetrical pattern, and again showed the utility and versatility of stoop-like tasks in probing mental tasks.


Assuntos
Sinais (Psicologia) , Movimentos Sacádicos , Atenção , Humanos , Tempo de Reação , Teste de Stroop
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA