Your browser doesn't support javascript.
loading
Text mining for identifying topics in the literatures about adolescent substance use and depression.
Wang, Shi-Heng; Ding, Yijun; Zhao, Weizhong; Huang, Yung-Hsiang; Perkins, Roger; Zou, Wen; Chen, James J.
Afiliação
  • Wang SH; Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, HFT-20, Jefferson, AR, 72079, USA.
  • Ding Y; Graduate Institute of Biostatistics, China Medical University, No. 91, Xueshi Rd, Taichung City, 40402, Taiwan.
  • Zhao W; Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, HFT-20, Jefferson, AR, 72079, USA.
  • Huang YH; Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, HFT-20, Jefferson, AR, 72079, USA.
  • Perkins R; National Applied Research Laboratories, National Center for High-Performance Computing, No. 7, R&D 6th Rd., Hsinchu Science Park, Hsinchu City, 30076, Taiwan.
  • Zou W; Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, HFT-20, Jefferson, AR, 72079, USA.
  • Chen JJ; Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, HFT-20, Jefferson, AR, 72079, USA. wen.zou@fda.hhs.gov.
BMC Public Health ; 16: 279, 2016 Mar 19.
Article em En | MEDLINE | ID: mdl-26993983
ABSTRACT

BACKGROUND:

Both adolescent substance use and adolescent depression are major public health problems, and have the tendency to co-occur. Thousands of articles on adolescent substance use or depression have been published. It is labor intensive and time consuming to extract huge amounts of information from the cumulated collections. Topic modeling offers a computational tool to find relevant topics by capturing meaningful structure among collections of documents.

METHODS:

In this study, a total of 17,723 abstracts from PubMed published from 2000 to 2014 on adolescent substance use and depression were downloaded as objects, and Latent Dirichlet allocation (LDA) was applied to perform text mining on the dataset. Word clouds were used to visually display the content of topics and demonstrate the distribution of vocabularies over each topic.

RESULTS:

The LDA topics recaptured the search keywords in PubMed, and further discovered relevant issues, such as intervention program, association links between adolescent substance use and adolescent depression, such as sexual experience and violence, and risk factors of adolescent substance use, such as family factors and peer networks. Using trend analysis to explore the dynamics of proportion of topics, we found that brain research was assessed as a hot issue by the coefficient of the trend test.

CONCLUSIONS:

Topic modeling has the ability to segregate a large collection of articles into distinct themes, and it could be used as a tool to understand the literature, not only by recapturing known facts but also by discovering other relevant topics.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Contexto em Saúde: 2_ODS3 Base de dados: MEDLINE Assunto principal: Transtornos Relacionados ao Uso de Substâncias / Depressão / Mineração de Dados Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Adolescent / Humans Idioma: En Revista: BMC Public Health Ano de publicação: 2016 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Contexto em Saúde: 2_ODS3 Base de dados: MEDLINE Assunto principal: Transtornos Relacionados ao Uso de Substâncias / Depressão / Mineração de Dados Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Adolescent / Humans Idioma: En Revista: BMC Public Health Ano de publicação: 2016 Tipo de documento: Article