Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add more filters








Database
Language
Publication year range
1.
Heliyon ; 10(11): e32464, 2024 Jun 15.
Article in English | MEDLINE | ID: mdl-38947458

ABSTRACT

Climate change is one of the most pressing global issues of our time, and understanding public perception and awareness of the topic is crucial for developing effective policies to mitigate its effects. While traditional survey methods have been used to gauge public opinion, advances in natural language processing (NLP) and data visualization techniques offer new opportunities to analyze user-generated content from social media and blog posts. In this study, a new dataset of climate change-related texts was collected from social media sources and various blogs. The dataset was analyzed using BERTopic and LDA to identify and visualize the most important topics related to climate change. The study also used sentence similarity to determine the similarities in the comments written and which topic categories they belonged to. The performance of different techniques for keyword extraction and text representation, including OpenAI, Maximal Marginal Relevance (MMR), and KeyBERT, was compared for topic modeling with BERTopic. It was seen that the best coherence score and topic diversity metric were obtained with OpenAI-based BERTopic. The results provide insights into the public's attitudes and perceptions towards climate change, which can inform policy development and contribute to efforts to reduce activities that cause climate change.

2.
Heliyon ; 9(5): e15670, 2023 May.
Article in English | MEDLINE | ID: mdl-37187909

ABSTRACT

Since Turkish is an agglutinative language and contains reduplication, idiom, and metaphor words, Turkish texts are sources of information with extremely rich meanings. For this reason, the processing and classification of Turkish texts according to their characteristics is both time-consuming and difficult. In this study, the performances of pre-trained language models for multi-text classification using Autotrain were compared in a 250 K Turkish dataset that we created. The results showed that the BERTurk (uncased, 128 k) language model on the dataset showed higher accuracy performance with a training time of 66 min compared to the other models and the CO2 emission was quite low. The ConvBERTurk mC4 (uncased) model is also the best-performing second language model. As a result of this study, we have provided a deeper understanding of the capabilities of pre-trained language models for Turkish on machine learning.

3.
Chemometr Intell Lab Syst ; 230: 104680, 2022 Nov 15.
Article in English | MEDLINE | ID: mdl-36213553

ABSTRACT

Although some people do not have any chronic disease or are not in the risky age group for Covid-19, they are more vulnerable to the coronavirus. As the reason for this situation, some experts focus on the immune system of the person, while others think that the genetic history of patients may play a role. It is critical to detect corona from DNA signals as early as possible to determine the relationship between Covid-19 and genes. Thus, the effect on the severe course of the disease of variations in the genes associated with the corona disease will be revealed. In this study, a novel intelligent computer approach is proposed to identify coronavirus from nucleotide signals for the first time. The proposed method presents a multilayered feature extraction structure to extract the most effective features using an Entropy-based mapping technique, Discrete Wavelet Transform (DWT), statistical feature extractor, and Singular Value Decomposition (SVD), together. Then 94 distinctive features are selected by the ReliefF technique. Support vector machine (SVM) and k nearest neighborhood (k-NN) are chosen as classifiers. The method achieved the highest classification accuracy rate of 98.84% with an SVM classifier to detect Covid-19 from DNA signals. The proposed method is ready to be tested with a different database in the diagnosis of Covid-19 using RNA or other signals.

4.
Chemometr Intell Lab Syst ; 229: 104640, 2022 Oct 15.
Article in English | MEDLINE | ID: mdl-36042844

ABSTRACT

Although the coronavirus epidemic spread rapidly with the Omicron variant, it lost its lethality rate with the effect of vaccine and immunity. The hospitalization and intense demand decreased. However, there is no definite information about when this disease will end or how dangerous the different variants could be. In addition, it is not possible to end the risk of variants that will continue to circulate among animals in nature. After this stage, drug-virus interactions should be examined in order to be able to prepare against possible new types of viruses and variants and to rapidly-produce drugs or vaccines against possible viruses. Despite experimental methods that are expensive, laborious, and time-consuming, geometric deep learning(GDL) is an alternative method that can be used to make this process faster and cheaper. In this study, we propose a new model based on geometric deep learning for the prediction of drug-virus interaction against COVID-19. First, we use the antiviral drug data in the SMILES molecular structure representation to generate too many features and better describe the structure of chemical species. Then the data is converted into a molecular representation and then into a graphical structure that the GDL model can understand. The node feature vectors are transferred to a different space with the Message Passing Neural Network (MPNN) for the training process to take place. We develop a geometric neural network architecture where the graph embedding values are passed through the fully connected layer and the prediction is actualized. The results indicate that the proposed method outperforms existing methods with 97% accuracy in predicting drug-virus interactions.

SELECTION OF CITATIONS
SEARCH DETAIL