Search | VHL Search Portal

Methicillin-resistant Staphylococcus aureus nasal colonization among HIV-infected patients in Taiwan: prevalence, molecular characteristics and associated factors with nasal carriage.

Hsu, Yi-Yu; Wu, David; Hung, Chien-Ching; Huang, Shie-Shian; Yuan, Fang-Hsueh; Lee, Ming-Hsun; Huang, Ching-Tai; Shie, Shian-Sen; Huang, Po-Yen; Yang, Chien-Chang; Cheng, Chun-Wen; Leu, Hsieh-Shong; Wu, Ting-Shu; Huang, Yhu-Chering.

BMC Infect Dis ; 20(1): 254, 2020 Mar 30.

Article in English | MEDLINE | ID: mdl-32228480

ABSTRACT

BACKGROUND: To evaluate nasal carriage, antibiotic susceptibility and molecular characteristics of methicillin-resistant Staphylococcus aureus (MRSA), as well as the risk factors of MRSA colonization, in human immunodeficiency virus (HIV)-infected patients in northern Taiwan. METHODS: From September 2014 to November 2015, HIV-infected patients seeking outpatient care at four hospitals were eligible for this study. A nasal specimen was obtained from each subject for the detection of S. aureus and a questionnaire was completed by each subject. MRSA isolates once identified were characterized. RESULTS: Of 553 patients surveyed, methicillin-susceptible S. aureus (MSSA) was detected in 119 subjects (21.5%) and MRSA in 19 subjects (3.4%). Female gender, injection drug use, smoking, hepatitis C virus carrier, cancer and antibiotic use within 1 year were positively associated with MRSA colonization. By multivariate analysis, only cancer (adjust odds ratio (aOR) 7.78, [95% confidence interval (CI), 1.909-31.731]) and antibiotic use within 1 year (aOR 3.89, [95% CI, 1.219-12.433]) were significantly associated with MRSA colonization. Ten isolates were characterized as sequence type (ST) 59/staphylococcal chromosome cassette (SCC) IV or VT, endemic community strains in Taiwan, four isolates as ST 8/SCCmec IV (USA 300) and one isolate as ST 239/SCCmec IIIA, a hospital strain. All the community-associated MRSA isolates were susceptible to trimethoprim-sulfamethoxazole (TMP-SMX). CONCLUSIONS: Nasal MRSA carriage in HIV-infected patients seeking outpatient care was low (3.4%) in northern Taiwan. Most of the colonizing isolates were genetically endemic community strains and exhibited high susceptibility to TMP-SMX and fluoroquinolones. Cancer and antibiotic use within 1 year were associated with MRSA colonization.

Subject(s)

HIV Infections/microbiology , Methicillin-Resistant Staphylococcus aureus/drug effects , Nasal Mucosa/microbiology , Staphylococcal Infections/epidemiology , Adult , Anti-Bacterial Agents/pharmacology , Female , HIV Infections/epidemiology , Humans , Male , Methicillin-Resistant Staphylococcus aureus/isolation & purification , Microbial Sensitivity Tests , Middle Aged , Prevalence , Risk Factors , Staphylococcal Infections/microbiology , Substance Abuse, Intravenous/complications , Taiwan/epidemiology , Trimethoprim, Sulfamethoxazole Drug Combination/pharmacology

The overview of the BioRED (Biomedical Relation Extraction Dataset) track at BioCreative VIII.

Islamaj, Rezarta; Lai, Po-Ting; Wei, Chih-Hsuan; Luo, Ling; Almeida, Tiago; Jonker, Richard A A; Conceição, Sofia I R; Sousa, Diana F; Phan, Cong-Phuoc; Chiang, Jung-Hsien; Li, Jiru; Pan, Dinghao; Meesawad, Wilailack; Tsai, Richard Tzong-Han; Sarol, M Janina; Hong, Gibong; Valiev, Airat; Tutubalina, Elena; Lee, Shao-Man; Hsu, Yi-Yu; Li, Mingjie; Verspoor, Karin; Lu, Zhiyong.

Database (Oxford) ; 20242024 Aug 08.

Article in English | MEDLINE | ID: mdl-39114977

ABSTRACT

The BioRED track at BioCreative VIII calls for a community effort to identify, semantically categorize, and highlight the novelty factor of the relationships between biomedical entities in unstructured text. Relation extraction is crucial for many biomedical natural language processing (NLP) applications, from drug discovery to custom medical solutions. The BioRED track simulates a real-world application of biomedical relationship extraction, and as such, considers multiple biomedical entity types, normalized to their specific corresponding database identifiers, as well as defines relationships between them in the documents. The challenge consisted of two subtasks: (i) in Subtask 1, participants were given the article text and human expert annotated entities, and were asked to extract the relation pairs, identify their semantic type and the novelty factor, and (ii) in Subtask 2, participants were given only the article text, and were asked to build an end-to-end system that could identify and categorize the relationships and their novelty. We received a total of 94 submissions from 14 teams worldwide. The highest F-score performances achieved for the Subtask 1 were: 77.17% for relation pair identification, 58.95% for relation type identification, 59.22% for novelty identification, and 44.55% when evaluating all of the above aspects of the comprehensive relation extraction. The highest F-score performances achieved for the Subtask 2 were: 55.84% for relation pair, 43.03% for relation type, 42.74% for novelty, and 32.75% for comprehensive relation extraction. The entire BioRED track dataset and other challenge materials are available at https://ftp.ncbi.nlm.nih.gov/pub/lu/BC8-BioRED-track/ and https://codalab.lisn.upsaclay.fr/competitions/13377 and https://codalab.lisn.upsaclay.fr/competitions/13378. Database URL: https://ftp.ncbi.nlm.nih.gov/pub/lu/BC8-BioRED-track/https://codalab.lisn.upsaclay.fr/competitions/13377https://codalab.lisn.upsaclay.fr/competitions/13378.

Subject(s)

Data Mining , Natural Language Processing , Humans , Data Mining/methods , Databases, Factual , Semantics

Using deep learning to identify translational research in genomic medicine beyond bench to bedside.

Hsu, Yi-Yu; Clyne, Mindy; Wei, Chih-Hsuan; Khoury, Muin J; Lu, Zhiyong.

Database (Oxford) ; 20192019 01 01.

Article in English | MEDLINE | ID: mdl-30753477

ABSTRACT

Tracking scientific research publications on the evaluation, utility and implementation of genomic applications is critical for the translation of basic research to impact clinical and population health. In this work, we utilize state-of-the-art machine learning approaches to identify translational research in genomics beyond bench to bedside from the biomedical literature. We apply the convolutional neural networks (CNNs) and support vector machines (SVMs) to the bench/bedside article classification on the weekly manual annotation data of the Public Health Genomics Knowledge Base database. Both classifiers employ salient features to determine the probability of curation-eligible publications, which can effectively reduce the workload of manual triage and curation process. We applied the CNNs and SVMs to an independent test set (n = 400), and the models achieved the F-measure of 0.80 and 0.74, respectively. We further tested the CNNs, which perform better results, on the routine annotation pipeline for 2 weeks and significantly reduced the effort and retrieved more appropriate research articles. Our approaches provide direct insight into the automated curation of genomic translational research beyond bench to bedside. The machine learning classifiers are found to be helpful for annotators to enhance the efficiency of manual curation.

Subject(s)

Deep Learning , Genomics , Translational Research, Biomedical , Neural Networks, Computer , ROC Curve , Reproducibility of Results , Statistics as Topic , Support Vector Machine , Task Performance and Analysis

Assisting document triage for human kinome curation via machine learning.

Hsu, Yi-Yu; Wei, Chih-Hsuan; Lu, Zhiyong.

Database (Oxford) ; 20182018 01 01.

Article in English | MEDLINE | ID: mdl-30239677

ABSTRACT

In the era of data explosion, the increasing frequency of published articles presents unorthodox challenges to fulfill specific curation requirements for bio-literature databases. Recognizing these demands, we designed a document triage system with automatic methods that can improve efficiency to retrieve the most relevant articles in curation workflows and reduce workloads for biocurators. Since the BioCreative VI (2017), we have implemented texting mining processing in our system in hopes of providing higher effectiveness for curating articles related to human kinase proteins. We tested several machine learning methods together with state-of-the-art concept extraction tools. For features, we extracted rich co-occurrence and linguistic information to model the curation process of human kinome articles by the neXtProt database. As shown in the official evaluation on the human kinome curation task in BioCreative VI, our system can effectively retrieve 5.2 and 6.5 kinase articles with the relevant disease (DIS) and biological process (BP) information, respectively, among the top 100 returned results. Comparing to neXtA5, our system demonstrates significant improvements in prioritizing kinome-related articles as follows: our system achieves 0.458 and 0.109 for the DIS axis whereas the neXtA5's best-reported mean average precision (MAP) and maximum precision observed are 0.41 and 0.04. Our system also outperforms the neXtA5 in retrieving BP axis with 0.195 for MAP and the neXtA5's reported value was 0.11. These results suggest that our system may be able to assist neXtProt biocurators in practice.

Subject(s)

Data Mining , Documentation , Machine Learning , Protein Kinases/metabolism , Proteome/metabolism , Humans , Statistics as Topic

Overview of the BioCreative VI text-mining services for Kinome Curation Track.

Gobeill, Julien; Gaudet, Pascale; Dopp, Daniel; Morrone, Adam; Kahanda, Indika; Hsu, Yi-Yu; Wei, Chih-Hsuan; Lu, Zhiyong; Ruch, Patrick.

Database (Oxford) ; 20182018 01 01.

Article in English | MEDLINE | ID: mdl-30329035

ABSTRACT

The text-mining services for kinome curation track, part of BioCreative VI, proposed a competition to assess the effectiveness of text mining to perform literature triage. The track has exploited an unpublished curated data set from the neXtProt database. This data set contained comprehensive annotations for 300 human protein kinases. For a given protein and a given curation axis [diseases or gene ontology (GO) biological processes], participants' systems had to identify and rank relevant articles in a collection of 5.2 M MEDLINE citations (task 1) or 530 000 full-text articles (task 2). Explored strategies comprised named-entity recognition and machine-learning frameworks. For that latter approach, participants developed methods to derive a set of negative instances, as the databases typically do not store articles that were judged as irrelevant by curators. The supervised approaches proposed by the participating groups achieved significant improvements compared to the baseline established in a previous study and compared to a basic PubMed search.

Subject(s)

Data Mining , Protein Kinases/metabolism , Databases, Factual , Humans , Periodicals as Topic

AuDis: an automatic CRF-enhanced disease normalization in biomedical text.

Lee, Hsin-Chun; Hsu, Yi-Yu; Kao, Hung-Yu.

Database (Oxford) ; 20162016.

Article in English | MEDLINE | ID: mdl-27278815

ABSTRACT

Diseases play central roles in many areas of biomedical research and healthcare. Consequently, aggregating the disease knowledge and treatment research reports becomes an extremely critical issue, especially in rapid-growth knowledge bases (e.g. PubMed). We therefore developed a system, AuDis, for disease mention recognition and normalization in biomedical texts. Our system utilizes an order two conditional random fields model. To optimize the results, we customize several post-processing steps, including abbreviation resolution, consistency improvement and stopwords filtering. As the official evaluation on the CDR task in BioCreative V, AuDis obtained the best performance (86.46% of F-score) among 40 runs (16 unique teams) on disease normalization of the DNER sub task. These results suggest that AuDis is a high-performance recognition system for disease recognition and normalization from biomedical literature.Database URL: http://ikmlab.csie.ncku.edu.tw/CDR2015/AuDis.html.

Subject(s)

Biomedical Research/standards , Computational Biology/methods , Data Mining/methods , Data Mining/standards , Disease , Databases, Factual , Humans , Internet , Natural Language Processing , Vocabulary, Controlled

Curatable Named-Entity Recognition Using Semantic Relations.

Hsu, Yi-Yu; Kao, Hung-Yu.

IEEE/ACM Trans Comput Biol Bioinform ; 12(4): 785-92, 2015.

Article in English | MEDLINE | ID: mdl-26357317

ABSTRACT

Named-entity recognition (NER) plays an important role in the development of biomedical databases. However, the existing NER tools produce multifarious named-entities which may result in both curatable and non-curatable markers. To facilitate biocuration with a straightforward approach, classifying curatable named-entities is helpful with regard to accelerating the biocuration workflow. Co-occurrence Interaction Nexus with Named-entity Recognition (CoINNER) is a web-based tool that allows users to identify genes, chemicals, diseases, and action term mentions in the Comparative Toxicogenomic Database (CTD). To further discover interactions, CoINNER uses multiple advanced algorithms to recognize the mentions in the BioCreative IV CTD Track. CoINNER is developed based on a prototype system that annotated gene, chemical, and disease mentions in PubMed abstracts at BioCreative 2012 Track I (literature triage). We extended our previous system in developing CoINNER. The pre-tagging results of CoINNER were developed based on the state-of-the-art named entity recognition tools in BioCreative III. Next, a method based on conditional random fields (CRFs) is proposed to predict chemical and disease mentions in the articles. Finally, action term mentions were collected by latent Dirichlet allocation (LDA). At the BioCreative IV CTD Track, the best F-measures reached for gene/protein, chemical/drug and disease NER were 54 percent while CoINNER achieved a 61.5 percent F-measure. System URL: http://ikmbio.csie.ncku.edu.tw/coinner/ introduction.htm.

Subject(s)

Computational Biology/methods , Data Mining/methods , Semantics , Algorithms , Pattern Recognition, Automated , Toxicogenetics

CoIN: a network analysis for document triage.

Hsu, Yi-Yu; Kao, Hung-Yu.

Database (Oxford) ; 2013: bat076, 2013.

Article in English | MEDLINE | ID: mdl-24218542

ABSTRACT

In recent years, there was a rapid increase in the number of medical articles. The number of articles in PubMed has increased exponentially. Thus, the workload for biocurators has also increased exponentially. Under these circumstances, a system that can automatically determine in advance which article has a higher priority for curation can effectively reduce the workload of biocurators. Determining how to effectively find the articles required by biocurators has become an important task. In the triage task of BioCreative 2012, we proposed the Co-occurrence Interaction Nexus (CoIN) for learning and exploring relations in articles. We constructed a co-occurrence analysis system, which is applicable to PubMed articles and suitable for gene, chemical and disease queries. CoIN uses co-occurrence features and their network centralities to assess the influence of curatable articles from the Comparative Toxicogenomics Database. The experimental results show that our network-based approach combined with co-occurrence features can effectively classify curatable and non-curatable articles. CoIN also allows biocurators to survey the ranking lists for specific queries without reviewing meaningless information. At BioCreative 2012, CoIN achieved a 0.778 mean average precision in the triage task, thus finishing in second place out of all participants. Database URL: http://ikmbio.csie.ncku.edu.tw/coin/home.php.

Subject(s)

Data Mining , Documentation , Databases, Factual , Peer Review, Research , PubMed , User-Computer Interface

Using a search engine-based mutually reinforcing approach to assess the semantic relatedness of biomedical terms.

Hsu, Yi-Yu; Chen, Hung-Yu; Kao, Hung-Yu.

PLoS One ; 8(11): e77868, 2013.

Article in English | MEDLINE | ID: mdl-24348899

ABSTRACT

BACKGROUND: Determining the semantic relatedness of two biomedical terms is an important task for many text-mining applications in the biomedical field. Previous studies, such as those using ontology-based and corpus-based approaches, measured semantic relatedness by using information from the structure of biomedical literature, but these methods are limited by the small size of training resources. To increase the size of training datasets, the outputs of search engines have been used extensively to analyze the lexical patterns of biomedical terms. METHODOLOGY/PRINCIPAL FINDINGS: In this work, we propose the Mutually Reinforcing Lexical Pattern Ranking (ReLPR) algorithm for learning and exploring the lexical patterns of synonym pairs in biomedical text. ReLPR employs lexical patterns and their pattern containers to assess the semantic relatedness of biomedical terms. By combining sentence structures and the linking activities between containers and lexical patterns, our algorithm can explore the correlation between two biomedical terms. CONCLUSIONS/SIGNIFICANCE: The average correlation coefficient of the ReLPR algorithm was 0.82 for various datasets. The results of the ReLPR algorithm were significantly superior to those of previous methods.

Subject(s)

Biomedical Research , Search Engine/standards , Semantics , Algorithms , Computational Biology , Pattern Recognition, Automated

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL