Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
J Biomed Inform ; 137: 104264, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36462599

RESUMO

The demand for the privacy-preserving survival analysis of medical data integrated from multiple institutions or countries has been increased. However, sharing the original medical data is difficult because of privacy concerns, and even if it could be achieved, we have to pay huge costs for cross-institutional or cross-border communications. To tackle these difficulties of privacy-preserving survival analysis on multiple parties, this study proposes a novel data collaboration Cox proportional hazards (DC-COX) model based on a data collaboration framework for horizontally and vertically partitioned data. By integrating dimensionality-reduced intermediate representations instead of the original data, DC-COX obtains a privacy-preserving survival analysis without iterative cross-institutional communications or huge computational costs. DC-COX enables each local party to obtain an approximation of the maximum likelihood model parameter, the corresponding statistic, such as the p-value, and survival curves for subgroups. Based on a bootstrap technique, we introduce a dimensionality reduction method to improve the efficiency of DC-COX. Numerical experiments demonstrate that DC-COX can compute a model parameter and the corresponding statistics with higher performance than the local party analysis. Particularly, DC-COX demonstrates outstanding performance in essential feature selection based on the p-value compared with the existing methods including the federated learning-based method.


Assuntos
Comunicação , Privacidade , Modelos de Riscos Proporcionais , Análise de Sobrevida
2.
J Biomed Inform ; 115: 103692, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33548543

RESUMO

OBJECTIVE: The goal of this work was to capture diseases in patients by comprehending the fine-grained medical conditions and disease progression manifested by transitions in medical conditions. We realize this by introducing our earlier work on a state-of-the-art knowledge presentation, which defines a disease as a causal chain of abnormal states (CCAS). Here, we propose a framework, EHR2CCAS, for constructing a system to map electronic health record (EHR) data to CCAS. MATERIALS AND METHODS: EHR2CCAS is a framework consisting of modules that access heterogeneous EHR to estimate the presence of abnormal states in a CCAS for a patient in a given time window. EHR2CCAS applies expert-driven (rule-based) and data-driven (machine learning) methods to identify abnormal states from structured and unstructured EHR data. It features data-driven approaches for unlocking clinical texts and imputations based on the EHR temporal properties and the causal CCAS structure. This study presents the CCAS of chronic kidney disease as an example. A mapping system between the EHR from the University of Tokyo Hospital and CCAS of chronic kidney disease was constructed and evaluated against expert annotation. RESULTS: The system achieved high prediction performance in identifying abnormal states that had strong agreement among annotators. Our handling of narrative varieties in texts and our imputation of the presence of an abnormal state markedly improved the prediction performance. EHR2CCAS presents patient data describing the temporal presence of abnormal states in CCAS, which is useful in individual disease progression management. Further analysis of the differentiation of transition among abnormal states outputted by EHR2CCAS can contribute to detecting disease subtypes. CONCLUSION: This work represents the first step toward combining disease knowledge and EHR to extract abnormality related to a disease defined as fine-grained abnormal states and transitions among them. This can aid in disease progression management and deep phenotyping.


Assuntos
Registros Eletrônicos de Saúde , Insuficiência Renal Crônica , Causalidade , Humanos , Conhecimento , Aprendizado de Máquina , Insuficiência Renal Crônica/diagnóstico
3.
Comput Inform Nurs ; 39(11): 828-834, 2021 05 12.
Artigo em Inglês | MEDLINE | ID: mdl-33990502

RESUMO

In Japan, nursing records are not easily put to secondary use because nursing documentation is not standardized. In recent years, electronic health records have necessitated the creation of Japanese nursing terminology. The purpose of this study was to develop and evaluate an automatic classification system for narrative nursing records using natural language processing technology and machine learning. We collected a week's worth of narrative nursing records from an academic hospital. The authors independently annotated the text data, dividing it into morphemes, the smallest meaningful unit in a language. During preprocessing when creating feature quantities, we used a Japanese tokenizer, MeCab, an open-source morphological parser, and the bag-of-words model. A support vector machine was adopted as a classifier for machine learning. The accuracy was 0.96 and 0.86 on the training set and test set, respectively, and the F value was 0.82. Our findings provide useful information regarding the development of an automatic classification system for Japanese nursing records using nursing terminology and natural language processing techniques.


Assuntos
Processamento de Linguagem Natural , Registros de Enfermagem , Registros Eletrônicos de Saúde , Eletrônica , Humanos , Japão , Aprendizado de Máquina
4.
Cognition ; 246: 105758, 2024 05.
Artigo em Inglês | MEDLINE | ID: mdl-38442587

RESUMO

We propose a method to achieve better wisdom of crowds by utilizing anchoring effects. In this method, people are first asked to make a comparative judgment such as "Is the number of new COVID-19 infections one month later more or less than 10 (or 200,000)?" As in this example, two sufficiently different anchors (e.g., "10" or "200,000") are set in the comparative judgment. After this comparative judgment, people are asked to make their own estimates. These estimates are then aggregated. We hypothesized that the aggregated estimates using this method would be more accurate than those without anchor presentation. To examine the effectiveness of the proposed method, we conducted three studies: a computer simulation and two behavioral experiments (numerical estimation of perceptual stimuli and estimation of new COVID-19 infections by physicians). Through computer simulations, we could identify situations in which the proposed method is effective. Although the proposed method is not always effective (e.g., when a group can make fairly accurate estimations), on average, the proposed method is more likely to achieve better wisdom of crowds. In particular, when a group cannot make accurate estimations (i.e., shows biases such as overestimation or underestimation), the proposed method can achieve better wisdom of crowds. The results of the behavioral experiments were consistent with the computer simulation findings. The proposed method achieved better wisdom of crowds. We discuss new insights into anchoring effects and methods for inducing diverse opinions from group members.


Assuntos
COVID-19 , Julgamento , Humanos , Simulação por Computador , Aglomeração
5.
J Multidiscip Healthc ; 17: 2069-2081, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38736534

RESUMO

Purpose: The aim of this pilot study was to first aggregate and then integrate the medical records of various healthcare professionals involved with breast cancer patients to reveal if and how patient-centered information is documented in multidisciplinary cancer care. Patients and Methods: We aggregated 20 types of medical records from various healthcare professionals such as physicians, nurses and allied healthcare professionals (AHPs) throughout three breast cancer patients' care pathways in a department of breast surgery at a university hospital. Purposeful sampling was used, and three cases were examined. The number of integrated type of records was 14, 14, 17 in case 1, 2 and 3, respectively. We manually annotated and analyzed them exploratively using a thematic analysis. The tags were produced using both a deductive template approach and a data-driven inductive approach. All records were then given tags. We defined patient-centered information related tags and biomedical information related tags and then analyzed for if and how patient-centered information was documented. Results: The number of patient-centered information related tags accounted for 30%, 30% and 20% of the total in case 1, 2 and 3, respectively. In all cases, patient-centered information was distributed across various medical records. The Progress Note written by doctors provided much of the patient-centered information, while other records contained information not described elsewhere in the Progress Notes. The records of nurses and AHPs included more patient-centered information than the doctors' notes. Each piece of patient-centered information was documented in fragments providing from each of the healthcare professionals' viewpoints. Conclusion: The documented information throughout the breast cancer care pathway in the cases examined was dominated by biomedical information. However, our findings suggest that integrating fragmented patient-centered information from various healthcare professionals' medical records produces holistic patient-centered information from multiple perspectives and thus may facilitate an enhanced multidisciplinary patient-centered care.


An important paradigm shift within healthcare is the shift toward patient-centered care and away from disease-centered treatment. Patient-centered care is based on shared decision-making, respecting an individual patient's preferences, needs and values, and considering social context and best available research evidence to improve the quality of care. A multidisciplinary team (MDT) approach plays an important role in patient-centered care and MDTs are already adopted into daily oncology practices in many countries, especially in breast cancer care. Previous studies have shown that an effective MDT needs more patient-centered information but often that patient-centered information is notably absent from medical records. We investigated if and how patient-centered information such as psychosocial entries exists in patient records. For this purpose, we performed an exploratory pilot study in which the patient records of three patients with breast cancer, including two patients with advanced stage disease, were studied throughout their care pathway. We observed that the documentation of patient-centered information was fragmented and scattered across various medical records written by multidisciplinary professionals. Moreover, these pieces of scattered information were recorded from different perspectives and viewpoints. Our findings point to a significant role that healthcare informatics could play, as integrating the various healthcare professionals' electronic health record could likely produce multifaceted and more holistic patient-centered information which could be shared and used in shared decision-making and MDTs with a view to considering both patient and clinical perspectives, potentially improving the quality of care.

6.
Sci Rep ; 13(1): 11820, 2023 07 21.
Artigo em Inglês | MEDLINE | ID: mdl-37479701

RESUMO

Recent studies showed that machine learning models such as gradient-boosting decision tree (GBDT) can predict diabetes with high accuracy from big data. In this study, we asked whether highly accurate prediction of diabetes is possible even from small data by expanding the amount of data through data collaboration (DC) analysis, a modern framework for integrating and analyzing data accumulated at multiple institutions while ensuring confidentiality. To this end, we focused on data from two institutions: health checkup data of 1502 citizens accumulated in Tsukuba City and health history data of 1399 patients collected at the University of Tsukuba Hospital. When using only the health checkup data, the ROC-AUC and Recall for logistic regression (LR) were 0.858 ± 0.014 and 0.970 ± 0.019, respectively, while those for GBDT were 0.856 ± 0.014 and 0.983 ± 0.016, respectively. When using also the health history data through DC analysis, these values for LR improved to 0.875 ± 0.013 and 0.993 ± 0.009, respectively, while those for GBDT deteriorated because of the low compatibility with a method used for confidential data sharing (although DC analysis brought improvements). Even in a situation where health checkup data of only 324 citizens are available, the ROC-AUC and Recall for LR were 0.767 ± 0.025 and 0.867 ± 0.04, respectively, thanks to DC analysis, indicating an 11% and 12% improvement. Thus, we concluded that the answer to the above question was "Yes" for LR but "No" for GBDT for the data set tested in this study.


Assuntos
Diabetes Mellitus , Humanos , Diabetes Mellitus/diagnóstico , Diabetes Mellitus/epidemiologia , Aprendizado de Máquina , Modelos Logísticos
7.
Stud Health Technol Inform ; 290: 168-172, 2022 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-35672993

RESUMO

Electronic health records should efficiently store the information required for clinical decision-making and contain progress notes that reference this information. However, beyond the inclusion of subjective data, objective data, assessment, and plan framework, the content required to make progress notes useful for readers with diverse specialties has not been clarified. Moreover, the documentation burden that including additional content places on medical doctors (MDs) has not been determined. We conducted a questionnaire with 74 MDs, nurses, and other clinical professionals to determine whether they found progress notes with varying specific contents useful. In addition, the degree of the burden of writing progress notes that contain specific content was measured when 25 MDs were instructed to add specific content. Our results reveal that progress notes are more useful for clinical reasoning for readers other than MDs when more specific information is included; this can be achieved without increasing the documentation burden.


Assuntos
Documentação , Médicos , Comunicação , Registros Eletrônicos de Saúde , Humanos , Redação
8.
Stud Health Technol Inform ; 290: 1066-1067, 2022 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-35673208

RESUMO

We compared the distribution of laboratory test values with several parametric statistical distributions to show that a lognormal distribution can represent the distribution of laboratory test values. Then, we estimated the distributions of laboratory test values of four datasets including only three published values: two endpoints of reference interval (RI) and one median.


Assuntos
Distribuições Estatísticas , Valores de Referência
9.
Sci Rep ; 12(1): 8167, 2022 05 17.
Artigo em Inglês | MEDLINE | ID: mdl-35581220

RESUMO

When asked for numerical estimations, people can respond by stating their estimates (e.g., writing down a number) or indicating a number on a scale. Although these methods are logically the same, such differences may affect the responses to the numerical estimations. In this study, we examined how differences in response format affected responses to numerical estimations using two behavioral experiments. We found that participants showed a round number bias (i.e., people answered estimates with round numbers) when simply stating a number and the distribution of responses tended to be less diverse. In contrast, this tendency was not observed when the participants responded using a scale. Participants provided more diverse estimates when they answered using a scale. Furthermore, we analyzed how this difference in response distribution was related to the wisdom of crowds (the aggregated judgment is as accurate as, or sometimes better than, the best individual judgment in the group) using computer simulations. The results indicated that round number bias affected the achievement of the wisdom of crowds. Particularly, when the group size was small, biased responses resulted in less effective achievement. Our findings suggest that using an appropriate scale is a low-cost method for eliminating round number bias and efficiently achieving the wisdom of crowds.


Assuntos
Aglomeração , Julgamento , Viés , Simulação por Computador , Humanos , Julgamento/fisiologia
10.
Int J Med Inform ; 124: 90-96, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30784432

RESUMO

OBJECTIVES: Electronic health record (EHR)-based phenotyping is an automated technique for identifying patients diagnosed with a particular disease using EHR data. However, EHR-based phenotyping has difficulties in achieving satisfactorily high performance because clinical notes include disease mentions that ultimately signify something other than the patient's diagnosis (such as differential diagnosis or screening). Our objective is to quantify the influence of such disease mentions on EHR-based phenotyping performance. METHODS: Physicians manually reviewed whether the disease mentions indicated the patients' diseases in 487,300 clinical notes of 4,430 patients. Particular focus was placed on disease mentions that did not signify the patient's diagnosis even though they did not have any syntactic modifier or indicator in the same sentences. Patients were then classified according to whether their clinical notes included such disease mentions. RESULTS: Among the patients whose clinical notes included disease mentions without any modifier or indicator, the proportion of patients whose disease mentions signified the patients' diagnosis was 78.1% (on average). This value can be interpreted as the bias of disease mentions that did not signify the patient's diagnosis on the precision of EHR-based phenotyping by extracting disease mentions from clinical notes. CONCLUSION: This study quantified the bias occurred owing to disease mentions that incorrectly signify a patient's diagnosis in the value of precision of EHR-based phenotyping from four dataset types. The results of this study will help researchers in diverse research environments with different available data types.


Assuntos
Diagnóstico , Registros Eletrônicos de Saúde , Diagnóstico Diferencial , Difusão de Inovações , Humanos , Padrões de Prática Médica , Reprodutibilidade dos Testes
11.
Stud Health Technol Inform ; 245: 432-436, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-29295131

RESUMO

Phenotyping is an automated technique for identifying patients diagnosed with a particular disease based on electronic health records (EHRs). To evaluate phenotyping algorithms, which should be reproducible, the annotation of EHRs as a gold standard is critical. However, we have found that the different types of EHRs cannot be definitively annotated into CASEs or CONTROLs. The influence of such "possible patients" on phenotyping algorithms is unknown. To assess these issues, for four chronic diseases, we annotated EHRs by using information not directly referring to the diseases and developed two types of phenotyping algorithms for each disease. We confirmed that each disease included different types of possible patients. The performance of phenotyping algorithms differed depending on whether possible patients were considered as CASEs, and this was independent of the type of algorithms. Our results indicate that researchers must share annotation criteria for classifying the possible patients to reproduce phenotyping algorithms.


Assuntos
Algoritmos , Registros Eletrônicos de Saúde , Fenótipo , Humanos
12.
J Diabetes Sci Technol ; 11(4): 791-799, 2017 07.
Artigo em Inglês | MEDLINE | ID: mdl-27932531

RESUMO

BACKGROUND: Phenotyping is an automated technique that can be used to distinguish patients based on electronic health records. To improve the quality of medical care and advance type 2 diabetes mellitus (T2DM) research, the demand for T2DM phenotyping has been increasing. Some existing phenotyping algorithms are not sufficiently accurate for screening or identifying clinical research subjects. OBJECTIVE: We propose a practical phenotyping framework using both expert knowledge and a machine learning approach to develop 2 phenotyping algorithms: one is for screening; the other is for identifying research subjects. METHODS: We employ expert knowledge as rules to exclude obvious control patients and machine learning to increase accuracy for complicated patients. We developed phenotyping algorithms on the basis of our framework and performed binary classification to determine whether a patient has T2DM. To facilitate development of practical phenotyping algorithms, this study introduces new evaluation metrics: area under the precision-sensitivity curve (AUPS) with a high sensitivity and AUPS with a high positive predictive value. RESULTS: The proposed phenotyping algorithms based on our framework show higher performance than baseline algorithms. Our proposed framework can be used to develop 2 types of phenotyping algorithms depending on the tuning approach: one for screening, the other for identifying research subjects. CONCLUSIONS: We develop a novel phenotyping framework that can be easily implemented on the basis of proper evaluation metrics, which are in accordance with users' objectives. The phenotyping algorithms based on our framework are useful for extraction of T2DM patients in retrospective studies.


Assuntos
Diabetes Mellitus Tipo 2/diagnóstico , Registros Eletrônicos de Saúde , Máquina de Vetores de Suporte , Área Sob a Curva , Humanos , Fenótipo , Curva ROC , Sensibilidade e Especificidade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA