RESUMO
In this study, we update the evaluation of the Russian GPT3 model presented in our previous paper in predicting the length of stay (LOS) in neurosurgery. We aimed to assess the performance the Russian GPT-3 (ruGPT-3) language model in LOS prediction using narrative medical records in neurosurgery compared to doctors' and patients' expectations. Doctors appeared to have the most realistic LOS expectations (MAE = 2.54), while the model's predictions (MAE = 3.53) were closest to the patients' (MAE = 3.47) but inferior to them (p = 0.011). A detailed analysis showed a solid quality of ruGPT-3 performance based on narrative clinical texts. Considering our previous findings obtained with recurrent neural networks and FastText vector representation, we estimate the new result as important but probably improveable.
Assuntos
Neurocirurgia , Humanos , Idioma , Tempo de Internação , Processamento de Linguagem Natural , Procedimentos NeurocirúrgicosRESUMO
Automated abstracts classification could significantly facilitate scientific literature screening. The classification of short texts could be based on their statistical properties. This research aimed to evaluate the quality of short medical abstracts classification primarily based on text statistical features. Twelve experiments with machine learning models over the sets of text features were performed on a dataset of 671 article abstracts. Each experiment was repeated 300 times to estimate the classification quality, ending up with 3600 tests total. We achieved the best result (F1 = 0.775) using a random forest machine learning model with keywords and three-dimensional Word2Vec embeddings. The classification of scientific abstracts might be implemented using straightforward and computationally inexpensive methods presented in this paper. The approach we described is expected to facilitate literature selection by researchers.
Assuntos
Aprendizado de Máquina , Processamento de Linguagem NaturalRESUMO
Gliomas are the most common neuroepithelial brain tumors, different by various biological tissue types and prognosis. They could be graded with four levels according to the 2007 WHO classification. The emergence of non-invasive histological and molecular diagnostics for nervous system neoplasms can revolutionize the efficacy and safety of medical care and radically reduce healthcare costs. Our pilot study aimed to evaluate the diagnostic accuracy of deep learning (DL) in subtyping gliomas by WHO grades (I-IV) based on preoperative magnetic resonance imaging (MRI) from Burdenko Neurosurgery Center's database. A total of 707 MRI studies was included. A "3D classification" approach predicting tumor type for the entire patient's MRI data showed the best result (accuracy = 83%, ROC AUC = 0.95), consistent with that of other authors who used different methodologies. Our preliminary results proved the separability of MR T1 axial images with contrast enhancement by WHO grade using DL.
Assuntos
Neoplasias Encefálicas , Aprendizado Profundo , Glioma , Neoplasias Encefálicas/diagnóstico por imagem , Neoplasias Encefálicas/patologia , Glioma/diagnóstico por imagem , Glioma/patologia , Humanos , Imageamento por Ressonância Magnética/métodos , Gradação de Tumores , Projetos Piloto , Estudos RetrospectivosRESUMO
Our study aimed to create a machine learning model to predict patients' functional outcomes after microsurgical treatment of unruptured intracranial aneurysms (UIA). Data on 615 microsurgically treated patients with UIA were collected retrospectively from the Electronic Health Records at N.N. Burdenko Neurosurgery Center (Moscow, Russia). The dichotomized modified Rankin Scale (mRS) at the discharge was used as a target variable. Several machine learning models were utilized: a random forest upon decision trees (RF), logistic regression (LR), support vector machine (SVM). The best result with F1-score metric = 0.904 was produced by the SVM model with a label-encode method. The predictive modeling based on machine learning might be promising as a decision support tool in intracranial aneurysm surgery.
Assuntos
Aneurisma Intracraniano , Humanos , Aneurisma Intracraniano/cirurgia , Aprendizado de Máquina , Procedimentos Neurocirúrgicos/métodos , Estudos Retrospectivos , Máquina de Vetores de Suporte , Resultado do TratamentoRESUMO
Our study aimed to compare the capability of different word embeddings to capture the semantic similarity of clinical concepts related to complications in neurosurgery at the level of medical experts. Eighty-four sets of word embeddings (based on Word2vec, GloVe, FastText, PMI, and BERT algorithms) were benchmarked in a clustering task. FastText model showed the best close to the medical expertise capability to group medical terms by their meaning (adjusted Rand index = 0.682). Word embedding models can accurately reflect clinical concepts' semantic and linguistic similarities, promising their robust usage in medical domain-specific NLP tasks.
Assuntos
Neurocirurgia , Algoritmos , Análise por Conglomerados , Linguística , SemânticaRESUMO
The possibility of postoperative speech dysfunction prediction in neurosurgery based on intraoperative cortico-cortical evoked potentials (CCEP) might provide a new basis to refine the criteria for the extent of intracerebral tumor resection and preserve patients' quality of life. In this study, we aimed to test the quality of predicting postoperative speech dysfunction with machine learning based on the initial intraoperative CCEP before tumor removal. CCEP data were reported for 26 patients. We used several machine learning models to predict speech deterioration following neurosurgery: a random forest of decision trees, logistic regression, support vector machine with different types of the kernel (linear, radial, and polynomial). The best result with F1-score = 0.638 was obtained by a support vector machine with a polynomial kernel. Most models showed low specificity and high sensitivity (reached 0.993 for the best model). Our pilot study demonstrated the insufficient quality of speech dysfunction prediction by solely intraoperative CCEP recorded before glial tumor resection, grounding our further research of CCEP postresectional dynamics.
Assuntos
Qualidade de Vida , Fala , Córtex Cerebral , Potenciais Evocados , Humanos , Aprendizado de Máquina , Projetos PilotoRESUMO
In this study, we tested the quality of the information extraction algorithm proposed by our group to detect pulmonary embolism (PE) in medical cases through sentence labeling. Having shown a comparable result (F1 = 0.921) to the best machine learning method (random forest, F1 = 0.937), our approach proved not to miss the information of interest. Scoping the number of texts under review down to distinct sentences and introducing labeling rules contributes to the efficiency and quality of information extraction by experts and makes the challenging tasks of labeling large textual datasets solvable.
Assuntos
Registros Eletrônicos de Saúde , Embolia Pulmonar , Humanos , Armazenamento e Recuperação da Informação , Idioma , Aprendizado de Máquina , Processamento de Linguagem Natural , Embolia Pulmonar/diagnósticoRESUMO
Patients, relatives, doctors, and healthcare providers anticipate the evidence-based length of stay (LOS) prediction in neurosurgery. This study aimed to assess the quality of LOS prediction with the GPT3 language model upon the narrative medical records in neurosurgery comparing to doctors' and patients' expectations. We found no significant difference (p = 0.109) between doctors', patients', and model's predictions with neurosurgeons tending to be more accurate in prognosis. The modern neural network language models demonstrate feasibility in LOS prediction.
Assuntos
Neurocirurgia , Humanos , Idioma , Tempo de Internação , Motivação , Federação RussaRESUMO
Implementing the best research principles initiates an important shift in clinical research culture, improving efficiency and the level of evidence obtained. In this article, we share our own view on the best research practice and our experience introducing it into the scientific activities of the N.N. Burdenko National Medical Research Center of Neurosurgery (Moscow, Russian Federation). While being adherent to the principles described in the article, the percentage of publications in the international scientific journals in our Center has increased from 7% to 27%, with an overall gain in the number of articles by 2 times since 2014. We believe it is important that medical informatics professionals equally to medical experts involved in clinical research are familiar with the best research principles.
Assuntos
Pesquisa Biomédica , Neurocirurgia , Hospitais , Procedimentos Neurocirúrgicos , Federação RussaRESUMO
Automated text classification is a natural language processing (NLP) technology that could significantly facilitate scientific literature selection. A specific topical dataset of 630 article abstracts was obtained from the PubMed database. We proposed 27 parametrized options of PubMedBERT model and 4 ensemble models to solve a binary classification task on that dataset. Three hundred tests with resamples were performed in each classification approach. The best PubMedBERT model demonstrated F1-score = 0.857 while the best ensemble model reached F1-score = 0.853. We concluded that the short scientific texts classification quality might be improved using the latest state-of-art approaches.
Assuntos
Processamento de Linguagem Natural , PubMedRESUMO
The automated detection of adverse events in medical records might be a cost-effective solution for patient safety management or pharmacovigilance. Our group proposed an information extraction algorithm (IEA) for detecting adverse events in neurosurgery using documents written in a natural rich-in-morphology language. In this paper, we challenge to optimize and evaluate its performance for the detection of any extremity muscle weakness in clinical texts. Our algorithm shows the accuracy of 0.96 and ROC AUC = 0.96 and might be easily implemented in other medical domains.
Assuntos
Debilidade Muscular , Processamento de Linguagem Natural , Registros Eletrônicos de Saúde , Humanos , Armazenamento e Recuperação da Informação , FarmacovigilânciaRESUMO
The number of scientific publications is constantly growing to make their processing extremely time-consuming. We hypothesized that a user-defined literature tracking may be augmented by machine learning on article summaries. A specific dataset of 671 article abstracts was obtained and nineteen binary classification options using machine learning (ML) techniques on various text representations were proposed in a pilot study. 300 tests with resamples were performed for each classification option. The best classification option demonstrated AUC = 0.78 proving the concept in general and indicating a potential for solution improvement.
Assuntos
Aprendizado de Máquina , Humanos , Processamento de Linguagem Natural , Projetos PilotoRESUMO
Identifying adverse events in clinical documents is demanded in retrospective clinical research and prospective monitoring of treatment safety and cost-effectiveness. We proposed and evaluated a few methods of semi-automated muscle weakness detection in preoperative clinical notes for a larger project on predicting paresis by images. The combination of semi-expert and machine learning methods demonstrated maximized sensitivity = 0.860 and specificity = 0.919, and largest AUC = 0.943 with a 95% CI [0.874; 0.991], outperforming each method used individually. Our approaches are expected to be effective for autoshaping a well- verified training dataset for supervised machine learning.