RESUMEN
The importance of social media has seen a dramatic increase in recent times, but much about its influence in academia is still unknown. To date, no comparative studies analysing the effect of social media promotion on citation counts have been undertaken in neurosurgical publishing. We randomized 177 articles published in Acta Neurochirurgica from May to September 2020. The 89 articles in the intervention group received a standardized social media promotion through one post on our official Twitter/X account, whereas the 88 articles in the control group did not receive any social media promotion. Citation counts, website visits and PDF downloads were tracked at one and two years post-promotion. We found no significant difference in number of citations at one year post-promotion (Intervention: 1.85 ± 3.94 vs. Control: 2.67 ± 6.65, p = 0.322) or at two years (5.35 ± 7.39 vs. 7.09 ± 12.1, p = 0.249). Similarly, no difference was detected in website visits at one (587.46 ± 568.04 vs. 590.65 ± 636.25, p = 0.972) or two years (865.79 ± 855.80 vs. 896.31 ± 981.97, p = 0.826) and PDF downloads at one (183.40 ± 152.02 vs. 187.78 ± 199.01, p = 0.870) or two years (255.99 ± 218.97 vs. 260.97 ± 258.44, p = 0.890). In a randomized study, a structured promotion of general neurosurgical articles on Twitter/X did not significantly impact citation count, website visits, or PDF downloads compared to no social media promotion. Combined with published evidence to date, the impact of social media on citation counts in academic publishing ultimately remains unclear.
Asunto(s)
Neurocirugia , Edición , Medios de Comunicación Sociales , Humanos , Publicaciones Periódicas como AsuntoRESUMEN
BACKGROUND: Clinical prediction models (CPM), such as the SCOAP-CERTAIN tool, can be utilized to enhance decision-making for lumbar spinal fusion surgery by providing quantitative estimates of outcomes, aiding surgeons in assessing potential benefits and risks for each individual patient. External validation is crucial in CPM to assess generalizability beyond the initial dataset. This ensures performance in diverse populations, reliability and real-world applicability of the results. Therefore, we externally validated the tool for predictability of improvement in oswestry disability index (ODI), back and leg pain (BP, LP). METHODS: Prospective and retrospective data from multicenter registry was obtained. As outcome measure minimum clinically important change was chosen for ODI with ≥ 15-point and ≥ 2-point reduction for numeric rating scales (NRS) for BP and LP 12 months after lumbar fusion for degenerative disease. We externally validate this tool by calculating discrimination and calibration metrics such as intercept, slope, Brier Score, expected/observed ratio, Hosmer-Lemeshow (HL), AUC, sensitivity and specificity. RESULTS: We included 1115 patients, average age 60.8 ± 12.5 years. For 12-month ODI, area-under-the-curve (AUC) was 0.70, the calibration intercept and slope were 1.01 and 0.84, respectively. For NRS BP, AUC was 0.72, with calibration intercept of 0.97 and slope of 0.87. For NRS LP, AUC was 0.70, with calibration intercept of 0.04 and slope of 0.72. Sensitivity ranged from 0.63 to 0.96, while specificity ranged from 0.15 to 0.68. Lack of fit was found for all three models based on HL testing. CONCLUSIONS: Utilizing data from a multinational registry, we externally validate the SCOAP-CERTAIN prediction tool. The model demonstrated fair discrimination and calibration of predicted probabilities, necessitating caution in applying it in clinical practice. We suggest that future CPMs focus on predicting longer-term prognosis for this patient population, emphasizing the significance of robust calibration and thorough reporting.
Asunto(s)
Vértebras Lumbares , Fusión Vertebral , Humanos , Fusión Vertebral/métodos , Persona de Mediana Edad , Masculino , Femenino , Vértebras Lumbares/cirugía , Anciano , Estudios Retrospectivos , Resultado del Tratamiento , Evaluación de la Discapacidad , Degeneración del Disco Intervertebral/cirugía , Estudios Prospectivos , Reproducibilidad de los ResultadosRESUMEN
PURPOSE: Volumetric assessments, such as extent of resection (EOR) or residual tumor volume, are essential criterions in glioma resection surgery. Our goal is to develop and validate segmentation machine learning models for pre- and postoperative magnetic resonance imaging scans, allowing us to assess the percentagewise tumor reduction after intracranial surgery for gliomas. METHODS: For the development of the preoperative segmentation model (U-Net), MRI scans of 1053 patients from the Multimodal Brain Tumor Segmentation Challenge (BraTS) 2021 as well as from patients who underwent surgery at the University Hospital in Zurich were used. Subsequently, the model was evaluated on a holdout set containing 285 images from the same sources. The postoperative model was developed using 72 scans and validated on 45 scans obtained from the BraTS 2015 and Zurich dataset. Performance is evaluated using Dice Similarity score, Jaccard coefficient and Hausdorff 95%. RESULTS: We were able to achieve an overall mean Dice Similarity Score of 0.59 and 0.29 on the pre- and postoperative holdout sets, respectively. Our algorithm managed to determine correct EOR in 44.1%. CONCLUSION: Although our models are not suitable for clinical use at this point, the possible applications are vast, going from automated lesion detection to disease progression evaluation. Precise determination of EOR is a challenging task, but we managed to show that deep learning can provide fast and objective estimates.
Asunto(s)
Neoplasias Encefálicas , Aprendizaje Profundo , Glioma , Humanos , Glioma/diagnóstico por imagen , Glioma/cirugía , Glioma/patología , Neoplasias Encefálicas/diagnóstico por imagen , Neoplasias Encefálicas/cirugía , Neoplasias Encefálicas/patología , Algoritmos , Imagen por Resonancia Magnética/métodos , Procesamiento de Imagen Asistido por Computador/métodosRESUMEN
BACKGROUND: Indications and outcomes in lumbar spinal fusion for degenerative disease are notoriously heterogenous. Selected subsets of patients show remarkable benefit. However, their objective identification is often difficult. Decision-making may be improved with reliable prediction of long-term outcomes for each individual patient, improving patient selection and avoiding ineffective procedures. METHODS: Clinical prediction models for long-term functional impairment [Oswestry Disability Index (ODI) or Core Outcome Measures Index (COMI)], back pain, and leg pain after lumbar fusion for degenerative disease were developed. Achievement of the minimum clinically important difference at 12 months postoperatively was defined as a reduction from baseline of at least 15 points for ODI, 2.2 points for COMI, or 2 points for pain severity. RESULTS: Models were developed and integrated into a web-app ( https://neurosurgery.shinyapps.io/fuseml/ ) based on a multinational cohort [N = 817; 42.7% male; mean (SD) age: 61.19 (12.36) years]. At external validation [N = 298; 35.6% male; mean (SD) age: 59.73 (12.64) years], areas under the curves for functional impairment [0.67, 95% confidence interval (CI): 0.59-0.74], back pain (0.72, 95%CI: 0.64-0.79), and leg pain (0.64, 95%CI: 0.54-0.73) demonstrated moderate ability to identify patients who are likely to benefit from surgery. Models demonstrated fair calibration of the predicted probabilities. CONCLUSIONS: Outcomes after lumbar spinal fusion for degenerative disease remain difficult to predict. Although assistive clinical prediction models can help in quantifying potential benefits of surgery and the externally validated FUSE-ML tool may aid in individualized risk-benefit estimation, truly impacting clinical practice in the era of "personalized medicine" necessitates more robust tools in this patient population.
Asunto(s)
Fusión Vertebral , Dolor de Espalda/diagnóstico , Dolor de Espalda/etiología , Dolor de Espalda/cirugía , Femenino , Humanos , Vértebras Lumbares/cirugía , Masculino , Persona de Mediana Edad , Modelos Estadísticos , Pronóstico , Fusión Vertebral/métodos , Resultado del TratamientoRESUMEN
OBJECTIVE: What is considered "abnormal" in clinical testing is typically defined by simple thresholds derived from normative data. For instance, when testing using the five-repetition sit-to-stand (5R-STS) test, the upper limit of normal (ULN) from a population of spine-healthy volunteers (10.5 seconds) is used to identify objective functional impairment (OFI), but this fails to consider different properties of individuals (e.g., taller and shorter, older and younger). Therefore, the authors developed a personalized testing strategy to quantify patient-specific OFI using machine learning. METHODS: Patients with disc herniation, spinal stenosis, spondylolisthesis, or discogenic chronic low-back pain and a population of spine-healthy volunteers, from two prospective studies, were included. A machine learning model was trained on normative data to predict personalized "expected" test times and their confidence intervals and ULNs (99th percentiles) based on simple demographics. OFI was defined as a test time greater than the personalized ULN. OFI was categorized into types 1 to 3 based on a clustering algorithm. A web app was developed to deploy the model clinically. RESULTS: Overall, 288 patients and 129 spine-healthy individuals were included. The model predicted "expected" test times with a mean absolute error of 1.18 (95% CI 1.13-1.21) seconds and R2 of 0.37 (95% CI 0.34-0.41). Based on the implemented personalized testing strategy, 191 patients (66.3%) exhibited OFI. Type 1, 2, and 3 impairments were seen in 64 (33.5%), 91 (47.6%), and 36 (18.8%) patients, respectively. Increasing detected levels of OFI were associated with statistically significant increases in subjective functional impairment, extreme anxiety and depression symptoms, being bedridden, extreme pain or discomfort, inability to carry out activities of daily living, and a limited ability to work. CONCLUSIONS: In the era of "precision medicine," simple population-based thresholds may eventually not be adequate to monitor quality and safety in neurosurgery. Individualized assessment integrating machine learning techniques provides more detailed and objective clinical assessment. The personalized testing strategy demonstrated concurrent validity with quality-of-life measures, and the freely accessible web app (https://neurosurgery.shinyapps.io/5RSTS/) enabled clinical application.
Asunto(s)
Degeneración del Disco Intervertebral , Dolor de la Región Lumbar , Actividades Cotidianas , Humanos , Vértebras Lumbares , Aprendizaje Automático , Estudios ProspectivosRESUMEN
STUDY DESIGN: Heterogeneous data collection via a mix of prospective, retrospective, and ambispective methods. OBJECTIVE: To evaluate the effect of biological sex on patient-reported outcomes after spinal fusion surgery for lumbar degenerative disease. SUMMARY OF BACKGROUND DATA: Current literature suggests sex differences regarding clinical outcome after spine surgery may exist. Substantial methodological heterogeneity and limited comparability of studies warrants further investigation of sex-related differences in treatment outcomes. METHODS: We analyzed patients who underwent spinal fusion with or without pedicle screw insertion for lumbar degenerative disease included within a multinational study, comprising patients from 11 centers in 7 countries. Absolute values and change scores (change from pe-operative baseline to post-operative follow-up) for 12-month functional impairment (Oswestry disability index [ODI]) and back and leg pain severity (numeric rating scale [NRS]) were compared between male and female patients. Minimum clinically important difference (MCID) was defined as > 30% improvement. RESULTS: Six-hundred-sixty (59%) of 1115 included patients were female. Female patients presented with significantly baseline ODI (51.5 ± 17.2 vs. 47.8 ± 17.9, P<0.001) and back pain (6.96 ± 2.32 vs. 6.60 ± 2.30, P=0.010) and leg pain (6.49 ± 2.76 vs. 6.01 ± 2.76, P=0.005). At 12-months, female patients still reported significantly higher ODI (22.76 ± 16.97 vs. 20.50 ± 16.10, P=0.025), but not higher back (3.13 ± 2.38 vs. 3.00 ± 2.40, P=0.355) or leg pain (2.62 ± 2.55 vs. .34 ± 2.43, P=0.060). Change scores at 12 months did not differ significantly among male and female patients in ODI (∆ 1.31, 95% CI -3.88-1.25, P=0.315), back (∆ 0.22, 95% CI -0.57-0.12, P=0.197) and leg pain (∆ 0.16, 95% CI -0.56-0.24, P=0.439). MCID at 12-months was achieved in 330 (77.5%) male patients and 481 (76.3%) female patients (P=0.729) for ODI. CONCLUSION: Both sexes experienced a similar benefit from surgery in terms of relative improvement in scores for functional impairment and pain. Although female patients reported a higher degree of functional impairment and pain preoperatively, at 12 months only their average scores for functional impairment remained higher than those for their male counterparts, while absolute pain scores were similar for female and male patients.
RESUMEN
BACKGROUND AND OBJECTIVES: In medical imaging, a limited number of trained deep learning algorithms have been externally validated and released publicly. We hypothesized that a deep learning algorithm can be trained to identify and localize subarachnoid hemorrhage (SAH) on head computed tomography (CT) scans and that the trained model performs satisfactorily when tested using external and real-world data. METHODS: We used noncontrast head CT images of patients admitted to Helsinki University Hospital between 2012 and 2017. We manually segmented (i.e., delineated) SAH on 90 head CT scans and used the segmented CT scans together with 22 negative (no SAH) control CT scans in training an open-source convolutional neural network (U-Net) to identify and localize SAH. We then tested the performance of the trained algorithm by using external data sets (137 SAH and 1,242 control cases) collected in 2 foreign countries and also by creating a data set of consecutive emergency head CT scans (8 SAH and 511 control cases) performed during on-call hours in 5 different domestic hospitals in September 2021. We assessed the algorithm's capability to identify SAH by calculating patient- and slice-level performance metrics, such as sensitivity and specificity. RESULTS: In the external validation set of 1,379 cases, the algorithm identified 136 of 137 SAH cases correctly (sensitivity 99.3% and specificity 63.2%). Of the 49,064 axial head CT slices, the algorithm identified and localized SAH in 1845 of 2,110 slices with SAH (sensitivity 87.4% and specificity 95.3%). Of 519 consecutive emergency head CT scans imaged in September 2021, the algorithm identified all 8 SAH cases correctly (sensitivity 100.0% and specificity 75.3%). The slice-level (27,167 axial slices in total) sensitivity and specificity were 87.3% and 98.8%, respectively, as the algorithm identified and localized SAH in 58 of 77 slices with SAH. The performance of the algorithm can be tested on through a web service. DISCUSSION: We show that the shared algorithm identifies SAH cases with a high sensitivity and that the slice-level specificity is high. In addition to openly sharing a high-performing deep learning algorithm, our work presents infrequently used approaches in designing, training, testing, and reporting deep learning algorithms developed for medical imaging diagnostics. CLASSIFICATION OF EVIDENCE: This study provides Class III evidence that a deep learning algorithm correctly identifies the presence of subarachnoid hemorrhage on CT scan.
Asunto(s)
Aprendizaje Profundo , Hemorragia Subaracnoidea , Humanos , Hemorragia Subaracnoidea/diagnóstico por imagen , Tomografía Computarizada por Rayos X/métodos , Algoritmos , CabezaRESUMEN
Introduction As a possible treatment option for chronic lower back pain (CLBP) due to single-level degenerative disc disorder (DDD), the efficacy of anterior lumbar interbody fusion (ALIF) has been reviewed various times in the existing literature. Nevertheless, a scarcity of data exists pertaining to ALIF procedures carried out in a short-stay setting using an Enhanced Recovery after Surgery (ERAS) protocol, particularly concerning the safety. Methods Prospectively collected data are analyzed to study the efficacy and safety of short-stay ERAS ALIF in treatment of single-level DDD. Visual Analog Scale (VAS) in both back and leg pain along with the Oswestry Disability Index (ODI) were used to collect measure outcomes. The primary endpoint was a minimum clinically important difference (MCID) of ≥30% for the ODI at 12 months. Results Forty-four patients underwent surgery after failed long-term conservative treatment. MCID was achieved in 78%. Age was the only significant factor in association with MCID (p = 0.03), while gender, Modic changes, results of prognostic tests, prior surgery and smoking status had no significant influence on either MCID or change scores for any outcome measure. One complication in the form of transient new radiculopathy occurred in one patient (2.3%). Conclusion With overall positive outcomes in terms of both efficacy and safety, an ALIF procedure with subsequent implementation of an ERAS protocol in a short-stay setting can be an option for strictly selected patients with CLBP. Further study, however, possibly with a larger sample size, would be necessary to substantiate these findings.