Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 97
Filtrar
1.
Nat Commun ; 15(1): 3922, 2024 May 09.
Artículo en Inglés | MEDLINE | ID: mdl-38724498

RESUMEN

Identification of differentially expressed proteins in a proteomics workflow typically encompasses five key steps: raw data quantification, expression matrix construction, matrix normalization, missing value imputation (MVI), and differential expression analysis. The plethora of options in each step makes it challenging to identify optimal workflows that maximize the identification of differentially expressed proteins. To identify optimal workflows and their common properties, we conduct an extensive study involving 34,576 combinatoric experiments on 24 gold standard spike-in datasets. Applying frequent pattern mining techniques to top-ranked workflows, we uncover high-performing rules that demonstrate optimality has conserved properties. Via machine learning, we confirm optimal workflows are indeed predictable, with average cross-validation F1 scores and Matthew's correlation coefficients surpassing 0.84. We introduce an ensemble inference to integrate results from individual top-performing workflows for expanding differential proteome coverage and resolve inconsistencies. Ensemble inference provides gains in pAUC (up to 4.61%) and G-mean (up to 11.14%) and facilitates effective aggregation of information across varied quantification approaches such as topN, directLFQ, MaxLFQ intensities, and spectral counts. However, further development and evaluation are needed to establish acceptable frameworks for conducting ensemble inference on multiple proteomics workflows.


Asunto(s)
Proteómica , Proteómica/métodos , Flujo de Trabajo , Aprendizaje Automático , Proteoma/metabolismo , Humanos , Algoritmos , Bases de Datos de Proteínas
5.
Postgrad Med J ; 100(1183): 344-349, 2024 Apr 22.
Artículo en Inglés | MEDLINE | ID: mdl-38272463

RESUMEN

Providing family updates is a common clinical task for medical trainees and practitioners working in hospital settings. Good clinical communication skills are essential in clinical care as it is associated with improved patient satisfaction, understanding of condition, treatment adherence, and better overall clinical outcomes. Moreover, poor communications are often the source of medical complaints. However, while patient-centred communication skills training has generally been incorporated into clinical education, there hitherto remains inadequate training on clinical communications with patients' families, which carry different nuances. In recent years, it is increasingly recognized that familial involvement in the care of hospitalized patients leads to better clinical and psychological outcomes. In fact, in Asian populations with more collectivistic cultures, families are generally highly involved in patient care and decision-making. Therefore, effective clinical communications and regular provision of family updates are essential to build therapeutic rapport, facilitate familial involvement in patient care, and also provide a more holistic understanding of the patient's background and psychosocial set-up. In this article, we herein describe a seven-step understand the clinical context, gather perspectives, deliver medical information, address questions, concerns and expectations, provide tentative plans, demonstrate empathy, postcommunication reflections model as a practical guide for medical trainees and practitioners in provision of structured and effective family updates in their clinical practice.


Asunto(s)
Comunicación , Relaciones Profesional-Familia , Humanos , Competencia Clínica , Empatía , Familia/psicología , Relaciones Médico-Paciente
6.
Postgrad Med J ; 100(1181): 196-202, 2024 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-38073326

RESUMEN

The term 'insight' is generically defined in English language as the ability to perceive deeper truths about people and situations. In clinical practice, patient insight is known to have important implications in treatment compliance and clinical outcomes, and can be assessed clinically by looking for the presence of illness awareness, correct attribution of symptoms to underlying condition, and acceptance of treatment. In this article, we suggest that cultivating insight is actually a highly important, yet often overlooked, component of medical training, which may explain why some consistently learn well, communicate effectively, and quickly attain clinical competency, while others struggle throughout their clinical training and may even be difficult to remediate. We herein define 'insight' in the context of medical training as having an astute perception of personal cognitive processes, motivations, emotions, and ability (strengths, weaknesses, and limitations) that should drive self-improvement and effective behavioural regulation. We then describe the utility of cultivating 'insight' in medical training through three lenses of (i) promoting self-regulated, lifelong clinical learning, (ii) improving clinical competencies and person-centred care, and (iii) enhancing physician mental health and well-being. In addition, we review educational pedagogies that are helpful to create a medical eco-system that promotes the cultivation of insight among its trainees and practitioners. Finally, we highlight several tell-tale signs of poor insight and discuss psychological and non-psychological interventions that may help those severely lacking in insight to become more amenable to change and remediation.


Asunto(s)
Educación Médica , Aprendizaje , Salud Mental , Humanos , Competencia Clínica , Atención Dirigida al Paciente
7.
Proteomics ; 24(1-2): e2200332, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-37876146

RESUMEN

This article summarizes the PROTREC method and investigates the impact that the different hyper-parameters have on the task of missing protein prediction using PROTREC. We evaluate missing protein recovery rates using different PROTREC score selection approaches (MAX, MIN, MEDIAN, and MEAN), different PROTREC score thresholds, as well as different complex size thresholds. In addition, we included two additional cancer datasets in our analysis and introduced a new validation method to check both the robustness of the PROTREC method as well as the correctness of our analysis. Our analysis showed that the missing protein recovery rate can be improved by adopting PROTREC score selection operations of MIN, MEDIAN, and MEAN instead of the default MAX. However, this may come at a cost of reduced numbers of proteins predicted and validated. The users should therefore choose their hyper-parameters carefully to find a balance in the accuracy-quantity trade-off. We also explored the possibility of combining PROTREC with a p-value-based method (FCS) and demonstrated that PROTREC is able to perform well independently without any help from a p-value-based method. Furthermore, we conducted a downstream enrichment analysis to understand the biological pathways and protein networks within the cancerous tissues using the recovered proteins. Missing protein recovery rate using PROTREC can be improved by selecting a different PROTREC score selection method. Different PROTREC score selection methods and other hyper-parameters such as PROTREC score threshold and complex size threshold introduce accuracy-quantity trade-off. PROTREC is able to perform well independently of any filtering using a p-value-based method. Verification of the PROTREC method on additional cancer datasets. Downstream Enrichment Analysis to understand the biological pathways and protein networks in cancerous tissues.


Asunto(s)
Algoritmos , Neoplasias , Humanos
8.
Sci Data ; 10(1): 858, 2023 12 02.
Artículo en Inglés | MEDLINE | ID: mdl-38042886

RESUMEN

Mass spectrometry-based proteomics plays a critical role in current biological and clinical research. Technical issues like data integration, missing value imputation, batch effect correction and the exploration of inter-connections amongst these technical issues, can produce errors but are not well studied. Although proteomic technologies have improved significantly in recent years, this alone cannot resolve these issues. What is needed are better algorithms and data processing knowledge. But to obtain these, we need appropriate proteomics datasets for exploration, investigation, and benchmarking. To meet this need, we developed MultiPro (Multi-purpose Proteome Resource), a resource comprising four comprehensive large-scale proteomics datasets with deliberate batch effects using the latest parallel accumulation-serial fragmentation in both Data-Dependent Acquisition (DDA) and Data Independent Acquisition (DIA) modes. Each dataset contains a balanced two-class design based on well-characterized and widely studied cell lines (A549 vs K562 or HCC1806 vs HS578T) with 48 or 36 biological and technical replicates altogether, allowing for investigation of a multitude of technical issues. These datasets allow for investigation of inter-connections between class and batch factors, or to develop approaches to compare and integrate data from DDA and DIA platforms.


Asunto(s)
Línea Celular , Proteoma , Proteómica , Algoritmos , Espectrometría de Masas , Proteoma/metabolismo , Humanos
9.
Front Public Health ; 11: 1301563, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-38089040

RESUMEN

Introduction: The potential for deployment of Artificial Intelligence (AI) technologies in various fields of medicine is vast, yet acceptance of AI amongst clinicians has been patchy. This research therefore examines the role of antecedents, namely trust, attitude, and beliefs in driving AI acceptance in clinical practice. Methods: We utilized online surveys to gather data from clinicians in the field of gastroenterology. Results: A total of 164 participants responded to the survey. Participants had a mean age of 44.49 (SD = 9.65). Most participants were male (n = 116, 70.30%) and specialized in gastroenterology (n = 153, 92.73%). Based on the results collected, we proposed and tested a model of AI acceptance in medical practice. Our findings showed that while the proposed drivers had a positive impact on AI tools' acceptance, not all effects were direct. Trust and belief were found to fully mediate the effects of attitude on AI acceptance by clinicians. Discussion: The role of trust and beliefs as primary mediators of the acceptance of AI in medical practice suggest that these should be areas of focus in AI education, engagement and training. This has implications for how AI systems can gain greater clinician acceptance to engender greater trust and adoption amongst public health systems and professional networks which in turn would impact how populations interface with AI. Implications for policy and practice, as well as future research in this nascent field, are discussed.


Asunto(s)
Inteligencia Artificial , Confianza , Adulto , Femenino , Humanos , Masculino , Escolaridad , Políticas , Tecnología , Gastroenterología , Endoscopía
10.
Comput Struct Biotechnol J ; 21: 4804-4815, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37841330

RESUMEN

The human microbiome is an emerging research frontier due to its profound impacts on health. High-throughput microbiome sequencing enables studying microbial communities but suffers from analytical challenges. In particular, the lack of dedicated preprocessing methods to improve data quality impedes effective minimization of biases prior to downstream analysis. This review aims to address this gap by providing a comprehensive overview of preprocessing techniques relevant to microbiome research. We outline a typical workflow for microbiome data analysis. Preprocessing methods discussed include quality filtering, batch effect correction, imputation of missing values, normalization, and data transformation. We highlight strengths and limitations of each technique to serve as a practical guide for researchers and identify areas needing further methodological development. Establishing robust, standardized preprocessing will be essential for drawing valid biological conclusions from microbiome studies.

11.
Asian J Psychiatr ; 89: 103796, 2023 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-37837946

RESUMEN

BACKGROUND: The peripheral blood is an attractive source of prognostic biomarkers for psychosis conversion. There is limited research on the transcriptomic changes associated with psychosis conversion in the peripheral whole blood. STUDY DESIGN: We performed RNA-sequencing of peripheral whole blood from 65 ultra-high-risk (UHR) participants and 70 healthy control participants recruited in the Longitudinal Youth-at-Risk Study (LYRIKS) cohort. 13 UHR participants converted in the study duration. Samples were collected at 3 timepoints, at 12-months interval across a 2-year period. We examined whether the genes differential with psychosis conversion contain schizophrenia risk loci. We then examined the functional ontologies and GWAS associations of the differential genes. We also identified the overlap between differentially expressed genes across different comparisons. STUDY RESULTS: Genes containing schizophrenia risk loci were not differentially expressed in the peripheral whole blood in psychosis conversion. The differentially expressed genes in psychosis conversion are enriched for ontologies associated with cellular replication. The differentially expressed genes in psychosis conversion are associated with non-neurological GWAS phenotypes reported to be perturbed in schizophrenia and psychosis but not schizophrenia and psychosis phenotypes themselves. We found minimal overlap between the genes differential with psychosis conversion and the genes that are differential between pre-conversion and non-conversion samples. CONCLUSION: The associations between psychosis conversion and peripheral blood-based biomarkers are likely to be indirect. Further studies to elucidate the mechanism behind potential indirect associations are needed.


Asunto(s)
Trastornos Psicóticos , Esquizofrenia , Adolescente , Humanos , Trastornos Psicóticos/genética , Esquizofrenia/genética , Estudios Longitudinales , Biomarcadores , ARN
12.
Brief Bioinform ; 24(6)2023 09 22.
Artículo en Inglés | MEDLINE | ID: mdl-37889118

RESUMEN

Selecting informative features, such as accurate biomarkers for disease diagnosis, prognosis and response to treatment, is an essential task in the field of bioinformatics. Medical data often contain thousands of features and identifying potential biomarkers is challenging due to small number of samples in the data, method dependence and non-reproducibility. This paper proposes a novel ensemble feature selection method, named Filter and Wrapper Stacking Ensemble (FWSE), to identify reproducible biomarkers from high-dimensional omics data. In FWSE, filter feature selection methods are run on numerous subsets of the data to eliminate irrelevant features, and then wrapper feature selection methods are applied to rank the top features. The method was validated on four high-dimensional medical datasets related to mental illnesses and cancer. The results indicate that the features selected by FWSE are stable and statistically more significant than the ones obtained by existing methods while also demonstrating biological relevance. Furthermore, FWSE is a generic method, applicable to various high-dimensional datasets in the fields of machine intelligence and bioinformatics.


Asunto(s)
Trastornos Mentales , Neoplasias , Humanos , Algoritmos , Inteligencia Artificial , Biomarcadores , Neoplasias/diagnóstico , Neoplasias/genética
13.
IJID Reg ; 8: 84-89, 2023 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-37529630

RESUMEN

Objectives: In critically ill patients with COVID-19, distinct hyperinflammatory and hypoinflammatory phenotypes have been described, with different outcomes and responses to therapy. We investigated if similar phenotypes exist in non-severe illness. Methods: Consecutive patients with polymerase chain reaction (PCR) confirmed SARS-CoV-2 were examined. Baseline demographics and laboratory investigations were tabulated, including serum C-reactive protein. Patients were divided into those who were hyperinflammatory (defined as C-reactive protein >17 mg/l) or hypoinflammatory. Adverse outcomes, defined as requiring oxygenation, intensive care, or death, were recorded during the hospital stay. Clinical characteristics and outcomes were compared. Results: Of the 1781 patients examined, 276 (15.5%) had a hyperinflammatory phenotype. They were older (51.8 ± 17.2 vs 40.3 ± 13.8 years, P <0.001), had a lower PCR cycle threshold (PCR cycle threshold value 19.3 ± 6.3 vs 22.7 ± 15.4, P = 0.025) at presentation, and more medical comorbidities. The hyperinflammatory phenotype was independently associated with adverse clinical outcomes, even after adjusting for age, medical history and viral load on multivariable analyses (adjusted odds ratio 5.78, 95% confidence interval 2.86-11.63). Conclusion: Even in non-severe COVID-19, there are distinct hyper- and hypoinflammatory phenotypes, with the hyperinflammatory phenotype strongly associated with adverse clinical outcomes, that could be distinguished with a simple biomarker.

14.
Brief Bioinform ; 24(4)2023 07 20.
Artículo en Inglés | MEDLINE | ID: mdl-37419612

RESUMEN

Missing values (MVs) can adversely impact data analysis and machine-learning model development. We propose a novel mixed-model method for missing value imputation (MVI). This method, ProJect (short for Protein inJection), is a powerful and meaningful improvement over existing MVI methods such as Bayesian principal component analysis (PCA), probabilistic PCA, local least squares and quantile regression imputation of left-censored data. We rigorously tested ProJect on various high-throughput data types, including genomics and mass spectrometry (MS)-based proteomics. Specifically, we utilized renal cancer (RC) data acquired using DIA-SWATH, ovarian cancer (OC) data acquired using DIA-MS, bladder (BladderBatch) and glioblastoma (GBM) microarray gene expression dataset. Our results demonstrate that ProJect consistently performs better than other referenced MVI methods. It achieves the lowest normalized root mean square error (on average, scoring 45.92% less error in RC_C, 27.37% in RC_full, 29.22% in OC, 23.65% in BladderBatch and 20.20% in GBM relative to the closest competing method) and the Procrustes sum of squared error (Procrustes SS) (exhibits 79.71% less error in RC_C, 38.36% in RC full, 18.13% in OC, 74.74% in BladderBatch and 30.79% in GBM compared to the next best method). ProJect also leads with the highest correlation coefficient among all types of MV combinations (0.64% higher in RC_C, 0.24% in RC full, 0.55% in OC, 0.39% in BladderBatch and 0.27% in GBM versus the second-best performing method). ProJect's key strength is its ability to handle different types of MVs commonly found in real-world data. Unlike most MVI methods that are designed to handle only one type of MV, ProJect employs a decision-making algorithm that first determines if an MV is missing at random or missing not at random. It then employs targeted imputation strategies for each MV type, resulting in more accurate and reliable imputation outcomes. An R implementation of ProJect is available at https://github.com/miaomiao6606/ProJect.


Asunto(s)
Algoritmos , Genómica , Teorema de Bayes , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Espectrometría de Masas/métodos
15.
Drug Discov Today ; 28(9): 103661, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37301250

RESUMEN

In data-processing pipelines, upstream steps can influence downstream processes because of their sequential nature. Among these data-processing steps, batch effect (BE) correction (BEC) and missing value imputation (MVI) are crucial for ensuring data suitability for advanced modeling and reducing the likelihood of false discoveries. Although BEC-MVI interactions are not well studied, they are ultimately interdependent. Batch sensitization can improve the quality of MVI. Conversely, accounting for missingness also improves proper BE estimation in BEC. Here, we discuss how BEC and MVI are interconnected and interdependent. We show how batch sensitization can improve any MVI and bring attention to the idea of BE-associated missing values (BEAMs). Finally, we discuss how batch-class imbalance problems can be mitigated by borrowing ideas from machine learning.


Asunto(s)
Procesamiento Automatizado de Datos
16.
Comput Biol Chem ; 104: 107845, 2023 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-36889140

RESUMEN

The robustness of a breast cancer gene signature, the super-proliferation set (SPS), is initially tested and investigated on breast cancer cell lines from the Cancer Cell Line Encyclopaedia (CCLE). Previously, SPS was derived via a meta-analysis of 47 independent breast cancer gene signatures, benchmarked on survival information from clinical data in the NKI dataset. Here, relying on the stability of cell line data and associative prior knowledge, we first demonstrate through Principal Component Analysis (PCA) that SPS prioritizes survival information over secondary subtype information, surpassing both PAM50 and Boruta, an artificial intelligence-based feature-selection algorithm, in this regard. We can also extract higher resolution 'progression' information using SPS, dividing survival outcomes into several clinically relevant stages ('good', 'intermediate', and 'bad) based on different quadrants of the PCA scatterplot. Furthermore, by transferring these 'progression' annotations onto independent clinical datasets, we demonstrate the generalisability of our method on actual patient data. Finally, via the characteristic genetic profiles of each quadrant/stage, we identified efficacious drugs using their gene reversal scores that can shift signatures across quadrants/stages, in a process known as gene signature reversal. This confirms the power of meta-analytical approaches for gene signature inference in breast cancer, as well as the clinical benefit in translating these inferences onto real-world patient data for more targeted therapies.


Asunto(s)
Inteligencia Artificial , Neoplasias de la Mama , Humanos , Femenino , Perfilación de la Expresión Génica/métodos , Algoritmos , Neoplasias de la Mama/genética
17.
Brief Bioinform ; 24(2)2023 03 19.
Artículo en Inglés | MEDLINE | ID: mdl-36907650

RESUMEN

Proteomic studies characterize the protein composition of complex biological samples. Despite recent advancements in mass spectrometry instrumentation and computational tools, low proteome coverage and interpretability remains a challenge. To address this, we developed Proteome Support Vector Enrichment (PROSE), a fast, scalable and lightweight pipeline for scoring proteins based on orthogonal gene co-expression network matrices. PROSE utilizes simple protein lists as input, generating a standard enrichment score for all proteins, including undetected ones. In our benchmark with 7 other candidate prioritization techniques, PROSE shows high accuracy in missing protein prediction, with scores correlating strongly to corresponding gene expression data. As a further proof-of-concept, we applied PROSE to a reanalysis of the Cancer Cell Line Encyclopedia proteomics dataset, where it captures key phenotypic features, including gene dependency. We lastly demonstrated its applicability on a breast cancer clinical dataset, showing clustering by annotated molecular subtype and identification of putative drivers of triple-negative breast cancer. PROSE is available as a user-friendly Python module from https://github.com/bwbio/PROSE.


Asunto(s)
Proteoma , Proteómica , Proteómica/métodos , Proteoma/análisis
18.
J Bioinform Comput Biol ; 21(1): 2350005, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36891972

RESUMEN

Some prediction methods use probability to rank their predictions, while some other prediction methods do not rank their predictions and instead use [Formula: see text]-values to support their predictions. This disparity renders direct cross-comparison of these two kinds of methods difficult. In particular, approaches such as the Bayes Factor upper Bound (BFB) for [Formula: see text]-value conversion may not make correct assumptions for this kind of cross-comparisons. Here, using a well-established case study on renal cancer proteomics and in the context of missing protein prediction, we demonstrate how to compare these two kinds of prediction methods using two different strategies. The first strategy is based on false discovery rate (FDR) estimation, which does not make the same naïve assumptions as BFB conversions. The second strategy is a powerful approach which we colloquially call "home ground testing". Both strategies perform better than BFB conversions. Thus, we recommend comparing prediction methods by standardization to a common performance benchmark such as a global FDR. And where this is not possible, we recommend reciprocal "home ground testing".


Asunto(s)
Proteínas , Proteómica , Teorema de Bayes , Probabilidad
19.
PLoS Comput Biol ; 19(3): e1010961, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36930671

RESUMEN

In mass spectrometry (MS)-based proteomics, protein inference from identified peptides (protein fragments) is a critical step. We present ProInfer (Protein Inference), a novel protein assembly method that takes advantage of information in biological networks. ProInfer assists recovery of proteins supported only by ambiguous peptides (a peptide which maps to more than one candidate protein) and enhances the statistical confidence for proteins supported by both unique and ambiguous peptides. Consequently, ProInfer rescues weakly supported proteins thereby improving proteome coverage. Evaluated across THP1 cell line, lung cancer and RAW267.4 datasets, ProInfer always infers the most numbers of true positives, in comparison to mainstream protein inference tools Fido, EPIFANY and PIA. ProInfer is also adept at retrieving differentially expressed proteins, signifying its usefulness for functional analysis and phenotype profiling. Source codes of ProInfer are available at https://github.com/PennHui2016/ProInfer.


Asunto(s)
Algoritmos , Péptidos , Péptidos/química , Proteoma/análisis , Espectrometría de Masas , Proteómica/métodos , Bases de Datos de Proteínas , Programas Informáticos
20.
Sci Rep ; 13(1): 3003, 2023 02 21.
Artículo en Inglés | MEDLINE | ID: mdl-36810890

RESUMEN

Data analysis is complex due to a myriad of technical problems. Amongst these, missing values and batch effects are endemic. Although many methods have been developed for missing value imputation (MVI) and batch correction respectively, no study has directly considered the confounding impact of MVI on downstream batch correction. This is surprising as missing values are imputed during early pre-processing while batch effects are mitigated during late pre-processing, prior to functional analysis. Unless actively managed, MVI approaches generally ignore the batch covariate, with unknown consequences. We examine this problem by modelling three simple imputation strategies: global (M1), self-batch (M2) and cross-batch (M3) first via simulations, and then corroborated on real proteomics and genomics data. We report that explicit consideration of batch covariates (M2) is important for good outcomes, resulting in enhanced batch correction and lower statistical errors. However, M1 and M3 are error-generating: global and cross-batch averaging may result in batch-effect dilution, with concomitant and irreversible increase in intra-sample noise. This noise is unremovable via batch correction algorithms and produces false positives and negatives. Hence, careless imputation in the presence of non-negligible covariates such as batch effects should be avoided.


Asunto(s)
Algoritmos , Genómica , Proteómica , Interpretación Estadística de Datos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...