Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 19.124
Filtrar
1.
J Biomed Opt ; 30(Suppl 1): S13703, 2025 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-39034959

RESUMO

Significance: Standardization of fluorescence molecular imaging (FMI) is critical for ensuring quality control in guiding surgical procedures. To accurately evaluate system performance, two metrics, the signal-to-noise ratio (SNR) and contrast, are widely employed. However, there is currently no consensus on how these metrics can be computed. Aim: We aim to examine the impact of SNR and contrast definitions on the performance assessment of FMI systems. Approach: We quantified the SNR and contrast of six near-infrared FMI systems by imaging a multi-parametric phantom. Based on approaches commonly used in the literature, we quantified seven SNRs and four contrast values considering different background regions and/or formulas. Then, we calculated benchmarking (BM) scores and respective rank values for each system. Results: We show that the performance assessment of an FMI system changes depending on the background locations and the applied quantification method. For a single system, the different metrics can vary up to ∼ 35 dB (SNR), ∼ 8.65 a . u . (contrast), and ∼ 0.67 a . u . (BM score). Conclusions: The definition of precise guidelines for FMI performance assessment is imperative to ensure successful clinical translation of the technology. Such guidelines can also enable quality control for the already clinically approved indocyanine green-based fluorescence image-guided surgery.


Assuntos
Benchmarking , Imagem Molecular , Imagem Óptica , Imagens de Fantasmas , Razão Sinal-Ruído , Imagem Molecular/métodos , Imagem Molecular/normas , Imagem Óptica/métodos , Imagem Óptica/normas , Processamento de Imagem Assistida por Computador/métodos
2.
Bioinformatics ; 40(8)2024 Aug 02.
Artigo em Inglês | MEDLINE | ID: mdl-39115813

RESUMO

MOTIVATION: Despite an increase in protein modelling accuracy following the development of AlphaFold2, there remains an accuracy gap between predicted and observed model quality assessment (MQA) scores. In CASP15, variations in AlphaFold2 model accuracy prediction were noticed for quaternary models of very similar observed quality. In this study, we compare plDDT and pTM to their observed counterparts the local distance difference test (lDDT) and TM-score for both tertiary and quaternary models to examine whether reliability is retained across the scoring range under normal modelling conditions and in situations where AlphaFold2 functionality is customized. We also explore plDDT and pTM ranking accuracy in comparison with the published independent MQA programmes ModFOLD9 and ModFOLDdock. RESULTS: plDDT was found to be an accurate descriptor of tertiary model quality compared to observed lDDT-Cα scores (Pearson r = 0.97), and achieved a ranking agreement true positive rate (TPR) of 0.34 with observed scores, which ModFOLD9 could not improve. However, quaternary structure accuracy was reduced (plDDT r = 0.67, pTM r = 0.70) and significant overprediction was seen with both scores for some lower quality models. Additionally, ModFOLDdock was able to improve upon AF2-Multimer model ranking compared to TM-score (TPR 0.34) and oligo-lDDT score (TPR 0.43). Finally, evidence is presented for increased variability in plDDT and pTM when using custom template recycling, which is more pronounced for quaternary structures. AVAILABILITY AND IMPLEMENTATION: The ModFOLD9 and ModFOLDdock quality assessment servers are available at https://www.reading.ac.uk/bioinf/ModFOLD/ and https://www.reading.ac.uk/bioinf/ModFOLDdock/, respectively. A docker image is available at https://hub.docker.com/r/mcguffin/multifold.


Assuntos
Benchmarking , Modelos Moleculares , Proteínas , Benchmarking/métodos , Proteínas/química , Software , Biologia Computacional/métodos , Conformação Proteica , Dobramento de Proteína
4.
Genome Biol ; 25(1): 225, 2024 Aug 16.
Artigo em Inglês | MEDLINE | ID: mdl-39152456

RESUMO

BACKGROUND: Single-cell chromatin accessibility assays, such as scATAC-seq, are increasingly employed in individual and joint multi-omic profiling of single cells. As the accumulation of scATAC-seq and multi-omics datasets continue, challenges in analyzing such sparse, noisy, and high-dimensional data become pressing. Specifically, one challenge relates to optimizing the processing of chromatin-level measurements and efficiently extracting information to discern cellular heterogeneity. This is of critical importance, since the identification of cell types is a fundamental step in current single-cell data analysis practices. RESULTS: We benchmark 8 feature engineering pipelines derived from 5 recent methods to assess their ability to discover and discriminate cell types. By using 10 metrics calculated at the cell embedding, shared nearest neighbor graph, or partition levels, we evaluate the performance of each method at different data processing stages. This comprehensive approach allows us to thoroughly understand the strengths and weaknesses of each method and the influence of parameter selection. CONCLUSIONS: Our analysis provides guidelines for choosing analysis methods for different datasets. Overall, feature aggregation, SnapATAC, and SnapATAC2 outperform latent semantic indexing-based methods. For datasets with complex cell-type structures, SnapATAC and SnapATAC2 are preferred. With large datasets, SnapATAC2 and ArchR are most scalable.


Assuntos
Benchmarking , Cromatina , Análise de Célula Única , Análise de Célula Única/métodos , Cromatina/genética , Cromatina/metabolismo , Humanos , Biologia Computacional/métodos
5.
Sci Rep ; 14(1): 18334, 2024 08 07.
Artigo em Inglês | MEDLINE | ID: mdl-39112664

RESUMO

The widespread adoption of robotic technologies in healthcare has opened up new perspectives for enhancing accuracy, effectiveness and quality of medical procedures and patients' care. Special attention has been given to the reliability of robots when operating in environments shared with humans and to the users' safety, especially in case of mobile platforms able to navigate autonomously. From the analysis of the literature, it emerges that navigation tests carried out in a hospital environment are preliminary and not standardized. This paper aims to overcome the limitations in the assessment of autonomous mobile robots navigating in hospital environments by proposing: (i) a structured benchmarking protocol composed of a set of standardized tests, taking into account conditions with increasing complexity, (ii) a set of quantitative performance metrics. The proposed approach has been used in a realistic setting to assess the performance of two robotic platforms, namely HOSBOT and TIAGo, with different technical features and developed for different applications in a clinical scenario.


Assuntos
Benchmarking , Hospitais , Robótica , Benchmarking/métodos , Robótica/métodos , Humanos
6.
J Environ Manage ; 367: 122053, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39111004

RESUMO

We introduce an analytical methodological framework that links knowledge generation efficiency with economic efficiency and the corresponding environmental impact for 199 European Regions during 2000-2018, using a benchmarking approach and especially a chain network DEA technique. A clear trade-off between knowledge generation efficiency and productive performance emerges. European regions which exhibit high innovation efficiency enjoy higher overall performance compared to their counterparts. In a second stage, we investigate the convergence patterns of the examined regions with respect to all the three facets of the estimated efficiency where the coexistence of multi-type convergence clubs is revealed.


Assuntos
Benchmarking , Europa (Continente) , Meio Ambiente
7.
Sci Data ; 11(1): 864, 2024 Aug 10.
Artigo em Inglês | MEDLINE | ID: mdl-39127718

RESUMO

Taxonomic classification is crucial in identifying organisms within diverse microbial communities when using metagenomics shotgun sequencing. While second-generation Illumina sequencing still dominates, third-generation nanopore sequencing promises improved classification through longer reads. However, extensive benchmarking studies on nanopore data are lacking. We systematically evaluated performance of bacterial taxonomic classification for metagenomics nanopore sequencing data for several commonly used classifiers, using standardized reference sequence databases, on the largest collection of publicly available data for defined mock communities thus far (nine samples), representing different research domains and application scopes. Our results categorize classifiers into three categories: low precision/high recall; medium precision/medium recall, and high precision/medium recall. Most fall into the first group, although precision can be improved without excessively penalizing recall with suitable abundance filtering. No definitive 'best' classifier emerges, and classifier selection depends on application scope and practical requirements. Although few classifiers designed for long reads exist, they generally exhibit better performance. Our comprehensive benchmarking provides concrete recommendations, supported by publicly available code for reassessment and fine-tuning by other scientists.


Assuntos
Bactérias , Benchmarking , Metagenômica , Metagenômica/métodos , Bactérias/genética , Bactérias/classificação , Sequenciamento por Nanoporos , Nanoporos , Microbiota
8.
J Chem Inf Model ; 64(15): 6162-6173, 2024 Aug 12.
Artigo em Inglês | MEDLINE | ID: mdl-39087481

RESUMO

Proteolysis targeting chimeras (PROTACs) are bifunctional compounds that recruit an E3 ligase to a target protein to induce ubiquitination and degradation of the target. Rational optimization of PROTAC requires a structural model of the ternary complex. In the absence of an experimental structure, computational tools have emerged that attempt to predict PROTAC ternary complexes. Here, we systematically benchmark three commonly used tools: PRosettaC, MOE, and ICM. We find that these PROTAC-focused methods produce an array of ternary complex structures, including some that are observed experimentally, but also many that significantly deviate from the crystal structure. Molecular dynamics simulations show that PROTAC complexes may exist in a multiplicity of configurational states and question the use of experimentally observed structures as a reference for accurate predictions. The pioneering computational tools benchmarked here highlight the promises and challenges in the field and may be more valuable when guided by clear structural and biophysical data. The benchmarking data set that we provide may also be valuable for evaluating other and future computational tools for ternary complex modeling.


Assuntos
Benchmarking , Simulação de Dinâmica Molecular , Proteólise , Conformação Proteica , Proteínas/química , Ubiquitina-Proteína Ligases/química , Ubiquitina-Proteína Ligases/metabolismo
9.
Genome Biol ; 25(1): 212, 2024 Aug 09.
Artigo em Inglês | MEDLINE | ID: mdl-39123269

RESUMO

BACKGROUND: Spatial transcriptomics (ST) is advancing our understanding of complex tissues and organisms. However, building a robust clustering algorithm to define spatially coherent regions in a single tissue slice and aligning or integrating multiple tissue slices originating from diverse sources for essential downstream analyses remains challenging. Numerous clustering, alignment, and integration methods have been specifically designed for ST data by leveraging its spatial information. The absence of comprehensive benchmark studies complicates the selection of methods and future method development. RESULTS: In this study, we systematically benchmark a variety of state-of-the-art algorithms with a wide range of real and simulated datasets of varying sizes, technologies, species, and complexity. We analyze the strengths and weaknesses of each method using diverse quantitative and qualitative metrics and analyses, including eight metrics for spatial clustering accuracy and contiguity, uniform manifold approximation and projection visualization, layer-wise and spot-to-spot alignment accuracy, and 3D reconstruction, which are designed to assess method performance as well as data quality. The code used for evaluation is available on our GitHub. Additionally, we provide online notebook tutorials and documentation to facilitate the reproduction of all benchmarking results and to support the study of new methods and new datasets. CONCLUSIONS: Our analyses lead to comprehensive recommendations that cover multiple aspects, helping users to select optimal tools for their specific needs and guide future method development.


Assuntos
Algoritmos , Benchmarking , Análise por Conglomerados , Animais , Perfilação da Expressão Gênica/métodos , Transcriptoma , Humanos , Software , Alinhamento de Sequência/métodos
10.
Clin Transl Sci ; 17(8): e13911, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-39123290

RESUMO

Pharmacogenomics (PGx) investigates the influence of genetics on drug responses, enabling tailored treatments for personalized healthcare. This study assessed the accuracy of genotyping six genes using whole genome sequencing with four different computational tools and various sequencing depths. The effects of using different reference genomes (GRCh38 and GRCh37) and sequence aligners (BWA-MEM and Bowtie2) were also explored. The results showed generally minor variations in tool performance across most genes; however, more notable discrepancies were observed in the analysis of the complex CYP2D6 gene. Cyrius, a CYP2D6-specific tool, demonstrated the most robust performance, achieving the highest concordance rates for CYP2D6 in all instances, comparable to the consensus approach in most cases. There were rather small differences between the samples with 20× coverage depth and those with higher depth, but the decreased performance was more evident at lower depths, particularly at 5×. Additionally, variations in CYP2D6 results were observed when samples were aligned to different reference genomes using the same method, or to the same genome using different aligners, which led to reporting incorrect rare star alleles in several cases. These findings inform the selection of optimal PGx tools and methodologies as well as suggest that employing a consensus approach with two or more tools might be preferable for certain genes and tool combinations, especially at lower sequencing depths, to ensure accurate results. Additionally, we show how the upstream alignment can affect the performance of tools, an important factor to take into account.


Assuntos
Benchmarking , Citocromo P-450 CYP2D6 , Farmacogenética , Humanos , Citocromo P-450 CYP2D6/genética , Farmacogenética/normas , Farmacogenética/métodos , Citocromo P-450 CYP2C19/genética , Sequenciamento Completo do Genoma/normas , Sequenciamento Completo do Genoma/métodos , Técnicas de Genotipagem/métodos , Genótipo , Citocromo P-450 CYP2C9/genética , Citocromo P-450 CYP2A6/genética , Testes Farmacogenômicos/normas , Testes Farmacogenômicos/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Vitamina K Epóxido Redutases
12.
Radiology ; 312(2): e233337, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-39136561

RESUMO

Background Prostate MRI for the detection of clinically significant prostate cancer (csPCa) is standardized by the Prostate Imaging Reporting and Data System (PI-RADS), currently in version 2.1. A systematic review and meta-analysis infrastructure with a 12-month update cycle was established to evaluate the diagnostic performance of PI-RADS over time. Purpose To provide estimates of diagnostic accuracy and cancer detection rates (CDRs) of PI-RADS version 2.1 categories for prostate MRI, which is required for further evidence-based patient management. Materials and Methods A systematic search of PubMed, Embase, Cochrane Library, and multiple trial registers (English-language studies published from March 1, 2019, to August 30, 2022) was performed. Studies that reported data on diagnostic accuracy or CDRs of PI-RADS version 2.1 with csPCa as the primary outcome were included. For the meta-analysis, pooled estimates for sensitivity, specificity, and CDRs were derived from extracted data at the lesion level and patient level. Sensitivity and specificity for PI-RADS greater than or equal to 3 and PI-RADS greater than or equal to 4 considered as test positive were investigated. In addition to individual PI-RADS categories 1-5, subgroup analyses of subcategories (ie, 2+1, 3+0) were performed. Results A total of 70 studies (11 686 lesions, 13 330 patients) were included. At the patient level, with PI-RADS greater than or equal to 3 considered positive, meta-analysis found a 96% summary sensitivity (95% CI: 95, 98) and 43% specificity (95% CI: 33, 54), with an area under the summary receiver operating characteristic (SROC) curve of 0.86 (95% CI: 0.75, 0.93). For PI-RADS greater than or equal to 4, meta-analysis found an 89% sensitivity (95% CI: 85, 92) and 66% specificity (95% CI: 58, 74), with an area under the SROC curve of 0.89 (95% CI: 0.85, 0.92). CDRs were as follows: PI-RADS 1, 6%; PI-RADS 2, 5%; PI-RADS 3, 19%; PI-RADS 4, 54%; and PI-RADS 5, 84%. The CDR was 12% (95% CI: 7, 19) for transition zone 2+1 lesions and 19% (95% CI: 12, 29) for 3+0 lesions (P = .12). Conclusion Estimates of diagnostic accuracy and CDRs for PI-RADS version 2.1 categories are provided for quality benchmarking and to guide further evidence-based patient management. © RSNA, 2024 Supplemental material is available for this article. See also the editorial by Tammisetti and Jacobs in this issue.


Assuntos
Benchmarking , Imageamento por Ressonância Magnética , Neoplasias da Próstata , Humanos , Neoplasias da Próstata/diagnóstico por imagem , Masculino , Imageamento por Ressonância Magnética/métodos , Sensibilidade e Especificidade , Próstata/diagnóstico por imagem , Próstata/patologia
13.
Nat Commun ; 15(1): 6516, 2024 Aug 02.
Artigo em Inglês | MEDLINE | ID: mdl-39095341

RESUMO

High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands of perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects severely limit community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we benchmark ten high-performing single-cell RNA sequencing (scRNA-seq) batch correction techniques, representing diverse approaches, using a newly released Cell Painting dataset, JUMP. We focus on five scenarios with varying complexity, ranging from batches prepared in a single lab over time to batches imaged using different microscopes in multiple labs. We find that Harmony and Seurat RPCA are noteworthy, consistently ranking among the top three methods for all tested scenarios while maintaining computational efficiency. Our proposed framework, benchmark, and metrics can be used to assess new batch correction methods in the future. This work paves the way for improvements that enable the community to make the best use of public Cell Painting data for scientific discovery.


Assuntos
Análise de Célula Única , Análise de Célula Única/métodos , Humanos , Processamento de Imagem Assistida por Computador/métodos , Análise de Sequência de RNA/métodos , Benchmarking
14.
Health Expect ; 27(4): e14169, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-39105687

RESUMO

INTRODUCTION: Outcome measurement instruments (OMIs) are used to gauge the effects of treatment. In post-stroke aphasia rehabilitation, benchmarks for meaningful change are needed to support the interpretation of patient outcomes. This study is part of a research programme to establish minimal important change (MIC) values (the smallest change above which patients perceive themselves as importantly changed) for core OMIs. As a first step in this process, the views of people with aphasia and clinicians were explored, and consensus was sought on a threshold for clinically meaningful change. METHODS: Sequential mixed-methods design was employed. Participants included people with post-stroke aphasia and speech pathologists. People with aphasia were purposively sampled based on time post-stroke, age and gender, whereas speech pathologists were sampled according to their work setting (hospital or community). Each participant attended a focus group followed by a consensus workshop with a survey component. Within the focus groups, experiences and methods for measuring meaningful change during aphasia recovery were explored. Qualitative data were transcribed and analysed using reflexive thematic analysis. In the consensus workshop, participants voted on thresholds for meaningful change in core outcome constructs of language, communication, emotional well-being and quality of life, using a six-point rating scale (much worse, slightly worse, no change, slightly improved, much improved and completely recovered). Consensus was defined a priori as 70% agreement. Voting results were reported using descriptive statistics. RESULTS: Five people with aphasia (n = 4, > 6 months after stroke; n = 5, < 65 years; n = 3, males) and eight speech pathologists (n = 4, hospital setting; n = 4, community setting) participated in one of four focus groups (duration: 92-112 min). Four themes were identified describing meaningful change as follows: (1) different for every single person; (2) small continuous improvements; (3) measured by progress towards personally relevant goals; and (4) influenced by personal factors. 'Slightly improved' was agreed as the threshold of MIC on the anchor-rating scale (75%-92%) within 6 months of stroke, whereas after 6 months there was a trend towards supporting 'much improved' (36%-66%). CONCLUSION: Our mixed-methods research with people with aphasia and speech pathologists provides novel evidence to inform the definition of MIC in aphasia rehabilitation. Future research will aim to establish MIC values for core OMIs. PATIENT OR PUBLIC CONTRIBUTION: This work is the result of engagement between people with lived experience of post-stroke aphasia, including people with aphasia, family members, clinicians and researchers. Engagement across the research cycle was sought to ensure that the research tasks were acceptable and easily understood by participants and that the outcomes of the study were relevant to the aphasia community. This engagement included the co-development of a plain English summary of the results. Advisors were remunerated in accordance with Health Consumers Queensland guidelines. Interview guides for clinicians were piloted by speech pathologists working in aphasia rehabilitation.


Assuntos
Afasia , Benchmarking , Grupos Focais , Reabilitação do Acidente Vascular Cerebral , Humanos , Afasia/reabilitação , Afasia/psicologia , Masculino , Feminino , Pessoa de Meia-Idade , Idoso , Qualidade de Vida , Avaliação de Resultados em Cuidados de Saúde , Adulto , Acidente Vascular Cerebral/complicações , Acidente Vascular Cerebral/psicologia , Pesquisa Qualitativa , Inquéritos e Questionários
15.
BMC Bioinformatics ; 25(1): 269, 2024 Aug 20.
Artigo em Inglês | MEDLINE | ID: mdl-39164632

RESUMO

BACKGROUND: Fluorescence microscopy (FM) is an important and widely adopted biological imaging technique. Segmentation is often the first step in quantitative analysis of FM images. Deep neural networks (DNNs) have become the state-of-the-art tools for image segmentation. However, their performance on natural images may collapse under certain image corruptions or adversarial attacks. This poses real risks to their deployment in real-world applications. Although the robustness of DNN models in segmenting natural images has been studied extensively, their robustness in segmenting FM images remains poorly understood RESULTS: To address this deficiency, we have developed an assay that benchmarks robustness of DNN segmentation models using datasets of realistic synthetic 2D FM images with precisely controlled corruptions or adversarial attacks. Using this assay, we have benchmarked robustness of ten representative models such as DeepLab and Vision Transformer. We find that models with good robustness on natural images may perform poorly on FM images. We also find new robustness properties of DNN models and new connections between their corruption robustness and adversarial robustness. To further assess the robustness of the selected models, we have also benchmarked them on real microscopy images of different modalities without using simulated degradation. The results are consistent with those obtained on the realistic synthetic images, confirming the fidelity and reliability of our image synthesis method as well as the effectiveness of our assay. CONCLUSIONS: Based on comprehensive benchmarking experiments, we have found distinct robustness properties of deep neural networks in semantic segmentation of FM images. Based on the findings, we have made specific recommendations on selection and design of robust models for FM image segmentation.


Assuntos
Benchmarking , Processamento de Imagem Assistida por Computador , Microscopia de Fluorescência , Redes Neurais de Computação , Microscopia de Fluorescência/métodos , Benchmarking/métodos , Processamento de Imagem Assistida por Computador/métodos , Semântica , Aprendizado Profundo , Algoritmos , Humanos
16.
BMJ Health Care Inform ; 31(1)2024 Aug 09.
Artigo em Inglês | MEDLINE | ID: mdl-39122448

RESUMO

OBJECTIVE: Collaborate, Analyse, Research and Audit (CARA) project set out to provide an infrastructure to enable Irish general practitioners (GPs) to use their routinely collected patient management software (PMS) data to better understand their patient population, disease management and prescribing through data dashboards. This paper explains the design and development of the CARA infrastructure. METHODS: The first exemplar dashboard was developed with GPs and focused on antibiotic prescribing to develop and showcase the proposed infrastructure. The data integration process involved extracting, loading and transforming de-identified patient data into data models which connect to the interactive dashboards for GPs to visualise, compare and audit their data. RESULTS: The architecture of the CARA infrastructure includes two main sections: extract, load and transform process (ELT, de-identified patient data into data models) and a Representational State Transfer Application Programming Interface (REST API) (which provides the security barrier between the data models and their visualisation on the CARA dashboard). CARAconnect was created to facilitate the extraction and de-identification of patient data from the practice database. DISCUSSION: The CARA infrastructure allows seamless connectivity with and compatibility with the main PMS in Irish general practice and provides a reproducible template to access and visualise patient data. CARA includes two dashboards, a practice overview and a topic-specific dashboard (example focused on antibiotic prescribing), which includes an audit tool, filters (within practice) and between-practice comparisons. CONCLUSION: CARA supports evidence-based decision-making by providing GPs with valuable insights through interactive data dashboards to optimise patient care, identify potential areas for improvement and benchmark their performance against other practices.Supplementary file 1. Graphical abstract.


Assuntos
Benchmarking , Medicina Geral , Humanos , Medicina Geral/organização & administração , Irlanda , Registros Eletrônicos de Saúde , Software , Interface Usuário-Computador
17.
Am J Hum Genet ; 111(8): 1717-1735, 2024 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-39059387

RESUMO

Mendelian randomization (MR), which utilizes genetic variants as instrumental variables (IVs), has gained popularity as a method for causal inference between phenotypes using genetic data. While efforts have been made to relax IV assumptions and develop new methods for causal inference in the presence of invalid IVs due to confounding, the reliability of MR methods in real-world applications remains uncertain. Instead of using simulated datasets, we conducted a benchmark study evaluating 16 two-sample summary-level MR methods using real-world genetic datasets to provide guidelines for the best practices. Our study focused on the following crucial aspects: type I error control in the presence of various confounding scenarios (e.g., population stratification, pleiotropy, and family-level confounders like assortative mating), the accuracy of causal effect estimates, replicability, and power. By comprehensively evaluating the performance of compared methods over one thousand exposure-outcome trait pairs, our study not only provides valuable insights into the performance and limitations of the compared methods but also offers practical guidance for researchers to choose appropriate MR methods for causal inference.


Assuntos
Benchmarking , Estudo de Associação Genômica Ampla , Análise da Randomização Mendeliana , Análise da Randomização Mendeliana/métodos , Humanos , Estudo de Associação Genômica Ampla/métodos , Fenótipo , Variação Genética , Causalidade , Polimorfismo de Nucleotídeo Único , Modelos Genéticos
18.
Anal Methods ; 16(31): 5419-5425, 2024 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-39037041

RESUMO

Fourier-transform infrared (FTIR) spectroscopy is a simple, fast and inexpensive method with a history of use for bacterial analysis. However, due to the limitations placed on spatial resolution inherent to infrared wavelengths, analysis has generally been performed on bulk samples, leading to biological variance among individual cells to be buried in averaged spectra. This also increases the bacterial load necessary for analysis, which can be problematic in clinical settings where limiting incubation time is valuable. Optical photothermal-induced resonance (O-PTIR) spectroscopy is a novel method aiming to bypass this limitation using a secondary lower wavelength laser, allowing for infrared measurements of a single bacterium. Here, using Staphylococcus capitis, Staphylococcus epidermidis and Micrococcus luteus strains as a model and FTIR as a benchmark, we examined O-PTIR's ability to discriminate single-cell samples at the intergenetic, interspecific and intraspecific levels. When combined with chemometric analysis, we showed that O-PTIR is capable of discriminating different between genera, species and strains within species to a degree comparable with FTIR. Furthermore, small variations in the amide bands associated with differences in the protein structure can still be seen in spite of smaller sample sizes. This demonstrates the potential of O-PTIR for single-cell bacterial analysis and classification.


Assuntos
Análise de Célula Única , Espectroscopia de Infravermelho com Transformada de Fourier/métodos , Análise de Célula Única/métodos , Staphylococcus/química , Benchmarking , Micrococcus luteus , Staphylococcus epidermidis/química
19.
JMIR Ment Health ; 11: e57306, 2024 Jul 23.
Artigo em Inglês | MEDLINE | ID: mdl-39042893

RESUMO

BACKGROUND: Comprehensive session summaries enable effective continuity in mental health counseling, facilitating informed therapy planning. However, manual summarization presents a significant challenge, diverting experts' attention from the core counseling process. Leveraging advances in automatic summarization to streamline the summarization process addresses this issue because this enables mental health professionals to access concise summaries of lengthy therapy sessions, thereby increasing their efficiency. However, existing approaches often overlook the nuanced intricacies inherent in counseling interactions. OBJECTIVE: This study evaluates the effectiveness of state-of-the-art large language models (LLMs) in selectively summarizing various components of therapy sessions through aspect-based summarization, aiming to benchmark their performance. METHODS: We first created Mental Health Counseling-Component-Guided Dialogue Summaries, a benchmarking data set that consists of 191 counseling sessions with summaries focused on 3 distinct counseling components (also known as counseling aspects). Next, we assessed the capabilities of 11 state-of-the-art LLMs in addressing the task of counseling-component-guided summarization. The generated summaries were evaluated quantitatively using standard summarization metrics and verified qualitatively by mental health professionals. RESULTS: Our findings demonstrated the superior performance of task-specific LLMs such as MentalLlama, Mistral, and MentalBART evaluated using standard quantitative metrics such as Recall-Oriented Understudy for Gisting Evaluation (ROUGE)-1, ROUGE-2, ROUGE-L, and Bidirectional Encoder Representations from Transformers Score across all aspects of the counseling components. Furthermore, expert evaluation revealed that Mistral superseded both MentalLlama and MentalBART across 6 parameters: affective attitude, burden, ethicality, coherence, opportunity costs, and perceived effectiveness. However, these models exhibit a common weakness in terms of room for improvement in the opportunity costs and perceived effectiveness metrics. CONCLUSIONS: While LLMs fine-tuned specifically on mental health domain data display better performance based on automatic evaluation scores, expert assessments indicate that these models are not yet reliable for clinical application. Further refinement and validation are necessary before their implementation in practice.


Assuntos
Benchmarking , Aconselhamento , Humanos , Aconselhamento/métodos , Adulto , Transtornos Mentais/terapia , Feminino
20.
Microbiol Spectr ; 12(8): e0003224, 2024 Aug 06.
Artigo em Inglês | MEDLINE | ID: mdl-38980028

RESUMO

Time-lapse microscopy offers a powerful approach for analyzing cellular activity. In particular, this technique is valuable for assessing the behavior of bacterial populations, which can exhibit growth and intercellular interactions in a monolayer. Such time-lapse imaging typically generates large quantities of data, limiting the options for manual investigation. Several image-processing software packages have been developed to facilitate analysis. It can thus be a challenge to identify the software package best suited to a particular research goal. Here, we compare four software packages that support the analysis of 2D time-lapse images of cellular populations: CellProfiler, SuperSegger-Omnipose, DeLTA, and FAST. We compare their performance against benchmarked results on time-lapse observations of Escherichia coli populations. Performance varies across the packages, with each of the four outperforming the others in at least one aspect of the analysis. Not surprisingly, the packages that have been in development for longer showed the strongest performance. We found that deep learning-based approaches to object segmentation outperformed traditional approaches, but the opposite was true for frame-to-frame object tracking. We offer these comparisons, together with insight into usability, computational efficiency, and feature availability, as a guide to researchers seeking image-processing solutions. IMPORTANCE: Time-lapse microscopy provides a detailed window into the world of bacterial behavior. However, the vast amount of data produced by these techniques is difficult to analyze manually. We have analyzed four software tools designed to process such data and compared their performance, using populations of commonly studied bacterial species as our test subjects. Our findings offer a roadmap to scientists, helping them choose the right tool for their research. This comparison bridges a gap between microbiology and computational analysis, streamlining research efforts.


Assuntos
Escherichia coli , Processamento de Imagem Assistida por Computador , Software , Imagem com Lapso de Tempo , Imagem com Lapso de Tempo/métodos , Processamento de Imagem Assistida por Computador/métodos , Escherichia coli/crescimento & desenvolvimento , Escherichia coli/fisiologia , Benchmarking
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...