RESUMO
In our rapidly expanding landscape of artificial intelligence, synthetic data have become a topic of great promise and also some concern. This review aimed to provide pathologists and laboratory professionals with a primer on the role of synthetic data and how it may soon shape the landscape within our field. Using synthetic data presents many advantages but also introduces a milieu of new obstacles and limitations. This review aimed to provide pathologists and laboratory professionals with a primer on the general concept of synthetic data and its potential to transform our field. By leveraging synthetic data, we can help accelerate the development of various machine learning models and enhance our medical education and research/quality study needs. This review explored the methods for generating synthetic data, including rule-based, machine learning model-based and hybrid approaches, as they apply to applications within pathology and laboratory medicine. We also discussed the limitations and challenges associated with such synthetic data, including data quality, malicious use, and ethical bias/concerns and challenges. By understanding the potential benefits (ie, medical education, training artificial intelligence programs, and proficiency testing, etc) and limitations of this new data realm, we can not only harness its power to improve patient outcomes, advance research, and enhance the practice of pathology but also become readily aware of their intrinsic limitations.
Assuntos
Aprendizado de Máquina , Humanos , Patologia , Inteligência ArtificialRESUMO
In the realm of health care, numerous generative and nongenerative artificial intelligence and machine learning (AI-ML) tools have been developed and deployed. Simultaneously, manufacturers of medical devices are leveraging AI-ML. However, the adoption of AI in health care raises several concerns, including safety, security, ethical biases, accountability, trust, economic impact, and environmental effects. Effective regulation can mitigate some of these risks, promote fairness, establish standards, and advocate for more sustainable AI practices. Regulating AI tools not only ensures their safe and effective adoption but also fosters public trust. It is important that regulations remain flexible to accommodate rapid advances in this field to support innovation and also not to add additional burden to some of our preexisting and well-established frameworks. This study covers regional and global regulatory aspects of AI-ML including data privacy, software as a medical device, agency approval and clearance pathways, reimbursement, and laboratory-developed tests.
RESUMO
PURPOSE OF REVIEW: Immunocompromised patients are at high risk for infection. During the coronavirus disease (COVID-19) pandemic, immunocompromised patients exhibited increased odds of intensive care unit admission and death. Early pathogen identification is essential to mitigating infection related risk in immunocompromised patients. Artificial intelligence (AI) and machine learning (ML) have tremendous appeal to address unmet diagnostic needs. These AI/ML tools often rely on the wealth of data found in healthcare to enhance our ability to identify clinically significant patterns of disease. To this end, our review provides an overview of the current AI/ML landscape as it applies to infectious disease testing with emphasis on immunocompromised patients. RECENT FINDINGS: Examples include AI/ML for predicting sepsis in high risk burn patients. Likewise, ML is utilized to analyze complex host-response proteomic data to predict respiratory infections including COVID-19. These same approaches have also been applied for pathogen identification of bacteria, viruses, and hard to detect fungal microbes. Future uses of AI/ML may include integration of predictive analytics in point-of-care (POC) testing and data fusion applications. SUMMARY: Immunocompromised patients are at high risk for infections. AI/ML is transforming infectious disease testing and has great potential to address challenges encountered in the immune compromised population.
Assuntos
COVID-19 , Doenças Transmissíveis , Humanos , Inteligência Artificial , Proteômica , COVID-19/diagnóstico , Aprendizado de Máquina , Doenças Transmissíveis/diagnóstico , Teste para COVID-19RESUMO
This timely captivating topic is organized and presented in this special issue of the journal of Seminar in diagnostic pathology. This special issue will be dedicated to the utilization of machine learning within the digital pathology and laboratory medicine fields. Special thanks to all the authors whose contributions to this review series has not only enhanced our overall understanding of this exciting new field but will also enrich the reader's understanding of this important discipline.
Assuntos
Inteligência Artificial , Aprendizado de Máquina , HumanosRESUMO
Machine learning (ML) is becoming an integral aspect of several domains in medicine. Yet, most pathologists and laboratory professionals remain unfamiliar with such tools and are unprepared for their inevitable integration. To bridge this knowledge gap, we present an overview of key elements within this emerging data science discipline. First, we will cover general, well-established concepts within ML, such as data type concepts, data preprocessing methods, and ML study design. We will describe common supervised and unsupervised learning algorithms and their associated common machine learning terms (provided within a comprehensive glossary of terms that are discussed within this review). Overall, this review will offer a broad overview of the key concepts and algorithms in machine learning, with a focus on pathology and laboratory medicine. The objective is to provide an updated useful reference for those new to this field or those who require a refresher.
Assuntos
Inteligência Artificial , Aprendizado de Máquina , Humanos , AlgoritmosRESUMO
BACKGROUND: Artificial intelligence (AI) and machine learning (ML) are poised to transform infectious disease testing. Uniquely, infectious disease testing is technologically diverse spaces in laboratory medicine, where multiple platforms and approaches may be required to support clinical decision-making. Despite advances in laboratory informatics, the vast array of infectious disease data is constrained by human analytical limitations. Machine learning can exploit multiple data streams, including but not limited to laboratory information and overcome human limitations to provide physicians with predictive and actionable results. As a quickly evolving area of computer science, laboratory professionals should become aware of AI/ML applications for infectious disease testing as more platforms are become commercially available. CONTENT: In this review we: (a) define both AI/ML, (b) provide an overview of common ML approaches used in laboratory medicine, (c) describe the current AI/ML landscape as it relates infectious disease testing, and (d) discuss the future evolution AI/ML for infectious disease testing in both laboratory and point-of-care applications. SUMMARY: The review provides an important educational overview of AI/ML technique in the context of infectious disease testing. This includes supervised ML approaches, which are frequently used in laboratory medicine applications including infectious diseases, such as COVID-19, sepsis, hepatitis, malaria, meningitis, Lyme disease, and tuberculosis. We also apply the concept of "data fusion" describing the future of laboratory testing where multiple data streams are integrated by AI/ML to provide actionable clinical knowledge.
Assuntos
Inteligência Artificial , Doenças Transmissíveis , Aprendizado de Máquina , Doenças Transmissíveis/diagnóstico , HumanosRESUMO
AIMS: Machine learning (ML) binary classification in diagnostic histopathology is an area of intense investigation. Several assumptions, including training image quality/format and the number of training images required, appear to be similar in many studies irrespective of the paucity of supporting evidence. We empirically compared training image file type, training set size, and two common convolutional neural networks (CNNs) using transfer learning (ResNet50 and SqueezeNet). METHODS AND RESULTS: Thirty haematoxylin and eosin (H&E)-stained slides with carcinoma or normal tissue from three tissue types (breast, colon, and prostate) were photographed, generating 3000 partially overlapping images (1000 per tissue type). These lossless Portable Networks Graphics (PNGs) images were converted to lossy Joint Photographic Experts Group (JPG) images. Tissue type-specific binary classification ML models were developed by the use of all PNG or JPG images, and repeated with a subset of 500, 200, 100, 50, 30 and 10 images. Eleven models were generated for each tissue type, at each quantity of training images, for each file type, and for each CNN, resulting in 924 models. Internal accuracies and generalisation accuracies were compared. There was no meaningful significant difference in accuracies between PNG and JPG models. Models trained with more images did not invariably perform better. ResNet50 typically outperformed SqueezeNet. Models were generalisable within a tissue type but not across tissue types. CONCLUSIONS: Lossy JPG images were not inferior to lossless PNG images in our models. Large numbers of unique H&E-stained slides were not required for training optimal ML models. This reinforces the need for an evidence-based approach to best practices for histopathological ML.
Assuntos
Aprendizado Profundo , Histologia , Patologia Clínica , Aprendizado Profundo/estatística & dados numéricos , Diagnóstico por Computador/estatística & dados numéricos , Feminino , Técnicas Histológicas/estatística & dados numéricos , Histologia/estatística & dados numéricos , Humanos , Interpretação de Imagem Assistida por Computador/estatística & dados numéricos , Aprendizado de Máquina , Masculino , Redes Neurais de Computação , Patologia Clínica/estatística & dados numéricosRESUMO
Anemia is a public health problem in Mexico. This study sought to determine the prevalence and correlates of anemia among women and children residing in a rural farming region of Baja California, Mexico. An existing partnership between universities, non-governmental organizations, and an underserved Mexican community was utilized to perform cross-sectional data collection in 2004-2005 (Wave 1) and in 2011-2012 (Wave 2) among women (15-49 years) and their children (6-59 months). All participants completed a survey and underwent anemia testing. Blood smears were obtained to identify etiology. Nutrition education interventions and clinical health evaluations were offered between waves. Participants included 201 women and 99 children in Wave 1, and 146 women and 77 children in Wave 2. Prevalence of anemia significantly decreased from 42.3 to 23.3 % between Waves 1 and 2 in women (p < 0.001), from 46.5 to 30.2 % in children 24-59 months (p = 0.066), and from 71.4 to 45.8 % in children 6-23 months (p = 0.061). Among women in Wave 1, consumption of iron absorption enhancing foods (green vegetables and fruits high in vitamin C) was protective against anemia (p = 0.043). Women in Wave 2 who ate ≥4 servings of green, leafy vegetables per week were less likely to be anemic (p = 0.034). Microscopic examination of blood smears revealed microcytic, hypochromic red blood cells in 90 % of anemic children and 68.8 % of anemic women, consistent with iron deficiency anemia.
Assuntos
Anemia/epidemiologia , População Rural/estatística & dados numéricos , Adolescente , Adulto , Criança , Pré-Escolar , Estudos Transversais , Feminino , Humanos , Lactente , México/epidemiologia , Pessoa de Meia-Idade , Prevalência , Adulto JovemRESUMO
We present a silica nanoparticle (SNP) functionalized with polyphosphate (polyP) that accelerates the natural clotting process of the body. SNPs initiate the contact pathway of the blood-clotting system; short-chain polyP accelerates the common pathway by the rapid formation of thrombin, which enhances the overall blood-clotting system, both by accelerating fibrin generation and by facilitating the regulatory anticoagulation mechanisms essential for hemostasis. Analysis of the clotting properties of bare SNPs, bare polyP, and polyP-functionalized SNPs in plasma demonstrated that the attachment of polyP to SNPs to form polyP-SNPs creates a substantially enhanced synergistic effect that lowers clotting time and increases thrombin production at low concentrations. PolyP-SNP even retains its clotting function at ambient temperature. The polyP-SNP system has the potential to significantly improve trauma-treatment protocols and outcomes in hospital and prehospital settings.
Assuntos
Coagulação Sanguínea/efeitos dos fármacos , Nanopartículas , Polifosfatos/química , Dióxido de Silício/farmacologia , Fibrina/química , Hemorragia/tratamento farmacológico , Hemostasia , Espectroscopia de Ressonância Magnética , Tamanho da Partícula , Espectrofotometria Atômica , Temperatura , Trombina/química , Tempo de Coagulação do Sangue Total , Zircônio/químicaRESUMO
In this review, we embark on a comprehensive exploration of venous thromboembolism (VTE) in the context of medical history and its current practice within medicine. We delve into the landscape of artificial intelligence (AI), exploring its present utility and envisioning its transformative roles within VTE management, from prevention to screening and beyond. Central to our discourse is a forward-looking perspective on the integration of AI within VTE in medicine, advocating for rigorous study design, robust validation processes, and meticulous statistical analysis to gauge the efficacy of AI applications. We further illuminate the potential of large language models and generative AI in revolutionizing VTE care, while acknowledging their inherent limitations and proposing innovative solutions to overcome challenges related to data availability and integrity, including the strategic use of synthetic data. The critical importance of navigating ethical, legal, and privacy concerns associated with AI is underscored, alongside the imperative for comprehensive governance and policy frameworks to regulate its deployment in VTE treatment. We conclude on a note of cautious optimism, where we highlight the significance of proactively addressing the myriad challenges that accompany the advent of AI in healthcare. Through diligent design, stringent validation, extensive education, and prudent regulation, we can harness AI's potential to significantly enhance our understanding and management of VTE. As we stand on the cusp of a new era, our commitment to these principles will be instrumental in ensuring that the promise of AI is fully realized within the realm of VTE care.
Assuntos
Inteligência Artificial , Aprendizado de Máquina , Tromboembolia Venosa , Tromboembolia Venosa/terapia , Tromboembolia Venosa/diagnóstico , HumanosRESUMO
Healthcare data accessibility for machine learning (ML) is encumbered by a range of stringent regulations and limitations. Using synthetic data that mirrors the underlying properties in the real data is emerging as a promising solution to overcome these barriers. We propose a fully automated synthetic tabular neural generator (STNG), which comprises multiple synthetic data generators and integrates an Auto-ML module to validate and comprehensively compare the synthetic datasets generated from different approaches. An empirical study was conducted to demonstrate the performance of STNG using twelve different datasets. The results highlight STNG's robustness and its pivotal role in enhancing the accessibility of validated synthetic healthcare data, thereby offering a promising solution to a critical barrier in ML applications in healthcare.
Assuntos
Aprendizado de Máquina , Humanos , Redes Neurais de Computação , AlgoritmosRESUMO
CONTEXT.: Technology companies and research groups are increasingly exploring applications of generative artificial intelligence (GenAI) in pathology and laboratory medicine. Although GenAI holds considerable promise, it also introduces novel risks for patients, communities, professionals, and the scientific process. OBJECTIVE.: To summarize the current frameworks for the ethical development and management of GenAI within health care settings. DATA SOURCES.: The analysis draws from scientific journals, organizational websites, and recent guidelines on artificial intelligence ethics and regulation. CONCLUSIONS.: The literature on the ethical management of artificial intelligence in medicine is extensive but is still in its nascent stages because of the evolving nature of the technology. Effective and ethical integration of GenAI requires robust processes and shared accountability among technology vendors, health care organizations, regulatory bodies, medical professionals, and professional societies. As the technology continues to develop, a multifaceted ecosystem of safety mechanisms and ethical oversight is crucial to maximize benefits and mitigate risks.
RESUMO
CONTEXT.: Generative artificial intelligence (GAI) technologies are likely to dramatically impact health care workflows in clinical pathology (CP). Applications in CP include education, data mining, decision support, result summaries, and patient trend assessments. OBJECTIVE.: To review use cases of GAI in CP, with a particular focus on large language models. Specific examples are provided for the applications of GAI in the subspecialties of clinical chemistry, microbiology, hematopathology, and molecular diagnostics. Additionally, the review addresses potential pitfalls of GAI paradigms. DATA SOURCES.: Current literature on GAI in health care was reviewed broadly. The use case scenarios for each CP subspecialty review common data sources generated in each subspecialty. The potential for utilization of CP data in the GAI context was subsequently assessed, focusing on issues such as future reporting paradigms, impact on quality metrics, and potential for translational research activities. CONCLUSIONS.: GAI is a powerful tool with the potential to revolutionize health care for patients and practitioners alike. However, GAI must be implemented with much caution considering various shortcomings of the technology such as biases, hallucinations, practical challenges of implementing GAI in existing CP workflows, and end-user acceptance. Human-in-the-loop models of GAI implementation have the potential to revolutionize CP by delivering deeper, meaningful insights into patient outcomes both at an individual and population level.
RESUMO
OBJECTIVES: Recently, deep learning medical image analysis in orthopedics has become highly active. However, progress has been restricted by the absence of large-scale and standardized ground-truth images. To the best of our knowledge, this study is the first to propose an innovative solution, namely a deep few-shot image augmentation pipeline, that addresses this challenge by synthetically generating knee radiographs for training downstream tasks, with a specific focus on knee osteoarthritis Kellgren-Lawrence (KL) grading. MATERIALS AND METHODS: This study leverages a deep few-shot image augmentation pipeline to generate synthetic knee radiographs. Despite the limited availability of training samples, we demonstrate the capability of our proposed computational strategy to produce high-fidelity plain knee radiographs and use them to successfully train a KL grade classifier. RESULTS: Our experimental results showcase the effectiveness of the proposed computational pipeline. The generated synthetic radiographs exhibit remarkable fidelity, evidenced by the achieved average Frechet Inception Distance (FID) score of 26.33 for KL grading and 22.538 for bilateral knee radiographs. For KL grading classification, the classifier achieved a test Cohen's Kappa and accuracy of 0.451 and 0.727, respectively. Our computational strategy also resulted in a publicly and freely available imaging dataset of 86 000 synthetic knee radiographs. CONCLUSIONS: Our approach demonstrates the capability to produce top-notch synthetic knee radiographs and use them for KL grading classification, even when working with a constrained training dataset. The results obtained emphasize the effectiveness of the pipeline in augmenting datasets for knee osteoarthritis research, opening doors for broader applications in orthopedics, medical image analysis, and AI-powered diagnosis.
Assuntos
Aprendizado Profundo , Osteoartrite do Joelho , Humanos , Osteoartrite do Joelho/diagnóstico por imagem , Articulação do Joelho/diagnóstico por imagem , Radiografia , Processamento de Imagem Assistida por Computador/métodos , Interpretação de Imagem Radiográfica Assistida por Computador/métodosRESUMO
Accurate screening of COVID-19 infection status for symptomatic patients is a critical public health task. Although molecular and antigen tests now exist for COVID-19, in resource-limited settings, screening tests are often not available. Furthermore, during the early stages of the pandemic tests were not available in any capacity. We utilized an automated machine learning (ML) approach to train and evaluate thousands of models on a clinical dataset consisting of commonly available clinical and laboratory data, along with cytokine profiles for patients (n = 150). These models were then further tested for generalizability on an out-of-sample secondary dataset (n = 120). We were able to develop a ML model for rapid and reliable screening of patients as COVID-19 positive or negative using three approaches: commonly available clinical and laboratory data, a cytokine profile, and a combination of the common data and cytokine profile. Of the tens of thousands of models automatically tested for the three approaches, all three approaches demonstrated > 92% sensitivity and > 88 specificity while our highest performing model achieved 95.6% sensitivity and 98.1% specificity. These models represent a potential effective deployable solution for COVID-19 status classification for symptomatic patients in resource-limited settings and provide proof-of-concept for rapid development of screening tools for novel emerging infectious diseases.
Assuntos
COVID-19 , Citocinas , Aprendizado de Máquina , Humanos , COVID-19/diagnóstico , Citocinas/sangue , SARS-CoV-2/isolamento & purificação , SARS-CoV-2/imunologia , Programas de Rastreamento/métodos , Masculino , Feminino , Sensibilidade e Especificidade , Pessoa de Meia-Idade , Adulto , IdosoRESUMO
BACKGROUND: Asthma classification into different subphenotypes is important to guide personalized therapy and improve outcomes. OBJECTIVES: To further explore asthma heterogeneity through determination of multiple patient groups by using novel machine learning (ML) approaches and large-scale real-world data. METHODS: We used electronic health records of patients with asthma followed at the Cleveland Clinic between 2010 and 2021. We used k-prototype unsupervised ML to develop a clustering model where predictors were age, sex, race, body mass index, prebronchodilator and postbronchodilator spirometry measurements, and the usage of inhaled/systemic steroids. We applied elbow and silhouette plots to select the optimal number of clusters. These clusters were then evaluated through LightGBM's supervised ML approach on their cross-validated F1 score to support their distinctiveness. RESULTS: Data from 13,498 patients with asthma with available postbronchodilator spirometry measurements were extracted to identify 5 stable clusters. Cluster 1 included a young nonsevere asthma population with normal lung function and higher frequency of acute exacerbation (0.8 /patient-year). Cluster 2 had the highest body mass index (mean ± SD, 44.44 ± 7.83 kg/m2), and the highest proportion of females (77.5%) and Blacks (28.9%). Cluster 3 comprised patients with normal lung function. Cluster 4 included patients with lower percent of predicted FEV1 of 77.03 (12.79) and poor response to bronchodilators. Cluster 5 had the lowest percent of predicted FEV1 of 68.08 (15.02), the highest postbronchodilator reversibility, and the highest proportion of severe asthma (44.9%) and blood eosinophilia (>300 cells/µL) (34.8%). CONCLUSIONS: Using real-world data and unsupervised ML, we classified asthma into 5 clinically important subphenotypes where group-specific asthma treatment and management strategies can be designed and deployed.
Assuntos
Asma , Aprendizado de Máquina , Fenótipo , Humanos , Asma/tratamento farmacológico , Asma/diagnóstico , Asma/fisiopatologia , Asma/epidemiologia , Feminino , Masculino , Análise por Conglomerados , Adulto , Pessoa de Meia-Idade , Espirometria , Registros Eletrônicos de Saúde , Idoso , Adulto JovemRESUMO
Artificial Intelligence and machine-learning (ML) studies are increasingly populating the life science space and some have also started to integrate certain clinical decision support tasks. However, most of the activities within this space understandably remain within the investigational domain and are not yet ready for broad use in healthcare. In short, artificial intelligence/ML is still in an infancy stage within the healthcare arena, and we are nowhere near reaching its full potential. Various factors have contributed to this slow adoption rate within healthcare, which include but are not limited to data accessibility and integrity issues, paucity of specialized data science personnel, certain regulatory measures, and various voids within the ML operational platform domain. However, these obstacles and voids have also introduced us to certain opportunities to better understand this arena as we fully embark on this new journey, which undoubtedly will become a major part of our future patient care activities. Considering the aforementioned needs, this review will be concentrating on various ML studies within the coagulation and hemostasis space to better understand their shared study needs, findings, and limitations. However, the ML needs within this subspecialty of medicine are not unique and most of these needs, voids, and limitations also apply to the other medical disciplines. Therefore, this review will not only concentrate on introducing the audience to ML concepts and ML study design elements but also on where the future within this arena in medicine is leading us.
Assuntos
Inteligência Artificial , Sistemas de Apoio a Decisões Clínicas , Humanos , Aprendizado de Máquina , Coagulação Sanguínea , PrevisõesRESUMO
Innovations in infectious disease testing have improved our abilities to detect and understand the microbial world. The 2019 novel coronavirus infectious disease (COVID-19) pandemic introduced new innovations including non-prescription "over the counter" infectious disease tests, mass spectrometry-based detection of COVID-19 host response, and the implementation of artificial intelligence (AI) and machine learning (ML) to identify individuals infected by the severe acute respiratory syndrome - coronavirus - 2 (SARS-CoV-2). As the world recovers from the COVID-19 pandemic; these innovative solutions will give rise to a new era of infectious disease tests extending beyond the detection of SARS-CoV-2. To this end, the purpose of this review is to summarize current trends in infectious disease testing and discuss innovative applications specifically in the areas of POC testing, MS, molecular diagnostics, sample types, and AI/ML.
Assuntos
COVID-19 , Doenças Transmissíveis , Humanos , COVID-19/diagnóstico , SARS-CoV-2 , Pandemias , Inteligência ArtificialRESUMO
One of the core elements of Machine Learning (ML) is statistics and its embedded foundational rules and without its appropriate integration, ML as we know would not exist. Various aspects of ML platforms are based on statistical rules and most notably the end results of the ML model performance cannot be objectively assessed without appropriate statistical measurements. The scope of statistics within the ML realm is rather broad and cannot be adequately covered in a single review article. Therefore, here we will mainly focus on the common statistical concepts that pertain to supervised ML (i.e. classification and regression) along with their interdependencies and certain limitations.