RESUMEN
Rationale: Early pathogenesis of lung adenocarcinoma (LUAD) remains largely unknown. We found that, relative to wild-type littermates, the innate immunomodulator Lcn2 (lipocalin-2) was increased in normal airways from mice with knockout of the airway lineage gene Gprc5a (Gprc5a-/-) and that are prone to developing inflammation and LUAD. Yet, the role of LCN2 in lung inflammation and LUAD is poorly understood.Objectives: Delineate the role of Lcn2 induction in LUAD pathogenesis.Methods: Normal airway brushings, uninvolved lung tissues, and tumors from Gprc5a-/- mice before and after tobacco carcinogen exposure were analyzed by RNA sequencing. LCN2 mRNA was analyzed in public and in-house data sets of LUAD, lung squamous cancer (LUSC), chronic obstructive pulmonary disease (COPD), and LUAD/LUSC with COPD. LCN2 protein was immunohistochemically analyzed in a tissue microarray of 510 tumors. Temporal lung tumor development, gene expression programs, and host immune responses were compared between Gprc5a-/- and Gprc5a-/-/Lcn2-/- littermates.Measurements and Main Results:Lcn2 was progressively elevated during LUAD development and positively correlated with proinflammatory cytokines and inflammation gene sets. LCN2 was distinctively elevated in human LUADs, but not in LUSCs, relative to normal lungs and was associated with COPD among smokers and patients with LUAD. Relative to Gprc5a-/- mice, Gprc5a-/-/Lcn2-/- littermates exhibited significantly increased lung tumor development concomitant with reduced T-cell abundance (CD4+) and richness, attenuated antitumor immune gene programs, and increased immune cell expression of protumor inflammatory cytokines.Conclusions: Augmented LCN2 expression is a molecular feature of COPD-associated LUAD and counteracts LUAD development in vivo by maintaining antitumor immunity.
Asunto(s)
Adenocarcinoma del Pulmón/inmunología , Antineoplásicos/inmunología , Lipocalina 2/genética , Lipocalina 2/inmunología , Neoplasias Pulmonares/inmunología , Enfermedad Pulmonar Obstructiva Crónica/sangre , Enfermedad Pulmonar Obstructiva Crónica/fisiopatología , Animales , Biomarcadores/sangre , Femenino , Regulación de la Expresión Génica , Humanos , Lipocalina 2/sangre , Masculino , Ratones , ARN MensajeroRESUMEN
Clinicians spend a substantial part of their workday reviewing and writing electronic medical notes. Here we describe how the current, widely accepted paradigm for electronic medical notes represents a poor organizational framework for both the individual clinician and the broader medical team. As described in this viewpoint, the medical chart-including notes, labs, and imaging results-can be reconceptualized as a dynamic, fully collaborative workspace organized by topic rather than time, writer, or data type. This revised framework enables a more accurate and complete assessment of the current state of the patient and easy historical review, saving clinicians substantial time on both data input and retrieval. Collectively, this approach has the potential to improve health care delivery effectiveness and efficiency.
Asunto(s)
Documentación , Escritura , Registros Electrónicos de Salud , HumanosRESUMEN
INTRODUCTION: Machine learning (ML) and natural language processing have great potential to improve information extraction (IE) within electronic medical records (EMRs) for a wide variety of clinical search and summarization tools. Despite ML advancements, clinical adoption of real time IE tools for patient care remains low. Clinically motivated IE task definitions, publicly available annotated clinical datasets, and inclusion of subtasks such as coreference resolution and named entity normalization are critical for the development of useful clinical tools. MATERIALS AND METHODS: We provide a task definition and comprehensive annotation requirements for a clinically motivated symptom extraction task. Four annotators labeled symptom mentions within 1108 discharge summaries from two public clinical note datasets for the tasks of named entity recognition, coreference resolution, and named entity normalization; these annotations will be released to the public. Baseline human performance was assessed and two ML models were evaluated on the symptom extraction task. RESULTS: 16,922 symptom mentions were identified within the discharge summaries, with 11,944 symptom instances after coreference resolution and 1255 unique normalized answer forms. Human annotator performance averaged 92.2% F1. Recurrent network model performance was 85.6% F1 (recall 85.8%, precision 85.4%), and Transformer-based model performance was 86.3% F1 (recall 86.6%, precision 86.1%). Our models extracted vague symptoms, acronyms, typographical errors, and grouping statements. The models generalized effectively to a separate clinical note corpus and can run in real time. CONCLUSION: To our knowledge, this dataset will be the largest and most comprehensive publicly released, annotated dataset for clinically motivated symptom extraction, as it includes annotations for named entity recognition, coreference, and normalization for more than 1000 clinical documents. Our neural network models extracted symptoms from unstructured clinical free text at near human performance in real time. In this paper, we present a clinically motivated task definition, dataset, and simple supervised natural language processing models to demonstrate the feasibility of building clinically applicable information extraction tools.
Asunto(s)
Almacenamiento y Recuperación de la Información , Procesamiento de Lenguaje Natural , Registros Electrónicos de Salud , Humanos , Aprendizaje Automático , Redes Neurales de la ComputaciónRESUMEN
Importance: Duplicated text is a well-documented hazard in electronic medical records (EMRs), leading to wasted clinician time, medical error, and burnout. This study hypothesizes that text duplication is prevalent and increases with time and EMR size and that duplicate information is shared across authors. Objective: To examine the prevalence and scope of duplication behavior in clinical notes from a large academic health system and the factors associated with duplication. Design, Setting, and Participants: This retrospective, cross-sectional analysis of note length and content duplication rates used a set of 10 adjacent word tokens (ie, a 10-gram) sliding-window approach to identify spans of text duplicated exactly from earlier notes in a patient's record for all inpatient and outpatient notes written within the University of Pennsylvania Health System from January 1, 2015, through December 31, 2020. Text duplicated from a different author vs text duplicated from the same author was quantified. Furthermore, novel text and duplicated text per author for various note types and author types, as well as per patient record by number of notes in the record, were quantified. Information scatter, another documentation hazard, was defined as the inverse of novel text per note, and the association between information duplication and information scatter was graphed. Data analysis was performed from January to March 2022. Main Outcomes and Measures: Total, novel, and duplicate text by note type and note author were determined, as were the mean intra-author and inter-author duplication per note by type and author. Results: There were a total of 104â¯456â¯653 notes for 1â¯960â¯689 unique patients consisting of 32â¯991â¯489â¯889 words; 50.1% of the total text in the record (16â¯523â¯851â¯210 words) was duplicated from prior text written about the same patient. The duplication fraction increased year-over-year, from 33.0% for notes written in 2015 to 54.2% for notes written in 2020. Of the text duplicated, 54.1% came from text written by the same author, whereas 45.9% was duplicated from a different author. Records with more notes had more total duplicate text, approaching 60%. Note types with high information scatter tended to have low information overload, and vice versa, suggesting a trade-off between these 2 hazards under the current documentation paradigm. Conclusions and Relevance: Duplicate text casts doubt on the veracity of all information in the medical record, making it difficult to find and verify information in day-to-day clinical work. The findings of this cross-sectional study suggest that text duplication is a systemic hazard, requiring systemic interventions to fix, and simple solutions such as banning copy-paste may have unintended consequences, such as worsening information scatter. The note paradigm should be further examined as a major cause of duplication and scatter, and alternative paradigms should be evaluated.
Asunto(s)
Documentación , Registros Electrónicos de Salud , Estudios Transversales , Humanos , Prevalencia , Estudios RetrospectivosRESUMEN
BACKGROUND: Clinicians spend large amounts of their workday using electronic medical records (EMRs). Poorly designed documentation systems contribute to the proliferation of out-of-date information, increased time spent on medical records, clinician burnout, and medical errors. Beyond software interfaces, examining the underlying paradigms and organizational structures for clinical information may provide insights into ways to improve documentation systems. In particular, our attachment to the note as the major organizational unit for storing unstructured medical data may be a cause of many of the problems with modern clinical documentation. Notes, as currently understood, systematically incentivize information duplication and information scattering, both within a single clinician's notes over time and across multiple clinicians' notes. Therefore, it is worthwhile to explore alternative paradigms for unstructured data organization. OBJECTIVE: The aim of this study is to demonstrate the feasibility of building an EMR that does not use notes as the core organizational unit for unstructured data and which is designed specifically to disincentivize information duplication and information scattering. METHODS: We used specific design principles to minimize the incentive for users to duplicate and scatter information. By default, the majority of a patient's medical history remains the same over time, so users should not have to redocument that information. Clinicians on different teams or services mostly share the same medical information, so all data should be collaboratively shared across teams and services (while still allowing for disagreement and nuance). In all cases where a clinician must state that information has remained the same, they should be able to attest to the information without redocumenting it. We designed and built a web-based EMR based on these design principles. RESULTS: We built a medical documentation system that does not use notes and instead treats the chart as a single, dynamically updating, and fully collaborative workspace. All information is organized by clinical topic or problem. Version history functionality is used to enable granular tracking of changes over time. Our system is highly customizable to individual workflows and enables each individual user to decide which data should be structured and which should be unstructured, enabling individuals to leverage the advantages of structured templating and clinical decision support as desired without requiring programming knowledge. The system is designed to facilitate real-time, fully collaborative documentation and communication among multiple clinicians. CONCLUSIONS: We demonstrated the feasibility of building a non-note-based, fully collaborative EMR system. Our attachment to the note as the only possible atomic unit of unstructured medical data should be reevaluated, and alternative models should be considered.
RESUMEN
Smoking perpetuates in cytologically normal airways a molecular "field of injury" that is pertinent to lung cancer and early detection. The evolution of airway field changes prior to lung oncogenesis is poorly understood largely due to the long latency of lung cancer in smokers. Here, we studied airway expression changes prior to lung cancer onset in mice with knockout of the Gprc5a gene (Gprc5a-/-) and tobacco carcinogen (NNK) exposure and that develop the most common type of lung cancer, lung adenocarcinoma, within 6 months following exposure. Airway epithelial brushings were collected from Gprc5a-/- mice before exposure and at multiple times post-NNK until time of lung adenocarcinoma development and then analyzed by RNA sequencing. Temporal airway profiles were identified by linear models and analyzed by comparative genomics in normal airways of human smokers with and without lung cancer. We identified significantly altered profiles (n = 926) in the NNK-exposed mouse normal airways relative to baseline epithelia, a subset of which were concordantly modulated with smoking status in the human airway. Among airway profiles that were significantly modulated following NNK, we found that expression changes (n = 22) occurring as early as 2 months following exposure were significantly associated with lung cancer status when examined in airways of human smokers. Furthermore, a subset of a recently reported human bronchial gene classifier (Percepta; n = 56) was enriched in the temporal mouse airway profiles. We underscore evolutionarily conserved profiles in the normal-appearing airway that develop prior to lung oncogenesis and that comprise viable markers for early lung cancer detection in suspect smokers. Cancer Prev Res; 11(4); 237-48. ©2018 AACR.
Asunto(s)
Adenocarcinoma/patología , Bronquios/metabolismo , Transformación Celular Neoplásica/patología , Regulación Neoplásica de la Expresión Génica , Neoplasias Pulmonares/patología , Receptores Acoplados a Proteínas G/fisiología , Fumar/efectos adversos , Adenocarcinoma/etiología , Animales , Bronquios/patología , Transformación Celular Neoplásica/genética , Femenino , Perfilación de la Expresión Génica , Genoma Humano , Genómica , Humanos , Neoplasias Pulmonares/etiología , Masculino , Ratones Endogámicos C57BL , Ratones Noqueados , Nitrosaminas/toxicidadRESUMEN
Micro-computed tomography (CT) enables three-dimensional (3D) imaging of complex soft tissue structures, but current protocols used to achieve this goal preclude cellular and molecular phenotyping of the tissue. Here we describe a radiolucent cryostage that permits micro-CT imaging of unfixed frozen human lung samples at an isotropic voxel size of (11 µm)3 under conditions where the sample is maintained frozen at -30°C during imaging. The cryostage was tested for thermal stability to maintain samples frozen up to 8 h. This report describes the methods used to choose the materials required for cryostage construction and demonstrates that whole genome mRNA integrity and expression are not compromised by exposure to micro-CT radiation and that the tissue can be used for immunohistochemistry. The new cryostage provides a novel method enabling integration of 3D tissue structure with cellular and molecular analysis to facilitate the identification of molecular determinants of disease. NEW & NOTEWORTHY: The described micro-CT cryostage provides a novel way to study the three-dimensional lung structure preserved without the effects of fixatives while enabling subsequent studies of the cellular matrix composition and gene expression. This approach will, for the first time, enable researchers to study structural changes of lung tissues that occur with disease and correlate them with changes in gene or protein signatures.
Asunto(s)
Pulmón/patología , Microtomografía por Rayos X/métodos , Expresión Génica/fisiología , Genoma/fisiología , Humanos , Imagenología Tridimensional/métodos , Pulmón/metabolismo , ARN Mensajero/metabolismoRESUMEN
Motivational deficits play a central role in disability caused by schizophrenia and constitute a major unmet therapeutic need. Negative symptoms have previously been linked to hypofunction in ventral striatum (VS), a core component of brain motivation circuitry. However, it remains unclear to what extent this relationship holds for specific negative symptoms such as amotivation, and this question has not been addressed with integrated behavioral, clinical, and imaging measures. Here, 41 individuals with schizophrenia and 37 controls performed a brief, computerized progressive ratio task (PRT) that quantifies effort exerted in pursuit of monetary reward. Clinical amotivation was assessed using the recently validated Clinical Assessment Interview for Negative Symptoms (CAINS). VS function was probed during functional magnetic resonance imaging using a monetary guessing paradigm. We found that individuals with schizophrenia had diminished motivation as measured by the PRT, which significantly and selectively related to clinical amotivation as measured by the CAINS. Critically, lower PRT motivation in schizophrenia was also dimensionally related to VS hypofunction. Our results demonstrate robust dimensional associations between behavioral amotivation, clinical amotivation, and VS hypofunction in schizophrenia. Integrating behavioral measures such as the PRT will facilitate translational efforts to identify biomarkers of amotivation and to assess response to novel therapeutic interventions.