RESUMO
Policy Points This study examines the impact of several world-changing events in 2020, such as the pandemic and widespread racism protests, on the US population's comfort with the use of identifiable data for public health. Before the 2020 election, there was no significant difference between Democrats and Republicans. However, African Americans exhibited a decrease in comfort that was different from other subgroups. Our findings suggest that the public remained supportive of public health data activities through the pandemic and the turmoil of 2020 election cycle relative to other data use. However, support among African Americans for public health data use experienced a unique decline compared to other demographic groups. CONTEXT: Recent legislative privacy efforts have not included special provisions for public health data use. Although past studies documented support for public health data use, several global events in 2020 have raised awareness and concern about privacy and data use. This study aims to understand whether the events of 2020 affected US privacy preferences on secondary uses of identifiable data, focusing on public health and research uses. METHODS: We deployed two online surveys-in February and November 2020-on data privacy attitudes and preferences using a choice-based-conjoint analysis. Participants received different data-use scenario pairs-varied by the type of data, user, and purpose-and selected scenarios based on their comfort. A hierarchical Bayes regression model simulated population preferences. FINDINGS: There were 1,373 responses. There was no statistically significant difference in the population's data preferences between February and November, each showing the highest comfort with population health and research data activities and the lowest with profit-driven activities. Most subgroups' data preferences were comparable with the population's preferences, except African Americans who showed significant decreases in comfort with population health and research. CONCLUSIONS: Despite world-changing events, including a pandemic, we found bipartisan public support for using identifiable data for public health and research. The decreasing support among African Americans could relate to the increased awareness of systemic racism, its harms, and persistent disparities. The US population's preferences support including legal provisions that permit public health and research data use in US laws, which are currently lacking specific public health use permissions.
Assuntos
Pandemias , Política , Saúde Pública , Humanos , Estados Unidos , Masculino , Feminino , Adulto , Inquéritos e Questionários , Pessoa de Meia-Idade , COVID-19/epidemiologia , Negro ou Afro-Americano , Opinião Pública , PrivacidadeRESUMO
BACKGROUND: Scientific researchers who wish to reuse health data pertaining to individuals can obtain consent through an opt-in procedure or opt-out procedure. The choice of procedure may have consequences for the consent rate and representativeness of the study sample and the quality of the research, but these consequences are not well known. OBJECTIVE: This review aimed to provide insight into the consequences for the consent rate and consent bias of the study sample of opt-in procedures versus opt-out procedures for the reuse of routinely recorded health data for scientific research purposes. METHODS: A systematic review was performed based on searches in PubMed, Embase, CINAHL, PsycINFO, Web of Science Core Collection, and the Cochrane Library. Two reviewers independently included studies based on predefined eligibility criteria and assessed whether the statistical methods used in the reviewed literature were appropriate for describing the differences between consenters and nonconsenters. Statistical pooling was conducted, and a description of the results was provided. RESULTS: A total of 15 studies were included in this meta-analysis. Of the 15 studies, 13 (87%) implemented an opt-in procedure, 1 (7%) implemented an opt-out procedure, and 1 (7%) implemented both the procedures. The average weighted consent rate was 84% (60,800/72,418 among the studies that used an opt-in procedure and 96.8% (2384/2463) in the single study that used an opt-out procedure. In the single study that described both procedures, the consent rate was 21% in the opt-in group and 95.6% in the opt-out group. Opt-in procedures resulted in more consent bias compared with opt-out procedures. In studies with an opt-in procedure, consenting individuals were more likely to be males, had a higher level of education, higher income, and higher socioeconomic status. CONCLUSIONS: Consent rates are generally lower when using an opt-in procedure compared with using an opt-out procedure. Furthermore, in studies with an opt-in procedure, participants are less representative of the study population. However, both the study populations and the way in which opt-in or opt-out procedures were organized varied widely between the studies, which makes it difficult to draw general conclusions regarding the desired balance between patient control over data and learning from health data. The reuse of routinely recorded health data for scientific research purposes may be hampered by administrative burdens and the risk of bias.
Assuntos
Renda , Consentimento Livre e Esclarecido , Feminino , Humanos , Masculino , Viés , Escolaridade , PubMedRESUMO
BACKGROUND: The SARS-CoV-2 pandemic has highlighted once more the great need for comprehensive access to, and uncomplicated use of, pre-existing patient data for medical research. Enabling secondary research-use of patient-data is a prerequisite for the efficient and sustainable promotion of translation and personalisation in medicine, and for the advancement of public-health. However, balancing the legitimate interests of scientists in broad and unrestricted data-access and the demand for individual autonomy, privacy and social justice is a great challenge for patient-based medical research. METHODS: We therefore conducted two questionnaire-based surveys among North-German outpatients (n = 650) to determine their attitude towards data-donation for medical research, implemented as an opt-out-process. RESULTS: We observed a high level of acceptance (75.0%), the most powerful predictor of a positive attitude towards data-donation was the conviction that every citizen has a duty to contribute to the improvement of medical research (> 80% of participants approving data-donation). Interestingly, patients distinguished sharply between research inside and outside the EU, despite a general awareness that universities and public research institutions cooperate with commercial companies, willingness to allow use of donated data by the latter was very low (7.1% to 29.1%, depending upon location of company). The most popular measures among interviewees to counteract reservations against commercial data-use were regulation by law (61.4%), stipulating in the process that data are not sold or resold (84.6%). A majority requested control of both the use (46.8%) and the protection (41.5%) of the data by independent bodies. CONCLUSIONS: In conclusion, data-donation for medical research, implemented as a combination of legal entitlement and easy-to-exercise-right to opt-out, was found to be widely supported by German patients and therefore warrants further consideration for a transposition into national law.
Assuntos
Pesquisa Biomédica , COVID-19 , Atitude , Humanos , Privacidade , SARS-CoV-2RESUMO
BACKGROUND: Changes in well-being of patients with multiple myeloma (MM) before and after diagnosis have not been quantified. AIMS: Explore the use of secondary data to examine the changes in the well-being of older patients with MM. METHODS: We used the Health and Retirement Study (HRS), linked to Medicare claims to identify older MM patients. We compared patient-reported measures (PRM), including physical impairment, sensory impairment, and patient experience (significant pain, self-rated health, depression) in the interviews before and after MM diagnosis using McNemar's test. We propensity-matched each MM patient to five HRS participants without MM diagnosis based on baseline characteristics. We compared the change in PRM between the MM patients and their matches. RESULTS: We identified 92 HRS patients with MM diagnosis (mean age = 74.6, SD = 8.4). Among the surviving patients, there was a decline in well-being across most measures, including ADL difficulty (23% to 40%, p value = 0.016), poor or fair self-rated health (38% to 61%, p value = 0.004), and depression (15% to 30%, p value = 0.021). Surviving patients reported worse health than participants without MM across most measures, including ADL difficulty (40% vs. 27%, p value = 0.04), significant pain (38% vs. 22%, p value = 0.01), and depression (29% vs. 11%, p value = 0.003). DISCUSSION: Secondary data were used to identify patients with MM diagnosis, and examine changes across multiple measures of well-being. MM diagnosis negatively affects several aspects of patients' well-being, and these declines are larger than those experienced by similar participants without MM. CONCLUSION: The results of this study are valuable addition to understanding the experience of patients with MM, despite several data limitations.
Assuntos
Mieloma Múltiplo , Atividades Cotidianas , Idoso , Idoso de 80 Anos ou mais , Dor do Câncer , Depressão , Feminino , Humanos , Armazenamento e Recuperação da Informação , Masculino , Mieloma Múltiplo/complicações , Medidas de Resultados Relatados pelo PacienteRESUMO
Background: The design of appropriate consent procedures for the secondary use of personal health data is a key concern of current medical research. In Germany, the concept of 'data donation' has recently come into focus, defined as a legal entitlement to the research use of personal medical data without prior consent, combined with an easy-to-exercise right of the data subjects to opt-out. Methods: Standardized online interviews of 3,013 individuals, representative of the German online population, were conducted in August 2022 to determine their attitude towards data donation for medical research. Results: A majority of participants supported a consent-free data donation regulation, both for publicly funded (85.1%) and for private medical research (66.4%). Major predictors of a positive attitude towards data donation included (i) sufficient appreciation of the respective kind of research (i.e. public or private), (ii) a reciprocity attitude that patients who benefit from research have a duty to support research, and (iii) sufficient trust in data protection and data control. Conclusion: People's attitude towards data donation to medical research is generally positive in Germany and depends upon factors that can be curbed by legislation and internal rules of procedure. Worthy of note, designing data donation in the form of an opt-out regulation does not necessarily mean that the paradigm of informedness has to be abandoned. Rather the process of information provision must be shifted towards the creation of basic knowledge in the general population about the risks and benefits of data-intensive medical research ('health data literacy').
RESUMO
Purpose: The primary aim of this work was to convert the Information System for Research in Primary Care (SIDIAP) from Catalonia, Spain, to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). Our second aim was to provide a descriptive analysis of COVID-19-related outcomes among the general population. Patients and Methods: We mapped patient-level data from SIDIAP to the OMOP CDM and we performed more than 3,400 data quality checks to assess its readiness for research. We established a general population cohort as of the 1st March 2020 and identified outpatient COVID-19 diagnoses or tested positive for, hospitalised with, admitted to intensive care units (ICU) with, died with, or vaccinated against COVID-19 up to 30th June 2022. Results: After verifying the high quality of the transformed dataset, we included 5,870,274 individuals in the general population cohort. Of those, 604,472 had either an outpatient COVID-19 diagnosis or positive test result, 58,991 had a hospitalisation, 5,642 had an ICU admission, and 11,233 died with COVID-19. A total of 4,584,515 received a COVID-19 vaccine. People who were hospitalised or died were more commonly older, male, and with more comorbidities. Those admitted to ICU with COVID-19 were generally younger and more often male than those hospitalised and those who died. Conclusion: We successfully transformed SIDIAP to the OMOP CDM. From this dataset, a general population cohort of 5.9 million individuals was identified and their COVID-19-related outcomes over time were described. The transformed SIDIAP database is a valuable resource that can enable distributed network research in COVID-19 and beyond.
RESUMO
BACKGROUND: Considering the soaring health-related costs directed toward a growing, aging, and comorbid population, the health sector needs effective data-driven interventions while managing rising care costs. While health interventions using data mining have become more robust and adopted, they often demand high-quality big data. However, growing privacy concerns have hindered large-scale data sharing. In parallel, recently introduced legal instruments require complex implementations, especially when it comes to biomedical data. New privacy-preserving technologies, such as decentralized learning, make it possible to create health models without mobilizing data sets by using distributed computation principles. Several multinational partnerships, including a recent agreement between the United States and the European Union, are adopting these techniques for next-generation data science. While these approaches are promising, there is no clear and robust evidence synthesis of health care applications. OBJECTIVE: The main aim is to compare the performance among health data models (eg, automated diagnosis and mortality prediction) developed using decentralized learning approaches (eg, federated and blockchain) to those using centralized or local methods. Secondary aims are comparing the privacy compromise and resource use among model architectures. METHODS: We will conduct a systematic review using the first-ever registered research protocol for this topic following a robust search methodology, including several biomedical and computational databases. This work will compare health data models differing in development architecture, grouping them according to their clinical applications. For reporting purposes, a PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) 2020 flow diagram will be presented. CHARMS (Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies)-based forms will be used for data extraction and to assess the risk of bias, alongside PROBAST (Prediction Model Risk of Bias Assessment Tool). All effect measures in the original studies will be reported. RESULTS: The queries and data extractions are expected to start on February 28, 2023, and end by July 31, 2023. The research protocol was registered with PROSPERO, under the number 393126, on February 3, 2023. With this protocol, we detail how we will conduct the systematic review. With that study, we aim to summarize the progress and findings from state-of-the-art decentralized learning models in health care in comparison to their local and centralized counterparts. Results are expected to clarify the consensuses and heterogeneities reported and help guide the research and development of new robust and sustainable applications to address the health data privacy problem, with applicability in real-world settings. CONCLUSIONS: We expect to clearly present the status quo of these privacy-preserving technologies in health care. With this robust synthesis of the currently available scientific evidence, the review will inform health technology assessment and evidence-based decisions, from health professionals, data scientists, and policy makers alike. Importantly, it should also guide the development and application of new tools in service of patients' privacy and future research. TRIAL REGISTRATION: PROSPERO 393126; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=393126. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): PRR1-10.2196/45823.
RESUMO
BACKGROUND: The increasing emphasis to share patient data from clinical research has resulted in substantial investments in data repositories and infrastructure. However, it is unclear how shared data are used and whether anticipated benefits are being realized. OBJECTIVE: The purpose of our study is to examine the current utilization of shared clinical research data sets and assess the effects on both scientific research and public health outcomes. Additionally, the study seeks to identify the factors that hinder or facilitate the ethical and efficient use of existing data based on the perspectives of data users. METHODS: The study will utilize a mixed methods design, incorporating a cross-sectional survey and in-depth interviews. The survey will involve at least 400 clinical researchers, while the in-depth interviews will include 20 to 40 participants who have utilized data from repositories or institutional data access committees. The survey will target a global sample, while the in-depth interviews will focus on individuals who have used data collected from low- and middle-income countries. Quantitative data will be summarized by using descriptive statistics, while multivariable analyses will be used to assess the relationships between variables. Qualitative data will be analyzed through thematic analysis, and the findings will be reported in accordance with the COREQ (Consolidated Criteria for Reporting Qualitative Research) guidelines. The study received ethical approval from the Oxford Tropical Research Ethics Committee in 2020 (reference number: 568-20). RESULTS: The results of the analysis, including both quantitative data and qualitative data, will be available in 2023. CONCLUSIONS: The outcomes of our study will offer crucial understanding into the current status of data reuse in clinical research, serving as a basis for guiding future endeavors to enhance the utilization of shared data for the betterment of public health outcomes and for scientific progress. TRIAL REGISTRATION: Thai Clinical Trials Registry TCTR20210301006; https://tinyurl.com/2p9atzhr. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): DERR1-10.2196/44875.
RESUMO
INTRODUCTION: Sharing of health data for secondary uses such as research and public policy development is common. There are many potential benefits, but also risks if information about an individual's health record can be inferred. Studies show cautious willingness amongst the public to share health data for beneficial purposes, as long as they are confident in their data privacy and security. There has been relatively little research into whether the technical guarantees of privacy-preserving technologies are well understood by people asked to consent to sharing their data. OBJECTIVES: We sought to assess how accurately people understood the effectiveness of techniques for protecting the privacy of shared health data. METHODS: We designed an online survey describing a data-sharing scenario motivated by medical research where data could be shared: raw (including identifiers), de-identified (using k-anonymity), aggregated, and differential privacy applied to aggregated data. Respondents were asked about willingness to share their data, and how likely it was that they could be identified. They were also asked for the meaning of 'de-identified' and whether they would agree to sharing information for 'not solely commercial' purposes, thus mirroring the consent language used by Australia's My Health Record system. RESULTS: Our findings revealed substantial tolerance for researcher use of health data with consistent preference to share data when better privacy-preserving techniques were employed. This was not entirely consistent as slight preference was shown for aggregated data over differential privacy, despite differential privacy being objectively more secure. We conjecture this was because differential privacy and its benefits were not well understood. Similarly, respondents showed no consistent understanding of the term 'de-identified', indicating that this needs to be carefully defined in contexts that seek consent. Finally, many respondents who indicated a willingness to share for purposes that were 'not solely commercial' nevertheless rejected at least some specific scenarios that mixed research and commercial objectives, again indicating a possible gap in their understanding of the terms. CONCLUSIONS: We found overall preference for better privacy protection of data as a precondition for secondary use, but limitations in respondents' understanding of key terminology and the differing privacy guarantees of available techniques. Further effort is needed to word secondary data use consent policies to ensure public understanding of commonly used terms and methods, if genuinely informed consent for data sharing is to be gained.
Assuntos
Compreensão , Privacidade , Austrália , Estudos Transversais , Humanos , Disseminação de InformaçãoRESUMO
BACKGROUND: The use of processed secondary data for health monitoring of fattening pigs has been established in various areas, such as the use of antibiotics or in the context of meat inspection. Standardized scores were calculated based on several sources of production data and can be used to describe animal health in a large collective of pig units. In the present study, the extent to which these scores are related to different farm characteristics and management decisions were investigated. In addition, slaughter scores were compared with the results of a veterinary examination on the farms. RESULTS: The comparison of the results of the uni- and multifactorial analyses revealed that almost all of the examined factors play a role in at least one of the scores when considered individually. However, when various significant influencing factors were taken into account at any one time, most of the variables lost their statistical significance due to confounding effects. In particular, production data such as production costs or daily feed intake remained in the final models of the scores on mortality, average daily gain and external lesions. Regarding the second part of the investigation, a basic technical correlation between the slaughter scores and the on-farm indicators could be established via principal component analysis. The modelling of the slaughter scores by the on-farm indicators showed that the score on external lesions could be represented by equivalent variables recorded on the farm (e.g., lesions caused by tail or ear biting). CONCLUSIONS: It has been demonstrated that the examined health scores are influenced by various farm and management characteristics. However, when several factors are taken into account, confounding occurs in some cases, which must be considered by consultants. Additionally, it was shown that on-farm examination content is related to the scores based on equivalent findings from slaughter pigs.
RESUMO
The OMOP Common Data Model (OMOP CDM) is an option to store patient data and to use these in an international context. Up to now, rare diseases can only be partly described in OMOP CDM. Therefore, it is necessary to investigate which special features in the context of rare diseases (e.g. terminologies) have to be considered, how these can be included in OMOP CDM and how physicians can use the data. An interdisciplinary team developed (1) a Transition Database for Rare Diseases by mapping Orpha Code, Alpha ID, SNOMED, ICD-10-GM, ICD-10-WHO and OMOP-conform concepts; and (2) a Rare Diseases Dashboard for physicians of a German Center of Rare Diseases by using methods of user-centered design. This demonstrated how OMOP CDM can be flexibly extended for different medical issues by using independent tools for mappings and visualization. Thereby, the adaption of OMOP CDM allows for international collaboration, enables (distributed) analysis of patient data and thus it can improve the care of people with rare diseases.
Assuntos
Doenças Raras , Systematized Nomenclature of Medicine , Bases de Dados Factuais , Atenção à Saúde , HumanosRESUMO
BACKGROUND: England operates a National Data Opt-Out (NDOO) for the secondary use of confidential health data for research and planning. We hypothesised that public awareness and support for the secondary use of health data and the NDOO would vary by participant demography and healthcare experience. We explored patient/public awareness and perceptions of secondary data use, grouping potential researchers into National Health Service (NHS), academia or commercial. We assessed awareness of the NDOO system amongst patients, carers, healthcare staff and the public. We co-developed recommendations to consider when sharing unconsented health data for research. METHODS: A patient and public engagement program, co-created and including patient and public workshops, questionnaires and discussion groups regarding anonymised health data use. RESULTS: There were 350 participants in total. Central concerns for health data use included unauthorised data re-use, the potential for discrimination and data sharing without patient benefit. 94% of respondents were happy for their data to be used for NHS research, 85% for academic research and 68% by health companies, but less than 50% for non-healthcare companies and opinions varied with demography and participant group. Questionnaires showed that knowledge of the NDOO was low, with 32% of all respondents, 53% of all NHS staff and 29% of all patients aware of the NDOO. Recommendations to guide unconsented secondary health data use included that health data use should benefit patients; data sharing decisions should involve patients/public. That data should remain in close proximity to health services with the principles of data minimisation applied. Further, that there should be transparency in secondary health data use, including publicly available lists of projects, summaries and benefits. Finally, organisations involved in data access decisions should participate in programmes to increase knowledge of the NDOO, to ensure public members were making informed choices about their own data. CONCLUSION: The majority of participants in this study reported that the use of healthcare data for secondary purposes was acceptable when accessed by NHS. Academic and health-focused companies. However, awareness was limited, including of the NDOO. Further development of publicly-agreed recommendations for secondary health data use may improve both awareness and confidence in secondary health data use.
Health data from routine care can be pseudonymised (with a link remaining to the patient but identifying features removed) or anonymised (with identifying features removed and the link to the patient severed) and used for research and health planning; termed "secondary use". The National Health Service (NHS) is a single publicly-funded health service for the United Kingdom (UK). The NHS supports secondary data use with a National Data opt-out system. The potential benefits of data secondary use are clear but concerns have been raised. Although the Data Opt-Out is publicised, it is unclear how much public awareness there is of this scheme. We report a patient and publicly created and delivered series of activities including > 350 people; with young adults, patients, NHS staff and the public; to assess concerns, knowledge and acceptance of data sharing.Perceptions of and support for secondary health data use varied depending on who was asked (by age, gender) and their experience of health services (Staff member, patient, member of the public). Knowledge of schemes to limit secondary data use (such as the UK National Data Op-Out) was low, even among NHS staff. The main concerns of sharing health data included onward data use, the potential for discrimination and exploitation and commercial gain from data use with no benefit to patients. Despite this, most participants agreed with health data sharing with NHS, academic and commercial health-based entities. Agreed, co-created themes to increase the acceptability of health data secondary use included education about 'Opt-out' schemes, health service oversight of data use (as the most trusted partner), public and patient involvement in data sharing decisions and public transparency.
RESUMO
Human biomedical datasets that are critical for research and clinical studies to benefit human health also often contain sensitive or potentially identifying information of individual participants. Thus, care must be taken when they are processed and made available to comply with ethical and regulatory frameworks and informed consent data conditions. To enable and streamline data access for these biomedical datasets, the Global Alliance for Genomics and Health (GA4GH) Data Use and Researcher Identities (DURI) work stream developed and approved the Data Use Ontology (DUO) standard. DUO is a hierarchical vocabulary of human and machine-readable data use terms that consistently and unambiguously represents a dataset's allowable data uses. DUO has been implemented by major international stakeholders such as the Broad and Sanger Institutes and is currently used in annotation of over 200,000 datasets worldwide. Using DUO in data management and access facilitates researchers' discovery and access of relevant datasets. DUO annotations increase the FAIRness of datasets and support data linkages using common data use profiles when integrating the data for secondary analyses. DUO is implemented in the Web Ontology Language (OWL) and, to increase community awareness and engagement, hosted in an open, centralized GitHub repository. DUO, together with the GA4GH Passport standard, offers a new, efficient, and streamlined data authorization and access framework that has enabled increased sharing of biomedical datasets worldwide.
RESUMO
The availability of research and outcomes data is the primary limitation to evidence-based practice. Today, only a fraction of clinical decisions are based upon evidence derived from randomized control trials (RCTs), the gold-standard of knowledge discovery. At the same time, clinical trial complexity has steadily increased as has the effort required at clinical investigational sites. Direct use of electronic health record (EHR) data for clinical trials has the potential to address some of these needs, improving data quality and reducing cost.
Assuntos
Sistemas de Apoio a Decisões Clínicas , Troca de Informação em Saúde , Controle de Custos , Confiabilidade dos Dados , Registros Eletrônicos de Saúde , Troca de Informação em Saúde/normas , Humanos , Ensaios Clínicos Controlados Aleatórios como AssuntoRESUMO
Medical data are being generated in large quantities. However, these data are rarely used beyond their primary purpose. Big data methods to medical data offer the opportunity to transform healthcare. The healthcare industry has been a lot less successful than other industries in applying these new tools. Main reasons are privacy concerns and the fact that medical data in Germany are scattered across institutions. The method of data donation can offer a solution. The first aim of the proposed Medical Data Donation Enabler (MADE) project is to investigate ethical, legal and socio-technical barriers to data donation. The second aim is to develop a concept of a medical data donation process model that addresses the barriers by providing a data donation process model. The process model concept created through MADE could be provided for this purpose.
Assuntos
Big Data , Segurança Computacional , Alemanha , PropriedadeRESUMO
EHR-based, computable phenotypes can be leveraged by healthcare organizations and researchers to improve the cohort identification process. The ability to identify patient cohorts using aspects of care and outcomes based on clinical characteristics or diagnostic conditions and/or risk factors presents opportunities to researchers targeting specific populations for drug development and disease interventions. The objective of this review was to summarize the literature describing the development and use of phenotypes for cohort identification of cancer patients. A survey of the literature indexed in PubMed was performed to identify studies using EHR-based phenotypes for use in cancer studies. Specific search criteria were formulated by leveraging a phenotype identification guideline developed by the Phenotypes, Data Standards, and Data Quality Core of the NIH Health Care Systems Research Collaboratory. The final set of articles was examined further to identify 1) the cancer of interest and 2) the different approaches used for phenotype development, validation and implementation. The articles reviewed were specific to breast cancer, colorectal cancer, ovarian cancer, and lung cancer. The approaches taken for phenotype development and validation varied slightly among the relevant publications. Four studies relied on chart review, three utilized machine learning techniques, one took an ontological approach, and one utilized natural language processing (NLP).
Assuntos
Registros Eletrônicos de Saúde , Processamento de Linguagem Natural , Neoplasias , Fenótipo , Estudos de Coortes , Atenção à Saúde , Humanos , Neoplasias/diagnósticoRESUMO
PURPOSE: We explore the challenges of the secondary use of data in clinical information systems which critical care units in the National Health Service (England) are facing. METHODS: We conducted an online survey of critical care units in England regarding their practices in collecting and using clinical information systems and data. RESULTS: Critical care units use clinical information systems typically independently of hospital information systems and focus mainly on using data for auditing, management reporting and research. Respondents reported that extracting data from their clinical information system was difficult and that they would use stored data more if it were easier to access. Data extraction takes time and who extracts data, the training they receive and the tools they use affect the extraction and use of data. CONCLUSION: A number of key challenges affect the secondary use of data in critical care: a lack of integration of information systems within critical care and across departments; barriers to accessing data; mismatched data tools and user requests. Data are predominantly used for reporting and research with less emphasis on using data to inform clinical practice.
RESUMO
PURPOSE: This study measured the quality of data extracted from a clinical information system widely used for critical care quality improvement and research. MATERIALS AND METHODS: We abstracted data from 30 fields in a random sample of 207 patients admitted to nine adult, medical-surgical intensive care units. We assessed concordance between data collected: (1) manually from the bedside system (eCritical MetaVision) by trained auditors, and (2) electronically from the system data warehouse (eCritical TRACER). Agreement was assessed using Cohen's Kappa for categorical variables and intraclass correlation coefficient (ICC) for continuous variables. RESULTS: Concordance between data sets was excellent. There was perfect agreement for 11/30 variables (35%). The median Kappa score for the 16 categorical variables was 0.99 (IQR 0.92-1.00). APACHE II had an ICC of 0.936 (0.898-0.960). The lowest concordance was observed for SOFA renal and respiratory components (ICC 0.804 and 0.846, respectively). Score translation errors by the manual auditor were the most common source of data discrepancies. CONCLUSIONS: Manual validation processes of electronic data are complex in comparison to validation of traditional clinical documentation. This study represents a straightforward approach to validate the use of data repositories to support reliable and efficient use of high quality secondary use data.
Assuntos
Cuidados Críticos/métodos , Registros Eletrônicos de Saúde/normas , Unidades de Terapia Intensiva , Informática Médica/métodos , Melhoria de Qualidade , APACHE , Adulto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Garantia da Qualidade dos Cuidados de Saúde , Projetos de Pesquisa , Estudos RetrospectivosRESUMO
Qualitative data provide rich information on research questions in diverse fields. Recent calls for increased transparency and openness in research emphasize data sharing. However, qualitative data sharing has yet to become the norm internationally and is particularly uncommon in the United States. Guidance for archiving and secondary use of qualitative data is required for progress in this regard. In this study, we review the benefits and concerns associated with qualitative data sharing and then describe the results of a content analysis of guidelines from international repositories that archive qualitative data. A minority of repositories provide qualitative data sharing guidelines. Of the guidelines available, there is substantial variation in whether specific topics are addressed. Some topics, such as removing direct identifiers, are consistently addressed, while others, such as providing an anonymization log, are not. We discuss the implications of our study for education, best practices, and future research.
Assuntos
Ética em Pesquisa , Guias como Assunto , Disseminação de Informação , Armazenamento e Recuperação da Informação , Pesquisa Qualitativa , Pesquisa , Pesquisa Biomédica/ética , Comportamento Cooperativo , Humanos , Internacionalidade , Estados UnidosRESUMO
BACKGROUND: Data that needs to be documented for clinical studies has often been acquired and documented in clinical routine. Usually this data is manually transferred to Case Report Forms (CRF) and/or directly into an electronic data capture (EDC) system. OBJECTIVES: To enhance the documentation process of a large clinical follow-up study targeting patients admitted for acutely decompensated heart failure by accessing the data created during routine and study visits from a hospital information system (HIS) and by transferring it via a data warehouse (DWH) into the study's EDC system. METHODS: This project is based on the clinical DWH developed at the University of Würzburg. The DWH was extended by several new data domains including data created by the study team itself. An R user interface was developed for the DWH that allows to access its source data in all its detail, to transform data as comprehensively as possible by R into study-specific variables and to support the creation of data and catalog tables. RESULTS: A data flow was established that starts with labeling patients as study patients within the HIS and proceeds with updating the DWH with this label and further data domains at a daily rate. Several study-specific variables were defined using the implemented R user interface of the DWH. This system was then used to export these variables as data tables ready for import into our EDC system. The data tables were then used to initialize the first 296 patients within the EDC system by pseudonym, visit and data values. Afterwards, these records were filled with clinical data on heart failure, vital parameters and time spent on selected wards. CONCLUSIONS: This solution focuses on the comprehensive access and transformation of data for a DWH-EDC system linkage. Using this system in a large clinical study has demonstrated the feasibility of this approach for a study with a complex visit schedule.