Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 6 de 6
1.
JAMIA Open ; 6(3): ooad054, 2023 Oct.
Article En | MEDLINE | ID: mdl-37545984

Objective: To describe the infrastructure, tools, and services developed at Stanford Medicine to maintain its data science ecosystem and research patient data repository for clinical and translational research. Materials and Methods: The data science ecosystem, dubbed the Stanford Data Science Resources (SDSR), includes infrastructure and tools to create, search, retrieve, and analyze patient data, as well as services for data deidentification, linkage, and processing to extract high-value information from healthcare IT systems. Data are made available via self-service and concierge access, on HIPAA compliant secure computing infrastructure supported by in-depth user training. Results: The Stanford Medicine Research Data Repository (STARR) functions as the SDSR data integration point, and includes electronic medical records, clinical images, text, bedside monitoring data and HL7 messages. SDSR tools include tools for electronic phenotyping, cohort building, and a search engine for patient timelines. The SDSR supports patient data collection, reproducible research, and teaching using healthcare data, and facilitates industry collaborations and large-scale observational studies. Discussion: Research patient data repositories and their underlying data science infrastructure are essential to realizing a learning health system and advancing the mission of academic medical centers. Challenges to maintaining the SDSR include ensuring sufficient financial support while providing researchers and clinicians with maximal access to data and digital infrastructure, balancing tool development with user training, and supporting the diverse needs of users. Conclusion: Our experience maintaining the SDSR offers a case study for academic medical centers developing data science and research informatics infrastructure.

2.
Stud Health Technol Inform ; 216: 1084, 2015.
Article En | MEDLINE | ID: mdl-26262383

Self-service cohort discovery tools strive to provide intuitive interfaces to large Clinical Data Warehouses that contain extensive historic information. In those tools, controlled vocabulary (e.g., ICD-9-CM, CPT) coded clinical information is often the main search criteria used because of its ubiquity in billing processes. These tools generally require a researcher to pick specific terms from the controlled vocabulary. However, controlled vocabularies evolve over time as medical knowledge changes and can even be replaced with new versions (e.g., ICD-9 to ICD-10). These tools generally only display the current version of the controlled vocabulary. Researchers should not be expected to understand the underlying controlled vocabulary versioning issues. We propose a computable controlled vocabulary versioning system that allows cohort discovery tools to automatically expand queries to account for terminology changes.


Data Accuracy , Data Mining/methods , Database Management Systems/organization & administration , Electronic Health Records/organization & administration , International Classification of Diseases , Natural Language Processing
3.
AMIA Annu Symp Proc ; 2009: 391-5, 2009 Nov 14.
Article En | MEDLINE | ID: mdl-20351886

STRIDE (Stanford Translational Research Integrated Database Environment) is a research and development project at Stanford University to create a standards-based informatics platform supporting clinical and translational research. STRIDE consists of three integrated components: a clinical data warehouse, based on the HL7 Reference Information Model (RIM), containing clinical information on over 1.3 million pediatric and adult patients cared for at Stanford University Medical Center since 1995; an application development framework for building research data management applications on the STRIDE platform and a biospecimen data management system. STRIDE's semantic model uses standardized terminologies, such as SNOMED, RxNorm, ICD and CPT, to represent important biomedical concepts and their relationships. The system is in daily use at Stanford and is an important component of Stanford University's CTSA (Clinical and Translational Science Award) Informatics Program.


Database Management Systems , Medical Records Systems, Computerized/organization & administration , Translational Research, Biomedical , Academic Medical Centers , Biomedical Research , California , Computer Communication Networks , Humans , Programming Languages
4.
Pac Symp Biocomput ; : 243-54, 2008.
Article En | MEDLINE | ID: mdl-18229690

Traditionally, the elucidation of genes involved in maturation and aging has been studied in a temporal fashion by examining gene expression at different time points in an organism's life as well as by knocking out, knocking in, and mutating genes thought to be involved. Here, we propose an in silico method to combine clinical electronic medical record (EMR) data and gene expression measurements in the context of disease to identify genes that may be involved in the process of human maturation and aging. First we show that absolute lymphocyte count may serve as a biomarker for maturation by using statistical methods to compare trends among different clinical laboratory tests in response to an increase in age. We then propose using the rate of decay for absolute lymphocyte count across 12 diseases as a proxy for differences in aging. We correlate the differing rates with gene expression across the same diseases to find maturation/aging related genes. Among the 53 genes with strongest correlations between expression profile and change in rate of decay, we found genes previously implicated in the process of aging, including MGMT (DNA repair), TERF2 (telomere stability), POLD1 (DNA replication and repair), and POLG (mtDNA replication).


Aging/genetics , Gene Expression Profiling/statistics & numerical data , Genetic Markers , Hospital Records , Medical Records Systems, Computerized , Adolescent , Aging/blood , Analysis of Variance , Child , Child, Preschool , Computational Biology , Humans , Infant , Infant, Newborn , Lymphocyte Count
5.
AMIA Annu Symp Proc ; : 115-9, 2007 Oct 11.
Article En | MEDLINE | ID: mdl-18693809

The severity of diseases has often been assigned by direct observation of a patient and by pathological examination after symptoms have appeared. As we move into the genomic era, the ability to predict disease severity prior to manifestation has improved dramatically due to genomic sequencing and analysis of gene expression microarrays. However, as the severity of diseases can be exacerbated by non genetic factors, the ability to predict disease severity by examining gene expression alone may be inadequate. We propose the creation of a "clinarray" to examine phenotypic expression in the form of clinical laboratory measurements. We demonstrate that the clinarray can be used to distinguish between the severities of patients with cystic fibrosis and those with Crohn's disease by applying unsupervised clustering methods that have been previously applied to microarrays.


Clinical Laboratory Techniques , Crohn Disease/classification , Cystic Fibrosis/classification , Phenotype , Severity of Illness Index , Crohn Disease/diagnosis , Crohn Disease/genetics , Cystic Fibrosis/diagnosis , Cystic Fibrosis/genetics , Down Syndrome/classification , Down Syndrome/diagnosis , Down Syndrome/genetics , Humans , Medical Records Systems, Computerized , Prognosis
6.
Proc AMIA Symp ; : 245-9, 2002.
Article En | MEDLINE | ID: mdl-12463824

Access to clinical data is of increasing importance to biomedical research. The pending HIPAA privacy regulations provide specific requirements for the release of protected health information. Under the regulations, biomedical researchers may utilize anonymized data, or adhere to HIPAA requirements regarding protected health information. In order to provide researchers with anonymized data from a clinical research database, we reviewed several published strategies for de-identification of protected health information. Critical analysis with respect to this project suggests that de-identification alone is problematic when applied to clinical research databases. We propose a hybrid system; utilizing secure key escrow, de-identification, and role-based access for IRB approved researchers.


Biomedical Research , Computer Security , Confidentiality , Medical Records Systems, Computerized , Databases as Topic , Humans
...