Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Cell ; 151(7): 1457-73, 2012 Dec 21.
Artigo em Inglês | MEDLINE | ID: mdl-23245941

RESUMO

Wnt/ß-catenin signaling plays a key role in the pathogenesis of colon and other cancers; emerging evidence indicates that oncogenic ß-catenin regulates several biological processes essential for cancer initiation and progression. To decipher the role of ß-catenin in transformation, we classified ß-catenin activity in 85 cancer cell lines in which we performed genome-scale loss-of-function screens and found that ß-catenin active cancers are dependent on a signaling pathway involving the transcriptional regulator YAP1. Specifically, we found that YAP1 and the transcription factor TBX5 form a complex with ß-catenin. Phosphorylation of YAP1 by the tyrosine kinase YES1 leads to localization of this complex to the promoters of antiapoptotic genes, including BCL2L1 and BIRC5. A small-molecule inhibitor of YES1 impeded the proliferation of ß-catenin-dependent cancers in both cell lines and animal models. These observations define a ß-catenin-YAP1-TBX5 complex essential to the transformation and survival of ß-catenin-driven cancers.


Assuntos
Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Transformação Celular Neoplásica , Neoplasias do Colo/metabolismo , Fosfoproteínas/metabolismo , Proteínas com Domínio T/metabolismo , beta Catenina/metabolismo , Animais , Linhagem Celular Tumoral , Colo/embriologia , Colo/metabolismo , Neoplasias do Colo/patologia , Humanos , Proteínas Inibidoras de Apoptose/genética , Camundongos , Camundongos Nus , Proteínas Proto-Oncogênicas c-yes/antagonistas & inibidores , Proteínas Proto-Oncogênicas c-yes/metabolismo , Survivina , Fatores de Transcrição , Transcrição Gênica , Proteínas de Sinalização YAP , Peixe-Zebra/embriologia , Proteína bcl-X/genética , Quinases da Família src/antagonistas & inibidores
2.
Cell ; 150(4): 842-54, 2012 Aug 17.
Artigo em Inglês | MEDLINE | ID: mdl-22901813

RESUMO

Due to genome instability, most cancers exhibit loss of regions containing tumor suppressor genes and collateral loss of other genes. To identify cancer-specific vulnerabilities that are the result of copy number losses, we performed integrated analyses of genome-wide copy number and RNAi profiles and identified 56 genes for which gene suppression specifically inhibited the proliferation of cells harboring partial copy number loss of that gene. These CYCLOPS (copy number alterations yielding cancer liabilities owing to partial loss) genes are enriched for spliceosome, proteasome, and ribosome components. One CYCLOPS gene, PSMC2, encodes an essential member of the 19S proteasome. Normal cells express excess PSMC2, which resides in a complex with PSMC1, PSMD2, and PSMD5 and acts as a reservoir protecting cells from PSMC2 suppression. Cells harboring partial PSMC2 copy number loss lack this complex and die after PSMC2 suppression. These observations define a distinct class of cancer-specific liabilities resulting from genome instability.


Assuntos
Genes Essenciais , Instabilidade Genômica , Neoplasias/genética , ATPases Associadas a Diversas Atividades Celulares , Animais , Linhagem Celular Tumoral , Deleção Cromossômica , Dosagem de Genes , Genes Supressores de Tumor , Humanos , Camundongos , Camundongos Nus , Transplante de Neoplasias , Neoplasias/metabolismo , Complexo de Endopeptidases do Proteassoma/genética , Complexo de Endopeptidases do Proteassoma/metabolismo , Transplante Heterólogo
3.
J Gen Intern Med ; 38(1): 5-11, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36071325

RESUMO

IMPORTANCE: Case reports that externalize expert diagnostic reasoning are utilized for clinical reasoning instruction but are difficult to search based on symptoms, final diagnosis, or differential diagnosis construction. Computational approaches that uncover how experienced diagnosticians analyze the medical information in a case as they formulate a differential diagnosis can guide educational uses of case reports. OBJECTIVE: To develop a "reasoning-encoded" case database for advanced clinical reasoning instruction by applying natural language processing (NLP), a sub-field of artificial intelligence, to a large case report library. DESIGN: We collected 2525 cases from the New England Journal of Medicine (NEJM) Clinical Pathological Conference (CPC) from 1965 to 2020 and used NLP to analyze the medical terminology in each case to derive unbiased (not prespecified) categories of analysis used by the clinical discussant. We then analyzed and mapped the degree of category overlap between cases. RESULTS: Our NLP algorithms identified clinically relevant categories that reflected the relationships between medical terms (which included symptoms, signs, test results, pathophysiology, and diagnoses). NLP extracted 43,291 symptoms across 2525 cases and physician-annotated 6532 diagnoses (both primary and related diagnoses). Our unsupervised learning computational approach identified 12 categories of medical terms that characterized the differential diagnosis discussions within individual cases. We used these categories to derive a measure of differential diagnosis similarity between cases and developed a website ( universeofcpc.com ) to allow visualization and exploration of 55 years of NEJM CPC case series. CONCLUSIONS: Applying NLP to curated instances of diagnostic reasoning can provide insight into how expert clinicians correlate and coordinate disease categories and processes when creating a differential diagnosis. Our reasoning-encoded CPC case database can be used by clinician-educators to design a case-based curriculum and by physicians to direct their lifelong learning efforts.


Assuntos
Inteligência Artificial , Processamento de Linguagem Natural , Humanos , Currículo , Algoritmos
4.
Nature ; 540(7631): 114-118, 2016 11 30.
Artigo em Inglês | MEDLINE | ID: mdl-27905446

RESUMO

Germ-cell tumours (GCTs) are derived from germ cells and occur most frequently in the testes. GCTs are histologically heterogeneous and distinctly curable with chemotherapy. Gains of chromosome arm 12p and aneuploidy are nearly universal in GCTs, but specific somatic genomic features driving tumour initiation, chemosensitivity and progression are incompletely characterized. Here, using clinical whole-exome and transcriptome sequencing of precursor, primary (testicular and mediastinal) and chemoresistant metastatic human GCTs, we show that the primary somatic feature of GCTs is highly recurrent chromosome arm level amplifications and reciprocal deletions (reciprocal loss of heterozygosity), variations that are significantly enriched in GCTs compared to 19 other cancer types. These tumours also acquire KRAS mutations during the development from precursor to primary disease, and primary testicular GCTs (TGCTs) are uniformly wild type for TP53. In addition, by functional measurement of apoptotic signalling (BH3 profiling) of fresh tumour and adjacent tissue, we find that primary TGCTs have high mitochondrial priming that facilitates chemotherapy-induced apoptosis. Finally, by phylogenetic analysis of serial TGCTs that emerge with chemotherapy resistance, we show how TGCTs gain additional reciprocal loss of heterozygosity and that this is associated with loss of pluripotency markers (NANOG and POU5F1) in chemoresistant teratomas or transformed carcinomas. Our results demonstrate the distinct genomic features underlying the origins of this disease and associated with the chemosensitivity phenotype, as well as the rare progression to chemoresistance. These results identify the convergence of cancer genomics, mitochondrial priming and GCT evolution, and may provide insights into chemosensitivity and resistance in other cancers.


Assuntos
Resistencia a Medicamentos Antineoplásicos , Genoma Humano/genética , Neoplasias Embrionárias de Células Germinativas/tratamento farmacológico , Neoplasias Embrionárias de Células Germinativas/genética , Apoptose , Progressão da Doença , Evolução Molecular , Exoma/genética , Genômica , Humanos , Perda de Heterozigosidade , Masculino , Mitocôndrias/metabolismo , Mutação , Proteína Homeobox Nanog/deficiência , Metástase Neoplásica/genética , Metástase Neoplásica/patologia , Neoplasias Embrionárias de Células Germinativas/metabolismo , Neoplasias Embrionárias de Células Germinativas/patologia , Fator 3 de Transcrição de Octâmero/deficiência , Filogenia , Proteínas Proto-Oncogênicas p21(ras)/genética , Teratoma/genética , Neoplasias Testiculares/tratamento farmacológico , Neoplasias Testiculares/genética , Neoplasias Testiculares/metabolismo , Neoplasias Testiculares/patologia , Transcriptoma/genética , Proteína Supressora de Tumor p53/genética
5.
Genes Dev ; 28(17): 1957-75, 2014 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-25184681

RESUMO

BRCA1 is a breast and ovarian tumor suppressor. Given its numerous incompletely understood functions and the possibility that more exist, we performed complementary systematic screens in search of new BRCA1 protein-interacting partners. New BRCA1 functions and/or a better understanding of existing ones were sought. Among the new interacting proteins identified, genetic interactions were detected between BRCA1 and four of the interactors: TONSL, SETX, TCEANC, and TCEA2. Genetic interactions were also detected between BRCA1 and certain interactors of TONSL, including both members of the FACT complex. From these results, a new BRCA1 function in the response to transcription-associated DNA damage was detected. Specifically, new roles for BRCA1 in the restart of transcription after UV damage and in preventing or repairing damage caused by stabilized R loops were identified. These roles are likely carried out together with some of the newly identified interactors. This new function may be important in BRCA1 tumor suppression, since the expression of several interactors, including some of the above-noted transcription proteins, is repeatedly aberrant in both breast and ovarian cancers.


Assuntos
Proteína BRCA1/metabolismo , Dano ao DNA/genética , Reparo do DNA/genética , Transcrição Gênica/genética , Proteína BRCA1/genética , Linhagem Celular Tumoral , Células HeLa , Humanos , NF-kappa B/genética , NF-kappa B/metabolismo , Ligação Proteica , Mapeamento de Interação de Proteínas , Raios Ultravioleta
6.
Genome Res ; 23(4): 665-78, 2013 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-23269662

RESUMO

Genome-scale RNAi libraries enable the systematic interrogation of gene function. However, the interpretation of RNAi screens is complicated by the observation that RNAi reagents designed to suppress the mRNA transcripts of the same gene often produce a spectrum of phenotypic outcomes due to differential on-target gene suppression or perturbation of off-target transcripts. Here we present a computational method, Analytic Technique for Assessment of RNAi by Similarity (ATARiS), that takes advantage of patterns in RNAi data across multiple samples in order to enrich for RNAi reagents whose phenotypic effects relate to suppression of their intended targets. By summarizing only such reagent effects for each gene, ATARiS produces quantitative, gene-level phenotype values, which provide an intuitive measure of the effect of gene suppression in each sample. This method is robust for data sets that contain as few as 10 samples and can be used to analyze screens of any number of targeted genes. We used this analytic approach to interrogate RNAi data derived from screening more than 100 human cancer cell lines and identified HNF1B as a transforming oncogene required for the survival of cancer cells that harbor HNF1B amplifications. ATARiS is publicly available at http://broadinstitute.org/ataris.


Assuntos
Regulação Neoplásica da Expressão Gênica , Genômica , Interferência de RNA , RNA Interferente Pequeno/genética , Software , Animais , Transformação Celular Neoplásica/genética , Biologia Computacional/métodos , Perfilação da Expressão Gênica , Genômica/métodos , Fator 1-beta Nuclear de Hepatócito/genética , Humanos , Internet , Camundongos , Neoplasias/genética , Fenótipo , Reprodutibilidade dos Testes
7.
Stud Health Technol Inform ; 310: 464-468, 2024 Jan 25.
Artigo em Inglês | MEDLINE | ID: mdl-38269846

RESUMO

Treatment patterns in systemic anticancer therapy (SACT) are extremely varied and complex. While professional society guidelines exist that suggest recommended treatment strategies, these guidelines are produced through an extremely laborious and sometimes opaque manual process, making it impossible for such guidelines to cover all relevant treatment scenarios. To complement these manually curated guidelines, we leveraged a database of 5818 clinical trials and 7012 supporting references from 1943-present to calculate a quantifiable "relevance score". In a pilot evaluation, this score was strongly associated with professional society guideline recommendations, while also providing relevance information on thousands of additional therapies. We show that this score also accurately illustrates trends in SACT adoption over time. We foresee that this score, which comprehensively evaluates the relevance of SACT overall and by cancer subtype, will have utility for clinical practitioners as well as researchers in real-world data.


Assuntos
Trabalho de Parto , Gravidez , Feminino , Humanos , Bases de Dados Factuais , Pesquisadores
8.
Obstet Gynecol ; 2024 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-39361958

RESUMO

Iron deficiency in pregnancy remains underdiagnosed despite professional society recommendations for first-trimester complete blood count (CBC) screening. To determine the effectiveness of the CBC hemoglobin and mean corpuscular volume (MCV) to identify iron deficiency in pregnancy, we conducted a retrospective analysis of 20,550 pregnancies from 2009 to 2022 at the University of California, San Francisco, obstetrics clinics. A total of 16,547 (80.5%) pregnant individuals had first-trimester screening CBC; 345 (2.1%) had a coincident ferritin test. Hemoglobin level less than 11 g/dL and MCV level less than 80 fL each had sensitivity of only 30% (95% CI, 20-41%) to detect first-trimester iron deficiency (ferritin level less than 30), corresponding to a negative likelihood ratio of 0.90 (95% CI, 0.77-1.05) and 0.85 (95% CI, 0.73-0.99), respectively. More than 50% of the 1,749 women with documented iron deficiency anytime during pregnancy were neither anemic nor microcytic at the time of diagnosis.

9.
JAMIA Open ; 7(4): ooae073, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-39399272

RESUMO

Objective: We aimed to investigate the impact of social circumstances on cancer therapy selection using natural language processing to derive insights from social worker documentation. Materials and Methods: We developed and employed a Bidirectional Encoder Representations from Transformers (BERT) based approach, using a hierarchical multi-step BERT model (BERT-MS), to predict the prescription of targeted cancer therapy to patients based solely on documentation by clinical social workers. Our corpus included free-text clinical social work notes, combined with medication prescription information, for all patients treated for breast cancer at UCSF between 2012 and 2021. We conducted a feature importance analysis to identify the specific social circumstances that impact cancer therapy regimen. Results: Using only social work notes, we consistently predicted the administration of targeted therapies, suggesting systematic differences in treatment selection exist due to non-clinical factors. The findings were confirmed by several language models, with GatorTron achieving the best performance with an area under the receiver operating characteristic curve (AUROC) of 0.721 and a Macro F1 score of 0.616. The UCSF BERT-MS model, capable of leveraging multiple pieces of notes, surpassed the UCSF-BERT model in both AUROC and Macro-F1. Our feature importance analysis identified several clinically intuitive social determinants of health that potentially contribute to disparities in treatment. Discussion: Leveraging social work notes can be instrumental in identifying disparities in clinical decision-making. Hypotheses generated in an automated way could be used to guide patient-specific quality improvement interventions. Further validation with diverse clinical outcomes and prospective studies is essential. Conclusions: Our findings indicate that significant disparities exist among breast cancer patients receiving different types of therapies based on social determinants of health. Social work reports play a crucial role in understanding these disparities in clinical decision-making.

10.
JAMIA Open ; 7(1): ooad112, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38223407

RESUMO

Objective: Existing research on social determinants of health (SDoH) predominantly focuses on physician notes and structured data within electronic medical records. This study posits that social work notes are an untapped, potentially rich source for SDoH information. We hypothesize that clinical notes recorded by social workers, whose role is to ameliorate social and economic factors, might provide a complementary information source of data on SDoH compared to physician notes, which primarily concentrate on medical diagnoses and treatments. We aimed to use word frequency analysis and topic modeling to identify prevalent terms and robust topics of discussion within a large cohort of social work notes including both outpatient and in-patient consultations. Materials and methods: We retrieved a diverse, deidentified corpus of 0.95 million clinical social work notes from 181 644 patients at the University of California, San Francisco. We conducted word frequency analysis related to ICD-10 chapters to identify prevalent terms within the notes. We then applied Latent Dirichlet Allocation (LDA) topic modeling analysis to characterize this corpus and identify potential topics of discussion, which was further stratified by note types and disease groups. Results: Word frequency analysis primarily identified medical-related terms associated with specific ICD10 chapters, though it also detected some subtle SDoH terms. In contrast, the LDA topic modeling analysis extracted 11 topics explicitly related to social determinants of health risk factors, such as financial status, abuse history, social support, risk of death, and mental health. The topic modeling approach effectively demonstrated variations between different types of social work notes and across patients with different types of diseases or conditions. Discussion: Our findings highlight LDA topic modeling's effectiveness in extracting SDoH-related themes and capturing variations in social work notes, demonstrating its potential for informing targeted interventions for at-risk populations. Conclusion: Social work notes offer a wealth of unique and valuable information on an individual's SDoH. These notes present consistent and meaningful topics of discussion that can be effectively analyzed and utilized to improve patient care and inform targeted interventions for at-risk populations.

11.
Clin Pharmacol Ther ; 115(4): 847-859, 2024 04.
Artigo em Inglês | MEDLINE | ID: mdl-38345264

RESUMO

Electronic health records (EHRs) provide meaningful knowledge of drug-related adverse events (AEs) that are not captured in standard drug development and postmarketing surveillance. Using variables obtained from EHR data in the University of California San Francisco de-identified Clinical Data Warehouse, we aimed to evaluate the potential of machine learning to predict two hematological AEs, thrombocytopenia and anemia, in a cohort of patients treated with linezolid for 3 or more days. Features for model input were extracted at linezolid initiation (index), and outcomes were characterized from index to 14 days post-treatment. Random forest classification (RFC) was used for AE prediction, and reduced feature models were evaluated using cumulative importance (cImp) for feature selection. Grade 3+ thrombocytopenia and anemia occurred in 31% of 2,171 and 56% of 2,170 evaluable patients, respectively. Of the total 53 features, as few as 7 contributed at least 50% cImp, resulting in prediction accuracies of 70% or higher and area under the receiver operating characteristic curves of 0.886 for grade 3+ thrombocytopenia and 0.759 for grade 3+ anemia. Sensitivity analyses in strictly defined patient subgroups revealed similarly high predictive performance in full and reduced feature models. A logistic regression model with the same 50% cImp features showed similar predictive performance as RFC and good concordance with RFC probability predictions after isotonic calibration, adding interpretability. Collectively, this work demonstrates potential for machine learning prediction of AE risk in real-world patients using few variables regularly available in EHRs, which may aid in clinical decision making and/or monitoring.


Assuntos
Anemia , Trombocitopenia , Humanos , Linezolida/efeitos adversos , Anemia/induzido quimicamente , Anemia/epidemiologia , Trombocitopenia/induzido quimicamente , Trombocitopenia/diagnóstico , Trombocitopenia/epidemiologia , Modelos Logísticos , São Francisco
12.
J Am Med Inform Assoc ; 31(10): 2315-2327, 2024 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-38900207

RESUMO

OBJECTIVE: Although supervised machine learning is popular for information extraction from clinical notes, creating large annotated datasets requires extensive domain expertise and is time-consuming. Meanwhile, large language models (LLMs) have demonstrated promising transfer learning capability. In this study, we explored whether recent LLMs could reduce the need for large-scale data annotations. MATERIALS AND METHODS: We curated a dataset of 769 breast cancer pathology reports, manually labeled with 12 categories, to compare zero-shot classification capability of the following LLMs: GPT-4, GPT-3.5, Starling, and ClinicalCamel, with task-specific supervised classification performance of 3 models: random forests, long short-term memory networks with attention (LSTM-Att), and the UCSF-BERT model. RESULTS: Across all 12 tasks, the GPT-4 model performed either significantly better than or as well as the best supervised model, LSTM-Att (average macro F1-score of 0.86 vs 0.75), with advantage on tasks with high label imbalance. Other LLMs demonstrated poor performance. Frequent GPT-4 error categories included incorrect inferences from multiple samples and from history, and complex task design, and several LSTM-Att errors were related to poor generalization to the test set. DISCUSSION: On tasks where large annotated datasets cannot be easily collected, LLMs can reduce the burden of data labeling. However, if the use of LLMs is prohibitive, the use of simpler models with large annotated datasets can provide comparable results. CONCLUSIONS: GPT-4 demonstrated the potential to speed up the execution of clinical NLP studies by reducing the need for large annotated datasets. This may increase the utilization of NLP-based variables and outcomes in clinical studies.


Assuntos
Neoplasias da Mama , Aprendizado de Máquina Supervisionado , Humanos , Neoplasias da Mama/patologia , Neoplasias da Mama/classificação , Feminino , Processamento de Linguagem Natural , Conjuntos de Dados como Assunto , Registros Eletrônicos de Saúde , Mineração de Dados/métodos
13.
JAMA Netw Open ; 7(5): e248895, 2024 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-38713466

RESUMO

Importance: The introduction of large language models (LLMs), such as Generative Pre-trained Transformer 4 (GPT-4; OpenAI), has generated significant interest in health care, yet studies evaluating their performance in a clinical setting are lacking. Determination of clinical acuity, a measure of a patient's illness severity and level of required medical attention, is one of the foundational elements of medical reasoning in emergency medicine. Objective: To determine whether an LLM can accurately assess clinical acuity in the emergency department (ED). Design, Setting, and Participants: This cross-sectional study identified all adult ED visits from January 1, 2012, to January 17, 2023, at the University of California, San Francisco, with a documented Emergency Severity Index (ESI) acuity level (immediate, emergent, urgent, less urgent, or nonurgent) and with a corresponding ED physician note. A sample of 10 000 pairs of ED visits with nonequivalent ESI scores, balanced for each of the 10 possible pairs of 5 ESI scores, was selected at random. Exposure: The potential of the LLM to classify acuity levels of patients in the ED based on the ESI across 10 000 patient pairs. Using deidentified clinical text, the LLM was queried to identify the patient with a higher-acuity presentation within each pair based on the patients' clinical history. An earlier LLM was queried to allow comparison with this model. Main Outcomes and Measures: Accuracy score was calculated to evaluate the performance of both LLMs across the 10 000-pair sample. A 500-pair subsample was manually classified by a physician reviewer to compare performance between the LLMs and human classification. Results: From a total of 251 401 adult ED visits, a balanced sample of 10 000 patient pairs was created wherein each pair comprised patients with disparate ESI acuity scores. Across this sample, the LLM correctly inferred the patient with higher acuity for 8940 of 10 000 pairs (accuracy, 0.89 [95% CI, 0.89-0.90]). Performance of the comparator LLM (accuracy, 0.84 [95% CI, 0.83-0.84]) was below that of its successor. Among the 500-pair subsample that was also manually classified, LLM performance (accuracy, 0.88 [95% CI, 0.86-0.91]) was comparable with that of the physician reviewer (accuracy, 0.86 [95% CI, 0.83-0.89]). Conclusions and Relevance: In this cross-sectional study of 10 000 pairs of ED visits, the LLM accurately identified the patient with higher acuity when given pairs of presenting histories extracted from patients' first ED documentation. These findings suggest that the integration of an LLM into ED workflows could enhance triage processes while maintaining triage quality and warrants further investigation.


Assuntos
Serviço Hospitalar de Emergência , Gravidade do Paciente , Humanos , Serviço Hospitalar de Emergência/estatística & dados numéricos , Estudos Transversais , Adulto , Masculino , Feminino , Pessoa de Meia-Idade , Índice de Gravidade de Doença , São Francisco
14.
Res Sq ; 2024 Feb 06.
Artigo em Inglês | MEDLINE | ID: mdl-38405831

RESUMO

Although supervised machine learning is popular for information extraction from clinical notes, creating large, annotated datasets requires extensive domain expertise and is time-consuming. Meanwhile, large language models (LLMs) have demonstrated promising transfer learning capability. In this study, we explored whether recent LLMs can reduce the need for large-scale data annotations. We curated a manually labeled dataset of 769 breast cancer pathology reports, labeled with 13 categories, to compare zero-shot classification capability of the GPT-4 model and the GPT-3.5 model with supervised classification performance of three model architectures: random forests classifier, long short-term memory networks with attention (LSTM-Att), and the UCSF-BERT model. Across all 13 tasks, the GPT-4 model performed either significantly better than or as well as the best supervised model, the LSTM-Att model (average macro F1 score of 0.83 vs. 0.75). On tasks with a high imbalance between labels, the differences were more prominent. Frequent sources of GPT-4 errors included inferences from multiple samples and complex task design. On complex tasks where large annotated datasets cannot be easily collected, LLMs can reduce the burden of large-scale data labeling. However, if the use of LLMs is prohibitive, the use of simpler supervised models with large annotated datasets can provide comparable results. LLMs demonstrated the potential to speed up the execution of clinical NLP studies by reducing the need for curating large annotated datasets. This may increase the utilization of NLP-based variables and outcomes in observational clinical studies.

15.
Lancet Digit Health ; 6(1): e12-e22, 2024 01.
Artigo em Inglês | MEDLINE | ID: mdl-38123252

RESUMO

BACKGROUND: Large language models (LLMs) such as GPT-4 hold great promise as transformative tools in health care, ranging from automating administrative tasks to augmenting clinical decision making. However, these models also pose a danger of perpetuating biases and delivering incorrect medical diagnoses, which can have a direct, harmful impact on medical care. We aimed to assess whether GPT-4 encodes racial and gender biases that impact its use in health care. METHODS: Using the Azure OpenAI application interface, this model evaluation study tested whether GPT-4 encodes racial and gender biases and examined the impact of such biases on four potential applications of LLMs in the clinical domain-namely, medical education, diagnostic reasoning, clinical plan generation, and subjective patient assessment. We conducted experiments with prompts designed to resemble typical use of GPT-4 within clinical and medical education applications. We used clinical vignettes from NEJM Healer and from published research on implicit bias in health care. GPT-4 estimates of the demographic distribution of medical conditions were compared with true US prevalence estimates. Differential diagnosis and treatment planning were evaluated across demographic groups using standard statistical tests for significance between groups. FINDINGS: We found that GPT-4 did not appropriately model the demographic diversity of medical conditions, consistently producing clinical vignettes that stereotype demographic presentations. The differential diagnoses created by GPT-4 for standardised clinical vignettes were more likely to include diagnoses that stereotype certain races, ethnicities, and genders. Assessment and plans created by the model showed significant association between demographic attributes and recommendations for more expensive procedures as well as differences in patient perception. INTERPRETATION: Our findings highlight the urgent need for comprehensive and transparent bias assessments of LLM tools such as GPT-4 for intended use cases before they are integrated into clinical care. We discuss the potential sources of these biases and potential mitigation strategies before clinical implementation. FUNDING: Priscilla Chan and Mark Zuckerberg.


Assuntos
Educação Médica , Instalações de Saúde , Feminino , Humanos , Masculino , Tomada de Decisão Clínica , Diagnóstico Diferencial , Atenção à Saúde
16.
NPJ Precis Oncol ; 7(1): 29, 2023 Mar 23.
Artigo em Inglês | MEDLINE | ID: mdl-36959495

RESUMO

The incidence and biochemical consequences of rare tumor subtypes are often hard to study. Fibrolamellar liver cancer (FLC) is a rare malignancy affecting adolescents and young adults. To better characterize the incidence and biochemical consequences of this disease, we combined a comprehensive analysis of the electronic medical record and national payer data and found that FLC incidence is likely five to eight times higher than previous estimates. By employing unsupervised learning on clinical laboratory data from patients with hyperammonemia, we find that FLC-associated hyperammonemia mirrors metabolic dysregulation in urea cycle disorders. Our findings demonstrate that advanced computational analysis of rich clinical datasets can provide key clinical and biochemical insights into rare cancers.

17.
Nat Commun ; 13(1): 604, 2022 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-35105861

RESUMO

The role of PPM1D mutations in de novo gliomagenesis has not been systematically explored. Here we analyze whole genome sequences of 170 pediatric high-grade gliomas and find that truncating mutations in PPM1D that increase the stability of its phosphatase are clonal driver events in 11% of Diffuse Midline Gliomas (DMGs) and are enriched in primary pontine tumors. Through the development of DMG mouse models, we show that PPM1D mutations potentiate gliomagenesis and that PPM1D phosphatase activity is required for in vivo oncogenesis. Finally, we apply integrative phosphoproteomic and functional genomics assays and find that oncogenic effects of PPM1D truncation converge on regulators of cell cycle, DNA damage response, and p53 pathways, revealing therapeutic vulnerabilities including MDM2 inhibition.


Assuntos
Glioma/genética , Mutação , Oncogenes/genética , Proteína Fosfatase 2C/genética , Adolescente , Adulto , Animais , Neoplasias do Tronco Encefálico/genética , Carcinogênese/genética , Ciclo Celular , Criança , Pré-Escolar , Dano ao DNA , Modelos Animais de Doenças , Feminino , Células HEK293 , Humanos , Lactente , Masculino , Camundongos , Proteínas Proto-Oncogênicas c-mdm2 , Transcriptoma , Proteína Supressora de Tumor p53/genética , Adulto Jovem
18.
Nat Cancer ; 3(8): 994-1011, 2022 08.
Artigo em Inglês | MEDLINE | ID: mdl-35788723

RESUMO

We analyzed the contributions of structural variants (SVs) to gliomagenesis across 179 pediatric high-grade gliomas (pHGGs). The most recurrent SVs targeted MYC isoforms and receptor tyrosine kinases (RTKs), including an SV amplifying a MYC enhancer in 12% of diffuse midline gliomas (DMG), indicating an underappreciated role for MYC in pHGG. SV signature analysis revealed that tumors with simple signatures were TP53 wild type (TP53WT) but showed alterations in TP53 pathway members PPM1D and MDM4. Complex signatures were associated with direct aberrations in TP53, CDKN2A and RB1 early in tumor evolution and with later-occurring extrachromosomal amplicons. All pHGGs exhibited at least one simple-SV signature, but complex-SV signatures were primarily restricted to subsets of H3.3K27M DMGs and hemispheric pHGGs. Importantly, DMGs with complex-SV signatures were associated with shorter overall survival independent of histone mutation and TP53 status. These data provide insight into the impact of SVs on gliomagenesis and the mechanisms that shape them.


Assuntos
Neoplasias Encefálicas , Glioma , Neoplasias Encefálicas/genética , Proteínas de Ciclo Celular/genética , Criança , Glioma/genética , Histonas/genética , Humanos , Mutação , Proteínas Proto-Oncogênicas/genética
19.
medRxiv ; 2020 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-32511606

RESUMO

Management of the COVID-19 pandemic has proven to be a significant challenge to policy makers. This is in large part due to uneven reporting and the absence of open-access visualization tools to present local trends and infer healthcare needs. Here we report the development of CovidCounties.org, an interactive web application that depicts daily disease trends at the level of US counties using time series plots and maps. This application is accompanied by a manually curated dataset that catalogs all major public policy actions made at the state-level, as well as technical validation of the primary data. Finally, the underlying code for the site is also provided as open source, enabling others to validate and learn from this work.

20.
Sci Data ; 7(1): 405, 2020 11 16.
Artigo em Inglês | MEDLINE | ID: mdl-33199721

RESUMO

Management of the COVID-19 pandemic has proven to be a significant challenge to policy makers. This is in large part due to uneven reporting and the absence of open-access visualization tools to present local trends and infer healthcare needs. Here we report the development of CovidCounties.org, an interactive web application that depicts daily disease trends at the level of US counties using time series plots and maps. This application is accompanied by a manually curated dataset that catalogs all major public policy actions made at the state-level, as well as technical validation of the primary data. Finally, the underlying code for the site is also provided as open source, enabling others to validate and learn from this work.


Assuntos
Infecções por Coronavirus/epidemiologia , Pneumonia Viral/epidemiologia , Software , Betacoronavirus , COVID-19 , Curadoria de Dados/métodos , Conjuntos de Dados como Assunto , Humanos , Internet , Pandemias , SARS-CoV-2 , Estados Unidos/epidemiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA