RESUMO
Research suggests that sleep plays a vital role in memory. We tested the impact of total sleep deprivation on adults' memory for a newly learned writing system and on their ability to generalise this knowledge to read untrained novel words. We trained participants to read fictitious words printed in a novel artificial orthography, while depriving them of sleep the night after learning (Experiment 1) or the night before learning (Experiment 2). Following two nights of recovery sleep, and again 10 days later, participants were tested on trained words and untrained words, and performance was compared to control groups who had not undergone sleep deprivation. Participants showed a high degree of accuracy in learning the trained words and in generalising their knowledge to untrained words. There was little evidence of impact of sleep deprivation on memory or generalisation. These data support emerging theories which suggest sleep-associated memory consolidation can be accelerated or entirely bypassed under certain conditions, and that such conditions also facilitate generalisation.
Assuntos
Generalização Psicológica/fisiologia , Idioma , Aprendizagem/fisiologia , Privação do Sono/psicologia , Adolescente , Adulto , Feminino , Humanos , Masculino , Consolidação da Memória/fisiologia , Sono/fisiologia , Adulto JovemRESUMO
In this study, we examined the effect of two metacognitive scaffolds on the accuracy of confidence judgments made while diagnosing dermatopathology slides in SlideTutor. Thirty-one (N = 31) first- to fourth-year pathology and dermatology residents were randomly assigned to one of the two scaffolding conditions. The cases used in this study were selected from the domain of Nodular and Diffuse Dermatitides. Both groups worked with a version of SlideTutor that provided immediate feedback on their actions for two hours before proceeding to solve cases in either the Considering Alternatives or Playback condition. No immediate feedback was provided on actions performed by participants in the scaffolding mode. Measurements included learning gains (pre-test and post-test), as well as metacognitive performance, including Goodman-Kruskal Gamma correlation, bias, and discrimination. Results showed that participants in both conditions improved significantly in terms of their diagnostic scores from pre-test to post-test. More importantly, participants in the Considering Alternatives condition outperformed those in the Playback condition in the accuracy of their confidence judgments and the discrimination of the correctness of their assertions while solving cases. The results suggested that presenting participants with their diagnostic decision paths and highlighting correct and incorrect paths helps them to become more metacognitively accurate in their confidence judgments.
RESUMO
Modern life causes a quarter of adults and half of teenagers to sleep for less than is recommended (Kocevska et al., 2021). Given well-documented benefits of sleep on memory, we must understand the cognitive costs of short sleep. We analysed 125 sleep restriction effect sizes from 39 reports involving 1234 participants. Restricting sleep (3-6.5â¯hours) compared to normal sleep (7-11â¯hours) negatively affects memory formation with a small effect size (Hedges' g = 0.29, 95â¯% CI = [0.13, 0.44]). We detected no evidence for publication bias. When sleep restriction effect sizes were compared with 185 sleep deprivation effect sizes (Newbury et al., 2021) no statistically significant difference was found, suggesting that missing some sleep has similar consequences for memory as not sleeping at all. When the analysis was restricted to post-encoding, rather than pre-encoding, sleep loss, sleep deprivation was associated with larger memory impairment than restriction. Our findings are best accounted for by the sequential hypothesis which emphasises complementary roles of slow-wave sleep and REM sleep for memory.
RESUMO
The purpose of this study is threefold: (1) to develop an automated, computer-based method to detect heuristics and biases as pathologists examine virtual slide cases, (2) to measure the frequency and distribution of heuristics and errors across three levels of training, and (3) to examine relationships of heuristics to biases, and biases to diagnostic errors. The authors conducted the study using a computer-based system to view and diagnose virtual slide cases. The software recorded participant responses throughout the diagnostic process, and automatically classified participant actions based on definitions of eight common heuristics and/or biases. The authors measured frequency of heuristic use and bias across three levels of training. Biases studied were detected at varying frequencies, with availability and search satisficing observed most frequently. There were few significant differences by level of training. For representativeness and anchoring, the heuristic was used appropriately as often or more often than it was used in biased judgment. Approximately half of the diagnostic errors were associated with one or more biases. We conclude that heuristic use and biases were observed among physicians at all levels of training using the virtual slide system, although their frequencies varied. The system can be employed to detect heuristic use and to test methods for decreasing diagnostic errors resulting from cognitive biases.
Assuntos
Diagnóstico por Computador/psicologia , Patologia/normas , Competência Clínica/normas , Diagnóstico por Computador/normas , Erros de Diagnóstico/psicologia , Humanos , Julgamento , Variações Dependentes do Observador , Patologia/métodosRESUMO
OBJECTIVES: We analyzed the extent to which comparative effectiveness research (CER) organizations share terms for designs, analyzed coverage of CER designs in Medical Subject Headings (MeSH) and Emtree, and explored whether scientists use CER design terms. METHODS: We developed local terminologies (LTs) and a CER design terminology by extracting terms in documents from five organizations. We defined coverage as the distribution over match type in MeSH and Emtree. We created a crosswalk by recording terms to which design terms mapped in both controlled vocabularies. We analyzed the hits for queries restricted to titles and abstracts to explore scientists' language. RESULTS: Pairwise LT overlap ranged from 22.64% (12/53) to 75.61% (31/41). The CER design terminology (nâ=â78 terms) consisted of terms for primary study designs and a few terms useful for evaluating evidence, such as opinion paper and systematic review. Patterns of coverage were similar in MeSH and Emtree (gammaâ=â0.581, Pâ=â0.002). CONCLUSIONS: Stakeholder terminologies vary, and terms are inconsistently covered in MeSH and Emtree. The CER design terminology and crosswalk may be useful for expert searchers. For partially mapped terms, queries could consist of free text for modifiers such as nonrandomized or interrupted added to broad or related controlled terms.
Assuntos
Pesquisa Comparativa da Efetividade/métodos , Bases de Dados Bibliográficas , Armazenamento e Recuperação da Informação/métodos , MEDLINE/organização & administração , Medical Subject Headings , Terminologia como Assunto , Humanos , National Library of Medicine (U.S.) , Estados UnidosRESUMO
Modern society is structured around early routines which cause evening types to suffer from health and performance detriments associated with sleep times being misaligned with biological needs (circadian preference). Given that COVID-19 lockdowns were less constrained by social schedules, the current study explores whether temporal behaviours became better aligned with biological needs, and whether these changes benefited work engagement. 406 UK participants reported circadian preference and pre-lockdown and lockdown sleep times, work times, and work engagement. Results found that sleep health improved under lockdown measures in terms of increased sleep duration and reduced social jetlag, and sleep and work times became better aligned with circadian preferences. The most circadian-misaligned participants - students and young adults - exhibited the largest changes to sleep and work habits. However, work engagement decreased more in participants with improved social jetlag and delayed work habits, which is surprising given that these temporal changes reflect improved circadian alignment. We discuss potential moderators including poor sleep quality, non-engaging work-from-home environments, and mental health. These findings have implications for encouraging flexible educational and employment schedules post-COVID-19 to satisfy the common drive to improve circadian alignment, but future work must determine the moderating factors that impair work engagement during remote work.
Assuntos
COVID-19 , Ritmo Circadiano , Adulto Jovem , Humanos , Engajamento no Trabalho , Controle de Doenças Transmissíveis , Sono , Síndrome do Jet Lag/complicações , Inquéritos e QuestionáriosRESUMO
MOTIVATION: Expressions that refer to a real-world entity already mentioned in a narrative are often considered anaphoric. For example, in the sentence "The pain comes and goes," the expression "the pain" is probably referring to a previous mention of pain. Interpretation of meaning involves resolving the anaphoric reference: deciding which expression in the text is the correct antecedent of the referring expression, also called an anaphor. We annotated a set of 180 clinical reports (surgical pathology, radiology, discharge summaries, and emergency department) from two institutions to indicate all anaphor-antecedent pairs. OBJECTIVE: The objective of this study is to describe the characteristics of the corpus in terms of the frequency of anaphoric relations, the syntactic and semantic nature of the members of the pairs, and the types of anaphoric relations that occur. Understanding how anaphoric reference is exhibited in clinical reports is critical to developing reference resolution algorithms and to identifying peculiarities of clinical text that may alter the features and methodologies that will be successful for automated anaphora resolution. RESULTS: We found that anaphoric reference is prevalent in all types of clinical reports, that annotations of noun phrases, semantic type, and section headings may be especially important for automated resolution of anaphoric reference, and that separate modules for reference resolution may be required for different report types, different institutions, and different types of anaphors. Accurate resolution will probably require extensive domain knowledge-especially for pathology and radiology reports with more part/whole and set/subset relations. CONCLUSION: We hope researchers will leverage the annotations in this corpus to develop automated algorithms and will add to the annotations to generate a more extensive corpus.
Assuntos
Registros Eletrônicos de Saúde/normas , Semântica , Algoritmos , Mineração de Dados/métodos , HumanosRESUMO
BACKGROUND: Stable and pain-free thumb function and pinch is vital for personal care, work, domestic and leisure activities. OBJECTIVE: The aim of this article is to increase awareness of timely referral to hand therapists for three common thumb conditions. This article covers three thumb conditions: acute trauma (ulnar collateral ligament), repetitive trauma (De Quervain's tendinopathy) and chronic progression (basal thumb osteoarthritis). DISCUSSION: The Australian Hand Therapy Association awards accreditation to occupational therapists and physiotherapists with specialist knowledge of the upper limb. Understanding the role of hand therapy in clinical testing, custom orthoses and exercise prescription benefits general practitioners, as timely referrals yield optimal functional outcomes for patients.
Assuntos
Osteoartrite , Médicos , Austrália , Terapia por Exercício , Humanos , Osteoartrite/terapia , PolegarRESUMO
While the biomedical informatics community widely acknowledges the utility of domain ontologies, there remain many barriers to their effective use. One important requirement of domain ontologies is that they must achieve a high degree of coverage of the domain concepts and concept relationships. However, the development of these ontologies is typically a manual, time-consuming, and often error-prone process. Limited resources result in missing concepts and relationships as well as difficulty in updating the ontology as knowledge changes. Methodologies developed in the fields of Natural Language Processing, information extraction, information retrieval and machine learning provide techniques for automating the enrichment of an ontology from free-text documents. In this article, we review existing methodologies and developed systems, and discuss how existing methods can benefit the development of biomedical ontologies.
Assuntos
Pesquisa Biomédica , Biologia Computacional , Mineração de Dados/métodos , Processamento de Linguagem Natural , Vocabulário ControladoRESUMO
Implementation of electronic health records (EHR), particularly computerized physician/provider order entry systems (CPOE), is often met with resistance. Influence presented at the right time, in the right manner, may minimize resistance or at least limit the risk of complete system failure. Combining established theories on power, influence tactics, and resistance, we developed the Ranked Levels of Influence model. Applying it to documented examples of EHR/CPOE failures at Cedars-Sinai and Kaiser Permanente in Hawaii, we evaluated the influence applied, the resistance encountered, and the resulting risk to the system implementation. Using the Ranked Levels of Influence model as a guideline, we demonstrate that these system failures were associated with the use of hard influence tactics that resulted in higher levels of resistance. We suggest that when influence tactics remain at the soft tactics level, the level of resistance stabilizes or de-escalates and the system can be saved.
Assuntos
Registros Eletrônicos de Saúde , Informática Médica , Havaí , Humanos , Sistemas de Registro de Ordens Médicas , Modelos Teóricos , MédicosRESUMO
Coreference resolution is the task of determining linguistic expressions that refer to the same real-world entity in natural language. Research on coreference resolution in the general English domain dates back to 1960s and 1970s. However, research on coreference resolution in the clinical free text has not seen major development. The recent US government initiatives that promote the use of electronic health records (EHRs) provide opportunities to mine patient notes as more and more health care institutions adopt EHR. Our goal was to review recent advances in general purpose coreference resolution to lay the foundation for methodologies in the clinical domain, facilitated by the availability of a shared lexical resource of gold standard coreference annotations, the Ontology Development and Information Extraction (ODIE) corpus.
Assuntos
Informática Médica/métodos , Processamento de Linguagem Natural , Registros Eletrônicos de Saúde , Humanos , Armazenamento e Recuperação da Informação , LinguísticaRESUMO
Research suggests that sleep deprivation both before and after encoding has a detrimental effect on memory for newly learned material. However, there is as yet no quantitative analyses of the size of these effects. We conducted two meta-analyses of studies published between 1970 and 2020 that investigated effects of total, acute sleep deprivation on memory (i.e., at least one full night of sleep deprivation): one for deprivation occurring before learning and one for deprivation occurring after learning. The impact of sleep deprivation after learning on memory was associated with Hedges' g = 0.277, 95% CI [0.177, 0.377]. Whether testing took place immediately after deprivation or after recovery sleep moderated the effect, with significantly larger effects observed in immediate tests. Procedural memory tasks also showed significantly larger effects than declarative memory tasks. The impact of sleep deprivation before learning was associated with Hedges' g = 0.621, 95% CI [0.473, 0.769]. Egger's tests for funnel plot asymmetry suggested significant publication bias in both meta-analyses. Statistical power was very low in most of the analyzed studies. Highly powered, preregistered replications are needed to estimate the underlying effect sizes more precisely. (PsycInfo Database Record (c) 2022 APA, all rights reserved).
Assuntos
Aprendizagem , Privação do Sono , Humanos , Viés de Publicação , SonoRESUMO
Previous studies in our laboratory have shown the benefits of immediate feedback on cognitive performance for pathology residents using an intelligent tutoring system (ITS) in pathology. In this study, we examined the effect of immediate feedback on metacognitive performance, and investigated whether other metacognitive scaffolds will support metacognitive gains when immediate feedback is faded. Twenty-three participants were randomized into intervention and control groups. For both groups, periods working with the ITS under varying conditions were alternated with independent computer-based assessments. On day 1, a within-subjects design was used to evaluate the effect of immediate feedback on cognitive and metacognitive performance. On day 2, a between-subjects design was used to compare the use of other metacognitive scaffolds (intervention group) against no metacognitive scaffolds (control group) on cognitive and metacognitive performance, as immediate feedback was faded. Measurements included learning gains (a measure of cognitive performance), as well as several measures of metacognitive performance, including Goodman-Kruskal gamma correlation (G), bias, and discrimination. For the intervention group, we also computed metacognitive measures during tutoring sessions. Results showed that immediate feedback in an intelligent tutoring system had a statistically significant positive effect on learning gains, G and discrimination. Removal of immediate feedback was associated with decreasing metacognitive performance, and this decline was not prevented when students used a version of the tutoring system that provided other metacognitive scaffolds. Results obtained directly from the ITS suggest that other metacognitive scaffolds do have a positive effect on G and discrimination, as immediate feedback is faded. We conclude that immediate feedback had a positive effect on both metacognitive and cognitive gains in a medical tutoring system. Other metacognitive scaffolds were not sufficient to replace immediate feedback in this study. However, results obtained directly from the tutoring system are not consistent with results obtained from assessments. In order to facilitate transfer to real-world tasks, further research will be needed to determine the optimum methods for supporting metacognition as immediate feedback is faded.
Assuntos
Instrução por Computador/instrumentação , Educação de Pós-Graduação em Medicina/métodos , Retroalimentação Psicológica , Intuição , Patologia , Adulto , Competência Clínica , Cognição , Avaliação Educacional , Feminino , Humanos , Masculino , Aprendizagem Baseada em Problemas , Reprodutibilidade dos Testes , AutoeficáciaRESUMO
The objective of this study was to determine the relative efficiency of novices compared to a prediction of skilled use when performing tasks using the touchscreen interface of an EMR developed in Malawi. We observed novice users performing touchscreen tasks and recorded timestamp data from their performances. Using a predictive human performance modeling tool, the authors predicted the skilled task performance time for each task. Efficiency and rates of error were evaluated with respect to user interface design. Nineteen participants performed 31 EMR tasks seven times for a total of 4,123 observed performances. We analyzed twelve representative tasks leaving 1,596 performances featuring six user interface designs. Mean novice performance time was significantly slower than mean predicted skilled performance time (p<0.001). However, novices performed faster than the predicted skilled level in 208 (13%) of successful task performances. These findings suggest the user interface design supports a primary design goal of the EMR--to allow novice users to perform tasks efficiently and effectively.
Assuntos
Terminais de Computador/estatística & dados numéricos , Registros Eletrônicos de Saúde/instrumentação , Registros Eletrônicos de Saúde/estatística & dados numéricos , Sistemas Homem-Máquina , Sistemas Automatizados de Assistência Junto ao Leito/estatística & dados numéricos , Análise e Desempenho de Tarefas , Interface Usuário-Computador , Humanos , Malaui , Competência Profissional/estatística & dados numéricos , TatoRESUMO
BACKGROUND: Data protection is important for all information systems that deal with human-subjects data. Grid-based systems--such as the cancer Biomedical Informatics Grid (caBIG)--seek to develop new mechanisms to facilitate real-time federation of cancer-relevant data sources, including sources protected under a variety of regulatory laws, such as HIPAA and 21CFR11. These systems embody new models for data sharing, and hence pose new challenges to the regulatory community, and to those who would develop or adopt them. These challenges must be understood by both systems developers and system adopters. In this paper, we describe our work collecting policy statements, expectations, and requirements from regulatory decision makers at academic cancer centers in the United States. We use these statements to examine fundamental assumptions regarding data sharing using data federations and grid computing. METHODS: An interview-based study of key stakeholders from a sample of US cancer centers. Interviews were structured, and used an instrument that was developed for the purpose of this study. The instrument included a set of problem scenarios--difficult policy situations that were derived during a full-day discussion of potentially problematic issues by a set of project participants with diverse expertise. Each problem scenario included a set of open-ended questions that were designed to elucidate stakeholder opinions and concerns. Interviews were transcribed verbatim and used for both qualitative and quantitative analysis. For quantitative analysis, data was aggregated at the individual or institutional unit of analysis, depending on the specific interview question. RESULTS: Thirty-one (31) individuals at six cancer centers were contacted to participate. Twenty-four out of thirty-one (24/31) individuals responded to our request- yielding a total response rate of 77%. Respondents included IRB directors and policy-makers, privacy and security officers, directors of offices of research, information security officers and university legal counsel. Nineteen total interviews were conducted over a period of 16 weeks. Respondents provided answers for all four scenarios (a total of 87 questions). Results were grouped by broad themes, including among others: governance, legal and financial issues, partnership agreements, de-identification, institutional technical infrastructure for security and privacy protection, training, risk management, auditing, IRB issues, and patient/subject consent. CONCLUSION: The findings suggest that with additional work, large scale federated sharing of data within a regulated environment is possible. A key challenge is developing suitable models for authentication and authorization practices within a federated environment. Authentication--the recognition and validation of a person's identity--is in fact a global property of such systems, while authorization - the permission to access data or resources--mimics data sharing agreements in being best served at a local level. Nine specific recommendations result from the work and are discussed in detail. These include: (1) the necessity to construct separate legal or corporate entities for governance of federated sharing initiatives on this scale; (2) consensus on the treatment of foreign and commercial partnerships; (3) the development of risk models and risk management processes; (4) development of technical infrastructure to support the credentialing process associated with research including human subjects; (5) exploring the feasibility of developing large-scale, federated honest broker approaches; (6) the development of suitable, federated identity provisioning processes to support federated authentication and authorization; (7) community development of requisite HIPAA and research ethics training modules by federation members; (8) the recognition of the need for central auditing requirements and authority, and; (9) use of two-protocol data exchange models where possible in the federation.
Assuntos
Pesquisa Biomédica , Segurança Computacional/normas , Confidencialidade/normas , Oncologia , Redes de Comunicação de Computadores , Segurança Computacional/legislação & jurisprudência , Confidencialidade/legislação & jurisprudência , Bases de Dados Factuais/legislação & jurisprudência , Bases de Dados Factuais/normas , Tomada de Decisões Gerenciais , Conselho Diretor , Regulamentação Governamental , Health Insurance Portability and Accountability Act , Humanos , Propriedade Intelectual , Entrevistas como Assunto , Política Organizacional , Estados UnidosRESUMO
How is the strength of a memory determined? This review discusses three main factors that contribute to memory enhancement - 1) emotion, 2) targeted memory reactivation, and 3) neural reinstatement. Whilst the mechanisms through which memories become enhanced vary, this review demonstrates that activation of the basolateral amygdala and hippocampal formation are crucial for facilitating encoding, consolidation, and retrieval. Here we suggest methodological factors to consider in future studies, and discuss several unanswered questions that should be pursued in order to clarify selective memory enhancement.
Assuntos
Associação , Complexo Nuclear Basolateral da Amígdala/fisiologia , Ondas Encefálicas/fisiologia , Emoções/fisiologia , Hipocampo/fisiologia , Consolidação da Memória/fisiologia , Rememoração Mental/fisiologia , Adulto , Animais , Complexo Nuclear Basolateral da Amígdala/metabolismo , Hipocampo/metabolismo , HumanosRESUMO
We report on the development of an instrument to measure clinicians' perceptions of their personal power in the workplace in relation to resistance to computerized physician order entry (CPOE). The instrument is based on French and Raven's six bases of social power and uses a semantic differential methodology. A measurement study was conducted to determine the reliability and validity of the survey. The survey was administered online and distributed via a URL by email to 19 physicians, nurses, and health unit coordinators from a university hospital. Acceptable reliability was achieved by removing or moving some semantic differential word pairs used to represent the six power bases (alpha range from 0.76 to 0.89). The Semantic Differential Power Perception (SDPP) survey validity was tested against an already validated instrument and found to be acceptable (correlation range from 0.51 to 0.81). The SDPP survey instrument was determined to be both reliable and valid.
Assuntos
Médicos/psicologia , Poder Psicológico , Local de Trabalho , Humanos , Reprodutibilidade dos TestesRESUMO
INTRODUCTION: We developed and evaluated a Natural Language Interface (NLI) for an Intelligent Tutoring System (ITS) in Diagnostic Pathology. The system teaches residents to examine pathologic slides and write accurate pathology reports while providing immediate feedback on errors they make in their slide review and diagnostic reports. Residents can ask for help at any point in the case, and will receive context-specific feedback. RESEARCH QUESTIONS: We evaluated (1) the performance of our natural language system, (2) the effect of the system on learning (3) the effect of feedback timing on learning gains and (4) the effect of ReportTutor on performance to self-assessment correlations. METHODS: The study uses a crossover 2 x 2 factorial design. We recruited 20 subjects from 4 academic programs. Subjects were randomly assigned to one of the four conditions--two conditions for the immediate interface, and two for the delayed interface. An expert dermatopathologist created a reference standard and 2 board certified AP/CP pathology fellows manually coded the residents' assessment reports. Subjects were given the opportunity to self grade their performance and we used a survey to determine student response to both interfaces. RESULTS: Our results show a highly significant improvement in report writing after one tutoring session with 4-fold increase in the learning gains with both interfaces but no effect of feedback timing on performance gains. Residents who used the immediate feedback interface first experienced a feature learning gain that is correlated with the number of cases they viewed. There was no correlation between performance and self-assessment in either condition.
Assuntos
Instrução por Computador/métodos , Processamento de Linguagem Natural , Patologia/educação , Instrução por Computador/normas , Retroalimentação Psicológica , Humanos , Internato e Residência , Modelos Educacionais , Aprendizagem Baseada em Problemas/métodos , Avaliação de Programas e Projetos de Saúde , Autoavaliação (Psicologia) , Interface Usuário-Computador , Redação/normasRESUMO
This report presents an overview for pathologists of the development and potential applications of a novel Web enabled system allowing indexing and retrieval of pathology specimens across multiple institutions. The system was developed through the National Cancer Institute's Shared Pathology Informatics Network program with the goal of creating a prototype system to find existing pathology specimens derived from routine surgical and autopsy procedures ("paraffin blocks") that may be relevant to cancer research. To reach this goal, a number of challenges needed to be met. A central aspect was the development of an informatics system that supported Web-based searching while retaining local control of data. Additional aspects included the development of an eXtensible Markup Language schema, representation of tissue specimen annotation, methods for deidentifying pathology reports, tools for autocoding critical data from these reports using the Unified Medical Language System, and hierarchies of confidentiality and consent that met or exceeded federal requirements. The prototype system supported Web-based querying of millions of pathology reports from 6 participating institutions across the country in a matter of seconds to minutes and the ability of bona fide researchers to identify and potentially to request specific paraffin blocks from the participating institutions. With the addition of associated clinical and outcome information, this system could vastly expand the pool of annotated tissues available for cancer research as well as other diseases.
Assuntos
Informática Médica/organização & administração , Patologia Cirúrgica/organização & administração , Manejo de Espécimes/métodos , Bancos de Tecidos , Humanos , Estados UnidosRESUMO
Part-of-speech tagging represents an important first step for most medical natural language processing (NLP) systems. The majority of current statistically-based POS taggers are trained using a general English corpus. Consequently, these systems perform poorly on medical text. Annotated medical corpora are difficult to develop because of the time and labor required. We investigated a heuristic-based sample selection method to minimize annotated corpus size for retraining a Maximum Entropy (ME) POS tagger. We developed a manually annotated domain specific corpus (DSC) of surgical pathology reports and a domain specific lexicon (DL). We sampled the DSC using two heuristics to produce smaller training sets and compared the retrained performance against (1) the original ME modeled tagger trained on general English, (2) the ME tagger retrained on the DL, and (3) the MedPost tagger trained on MEDLINE abstracts. RESULTS showed that the ME tagger retrained with a DSC was superior to the tagger retrained with the DL, and also superior to MedPost. Heuristic methods for sample selection produced performance equivalent to use of the entire training set, but with many fewer sentences. Learning curve analysis showed that sample selection would enable an 84% decrease in the size of the training set without a decrement in performance. We conclude that heuristic sample selection can be used to markedly reduce human annotation requirements for training of medical NLP systems.