RESUMO
Computer simulations of emergency departments (EDs) are tools that can support managing and optimising ED operations. A core component of ED simulation models is patient trajectories, defined as the series of activities patients undergo in the ED from arrival to discharge. The combined duration of these activities, and waiting times between them, determines a patient's length of stay (LOS). Patient trajectories are often calibrated and validated solely based on the estimated acuity of patients assigned upon arrival. However, acuity is a prospective patient indicator that inconsistently reflects patients' actual urgency and resource usage as seen retrospectively upon discharge. Here, we propose a data-driven ED simulation model in which patient trajectories are modelled based on both acuity and retrospective patient indicators. We show that including retrospective patient indicators recovers the observed LOS distributions more accurately than when using acuity alone. We also demonstrate how the use of retrospective patient indicators leads to more plausible estimates of the impact of increased stress in the ED on patients' LOS. Our work exemplifies how we can better utilise ED data to make the development and evaluation of ED simulation models more accurate and robust, enabling them to provide more reliable and useful operational insights.
RESUMO
BACKGROUND: Residential aged-care facilities (RACFs, also called long-term care facilities, aged care homes, or nursing homes) have elevated risks of respiratory infection outbreaks and associated disease burden. During the COVID-19 pandemic, social isolation policies were commonly used in these facilities to prevent and mitigate outbreaks. We refer specifically to general isolation policies that were intended to reduce contact between residents, without regard to confirmed infection status. Such policies are controversial because of their association with adverse mental and physical health indicators and there is a lack of modelling that assesses their effectiveness. METHODS: In consultation with the Australian Government Department of Health and Aged Care, we developed an agent-based model of COVID-19 transmission in a structured population, intended to represent the salient characteristics of a residential care environment. Using our model, we generated stochastic ensembles of simulated outbreaks and compared summary statistics of outbreaks simulated under different mitigation conditions. Our study focuses on the marginal impact of general isolation (reducing social contact between residents), regardless of confirmed infection. For a realistic assessment, our model included other generic interventions consistent with the Australian Government's recommendations released during the COVID-19 pandemic: isolation of confirmed resident cases, furlough (mandatory paid leave) of staff members with confirmed infection, and deployment of personal protective equipment (PPE) after outbreak declaration. RESULTS: In the absence of any asymptomatic screening, general isolation of residents to their rooms reduced median cumulative cases by approximately 27%. However, when conducted concurrently with asymptomatic screening and isolation of confirmed cases, general isolation reduced the median number of cumulative infections by only 12% in our simulations. CONCLUSIONS: Under realistic sets of assumptions, our simulations showed that general isolation of residents did not provide substantial benefits beyond those achieved through screening, isolation of confirmed cases, and deployment of PPE. Our results also highlight the importance of effective case isolation, and indicate that asymptomatic screening of residents and staff may be warranted, especially if importation risk from the outside community is high. Our conclusions are sensitive to assumptions about the proportion of total contacts in a facility accounted for by casual interactions between residents.
Assuntos
COVID-19 , Surtos de Doenças , SARS-CoV-2 , Isolamento Social , Humanos , COVID-19/epidemiologia , COVID-19/prevenção & controle , COVID-19/transmissão , Austrália/epidemiologia , Isolamento Social/psicologia , Surtos de Doenças/prevenção & controle , SARS-CoV-2/isolamento & purificação , Casas de Saúde , Instituição de Longa Permanência para Idosos , Idoso , Instituições ResidenciaisRESUMO
BACKGROUND: Hospitals in any given region can be considered as part of a network, where facilities are connected to one another - and hospital pathogens potentially spread - through the movement of patients between them. We sought to describe the hospital admission patterns of patients known to be colonised with carbapenemase-producing Enterobacterales (CPE), and compare them with CPE-negative patient cohorts, matched on comorbidity information. METHODS: We performed a linkage study in Victoria, Australia, including datasets with notifiable diseases (CPE notifications) and hospital admissions (admission dates and diagnostic codes) for the period 2011 to 2020. Where the CPE notification date occurred during a hospital admission for the same patient, we identified this as the 'index admission'. We determined the number of distinct health services each patient was admitted to, and time to first admission to a different health service. We compared CPE-positive patients with four cohorts of CPE-negative patients, sampled based on different matching criteria. RESULTS: Of 528 unique patients who had CPE detected during a hospital admission, 222 (42%) were subsequently admitted to a different health service during the study period. Among these patients, CPE diagnosis tended to occur during admission to a metropolitan public hospital (86%, 190/222), whereas there was a greater number of metropolitan private (23%, 52/222) and rural public (18%, 39/222) hospitals for the subsequent admission. Median time to next admission was 4 days (IQR, 0-75 days). Admission patterns for CPE-positive patients was similar to the cohort of CPE-negative patients matched on index admission, time period, and age-adjusted Charlson comorbidity index. CONCLUSIONS: Movement of CPE-positive patients between health services is not a rare event. While the most common movement is from one public metropolitan health service to another, there is also a trend for movement from metropolitan public hospitals into private and rural hospitals. After accounting for clinical comorbidities, CPE colonisation status does not appear to impact on hospital admission frequency or timing. These findings support the potential utility of a centralised notification and outbreak management system for CPE positive patients.
Assuntos
Proteínas de Bactérias , Infecções por Enterobacteriaceae , beta-Lactamases , Humanos , Infecções por Enterobacteriaceae/epidemiologia , Infecções por Enterobacteriaceae/microbiologia , Masculino , Feminino , Pessoa de Meia-Idade , Vitória/epidemiologia , Idoso , beta-Lactamases/metabolismo , Proteínas de Bactérias/metabolismo , Hospitalização , Adulto , Enterobacteriáceas Resistentes a Carbapenêmicos , Admissão do Paciente , Enterobacteriaceae , Infecção Hospitalar/microbiologia , Infecção Hospitalar/epidemiologia , Idoso de 80 Anos ou mais , Adulto Jovem , Portador Sadio/epidemiologia , Portador Sadio/microbiologiaRESUMO
MOTIVATION: Biological background knowledge plays an important role in the manual quality assurance (QA) of biological database records. One such QA task is the detection of inconsistencies in literature-based Gene Ontology Annotation (GOA). This manual verification ensures the accuracy of the GO annotations based on a comprehensive review of the literature used as evidence, Gene Ontology (GO) terms, and annotated genes in GOA records. While automatic approaches for the detection of semantic inconsistencies in GOA have been developed, they operate within predetermined contexts, lacking the ability to leverage broader evidence, especially relevant domain-specific background knowledge. This paper investigates various types of background knowledge that could improve the detection of prevalent inconsistencies in GOA. In addition, the paper proposes several approaches to integrate background knowledge into the automatic GOA inconsistency detection process. RESULTS: We have extended a previously developed GOA inconsistency dataset with several kinds of GOA-related background knowledge, including GeneRIF statements, biological concepts mentioned within evidence texts, GO hierarchy and existing GO annotations of the specific gene. We have proposed several effective approaches to integrate background knowledge as part of the automatic GOA inconsistency detection process. The proposed approaches can improve automatic detection of self-consistency and several of the most prevalent types of inconsistencies.This is the first study to explore the advantages of utilizing background knowledge and to propose a practical approach to incorporate knowledge in automatic GOA inconsistency detection. We establish a new benchmark for performance on this task. Our methods may be applicable to various tasks that involve incorporating biological background knowledge. AVAILABILITY AND IMPLEMENTATION: https://github.com/jiyuc/de-inconsistency.
Assuntos
Ontologia Genética , Anotação de Sequência Molecular , Anotação de Sequência Molecular/métodos , Bases de Dados Genéticas , Biologia Computacional/métodos , Semântica , HumanosRESUMO
This perspective is part of an international effort to improve epidemiological models with the goal of reducing the unintended consequences of infectious disease interventions. The scenarios in which models are applied often involve difficult trade-offs that are well recognised in public health ethics. Unless these trade-offs are explicitly accounted for, models risk overlooking contested ethical choices and values, leading to an increased risk of unintended consequences. We argue that such risks could be reduced if modellers were more aware of ethical frameworks and had the capacity to explicitly account for the relevant values in their models. We propose that public health ethics can provide a conceptual foundation for developing this capacity. After reviewing relevant concepts in public health and clinical ethics, we discuss examples from the COVID-19 pandemic to illustrate the current separation between public health ethics and infectious disease modelling. We conclude by describing practical steps to build the capacity for ethically aware modelling. Developing this capacity constitutes a critical step towards ethical practice in computational modelling of public health interventions, which will require collaboration with experts on public health ethics, decision support, behavioural interventions, and social determinants of health, as well as direct consultation with communities and policy makers.
Assuntos
Doenças Transmissíveis , Pandemias , Humanos , Pandemias/prevenção & controle , Saúde Pública , Doenças Transmissíveis/epidemiologia , Simulação por ComputadorRESUMO
Early case detection is critical to preventing onward transmission of COVID-19 by enabling prompt isolation of index infections, and identification and quarantining of contacts. Timeliness and completeness of ascertainment depend on the surveillance strategy employed. This paper presents modelling used to inform workplace testing strategies for the Australian government in early 2021. We use rapid prototype modelling to quickly investigate the effectiveness of testing strategies to aid decision making. Models are developed with a focus on providing relevant results to policy makers, and these models are continually updated and improved as new questions are posed. Developed to support the implementation of testing strategies in high risk workplace settings in Australia, our modelling explores the effects of test frequency and sensitivity on outbreak detection. We start with an exponential growth model, which demonstrates how outbreak detection changes depending on growth rate, test frequency and sensitivity. From the exponential model, we learn that low sensitivity tests can produce high probabilities of detection when testing occurs frequently. We then develop a more complex Agent Based Model, which was used to test the robustness of the results from the exponential model, and extend it to include intermittent workplace scheduling. These models help our fundamental understanding of disease detectability through routine surveillance in workplaces and evaluate the impact of testing strategies and workplace characteristics on the effectiveness of surveillance. This analysis highlights the risks of particular work patterns while also identifying key testing strategies to best improve outbreak detection in high risk workplaces.
Assuntos
COVID-19 , Humanos , COVID-19/epidemiologia , SARS-CoV-2 , Austrália/epidemiologia , Surtos de Doenças/prevenção & controle , Local de TrabalhoRESUMO
The ability for vaccines to protect against infectious diseases varies among individuals, but computational models employed to inform policy typically do not account for this variation. Here we examine this issue: we implement a model of vaccine efficacy developed in the context of SARS-CoV-2 in order to evaluate the general implications of modelling correlates of protection on the individual level. Due to high levels of variation in immune response, the distributions of individual-level protection emerging from this model tend to be highly dispersed, and are often bimodal. We describe the specification of the model, provide an intuitive parameterisation, and comment on its general robustness. We show that the model can be viewed as an intermediate between the typical approaches that consider the mode of vaccine action to be either "all-or-nothing" or "leaky". Our view based on this analysis is that individual variation in correlates of protection is an important consideration that may be crucial to designing and implementing models for estimating population-level impacts of vaccination programs.
Assuntos
COVID-19 , Doenças Transmissíveis , Vacinas , Humanos , COVID-19/prevenção & controle , SARS-CoV-2 , ImunidadeRESUMO
Scabies is a parasitic infestation with high global burden. Mass drug administrations (MDAs) are recommended for communities with a scabies prevalence of >10%. Quantitative analyses are needed to demonstrate the likely effectiveness of MDA recommendations. In this study, we developed an agent-based model of scabies transmission calibrated to demographic and epidemiological data from Monrovia. We used this model to compare the effectiveness of MDA scenarios for achieving scabies elimination and reducing scabies burden, as measured by time until recrudescence following delivery of an MDA and disability-adjusted-life-years (DALYs) averted. Our model showed that three rounds of MDA delivered at six-month intervals and reaching 80% of the population could reduce prevalence below 2% for three years following the final round, before recrudescence. When MDAs were followed by increased treatment uptake, prevalence was maintained below 2% indefinitely. Increasing the number of and coverage of MDA rounds increased the probability of achieving elimination and the number of DALYs averted. Our results suggest that acute reduction of scabies prevalence by MDA can support a transition to improved treatment access. This study demonstrates how modelling can be used to estimate the expected impact of MDAs by projecting future epidemiological dynamics and health gains under alternative scenarios.
Assuntos
Escabiose , Humanos , Libéria/epidemiologia , Escabiose/tratamento farmacológico , Escabiose/epidemiologia , Escabiose/prevenção & controle , Administração Massiva de Medicamentos , PrevalênciaRESUMO
Since the emergence of SARS-CoV-2 in 2019 through to mid-2021, much of the Australian population lived in a COVID-19-free environment. This followed the broadly successful implementation of a strong suppression strategy, including international border closures. With the availability of COVID-19 vaccines in early 2021, the national government sought to transition from a state of minimal incidence and strong suppression activities to one of high vaccine coverage and reduced restrictions but with still-manageable transmission. This transition is articulated in the national 're-opening' plan released in July 2021. Here, we report on the dynamic modelling study that directly informed policies within the national re-opening plan including the identification of priority age groups for vaccination, target vaccine coverage thresholds and the anticipated requirements for continued public health measures-assuming circulation of the Delta SARS-CoV-2 variant. Our findings demonstrated that adult vaccine coverage needed to be at least 60% to minimize public health and clinical impacts following the establishment of community transmission. They also supported the need for continued application of test-trace-isolate-quarantine and social measures during the vaccine roll-out phase and beyond.
Assuntos
Vacinas contra COVID-19 , COVID-19 , Adulto , Humanos , SARS-CoV-2 , Incidência , COVID-19/epidemiologia , COVID-19/prevenção & controle , Austrália/epidemiologiaRESUMO
Disease surveillance is used to monitor ongoing control activities, detect early outbreaks, and inform intervention priorities and policies. However, data from disease surveillance that could be used to support real-time decisionmaking remain largely underutilised. Using the Brazilian Amazon malaria surveillance dataset as a case study, in this paper we explore the potential for unsupervised anomaly detection machine learning techniques to discover signals of epidemiological interest. We found that our models were able to provide an early indication of outbreak onset, outbreak peaks, and change points in the proportion of positive malaria cases. Specifically, the sustained rise in malaria in the Brazilian Amazon in 2016 was flagged by several models. We found that no single model detected all anomalies across all health regions. Because of this, we provide the minimum number of machine learning models top-k models) to maximise the number of anomalies detected across different health regions. We discovered that the top three models that maximise the coverage of the number and types of anomalies detected across the thirteen health regions are principal component analysis, stochastic outlier selection, and the minimum covariance determinant. Anomaly detection is a potentially valuable approach to discovering patterns of epidemiological importance when confronted with a large volume of data across space and time. Our exploratory approach can be replicated for other diseases and locations to inform monitoring, timely interventions, and actions towards the goal of controlling endemic disease.
RESUMO
Early estimates of the transmission properties of a newly emerged pathogen are critical to an effective public health response, and are often based on limited outbreak data. Here, we use simulations to investigate how correlations between the viral load of cases in transmission chains can affect estimates of these fundamental transmission properties. Our computational model simulates a disease transmission mechanism in which the viral load of the infector at the time of transmission influences the infectiousness of the infectee. These correlations in transmission pairs produce a population-level convergence process during which the distributions of initial viral loads in each subsequent generation converge to a steady state. We find that outbreaks arising from index cases with low initial viral loads give rise to early estimates of transmission properties that could be misleading. These findings demonstrate the potential for transmission mechanisms to affect estimates of the transmission properties of newly emerged viruses in ways that could be operationally significant to a public health response.
Assuntos
Surtos de Doenças , SARS-CoV-2 , Carga Viral , Número Básico de ReproduçãoRESUMO
BACKGROUND: Streptococcus pyogenes, or group A Streptococcus (GAS), infections contribute to a high burden of disease in Aboriginal Australians, causing skin infections and immune sequelae such as rheumatic heart disease. Controlling skin infections in these populations has proven difficult, with transmission dynamics being poorly understood. We aimed to identify the relative contributions of impetigo and asymptomatic throat carriage to GAS transmission. METHODS: In this genomic analysis, we retrospectively applied whole genome sequencing to GAS isolates that were collected as part of an impetigo surveillance longitudinal household survey conducted in three remote Aboriginal communities in the Northern Territory of Australia between Aug 6, 2003, and June 22, 2005. We included GAS isolates from all throats and impetigo lesions of people living in two of the previously studied communities. We classified isolates into genomic lineages based on pairwise shared core genomes of more than 99% with five or fewer single nucleotide polymorphisms. We used a household network analysis of epidemiologically and genomically linked lineages to quantify the transmission of GAS within and between households. FINDINGS: We included 320 GAS isolates in our analysis: 203 (63%) from asymptomatic throat swabs and 117 (37%) from impetigo lesions. Among 64 genomic lineages (encompassing 39 emm types) we identified 264 transmission links (involving 93% of isolates), for which the probable source was asymptomatic throat carriage in 166 (63%) and impetigo lesions in 98 (37%). Links originating from impetigo cases were more frequent between households than within households. Households were infected with GAS for a mean of 57 days (SD 39 days), and once cleared, reinfected 62 days (SD 40 days) later. Increased household size and community presence of GAS and scabies were associated with slower clearance of GAS. INTERPRETATION: In communities with high prevalence of endemic GAS-associated skin infection, asymptomatic throat carriage is a GAS reservoir. Public health interventions such as vaccination or community infection control programmes aimed at interrupting transmission of GAS might need to include consideration of asymptomatic throat carriage. FUNDING: Australian National Health and Medical Research Council.
Assuntos
Impetigo , Dermatopatias Infecciosas , Infecções Estreptocócicas , Humanos , Impetigo/epidemiologia , Streptococcus pyogenes/genética , Estudos Retrospectivos , Faringe , Northern Territory/epidemiologia , Infecções Estreptocócicas/epidemiologia , GenômicaRESUMO
BACKGROUND: During the early stages of the COVID-19 pandemic, there was considerable uncertainty surrounding epidemiological and clinical aspects of SARS-CoV-2. Governments around the world, starting from varying levels of pandemic preparedness, needed to make decisions about how to respond to SARS-CoV-2 with only limited information about transmission rates, disease severity and the likely effectiveness of public health interventions. In the face of such uncertainties, formal approaches to quantifying the value of information can help decision makers to prioritise research efforts. METHODS: In this study we use Value of Information (VoI) analysis to quantify the likely benefit associated with reducing three key uncertainties present in the early stages of the COVID-19 pandemic: the basic reproduction number ([Formula: see text]), case severity (CS), and the relative infectiousness of children compared to adults (CI). The specific decision problem we consider is the optimal level of investment in intensive care unit (ICU) beds. Our analysis incorporates mathematical models of disease transmission and clinical pathways in order to estimate ICU demand and disease outcomes across a range of scenarios. RESULTS: We found that VoI analysis enabled us to estimate the relative benefit of resolving different uncertainties about epidemiological and clinical aspects of SARS-CoV-2. Given the initial beliefs of an expert, obtaining more information about case severity had the highest parameter value of information, followed by the basic reproduction number [Formula: see text]. Resolving uncertainty about the relative infectiousness of children did not affect the decision about the number of ICU beds to be purchased for any COVID-19 outbreak scenarios defined by these three parameters. CONCLUSION: For the scenarios where the value of information was high enough to justify monitoring, if CS and [Formula: see text] are known, management actions will not change when we learn about child infectiousness. VoI is an important tool for understanding the importance of each disease factor during outbreak preparedness and can help to prioritise the allocation of resources for relevant information.
Assuntos
COVID-19 , Adulto , Criança , Humanos , COVID-19/epidemiologia , SARS-CoV-2 , Pandemias/prevenção & controle , Unidades de Terapia Intensiva , Modelos TeóricosRESUMO
Rotavirus infection is a common cause of severe diarrheal disease and a major cause of deaths and hospitalizations among young children. Incidence of rotavirus has declined globally with increasing vaccine coverage. However, it remains a significant cause of morbidity and mortality in low-income countries where vaccine access is limited and efficacy is lower. The oral human neonatal vaccine RV3-BB can be safely administered earlier than other vaccines, and recent trials in Indonesia have demonstrated high efficacy. In this study, we use a stochastic individual-based model of rotavirus transmission and disease to estimate the anticipated population-level impact of RV3-BB following delivery according to either an infant (2, 4, 6 months) and neonatal (0, 2, 4 months) schedule. Using our model, which incorporated an age- and household-structured population and estimates of vaccine efficacy derived from trial data, we found both delivery schedules to be effective at reducing infection and disease. We estimated 95-96% reductions in infection and disease in children under 12 months of age when vaccine coverage is 85%. We also estimate high levels of indirect protection from vaccination, including 78% reductions in infection in adults over 17 years of age. Even for lower vaccine coverage of 55%, we estimate reductions of 84% in infection and disease in children under 12 months of age. While open questions remain about the drivers of observed lower efficacy in low-income settings, our model suggests RV3-BB could be effective at reducing infection and preventing disease in young infants at the population level.
Assuntos
Infecções por Rotavirus , Vacinas contra Rotavirus , Rotavirus , Recém-Nascido , Criança , Adulto , Humanos , Lactente , Pré-Escolar , Infecções por Rotavirus/epidemiologia , Infecções por Rotavirus/prevenção & controle , Vacinas Atenuadas , DiarreiaRESUMO
Nucleotide and protein sequences stored in public databases are the cornerstone of many bioinformatics analyses. The records containing these sequences are prone to a wide range of errors, including incorrect functional annotation, sequence contamination and taxonomic misclassification. One source of information that can help to detect errors are the strong interdependency between records. Novel sequences in one database draw their annotations from existing records, may generate new records in multiple other locations and will have varying degrees of similarity with existing records across a range of attributes. A network perspective of these relationships between sequence records, within and across databases, offers new opportunities to detect-or even correct-erroneous entries and more broadly to make inferences about record quality. Here, we describe this novel perspective of sequence database records as a rich network, which we call the sequence database network, and illustrate the opportunities this perspective offers for quantification of database quality and detection of spurious entries. We provide an overview of the relevant databases and describe how the interdependencies between sequence records across these databases can be exploited by network analyses. We review the process of sequence annotation and provide a classification of sources of error, highlighting propagation as a major source. We illustrate the value of a network perspective through three case studies that use network analysis to detect errors, and explore the quality and quantity of critical relationships that would inform such network analyses. This systematic description of a network perspective of sequence database records provides a novel direction to combat the proliferation of errors within these critical bioinformatics resources.
Assuntos
Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Sequência de AminoácidosRESUMO
Cultural practices and development level can influence a population's household structures and mixing patterns. Within some populations, households can be organized across multiple dwellings. This likely affects the spread of infectious disease through these communities; however, current demographic data collection tools do not record these data. METHODS: Between June and October 2018, the Contact And Mobility Patterns in remote Aboriginal Australian communities (CAMP-remote) pilot study recruited Aboriginal mothers with infants in a remote northern Australian community to complete a monthly iPad-based contact survey. RESULTS: Thirteen mother-infant pairs (participants) completed 69 study visits between recruitment and the end of May 2019. Participants reported they and their other children slept in 28 dwellings during the study. The median dwelling occupancy, defined as people sleeping in the same dwelling on the previous night, was ten (range: 3.5-25). Participants who completed at least three responses (n = 8) slept in a median of three dwellings (range: 2-9). Each month, a median of 28% (range: 0-63%) of the participants travelled out of the community. Including these data in disease transmission models amplified estimates of infectious disease spread in the study community, compared to models parameterized using census data. CONCLUSIONS: The lack of data on mixing patterns in populations where households can be organized across dwellings may impact the accuracy of infectious disease models for these communities and the efficacy of public health actions they inform.
Assuntos
Características da Família , Havaiano Nativo ou Outro Ilhéu do Pacífico , Austrália/epidemiologia , Criança , Feminino , Humanos , Povos Indígenas , Lactente , Projetos PilotoRESUMO
MOTIVATION: Survival risk prediction using gene expression data is important in making treatment decisions in cancer. Standard neural network (NN) survival analysis models are black boxes with a lack of interpretability. More interpretable visible neural network architectures are designed using biological pathway knowledge. But they do not model how pathway structures can change for particular cancer types. RESULTS: We propose a novel Mutated Pathway Visible Neural Network (MPVNN) architecture, designed using prior signaling pathway knowledge and random replacement of known pathway edges using gene mutation data simulating signal flow disruption. As a case study, we use the PI3K-Akt pathway and demonstrate overall improved cancer-specific survival risk prediction of MPVNN over other similar-sized NN and standard survival analysis methods. We show that trained MPVNN architecture interpretation, which points to smaller sets of genes connected by signal flow within the PI3K-Akt pathway that is important in risk prediction for particular cancer types, is reliable. AVAILABILITY AND IMPLEMENTATION: The data and code are available at https://github.com/gourabghoshroy/MPVNN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Neoplasias , Fosfatidilinositol 3-Quinases , Humanos , Fosfatidilinositol 3-Quinases/genética , Proteínas Proto-Oncogênicas c-akt , Redes Neurais de Computação , Neoplasias/genética , MutaçãoRESUMO
MOTIVATION: Literature-based gene ontology annotations (GOA) are biological database records that use controlled vocabulary to uniformly represent gene function information that is described in the primary literature. Assurance of the quality of GOA is crucial for supporting biological research. However, a range of different kinds of inconsistencies in between literature as evidence and annotated GO terms can be identified; these have not been systematically studied at record level. The existing manual-curation approach to GOA consistency assurance is inefficient and is unable to keep pace with the rate of updates to gene function knowledge. Automatic tools are therefore needed to assist with GOA consistency assurance. This article presents an exploration of different GOA inconsistencies and an early feasibility study of automatic inconsistency detection. RESULTS: We have created a reliable synthetic dataset to simulate four realistic types of GOA inconsistency in biological databases. Three automatic approaches are proposed. They provide reasonable performance on the task of distinguishing the four types of inconsistency and are directly applicable to detect inconsistencies in real-world GOA database records. Major challenges resulting from such inconsistencies in the context of several specific application settings are reported. This is the first study to introduce automatic approaches that are designed to address the challenges in current GOA quality assurance workflows. The data underlying this article are available in Github at https://github.com/jiyuc/AutoGOAConsistency.
Assuntos
Publicações , Ontologia Genética , Anotação de Sequência MolecularRESUMO
BACKGROUND: Estimating community level scabies prevalence is crucial for targeting interventions to areas of greatest need. The World Health Organisation recommends sampling at the unit of households or schools, but there is presently no standardised approach to scabies prevalence assessment. Consequently, a wide range of sampling sizes and methods have been used. As both prevalence and drivers of transmission vary across populations, there is a need to understand how sampling strategies for estimating scabies prevalence interact with local epidemiology to affect the accuracy of prevalence estimates. METHODS: We used a simulation-based approach to compare the efficacy of different scabies sampling strategies. First, we generated synthetic populations broadly representative of remote Australian Indigenous communities and assigned a scabies status to individuals to achieve a specified prevalence using different assumptions about scabies epidemiology. Second, we calculated an observed prevalence for different sampling methods and sizes. RESULTS: The distribution of prevalence in subpopulation groups can vary substantially when the underlying scabies assignment method changes. Across all of the scabies assignment methods combined, the simple random sampling method produces the narrowest 95% confidence interval for all sample sizes. The household sampling method introduces higher variance compared to simple random sampling when the assignment of scabies includes a household-specific component. The school sampling method overestimates community prevalence when the assignment of scabies includes an age-specific component. DISCUSSION: Our results indicate that there are interactions between transmission assumptions and surveillance strategies, emphasizing the need for understanding scabies transmission dynamics. We suggest using the simple random sampling method for estimating scabies prevalence. Our approach can be adapted to various populations and diseases.
Assuntos
Escabiose , Austrália/epidemiologia , Simulação por Computador , Humanos , Prevalência , Escabiose/epidemiologia , Instituições AcadêmicasRESUMO
Multi-strain pathogens such as Group A Streptococcus, Streptococcus pneumoniae, and Staphylococcus aureus cause millions of infections each year with a substantial health burden. Control of multi-strain pathogens can be complicated by the high strain diversity often observed in endemic settings. It is not well understood how high strain diversity is maintained in populations, given that they compete with each other both directly (within an individual host) and indirectly (via host immunity). Previous modelling studies have investigated how indirect competition affects the prevalence and diversity of strains. However, these studies often make simplifying assumptions about the direct competition that occurs within hosts. Currently, little data is available to validate these assumptions, hence there is a need to clarify how sensitive model outputs are to these assumptions. In this study, we compare the dynamics of multi-strain pathogens under different assumptions about direct competition between strains using an agent-based model. We find that the assumptions made about direct competition can affect the epidemiological dynamics, particularly when there is no long-term immunity following infections and a low rate of importation of non-circulating strains. Our results suggest that while direct and indirect competition can each decrease strain diversity when they act in isolation, they may increase strain diversity when they act together. This finding highlights the importance of examining sensitivity to assumptions about strain competition. In particular, omitting consideration of direct competition can lead to inaccurate estimates of the likely effectiveness of control strategies as changes in strain diversity shift the level of direct strain competition.