Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 47
Filtrar
1.
Nat Microbiol ; 8(12): 2365-2377, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37996707

RESUMO

Malaria results in over 600,000 deaths annually, with the highest burden of deaths in young children living in sub-Saharan Africa. Molecular surveillance can provide important information for malaria control policies, including detection of antimalarial drug resistance. However, genome sequencing capacity in malaria-endemic countries is limited. We designed and implemented an end-to-end workflow to detect Plasmodium falciparum antimalarial resistance markers and diversity in the vaccine target circumsporozoite protein (csp) using nanopore sequencing in Ghana. We analysed 196 clinical samples and showed that our method is rapid, robust, accurate and straightforward to implement. Importantly, our method could be applied to dried blood spot samples, which are readily collected in endemic settings. We report that P. falciparum parasites in Ghana are mostly susceptible to chloroquine, with persistent sulfadoxine-pyrimethamine resistance and no evidence of artemisinin resistance. Multiple single nucleotide polymorphisms were identified in csp, but their significance is uncertain. Our study demonstrates the feasibility of nanopore sequencing for malaria genomic surveillance in endemic countries.


Assuntos
Antimaláricos , Malária Falciparum , Malária , Sequenciamento por Nanoporos , Criança , Humanos , Pré-Escolar , Plasmodium falciparum/genética , Gana/epidemiologia , Antimaláricos/farmacologia , Malária/epidemiologia , Malária Falciparum/epidemiologia , Malária Falciparum/prevenção & controle , Malária Falciparum/tratamento farmacológico , Resistência a Medicamentos/genética
2.
Microb Genom ; 9(10)2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37902454

RESUMO

Escherichia coli is a ubiquitous component of the human gut microbiome, but is also a common pathogen, causing around 40, 000 bloodstream infections (BSI) in the United Kingdom (UK) annually. The number of E. coli BSI has increased over the last decade in the UK, and emerging antimicrobial resistance (AMR) profiles threaten treatment options. Here, we combined clinical, epidemiological, and whole genome sequencing data with high content imaging to characterise over 300 E. coli isolates associated with BSI in a large teaching hospital in the East of England. Overall, only a limited number of sequence types (ST) were responsible for the majority of organisms causing invasive disease. The most abundant (20 % of all isolates) was ST131, of which around 90 % comprised the pandemic O25b:H4 group. ST131-O25b:H4 isolates were frequently multi-drug resistant (MDR), with a high prevalence of extended spectrum ß-lactamases (ESBL) and fluoroquinolone resistance. There was no association between AMR phenotypes and the source of E. coli bacteraemia or whether the infection was healthcare-associated. Several clusters of ST131 were genetically similar, potentially suggesting a shared transmission network. However, there was no clear epidemiological associations between these cases, and they included organisms from both healthcare-associated and non-healthcare-associated origins. The majority of ST131 isolates exhibited strong binding with an anti-O25b antibody, raising the possibility of developing rapid diagnostics targeting this pathogen. In summary, our data suggest that a restricted set of MDR E. coli populations can be maintained and spread across both community and healthcare settings in this location, contributing disproportionately to invasive disease and AMR.


Assuntos
Infecções por Escherichia coli , Sepse , Humanos , Escherichia coli/genética , Hospitais de Ensino , Reino Unido/epidemiologia , Inglaterra , Infecções por Escherichia coli/epidemiologia , Genômica
3.
Trends Parasitol ; 39(12): 996-1000, 2023 12.
Artigo em Inglês | MEDLINE | ID: mdl-37865609

RESUMO

Nanopore-based sequencing platforms offer the potential for affordable malaria molecular surveillance (MMS) in resource-limited settings to track and ultimately counteract emerging threats, such as drug resistance and diagnostic escape. Here, we discuss opportunities and challenges to implementing MMS using nanopore sequencing, highlighting priority areas for technical development and innovation.


Assuntos
Malária , Sequenciamento por Nanoporos , Humanos , Malária/diagnóstico , Malária/epidemiologia , Malária/prevenção & controle , Resistência a Medicamentos , Região de Recursos Limitados
4.
Brief Bioinform ; 23(6)2022 11 19.
Artigo em Inglês | MEDLINE | ID: mdl-36151740

RESUMO

Drug discovery and development is a complex and costly process. Machine learning approaches are being investigated to help improve the effectiveness and speed of multiple stages of the drug discovery pipeline. Of these, those that use Knowledge Graphs (KG) have promise in many tasks, including drug repurposing, drug toxicity prediction and target gene-disease prioritization. In a drug discovery KG, crucial elements including genes, diseases and drugs are represented as entities, while relationships between them indicate an interaction. However, to construct high-quality KGs, suitable data are required. In this review, we detail publicly available sources suitable for use in constructing drug discovery focused KGs. We aim to help guide machine learning and KG practitioners who are interested in applying new techniques to the drug discovery field, but who may be unfamiliar with the relevant data sources. The datasets are selected via strict criteria, categorized according to the primary type of information contained within and are considered based upon what information could be extracted to build a KG. We then present a comparative analysis of existing public drug discovery KGs and an evaluation of selected motivating case studies from the literature. Additionally, we raise numerous and unique challenges and issues associated with the domain and its datasets, while also highlighting key future research directions. We hope this review will motivate KGs use in solving key and emerging questions in the drug discovery domain.


Assuntos
Aprendizado de Máquina , Reconhecimento Automatizado de Padrão , Descoberta de Drogas , Conhecimento , Armazenamento e Recuperação da Informação
5.
Sci Rep ; 12(1): 10492, 2022 06 21.
Artigo em Inglês | MEDLINE | ID: mdl-35729228

RESUMO

Breakthrough infections with SARS-CoV-2 Delta variant have been reported in doubly-vaccinated recipients and as re-infections. Studies of viral spread within hospital settings have highlighted the potential for transmission between doubly-vaccinated patients and health care workers and have highlighted the benefits of high-grade respiratory protection for health care workers. However the extent to which vaccination is preventative of viral spread in health care settings is less well studied. Here, we analysed data from 118 vaccinated health care workers (HCW) across two hospitals in India, constructing two probable transmission networks involving six HCWs in Hospital A and eight HCWs in Hospital B from epidemiological and virus genome sequence data, using a suite of computational approaches. A maximum likelihood reconstruction of transmission involving known cases of infection suggests a high probability that doubly vaccinated HCWs transmitted SARS-CoV-2 between each other and highlights potential cases of virus transmission between individuals who had received two doses of vaccine. Our findings show firstly that vaccination may reduce rates of transmission, supporting the need for ongoing infection control measures even in highly vaccinated populations, and secondly we have described a novel approach to identifying transmissions that is scalable and rapid, without the need for an infection control infrastructure.


Assuntos
COVID-19 , SARS-CoV-2 , COVID-19/epidemiologia , COVID-19/prevenção & controle , Pessoal de Saúde , Humanos , Controle de Infecções , SARS-CoV-2/genética , Vacinação
6.
Nat Commun ; 13(1): 751, 2022 02 08.
Artigo em Inglês | MEDLINE | ID: mdl-35136068

RESUMO

Understanding SARS-CoV-2 transmission in higher education settings is important to limit spread between students, and into at-risk populations. In this study, we sequenced 482 SARS-CoV-2 isolates from the University of Cambridge from 5 October to 6 December 2020. We perform a detailed phylogenetic comparison with 972 isolates from the surrounding community, complemented with epidemiological and contact tracing data, to determine transmission dynamics. We observe limited viral introductions into the university; the majority of student cases were linked to a single genetic cluster, likely following social gatherings at a venue outside the university. We identify considerable onward transmission associated with student accommodation and courses; this was effectively contained using local infection control measures and following a national lockdown. Transmission clusters were largely segregated within the university or the community. Our study highlights key determinants of SARS-CoV-2 transmission and effective interventions in a higher education setting that will inform public health policy during pandemics.


Assuntos
COVID-19/epidemiologia , COVID-19/transmissão , SARS-CoV-2/genética , Universidades , COVID-19/prevenção & controle , COVID-19/virologia , Busca de Comunicante , Genoma Viral/genética , Genômica , Humanos , Filogenia , RNA Viral/genética , Fatores de Risco , SARS-CoV-2/classificação , SARS-CoV-2/isolamento & purificação , Estudantes , Reino Unido/epidemiologia , Universidades/estatística & dados numéricos
7.
Mol Biol Evol ; 39(3)2022 03 02.
Artigo em Inglês | MEDLINE | ID: mdl-35106603

RESUMO

Identifying linked cases of infection is a critical component of the public health response to viral infectious diseases. In a clinical context, there is a need to make rapid assessments of whether cases of infection have arrived independently onto a ward, or are potentially linked via direct transmission. Viral genome sequence data are of great value in making these assessments, but are often not the only form of data available. Here, we describe A2B-COVID, a method for the rapid identification of potentially linked cases of COVID-19 infection designed for clinical settings. Our method combines knowledge about infection dynamics, data describing the movements of individuals, and evolutionary analysis of genome sequences to assess whether data collected from cases of infection are consistent or inconsistent with linkage via direct transmission. A retrospective analysis of data from two wards at Cambridge University Hospitals NHS Foundation Trust during the first wave of the pandemic showed qualitatively different patterns of linkage between cases on designated COVID-19 and non-COVID-19 wards. The subsequent real-time application of our method to data from the second epidemic wave highlights its value for monitoring cases of infection in a clinical context.


Assuntos
COVID-19 , SARS-CoV-2 , Hospitais , Humanos , Pandemias , Estudos Retrospectivos , SARS-CoV-2/genética
8.
Lancet Microbe ; 3(2): e151-e158, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34608459

RESUMO

We reviewed all genomic epidemiology studies on COVID-19 in long-term care facilities (LTCFs) that had been published to date. We found that staff and residents were usually infected with identical, or near identical, SARS-CoV-2 genomes. Outbreaks usually involved one predominant cluster, and the same lineages persisted in LTCFs despite infection control measures. Outbreaks were most commonly due to single or few introductions followed by a spread rather than a series of seeding events from the community into LTCFs. The sequencing of samples taken consecutively from the same individuals at the same facilities showed the persistence of the same genome sequence, indicating that the sequencing technique was robust over time. When combined with local epidemiology, genomics allowed probable transmission sources to be better characterised. The transmission between LTCFs was detected in multiple studies. The mortality rate among residents was high in all facilities, regardless of the lineage. Bioinformatics methods were inadequate in a third of the studies reviewed, and reproducing the analyses was difficult because sequencing data were not available in many facilities.


Assuntos
COVID-19 , COVID-19/epidemiologia , Surtos de Doenças , Genômica , Humanos , Assistência de Longa Duração , SARS-CoV-2/genética
9.
Bioinformatics ; 38(4): 970-976, 2022 01 27.
Artigo em Inglês | MEDLINE | ID: mdl-34791045

RESUMO

MOTIVATION: RNA 3D motifs are recurrent substructures, modeled as networks of base pair interactions, which are crucial for understanding structure-function relationships. The task of automatically identifying such motifs is computationally hard, and remains a key challenge in the field of RNA structural biology and network analysis. State-of-the-art methods solve special cases of the motif problem by constraining the structural variability in occurrences of a motif, and narrowing the substructure search space. RESULTS: Here, we relax these constraints by posing the motif finding problem as a graph representation learning and clustering task. This framing takes advantage of the continuous nature of graph representations to model the flexibility and variability of RNA motifs in an efficient manner. We propose a set of node similarity functions, clustering methods and motif construction algorithms to recover flexible RNA motifs. Our tool, Vernal can be easily customized by users to desired levels of motif flexibility, abundance and size. We show that Vernal is able to retrieve and expand known classes of motifs, as well as to propose novel motifs. AVAILABILITY AND IMPLEMENTATION: The source code, data and a webserver are available at vernal.cs.mcgill.ca. We also provide a flexible interface and a user-friendly webserver to browse and download our results. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , RNA , RNA/química , Motivos de Nucleotídeos , Software , Pareamento de Bases , Biologia Computacional
10.
Bioinformatics ; 38(5): 1458-1459, 2022 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-34908108

RESUMO

SUMMARY: RNA 3D architectures are stabilized by sophisticated networks of (non-canonical) base pair interactions, which can be conveniently encoded as multi-relational graphs and efficiently exploited by graph theoretical approaches and recent progresses in machine learning techniques. RNAglib is a library that eases the use of this representation, by providing clean data, methods to load it in machine learning pipelines and graph-based deep learning models suited for this representation. RNAglib also offers other utilities to model RNA with 2.5 D graphs, such as drawing tools, comparison functions or baseline performances on RNA applications. AVAILABILITY AND IMPLEMENTATION: The method is distributed as a pip package, RNAglib. Data are available in a repository and can be accessed on rnaglib's web page. The source code, data and documentation are available at https://rnaglib.cs.mcgill.ca. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Bibliotecas , Software , Aprendizado de Máquina , Documentação , Biblioteca Gênica
11.
Nature ; 599(7883): 114-119, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34488225

RESUMO

The B.1.617.2 (Delta) variant of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was first identified in the state of Maharashtra in late 2020 and spread throughout India, outcompeting pre-existing lineages including B.1.617.1 (Kappa) and B.1.1.7 (Alpha)1. In vitro, B.1.617.2 is sixfold less sensitive to serum neutralizing antibodies from recovered individuals, and eightfold less sensitive to vaccine-elicited antibodies, compared with wild-type Wuhan-1 bearing D614G. Serum neutralizing titres against B.1.617.2 were lower in ChAdOx1 vaccinees than in BNT162b2 vaccinees. B.1.617.2 spike pseudotyped viruses exhibited compromised sensitivity to monoclonal antibodies to the receptor-binding domain and the amino-terminal domain. B.1.617.2 demonstrated higher replication efficiency than B.1.1.7 in both airway organoid and human airway epithelial systems, associated with B.1.617.2 spike being in a predominantly cleaved state compared with B.1.1.7 spike. The B.1.617.2 spike protein was able to mediate highly efficient syncytium formation that was less sensitive to inhibition by neutralizing antibody, compared with that of wild-type spike. We also observed that B.1.617.2 had higher replication and spike-mediated entry than B.1.617.1, potentially explaining the B.1.617.2 dominance. In an analysis of more than 130 SARS-CoV-2-infected health care workers across three centres in India during a period of mixed lineage circulation, we observed reduced ChAdOx1 vaccine effectiveness against B.1.617.2 relative to non-B.1.617.2, with the caveat of possible residual confounding. Compromised vaccine efficacy against the highly fit and immune-evasive B.1.617.2 Delta variant warrants continued infection control measures in the post-vaccination era.


Assuntos
Evasão da Resposta Imune , SARS-CoV-2/crescimento & desenvolvimento , SARS-CoV-2/imunologia , Replicação Viral/imunologia , Anticorpos Neutralizantes/imunologia , Vacinas contra COVID-19/imunologia , Fusão Celular , Linhagem Celular , Feminino , Pessoal de Saúde , Humanos , Índia , Cinética , Masculino , Glicoproteína da Espícula de Coronavírus/metabolismo , Vacinação
12.
Elife ; 102021 08 13.
Artigo em Inglês | MEDLINE | ID: mdl-34387545

RESUMO

Monitoring the spread of SARS-CoV-2 and reconstructing transmission chains has become a major public health focus for many governments around the world. The modest mutation rate and rapid transmission of SARS-CoV-2 prevents the reconstruction of transmission chains from consensus genome sequences, but within-host genetic diversity could theoretically help identify close contacts. Here we describe the patterns of within-host diversity in 1181 SARS-CoV-2 samples sequenced to high depth in duplicate. 95.1% of samples show within-host mutations at detectable allele frequencies. Analyses of the mutational spectra revealed strong strand asymmetries suggestive of damage or RNA editing of the plus strand, rather than replication errors, dominating the accumulation of mutations during the SARS-CoV-2 pandemic. Within- and between-host diversity show strong purifying selection, particularly against nonsense mutations. Recurrent within-host mutations, many of which coincide with known phylogenetic homoplasies, display a spectrum and patterns of purifying selection more suggestive of mutational hotspots than recombination or convergent evolution. While allele frequencies suggest that most samples result from infection by a single lineage, we identify multiple putative examples of co-infection. Integrating these results into an epidemiological inference framework, we find that while sharing of within-host variants between samples could help the reconstruction of transmission chains, mutational hotspots and rare cases of superinfection can confound these analyses.


The COVID-19 pandemic has had major health impacts across the globe. The scientific community has focused much attention on finding ways to monitor how the virus responsible for the pandemic, SARS-CoV-2, spreads. One option is to perform genetic tests, known as sequencing, on SARS-CoV-2 samples to determine the genetic code of the virus and to find any differences or mutations in the genes between the viral samples. Viruses mutate within their hosts and can develop into variants that are able to more easily transmit between hosts. Genetic sequencing can reveal how genetically similar two SARS-CoV-2 samples are. But tracking how SARS-CoV-2 moves from one person to the next through sequencing can be tricky. Even a sample of SARS-CoV-2 viruses from the same individual can display differences in their genetic material or within-host variants. Could genetic testing of within-host variants shed light on factors driving SARS-CoV-2 to evolve in humans? To get to the bottom of this, Tonkin-Hill, Martincorena et al. probed the genetics of SARS-CoV-2 within-host variants using 1,181 samples. The analyses revealed that 95.1% of samples contained within-host variants. A number of variants occurred frequently in many samples, which were consistent with mutational hotspots in the SARS-CoV-2 genome. In addition, within-host variants displayed mutation patterns that were similar to patterns found between infected individuals. The shared within-host variants between samples can help to reconstruct transmission chains. However, the observed mutational hotspots and the detection of multiple strains within an individual can make this challenging. These findings could be used to help predict how SARS-CoV-2 evolves in response to interventions such as vaccines. They also suggest that caution is needed when using information on within-host variants to determine transmission between individuals.


Assuntos
COVID-19/genética , COVID-19/fisiopatologia , Variação Genética , Genoma Viral , Interações Hospedeiro-Patógeno/genética , Mutação , SARS-CoV-2/genética , Sequência de Bases , Humanos , Pandemias , Filogenia
13.
Elife ; 102021 08 24.
Artigo em Inglês | MEDLINE | ID: mdl-34425938

RESUMO

SARS-CoV-2 is notable both for its rapid spread, and for the heterogeneity of its patterns of transmission, with multiple published incidences of superspreading behaviour. Here, we applied a novel network reconstruction algorithm to infer patterns of viral transmission occurring between patients and health care workers (HCWs) in the largest clusters of COVID-19 infection identified during the first wave of the epidemic at Cambridge University Hospitals NHS Foundation Trust, UK. Based upon dates of individuals reporting symptoms, recorded individual locations, and viral genome sequence data, we show an uneven pattern of transmission between individuals, with patients being much more likely to be infected by other patients than by HCWs. Further, the data were consistent with a pattern of superspreading, whereby 21% of individuals caused 80% of transmission events. Our study provides a detailed retrospective analysis of nosocomial SARS-CoV-2 transmission, and sheds light on the need for intensive and pervasive infection control procedures.


The COVID-19 pandemic, caused by the SARS-CoV-2 virus, presents a global public health challenge. Hospitals have been at the forefront of this battle, treating large numbers of sick patients over several waves of infection. Finding ways to manage the spread of the virus in hospitals is key to protecting vulnerable patients and workers, while keeping hospitals running, but to generate effective infection control, researchers must understand how SARS-CoV-2 spreads. A range of factors make studying the transmission of SARS-CoV-2 in hospitals tricky. For instance, some people do not present any symptoms, and, amongst those who do, it can be difficult to determine whether they caught the virus in the hospital or somewhere else. However, comparing the genetic information of the SARS-CoV-2 virus from different people in a hospital could allow scientists to understand how it spreads. Samples of the genetic material of SARS-CoV-2 can be obtained by swabbing infected individuals. If the genetic sequences of two samples are very different, it is unlikely that the individuals who provided the samples transmitted the virus to one another. Illingworth, Hamilton et al. used this information, along with other data about how SARS-CoV-2 is transmitted, to develop an algorithm that can determine how the virus spreads from person to person in different hospital wards. To build their algorithm, Illingworth, Hamilton et al. collected SARS-CoV-2 genetic data from patients and staff in a hospital, and combined it with information about how SARS-CoV-2 spreads and how these people moved in the hospital . The algorithm showed that, for the most part, patients were infected by other patients (20 out of 22 cases), while staff were infected equally by patients and staff. By further probing these data, Illingworth, Hamilton et al. revealed that 80% of hospital-acquired infections were caused by a group of just 21% of individuals in the study, identifying a 'superspreader' pattern. These findings may help to inform SARS-CoV-2 infection control measures to reduce spread within hospitals, and could potentially be used to improve infection control in other contexts.


Assuntos
COVID-19/epidemiologia , COVID-19/transmissão , Surtos de Doenças/estatística & dados numéricos , Hospitais/estatística & dados numéricos , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Retrospectivos
15.
Elife ; 102021 03 02.
Artigo em Inglês | MEDLINE | ID: mdl-33650490

RESUMO

COVID-19 poses a major challenge to care homes, as SARS-CoV-2 is readily transmitted and causes disproportionately severe disease in older people. Here, 1167 residents from 337 care homes were identified from a dataset of 6600 COVID-19 cases from the East of England. Older age and being a care home resident were associated with increased mortality. SARS-CoV-2 genomes were available for 700 residents from 292 care homes. By integrating genomic and temporal data, 409 viral clusters within the 292 homes were identified, indicating two different patterns - outbreaks among care home residents and independent introductions with limited onward transmission. Approximately 70% of residents in the genomic analysis were admitted to hospital during the study, providing extensive opportunities for transmission between care homes and hospitals. Limiting viral transmission within care homes should be a key target for infection control to reduce COVID-19 mortality in this population.


Assuntos
COVID-19/epidemiologia , COVID-19/transmissão , Casas de Saúde , SARS-CoV-2/genética , Idoso de 80 Anos ou mais , COVID-19/virologia , Surtos de Doenças , Inglaterra/epidemiologia , Feminino , Humanos , Transmissão de Doença Infecciosa do Paciente para o Profissional , Transmissão de Doença Infecciosa do Profissional para o Paciente , Masculino , Polimorfismo de Nucleotídeo Único , Análise de Sequência , Fatores de Tempo
16.
Nat Commun ; 11(1): 6385, 2020 12 14.
Artigo em Inglês | MEDLINE | ID: mdl-33318491

RESUMO

The response to the coronavirus disease 2019 (COVID-19) pandemic has been hampered by lack of an effective severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) antiviral therapy. Here we report the use of remdesivir in a patient with COVID-19 and the prototypic genetic antibody deficiency X-linked agammaglobulinaemia (XLA). Despite evidence of complement activation and a robust T cell response, the patient developed persistent SARS-CoV-2 pneumonitis, without progressing to multi-organ involvement. This unusual clinical course is consistent with a contribution of antibodies to both viral clearance and progression to severe disease. In the absence of these confounders, we take an experimental medicine approach to examine the in vivo utility of remdesivir. Over two independent courses of treatment, we observe a temporally correlated clinical and virological response, leading to clinical resolution and viral clearance, with no evidence of acquired drug resistance. We therefore provide evidence for the antiviral efficacy of remdesivir in vivo, and its potential benefit in selected patients.


Assuntos
Monofosfato de Adenosina/análogos & derivados , Alanina/análogos & derivados , Imunidade Humoral/efeitos dos fármacos , SARS-CoV-2/efeitos dos fármacos , Monofosfato de Adenosina/uso terapêutico , Adulto , Alanina/uso terapêutico , Antivirais/uso terapêutico , COVID-19/virologia , Febre/prevenção & controle , Humanos , Imunidade Humoral/imunologia , Contagem de Linfócitos , Masculino , SARS-CoV-2/imunologia , SARS-CoV-2/fisiologia , Resultado do Tratamento
17.
Bioinformatics ; 36(Suppl_1): i276-i284, 2020 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-32657407

RESUMO

MOTIVATION: RNA-protein interactions are key effectors of post-transcriptional regulation. Significant experimental and bioinformatics efforts have been expended on characterizing protein binding mechanisms on the molecular level, and on highlighting the sequence and structural traits of RNA that impact the binding specificity for different proteins. Yet our ability to predict these interactions in silico remains relatively poor. RESULTS: In this study, we introduce RPI-Net, a graph neural network approach for RNA-protein interaction prediction. RPI-Net learns and exploits a graph representation of RNA molecules, yielding significant performance gains over existing state-of-the-art approaches. We also introduce an approach to rectify an important type of sequence bias caused by the RNase T1 enzyme used in many CLIP-Seq experiments, and we show that correcting this bias is essential in order to learn meaningful predictors and properly evaluate their accuracy. Finally, we provide new approaches to interpret the trained models and extract simple, biologically interpretable representations of the learned sequence and structural motifs. AVAILABILITY AND IMPLEMENTATION: Source code can be accessed at https://www.github.com/HarveyYan/RNAonGraph. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Redes Neurais de Computação , RNA , Ligação Proteica , Estrutura Secundária de Proteína , RNA/metabolismo , Software
18.
Nucleic Acids Res ; 48(14): 7690-7699, 2020 08 20.
Artigo em Inglês | MEDLINE | ID: mdl-32652015

RESUMO

RNA-small molecule binding is a key regulatory mechanism which can stabilize 3D structures and activate molecular functions. The discovery of RNA-targeting compounds is thus a current topic of interest for novel therapies. Our work is a first attempt at bringing the scalability and generalization abilities of machine learning methods to the problem of RNA drug discovery, as well as a step towards understanding the interactions which drive binding specificity. Our tool, RNAmigos, builds and encodes a network representation of RNA structures to predict likely ligands for novel binding sites. We subject ligand predictions to virtual screening and show that we are able to place the true ligand in the 71st-73rd percentile in two decoy libraries, showing a significant improvement over several baselines, and a state of the art method. Furthermore, we observe that augmenting structural networks with non-canonical base pairing data is the only representation able to uncover a significant signal, suggesting that such interactions are a necessary source of binding specificity. We also find that pre-training with an auxiliary graph representation learning task significantly boosts performance of ligand prediction. This finding can serve as a general principle for RNA structure-function prediction when data is scarce. RNAmigos shows that RNA binding data contains structural patterns with potential for drug discovery, and provides methodological insights for possible applications to other structure-function learning tasks. The source code, data and a Web server are freely available at http://rnamigos.cs.mcgill.ca.


Assuntos
RNA/química , Software , Pareamento de Bases , Sítios de Ligação , Ligantes , Conformação de Ácido Nucleico
19.
Lancet Infect Dis ; 20(11): 1263-1272, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-32679081

RESUMO

BACKGROUND: The burden and influence of health-care associated severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections is unknown. We aimed to examine the use of rapid SARS-CoV-2 sequencing combined with detailed epidemiological analysis to investigate health-care associated SARS-CoV-2 infections and inform infection control measures. METHODS: In this prospective surveillance study, we set up rapid SARS-CoV-2 nanopore sequencing from PCR-positive diagnostic samples collected from our hospital (Cambridge, UK) and a random selection from hospitals in the East of England, enabling sample-to-sequence in less than 24 h. We established a weekly review and reporting system with integration of genomic and epidemiological data to investigate suspected health-care associated COVID-19 cases. FINDINGS: Between March 13 and April 24, 2020, we collected clinical data and samples from 5613 patients with COVID-19 from across the East of England. We sequenced 1000 samples producing 747 high-quality genomes. We combined epidemiological and genomic analysis of the 299 patients from our hospital and identified 35 clusters of identical viruses involving 159 patients. 92 (58%) of 159 patients had strong epidemiological links and 32 (20%) patients had plausible epidemiological links. These results were fed back to clinical, infection control, and hospital management teams, leading to infection-control interventions and informing patient safety reporting. INTERPRETATION: We established real-time genomic surveillance of SARS-CoV-2 in a UK hospital and showed the benefit of combined genomic and epidemiological analysis for the investigation of health-care associated COVID-19. This approach enabled us to detect cryptic transmission events and identify opportunities to target infection-control interventions to further reduce health-care associated infections. Our findings have important implications for national public health policy as they enable rapid tracking and investigation of infections in hospital and community settings. FUNDING: COVID-19 Genomics UK funded by the Department of Health and Social Care, UK Research and Innovation, and the Wellcome Sanger Institute.


Assuntos
Betacoronavirus/genética , Infecções por Coronavirus/epidemiologia , Infecções por Coronavirus/prevenção & controle , Infecção Hospitalar/epidemiologia , Infecção Hospitalar/prevenção & controle , Controle de Infecções/métodos , Pandemias/prevenção & controle , Pneumonia Viral/epidemiologia , Pneumonia Viral/prevenção & controle , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , COVID-19 , Criança , Pré-Escolar , Infecções por Coronavirus/virologia , Infecção Hospitalar/virologia , Inglaterra/epidemiologia , Feminino , Genoma Viral/genética , Hospitais Universitários , Humanos , Lactente , Recém-Nascido , Masculino , Pessoa de Meia-Idade , Segurança do Paciente , Filogenia , Pneumonia Viral/virologia , Reação em Cadeia da Polimerase/métodos , Polimorfismo de Nucleotídeo Único , Estudos Prospectivos , SARS-CoV-2 , Sequenciamento Completo do Genoma/métodos , Adulto Jovem
20.
Elife ; 92020 05 11.
Artigo em Inglês | MEDLINE | ID: mdl-32392129

RESUMO

Significant differences exist in the availability of healthcare worker (HCW) SARS-CoV-2 testing between countries, and existing programmes focus on screening symptomatic rather than asymptomatic staff. Over a 3 week period (April 2020), 1032 asymptomatic HCWs were screened for SARS-CoV-2 in a large UK teaching hospital. Symptomatic staff and symptomatic household contacts were additionally tested. Real-time RT-PCR was used to detect viral RNA from a throat+nose self-swab. 3% of HCWs in the asymptomatic screening group tested positive for SARS-CoV-2. 17/30 (57%) were truly asymptomatic/pauci-symptomatic. 12/30 (40%) had experienced symptoms compatible with coronavirus disease 2019 (COVID-19)>7 days prior to testing, most self-isolating, returning well. Clusters of HCW infection were discovered on two independent wards. Viral genome sequencing showed that the majority of HCWs had the dominant lineage B∙1. Our data demonstrates the utility of comprehensive screening of HCWs with minimal or no symptoms. This approach will be critical for protecting patients and hospital staff.


Patients admitted to NHS hospitals are now routinely screened for SARS-CoV-2 (the virus that causes COVID-19), and isolated from other patients if necessary. Yet healthcare workers, including frontline patient-facing staff such as doctors, nurses and physiotherapists, are only tested and excluded from work if they develop symptoms of the illness. However, there is emerging evidence that many people infected with SARS-CoV-2 never develop significant symptoms: these people will therefore be missed by 'symptomatic-only' testing. There is also important data showing that around half of all transmissions of SARS-CoV-2 happen before the infected individual even develops symptoms. This means that much broader testing programs are required to spot people when they are most infectious. Rivett, Sridhar, Sparkes, Routledge et al. set out to determine what proportion of healthcare workers was infected with SARS-CoV-2 while also feeling generally healthy at the time of testing. Over 1,000 staff members at a large UK hospital who felt they were well enough to work, and did not fit the government criteria for COVID-19 infection, were tested. Amongst these, 3% were positive for SARS-CoV-2. On closer questioning, around one in five reported no symptoms, two in five very mild symptoms that they had dismissed as inconsequential, and a further two in five reported COVID-19 symptoms that had stopped more than a week previously. In parallel, healthcare workers with symptoms of COVID-19 (and their household contacts) who were self-isolating were also tested, in order to allow those without the virus to quickly return to work and bolster a stretched workforce. Finally, the rates of infection were examined to probe how the virus could have spread through the hospital and among staff ­ and in particular, to understand whether rates of infection were greater among staff working in areas devoted to COVID-19 patients. Despite wearing appropriate personal protective equipment, healthcare workers in these areas were almost three times more likely to test positive than those working in areas without COVID-19 patients. However, it is not clear whether this genuinely reflects greater rates of patients passing the infection to staff. Staff may give the virus to each other, or even acquire it at home. Overall, this work implies that hospitals need to be vigilant and introduce broad screening programmes across their workforces. It will be vital to establish such approaches before 'lockdown' is fully lifted, so healthcare institutions are prepared for any second peak of infections.


Assuntos
Infecções Assintomáticas , Técnicas de Laboratório Clínico , Pessoal de Saúde , Betacoronavirus/fisiologia , COVID-19 , Teste para COVID-19 , Vacinas contra COVID-19 , Infecções por Coronavirus/diagnóstico , Infecções por Coronavirus/epidemiologia , Infecções por Coronavirus/transmissão , Feminino , Humanos , Controle de Infecções , Masculino , Pandemias , Pneumonia Viral/diagnóstico , Pneumonia Viral/epidemiologia , Pneumonia Viral/transmissão , Reação em Cadeia da Polimerase em Tempo Real , SARS-CoV-2 , Reino Unido/epidemiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA