RESUMO
BACKGROUND: Respiratory illness affects individuals across all age demographics on a global scale, often precipitated by viral infections. The symptomatic manifestations of these diseases bear clinical resemblance, complicating the accurate determination of their etiological origins. Furthermore, the diagnostic panels for respiratory pathogens used within local medical practices, may not encompass the full spectrum of viral agents responsible for such ailments. Consequently, a significant number of clinically important viral pathogens may remain undetected. METHODS AND FINDINGS: In the light of this, we conducted a metagenomic examination of 66 nasopharyngeal swab specimens, obtained from patients presenting with acute respiratory conditions yet tested negative by the standard diagnostic panels available locally. These specimens were obtained from the Public Health Laboratory, Maceio, State of Alagoas. Our findings indicate a predominant diagnostic escape of rhinoviruses and notably enterovirus D68. Moreover, our study identified a substantial quantity of sequence reads attributed to human respirovirus 3 (human parainfluenza 3) along with various herpresviruses including human herpesvirus-1, Epstein-Barr virus (Human herpesvirus-4), Human herpesviruses 6 and 7 and human parvovirus B19 (B19V). Notably, the metagenomic analysis uncovered a widespread presence of the emerging human vientovirus FB in most of sample pools, though its clinical importance remains to be elucidated. CONCLUSIONS: The obtained results in this study underscore the invaluable role of viral metagenomics in the identification of underrecognized viruses bearing clinical relevance. Furthermore, it offers insights into the dissemination of these pathogens within the studied area, thereby informing public health strategies aimed at enhancing diagnostic accuracy and improving patient care.
Assuntos
Metagenômica , Nasofaringe , Infecções Respiratórias , Humanos , Brasil/epidemiologia , Metagenômica/métodos , Feminino , Adulto , Adolescente , Masculino , Criança , Pré-Escolar , Adulto Jovem , Pessoa de Meia-Idade , Infecções Respiratórias/virologia , Infecções Respiratórias/epidemiologia , Nasofaringe/virologia , Viroses/virologia , Viroses/epidemiologia , Vírus/genética , Vírus/classificação , Vírus/isolamento & purificação , Lactente , Idoso , Doença AgudaRESUMO
The COVID-19 pandemic underscored the significance of omics technology and Wastewater-Based Epidemiology for epidemic preparedness. This study investigates the virosphere in wastewater samples from Natal (Brazil), aiming to understand its structure, relationships, and potential. Metaviromic analysis was used on DNA and RNA from weekly samples collected over a year (June/2021 to May/2022) from three wastewater treatment plants. The virosphere showed stability, particularly in viruses infecting microorganisms and plants. However, an alternation of representatives of viruses that infect animals has been observed. Among the most abundant viruses infecting microorganisms are genera associated with the bacterial genera Escherichia, Pseudomonas, and Caulobacte. Regarding the viruses infecting plants, Sobemovirus and Tobamovirus are the most abundant genera. Odontoglossum ringspot virus was identified as a possible RNA virus biomarker. Among DNA viruses infecting animals, genera Bocaparvovirus and Mastadenovirus are the most prevalent. Intriguingly, some Poxviridae family members were observed in the samples. Co-occurrence network analysis identified potential biomarkers like Volepox virus, Anatid herpesvirus 1, and Caviid herpesvirus 2. Among RNA viruses affecting animals, Mamastrovirus, Rotavirus, and Norovirus genera were the most abundant pathogens. Furthermore, members of the Coronaviridae family exhibited a high degree of centrality values in the co-occurrence network, even connecting with unclassified viruses. The study emphasizes the importance of research in understanding the roles of unclassified viruses. In addition, we observed an association between Coronaviridae reads, rainfall, and the number of reported COVID-19 cases. Our study highlights the diversity and complexity of the viral community in wastewater and the need for research to understand better the ecological roles unclassified viruses play. Such advances will significantly contribute to our preparedness and response to future viral threats. Furthermore, our study contributes to knowledge of virosphere dynamics, offering insights that can contribute to the direction of future public health policies and interventions.
Assuntos
Águas Residuárias , Brasil , Águas Residuárias/virologia , Vírus/genética , Vírus/isolamento & purificação , Vírus de RNA/genética , Viroma , COVID-19/virologiaRESUMO
Theobroma cacao plantations are of significant economic importance worldwide, primarily for chocolate production. During the harvest and processing of cocoa beans, they are subjected to fermentation either by microorganisms present in the environment (spontaneous fermentation) or the addition of starter cultures, with different strains directly contributing distinct flavor and color characteristics to the beans. In addition to fungi and bacteria, viruses are ubiquitous and can affect the quality of the fermentation process by infecting fermenting organisms, destabilizing microbial diversity, and consequently affecting fermentation quality. Therefore, in this study, we explored publicly available metatranscriptomic libraries of cocoa bean fermentation in Limon Province, Costa Rica, looking for viruses associated with fermenting microorganisms. Libraries were derived from the same sample at different time points: 7, 20, and 68 h of fermentation, corresponding to yeast- and lactic acid bacteria-driven phases. Using a comprehensive pipeline, we identified 68 viral sequences that could be assigned to 62 new viral species and 6 known viruses distributed among at least nine families, with particular abundance of elements from the Lenarviricota phylum. Interestingly, 44 of these sequences were specifically associated with ssRNA phages (Fiersviridae) and mostly fungi-infecting viral families (Botourmiaviridae, Narnaviridae, and Mitoviridae). Of note, viruses from those families show a complex evolutionary relationship, transitioning from infecting bacteria to infecting fungi. We also identified 10 and 3 viruses classified within the Totiviridae and Nodaviridae families, respectively. The quantification of the virus-derived RNAs shows a general pattern of decline, similar to the dynamic profile of some microorganism genera during the fermentation process. Unexpectedly, we identified narnavirus-related elements that showed similarity to segmented viral species. By exploring the molecular characteristics of these viral sequences and applying Hidden Markov Models, we were capable of associating these additional segments with a specific taxon. In summary, our study elucidates the complex virome associated with the microbial consortia engaged in cocoa bean fermentation that could contribute to organism/strain selection, altering metabolite production and, consequently, affecting the sensory characteristics of cocoa beans.
Assuntos
Cacau , Fermentação , Viroma , Cacau/virologia , Cacau/microbiologia , Vírus/genética , Vírus/classificação , Vírus/isolamento & purificação , Fungos/virologia , Fungos/genética , Fungos/classificação , Filogenia , Bacteriófagos/genética , Bacteriófagos/classificação , Bacteriófagos/isolamento & purificação , Costa Rica , Bactérias/genética , Bactérias/classificação , Bactérias/virologia , Metagenômica , Genoma ViralRESUMO
Waste pickers constitute a marginalized demographic engaged in the collection of refuse, facing considerable occupational hazards that heighten their susceptibility to contract infectious diseases. Moreover, waste pickers contend with societal stigmatization and encounter barriers to accessing healthcare services. To explore the viral profile of waste pickers potentially linked to their occupational environment, we conducted a metagenomic analysis on 120 plasma specimens sampled from individuals employed at the Cidade Estrutural dumpsite in Brasilia city, Brazil. In total, 60 blood donors served as a comparative control group. Specimens were pooled and subjected to Illumina NextSeq 2000 sequencing. Viral abundance among waste pickers revealed the presence of significant pathogens, including HIV, HCV, and Chikungunya, which were not detected in the control group. Additionally, elevated levels of anelloviruses and Human pegivirus-1 were noted, with a comparable incidence in the control group. These findings underscore the utility of metagenomics in identifying clinically relevant viral agents within underserved populations. The implications of this study extend to informing public health policies aimed at surveilling infectious diseases among individuals facing socioeconomic disparities and limited access to healthcare resources.
Assuntos
Metagenômica , Humanos , Brasil , Masculino , Adulto , Feminino , Vírus/genética , Vírus/classificação , Vírus/isolamento & purificação , Exposição Ocupacional , Estudos de Casos e Controles , Pessoa de Meia-Idade , Eliminação de ResíduosRESUMO
Viral gastroenteritis is commonly reported in dogs and involves a great diversity of enteric viruses. In this research, viral diversity was investigated in dogs with diarrhea in Northern Brazil using shotgun metagenomics. Furthermore, the presence of norovirus (NoV) was investigated in 282 stool/rectal swabs of young/adult dogs with or without diarrhea from two public kennels, based on one-step reverse transcription polymerase chain reaction (RT-PCR) for genogroup VI and VII (GVI and GVII) and real-time RT-PCR for GI, GII, and GIV. Thirty-one viral families were identified, including bacteriophages. Phylogenetic analyses showed twelve complete or nearly complete genomes belonging to the species of Protoparvovirus carnivoran1, Mamastrovirus 5, Aichivirus A2, Alphacoronavirus 1, and Chipapillomavirus 1. This is the first description of the intestinal virome of dogs in Northern Brazil and the first detection of canine norovirus GVII in the country. These results are important for helping to understand the viral groups that circulate in the canine population.
Assuntos
Doenças do Cão , Gastroenterite , Filogenia , Cães , Animais , Gastroenterite/virologia , Gastroenterite/veterinária , Gastroenterite/epidemiologia , Brasil/epidemiologia , Doenças do Cão/virologia , Doenças do Cão/epidemiologia , Norovirus/genética , Norovirus/classificação , Norovirus/isolamento & purificação , Fezes/virologia , Genoma Viral , Variação Genética , Metagenômica/métodos , Vírus/classificação , Vírus/genética , Vírus/isolamento & purificação , ViromaRESUMO
South American ecosystems host astonishing biodiversity, with potentially great richness in viruses. However, these ecosystems have not yet been the source of any widespread, epidemic viruses. Here we explore a set of putative causes that may explain this apparent paradox. We discuss that human presence in South America is recent, beginning around 14,000 years ago; that few domestications of native species have occurred; and that successive immigration events associated with Old World virus introductions reduced the likelihood of spillovers and adaptation of local viruses into humans. Also, the diversity and ecological characteristics of vertebrate hosts might serve as protective factors. Moreover, although forest areas remained well preserved until recently, current brutal, sudden, and large-scale clear cuts through the forest have resulted in nearly no ecotones, which are essential for creating an adaptive gradient of microbes, hosts, and vectors. This may be temporarily preventing virus emergence. Nevertheless, the mid-term effect of such drastic changes in habitats and landscapes, coupled with explosive urbanization and climate changes, must not be overlooked by health authorities.
Assuntos
Ecossistema , Humanos , América do Sul/epidemiologia , Animais , Viroses/epidemiologia , Viroses/transmissão , Viroses/virologia , Biodiversidade , Vírus/genética , Vírus/classificação , Doenças Transmissíveis Emergentes/virologia , Doenças Transmissíveis Emergentes/epidemiologia , Doenças Transmissíveis Emergentes/transmissão , Mudança ClimáticaRESUMO
Cattle have a significant impact on human societies in terms of both economics and health. Viral infections pose a relevant problem as they directly or indirectly disrupt the balance within cattle populations. This has negative consequences at the economic level for producers and territories, and also jeopardizes human health through the transmission of zoonotic diseases that can escalate into outbreaks or pandemics. To establish prevention strategies and control measures at various levels (animal, farm, region, or global), it is crucial to identify the viral agents present in animals. Various techniques, including virus isolation, serological tests, and molecular techniques like PCR, are typically employed for this purpose. However, these techniques have two major drawbacks: they are ineffective for non-culturable viruses, and they only detect a small fraction of the viruses present. In contrast, metagenomics offers a promising approach by providing a comprehensive and unbiased analysis for detecting all viruses in a given sample. It has the potential to identify rare or novel infectious agents promptly and establish a baseline of healthy animals. Nevertheless, the routine application of viral metagenomics for epidemiological surveillance and diagnostics faces challenges related to socioeconomic variables, such as resource availability and space dedicated to metagenomics, as well as the lack of standardized protocols and resulting heterogeneity in presenting results. This review aims to provide an overview of the current knowledge and prospects for using viral metagenomics to detect and identify viruses in cattle raised for livestock, while discussing the epidemiological and clinical implications.
Assuntos
Doenças dos Bovinos , Metagenômica , Animais , Bovinos , Doenças dos Bovinos/virologia , Doenças dos Bovinos/epidemiologia , Viroses/veterinária , Viroses/virologia , Viroses/epidemiologia , Vírus/isolamento & purificação , Vírus/genética , Vírus/classificação , Gado/virologia , Doenças Transmissíveis Emergentes/veterinária , Doenças Transmissíveis Emergentes/virologia , Doenças Transmissíveis Emergentes/epidemiologiaRESUMO
Marseilleviruses (MsV) are a group of viruses that compose the Marseilleviridae family within the Nucleocytoviricota phylum. They have been found in different samples, mainly in freshwater. MsV are classically organized into five phylogenetic lineages (A/B/C/D/E), but the current taxonomy does not fully represent all the diversity of the MsV lineages. Here, we describe a novel strain isolated from a Brazilian saltwater sample named Marseillevirus cajuinensis. Based on genomics and phylogenetic analyses, M. cajuinensis exhibits a 380,653-bp genome that encodes 515 open reading frames. Additionally, M. cajuinensis encodes a transfer RNA, a feature that is rarely described for Marseilleviridae. Phylogeny suggests that M. cajuinensis forms a divergent branch within the MsV lineage A. Furthermore, our analysis suggests that the common ancestor for the five classical lineages of MsV diversified into three major groups. The organization of MsV into three main groups is reinforced by a comprehensive analysis of clusters of orthologous groups, sequence identities, and evolutionary distances considering several MsV isolates. Taken together, our results highlight the importance of discovering new viruses to expand the knowledge about known viruses that belong to the same lineages or families. This work proposes a new perspective on the Marseilleviridae lineages organization that could be helpful to a future update in the taxonomy of the Marseilleviridae family. IMPORTANCE: Marseilleviridae is a family of viruses whose members were mostly isolated from freshwater samples. In this work, we describe the first Marseillevirus isolated from saltwater samples, which we called Marseillevirus cajuinensis. Most of M. cajuinensis genomic features are comparable to other Marseilleviridae members, such as its high number of unknown proteins. On the other hand, M. cajuinensis encodes a transfer RNA, which is a gene category involved in protein translation that is rarely described in this viral family. Additionally, our phylogenetic analyses suggested the existence of, at least, three major Marseilleviridae groups. These observations provide a new perspective on Marseilleviridae lineages organization, which will be valuable in future updates to the taxonomy of the family since the current official classification does not capture all the Marseilleviridae known diversity.
Assuntos
Genoma Viral , Vírus , Brasil , Evolução Molecular , Genômica/métodos , Fases de Leitura Aberta , Filogenia , RNA Viral/genética , Vírus/classificação , Vírus/genéticaRESUMO
The field of viral genomic studies has experienced an unprecedented increase in data volume. New strains of known viruses are constantly being added to the GenBank database and so are completely new species with little or no resemblance to our databases of sequences. In addition to this, metagenomic techniques have the potential to further increase the number and rate of sequenced genomes. Besides, it is important to consider that viruses have a set of unique features that often break down molecular biology dogmas, e.g., the flux of information from RNA to DNA in retroviruses and the use of RNA molecules as genomes. As a result, extracting meaningful information from viral genomes remains a challenge and standard methods for comparing the unknown and our databases of characterized sequences may need adaptations. Thus, several bioinformatic approaches and tools have been created to address the challenge of analyzing viral data. This chapter offers descriptions and protocols of some of the most important bioinformatic techniques for comparative analysis of viruses. The authors also provide comments and discussion on how viruses' unique features can affect standard analyses and how to overcome some of the major sources of problems. Protocols and topics emphasize online tools (which are more accessible to users) and give the real experience of what most bioinformaticians do in day-by-day work with command-line pipelines. The topics discussed include (1) clustering related genomes, (2) whole genome multiple sequence alignments for small RNA viruses, (3) protein alignment for marker genes and species affiliation, (4) variant calling and annotation, and (5) virome analyses and pathogen identification.
Assuntos
Biologia Computacional , Genoma Viral , Vírus , Biologia Computacional/métodos , Vírus/genética , Vírus/classificação , Software , Bases de Dados GenéticasRESUMO
AIMS: The present study aimed to use a conventional and metagenomic approach to investigate the microbiological diversity of water bodies in a network of drainage channels and rivers located in the central area of the city of Belém, northern Brazil, which is considered one of the largest cities in the Brazilian Amazon. METHODS AND RESULTS: In eight of the analyzed points, both bacterial and viral microbiological indicators of environmental contamination-physical-chemical and metals-were assessed. The bacterial resistance genes, drug resistance mechanisms, and viral viability in the environment were also assessed. A total of 473 families of bacteria and 83 families of viruses were identified. Based on the analysis of metals, the levels of three metals (Cd, Fe, and Mn) were found to be above the recommended acceptable level by local legislation. The levels of the following three physicochemical parameters were also higher than recommended: biochemical oxygen demand, dissolved oxygen, and turbidity. Sixty-three bacterial resistance genes that conferred resistance to 13 different classes of antimicrobials were identified. Further, five mechanisms of antimicrobial resistance were identified and viral viability in the environment was confirmed. CONCLUSIONS: Intense human actions combined with a lack of public policies and poor environmental education of the population cause environmental degradation, especially in water bodies. Thus, urgent interventions are warranted to restore the quality of this precious and scarce asset worldwide.
Assuntos
Bactérias , Metagenômica , Microbiologia da Água , Brasil , Bactérias/genética , Bactérias/isolamento & purificação , Bactérias/classificação , Bactérias/efeitos dos fármacos , Saúde Ambiental , Rios/microbiologia , Rios/virologia , Vírus/genética , Vírus/isolamento & purificação , Monitoramento Ambiental , Farmacorresistência Bacteriana/genética , Humanos , Cidades , Metais/farmacologiaRESUMO
In this study, we analyzed the potential of viral infections in the species Homo sapiens as environmental causes of orofacial clefts (OFCs). A scoring system was adapted for qualitatively assessing the potential of viruses to cause cleft lip and/or palate (CL/P). This assessment considered factors such as information from the literature, nucleotide and amino acid similarities, and the presence of Endogenous Viral Elements (EVEs). The analysis involved various algorithm packages within Basic Local Alignment Search Tool 2.13.0 software and databases from the National Center for Biotechnology Information and the International Committee on Taxonomy of Viruses. Twenty significant viral species using different biosynthesis strategies were identified: Human coronavirus NL63, Rio Negro virus, Alphatorquevirus homin9, Brisavirus, Cosavirus B, Torque teno mini virus 4, Bocaparvovirus primate2, Human coronavirus HKU1, Monkeypox virus, Mammarenavirus machupoense, Volepox virus, Souris mammarenavirus, Gammapapillomavirus 7, Betainfluenzavirus influenzae, Lymphocytic choriomeningitis mammarenavirus, Ledantevirus kern, Gammainfluenzavirus influenzae, Betapolyomavirus hominis, Vesiculovirus perinet, and Cytomegalovirus humanbeta5. The evident viral etiological potential in relation to CL/P varies depending on the Baltimore class to which the viral species belongs. Given the multifactorial nature of CL/P, this relationship appears to be dynamic.
Assuntos
Fenda Labial , Fissura Palatina , Vírus , Fenda Labial/virologia , Humanos , Fissura Palatina/virologia , Vírus/genética , Vírus/classificação , Vírus/isolamento & purificação , Viroses/virologia , AnimaisRESUMO
The frequency of respiratory viruses in people living with HIV (PLHIV) and their impact on lung function remain unclear. We aimed to determine the frequency of respiratory viruses in bronchoalveolar lavage and induced sputum samples in PLHIV and correlate their presence with lung function. A prospective cohort of adults hospitalized in Medellín between September 2016 and December 2018 included three groups: group 1 = people diagnosed with HIV and a diagnosis of community-acquired pneumonia (CAP), group 2 = HIV, and group 3 = CAP. People were followed up with at months 1, 6, and 12. Clinical, microbiological, and spirometric data were collected. Respiratory viruses were detected by multiplex RT-PCR. Sixty-five patients were included. At least 1 respiratory virus was identified in 51.9%, 45.1%, and 57.1% of groups 1, 2 and 3, respectively. Among these, 89% of respiratory viruses were detected with another pathogen, mainly Mycobacterium tuberculosis (40.7%) and Pneumocystis jirovecii (22.2%). The most frequent respiratory virus was rhinovirus (24/65, 37%). On admission, 30.4% of group 1, 16.6% of group 2, and 50% of group 3 had airflow limitation, with alteration in forced expiratory volume at first second in both groups with pneumonia compared to HIV. Respiratory viruses are frequent in people diagnosed with HIV, generally coexisting with other pathogens. Pulmonary function on admission was affected in patients with pneumonia, improving significantly in the 1st, 6th, and 12th months after CAP onset.
Assuntos
Infecções por HIV , Pneumonia , Vírus , Adulto , Humanos , Estudos Prospectivos , Seguimentos , Pneumonia/epidemiologia , Vírus/genética , Pulmão , Infecções por HIV/complicaçõesRESUMO
Due to the enormous diversity of non-culturable viruses, new viruses must be characterized using culture-independent techniques. The associated host is an important phenotypic feature that can be inferred from metagenomic viral contigs thanks to the development of several bioinformatic tools. Here, we compare the performance of recently developed virus-host prediction tools on a dataset of 1,046 virus-host pairs and then apply the best-performing tools to a metagenomic dataset derived from a highly diverse transiently hypersaline site known as the Archaean Domes (AD) within the Cuatro Ciénegas Basin, Coahuila, Mexico. Among host-dependent methods, alignment-based approaches had a precision of 66.07% and a sensitivity of 24.76%, while alignment-free methods had an average precision of 75.7% and a sensitivity of 57.5%. RaFAH, a virus-dependent alignment-based tool, had the best overall performance (F1_score = 95.7%). However, when predicting the host of AD viruses, methods based on public reference databases (such as RaFAH) showed lower inter-method agreement than host-dependent methods run against custom databases constructed from prokaryotes inhabiting AD. Methods based on custom databases also showed the greatest agreement between the source environment and the predicted host taxonomy, habitat, lifestyle, or metabolism. This highlights the value of including custom data when predicting hosts on a highly diverse metagenomic dataset, and suggests that using a combination of methods and qualitative validations related to the source environment and predicted host biology can increase the number of correct predictions. Finally, these predictions suggest that AD viruses infect halophilic archaea as well as a variety of bacteria that may be halophilic, halotolerant, alkaliphilic, thermophilic, oligotrophic, sulfate-reducing, or marine, which is consistent with the specific environment and the known geological and biological evolution of the Cuatro Ciénegas Basin and its microorganisms.
Assuntos
Vírus , México , Filogenia , Vírus/genética , Bactérias , EcossistemaRESUMO
Bats are important reservoirs for many zoonotic viruses. To explore and monitor potential novel viruses carried by bats, 21 liver samples of bats (Hipposideros armiger) were collected from Yunnan Province in southern China. Only one (4.8%) of all models was detected with adenovirus. The whole genome strain obtained by the viral metagenomics method combined with PCR was temporarily named YN01. The complete genome of YN01 was 37,676 bp, with a G + C content of 55.20% and 28 open reading frames. Phylogenetic analysis indicated that the strain YN01 can be classified as genus Mastadenovirus and was the most similar to the adenovirus isolated from Rhinolophus sinicus in China in 2016. The analysis is needed to verify the possibility of cross-species transmission. This virological investigation has increased our understanding of the ecology of bat-borne viruses in this area and provided a reference for possible future infectious diseases.
Assuntos
Infecções por Adenoviridae , Quirópteros , Vírus , Animais , Adenoviridae/genética , Filogenia , China , Infecções por Adenoviridae/veterinária , Vírus/genética , Fígado , Genoma ViralRESUMO
BACKGROUND: Bats are renowned for harboring a high viral diversity, their characteristics contribute to emerging infectious diseases. However, environmental and anthropic factors also play a significant role in the emergence of zoonotic viruses. Metagenomic is an important tool for investigating the virome of bats and discovering new viruses. RESULTS: Twenty-four families of virus were detected in lung samples by sequencing and bioinfomatic analysis, the largest amount of reads was focused on the Retroviridae and contigs assembled to Desmodus rotundus endogenous retrovirus, which was feasible to acquire complete sequences. The reads were also abundant for phages. CONCLUSION: This lung virome of D. rotundus contributes valuable information regarding the viral diversity found in bats, which is useful for understanding the drivers of viral cycles and their ecology in this species. The identification and taxonomic categorization of viruses hosted by bats carry epidemiological significance due to the potential for viral adaptation to other animals and humans, which can have severe repercussions for public health. Furthermore, the characterization of endogenized viruses helps to understanding the host genome and the evolution of the species.
Assuntos
Bacteriófagos , Quirópteros , Vírus , Animais , Quirópteros/virologia , Ecologia , Filogenia , Viroma/genética , Vírus/genéticaRESUMO
Throughout evolution, pathogenic viruses have developed different strategies to evade the response of the adaptive immune system. To carry out successful replication, some pathogenic viruses encode different proteins that manipulate the molecular mechanisms of host cells. Currently, there are different bioinformatics tools for virus research; however, none of them focus on predicting viral proteins that evade the adaptive system. In this work, we have developed a novel tool based on machine and deep learning for predicting this type of viral protein named VirusHound-I. This tool is based on a model developed with the multilayer perceptron algorithm using the dipeptide composition molecular descriptor. In this study, we have also demonstrated the robustness of our strategy for data augmentation of the positive dataset based on generative antagonistic networks. During the 10-fold cross-validation step in the training dataset, the predictive model showed 0.947 accuracy, 0.994 precision, 0.943 F1 score, 0.995 specificity, 0.896 sensitivity, 0.894 kappa, 0.898 Matthew's correlation coefficient and 0.989 AUC. On the other hand, during the testing step, the model showed 0.964 accuracy, 1.0 precision, 0.967 F1 score, 1.0 specificity, 0.936 sensitivity, 0.929 kappa, 0.931 Matthew's correlation coefficient and 1.0 AUC. Taking this model into account, we have developed a tool called VirusHound-I that makes it possible to predict viral proteins that evade the host's adaptive immune system. We believe that VirusHound-I can be very useful in accelerating studies on the molecular mechanisms of evasion of pathogenic viruses, as well as in the discovery of therapeutic targets.
Assuntos
Proteínas Virais , Vírus , Proteínas Virais/genética , Proteínas Virais/química , Algoritmo Florestas Aleatórias , Redes Neurais de Computação , Algoritmos , Vírus/genéticaRESUMO
The Special Issue "Emerging Viruses: Surveillance, Prevention, Evolution and Control" has been published annually by Viruses, since 2019, highlighting the increasing effort of the scientific community for the surveillance and further research of new emerging or re-emerging viruses [...].
Assuntos
Doenças Transmissíveis Emergentes , Vírus , Vírus/genética , Doenças Transmissíveis Emergentes/virologiaRESUMO
This study reports the virome investigation of pollinator species and other floral visitors associated with plants from the south of Bahia: Aphis aurantii, Atrichopogon sp., Dasyhelea sp., Forcipomyia taiwana, and Trigona ventralis hoozana. Studying viruses in insects associated with economically important crops is vital to understand transmission dynamics and manage viral diseases that pose as threats for global food security. Using literature mining and public RNA next-generation sequencing data deposited in the NCBI SRA database, we identified potential vectors associated with Malvaceae plant species and characterized the microbial communities resident in these insects. Bacteria and Eukarya dominated the metagenomic analyses of all taxon groups. We also found sequences showing similarity to elements from several viral families, including Bunyavirales, Chuviridae, Iflaviridae, Narnaviridae, Orthomyxoviridae, Rhabdoviridae, Totiviridae, and Xinmoviridae. Phylogenetic analyses indicated the existence of at least 16 new viruses distributed among A. aurantii (3), Atrichopogon sp. (4), Dasyhelea sp. (3), and F. taiwana (6). No novel viruses were found for T. ventralis hoozana. For F. taiwana, the available libraries also allowed us to suggest possible vertical transmission, while for A. aurantii we followed the infection profile along the insect development. Our results highlight the importance of studying the virome of insect species associated with crop pollination, as they may play a crucial role in the transmission of viruses to economically important plants, such as those of the genus Theobroma, or they will reduce the pollination process. This information may be valuable in developing strategies to mitigate the spread of viruses and protect the global industry.