Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
Ecotoxicol Environ Saf ; 267: 115659, 2023 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-37944467

RESUMO

Plastic pollution has become a global issue nowadays. Due to the increased population in developing countries, we largely depend on fish from our aquaculture industry to meet the required protein demand. Though several studies documented plastic ingestion in freshwater and marine organisms, very limited studies have been conducted to elucidate microplastic (MP) contamination in commercial fish feed. Therefore, this study was designed to identify, quantify, and characterize microplastics (MPs) in commercial fish feeds in Bangladesh and assess possible health risks in fish consuming different commercial fish feeds. All fish feed samples were 100 % contaminated with MPs, where the mean abundance of MPs ranged between 500 and 2200 MPs/kg. No significant differences among different types of feeds (e.g., starter, grower, and finisher) were observed in terms of MPs abundance (F = 0.999, p = 0.385). This study revealed that fiber was the most dominant shape of MPs (90 %), while the most dominant color of MPs was red (34 %), followed by black (31 %) and blue (19 %). The 100-1500 µm size class covers 88 % of the total MPs in the collected fish feed samples. Identified polymers in the samples were polyethylene (PE, 37.71 %), polyvinyl chloride (PVC, 27.14 %), polypropylene (PP, 22.08 %), and polyethylene terephthalate (PET, 13.07 %), respectively, where PE and PVC fall under the risk category IV to V. The Pollution load index (PLI) values of all fish feed samples were <10, indicating the risk category of I (low risk). Therefore, this study highly recommended avoiding plastic materials in the packaging and storing purposes of feed ingredients in the feed mills to ensure contamination-free fish feed for sustainable aquaculture.


Assuntos
Países em Desenvolvimento , Poluentes Químicos da Água , Animais , Microplásticos , Plásticos , Aquicultura , Peixes , Polietileno , Monitoramento Ambiental
2.
Bioinformatics ; 36(20): 5120-5121, 2020 12 22.
Artigo em Inglês | MEDLINE | ID: mdl-32683454

RESUMO

SUMMARY: We present GeoBoost2, a natural language-processing pipeline for extracting the location of infected hosts for enriching metadata in nucleotide sequences repositories like National Center of Biotechnology Information's GenBank for downstream analysis including phylogeography and genomic epidemiology. The increasing number of pathogen sequences requires complementary information extraction methods for focused research, including surveillance within countries and between borders. In this article, we describe the enhancements from our earlier release including improvement in end-to-end extraction performance and speed, availability of a fully functional web-interface and state-of-the-art methods for location extraction using deep learning. AVAILABILITY AND IMPLEMENTATION: Application is freely available on the web at https://zodo.asu.edu/geoboost2. Source code, usage examples and annotated data for GeoBoost2 is freely available at https://github.com/ZooPhy/geoboost2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Bases de Dados de Ácidos Nucleicos , Metadados , Genômica , Filogeografia , Software
3.
Bioinformatics ; 34(9): 1606-1608, 2018 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-29240889

RESUMO

Summary: GeoBoost is a command-line software package developed to address sparse or incomplete metadata in GenBank sequence records that relate to the location of the infected host (LOIH) of viruses. Given a set of GenBank accession numbers corresponding to virus GenBank records, GeoBoost extracts, integrates and normalizes geographic information reflecting the LOIH of the viruses using integrated information from GenBank metadata and related full-text publications. In addition, to facilitate probabilistic geospatial modeling, GeoBoost assigns probability scores for each possible LOIH. Availability and implementation: Binaries and resources required for running GeoBoost are packed into a single zipped file and freely available for download at https://tinyurl.com/geoboost. A video tutorial is included to help users quickly and easily install and run the software. The software is implemented in Java 1.8, and supported on MS Windows and Linux platforms. Contact: gragon@upenn.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Metadados , Vírus , Bases de Dados de Ácidos Nucleicos , Software
4.
Brief Bioinform ; 17(1): 33-42, 2016 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-26420781

RESUMO

Precision medicine will revolutionize the way we treat and prevent disease. A major barrier to the implementation of precision medicine that clinicians and translational scientists face is understanding the underlying mechanisms of disease. We are starting to address this challenge through automatic approaches for information extraction, representation and analysis. Recent advances in text and data mining have been applied to a broad spectrum of key biomedical questions in genomics, pharmacogenomics and other fields. We present an overview of the fundamental methods for text and data mining, as well as recent advances and emerging applications toward precision medicine.


Assuntos
Mineração de Dados/tendências , Biologia Computacional/tendências , Interpretação Estatística de Dados , Reposicionamento de Medicamentos/estatística & dados numéricos , Genômica/estatística & dados numéricos , Humanos , Farmacogenética/estatística & dados numéricos , Medicina de Precisão/estatística & dados numéricos , Medicina de Precisão/tendências , Transdução de Sinais , Toxicologia/estatística & dados numéricos
5.
J Proteome Res ; 16(11): 3969-3977, 2017 11 03.
Artigo em Inglês | MEDLINE | ID: mdl-28938071

RESUMO

In recent studies involving NAPPA microarrays, extra-well fluorescence is used as a key measure for identifying disease biomarkers because there is evidence to support that it is better correlated with strong antibody responses than statistical analysis involving intraspot intensity. Because this feature is not well quantified by traditional image analysis software, identification and quantification of extra-well fluorescence is performed manually, which is both time-consuming and highly susceptible to variation between raters. A system that could automate this task efficiently and effectively would greatly improve the process of data acquisition in microarray studies, thereby accelerating the discovery of disease biomarkers. In this study, we experimented with different machine learning methods, as well as novel heuristics, for identifying spots exhibiting extra-well fluorescence (rings) in microarray images and assigning each ring a grade of 1-5 based on its intensity and morphology. The sensitivity of our final system for identifying rings was found to be 72% at 99% specificity and 98% at 92% specificity. Our system performs this task significantly faster than a human, while maintaining high performance, and therefore represents a valuable tool for microarray image analysis.


Assuntos
Automação/métodos , Processamento de Imagem Assistida por Computador/métodos , Análise em Microsséries/métodos , Humanos , Reconhecimento Automatizado de Padrão , Sensibilidade e Especificidade
6.
Bioinformatics ; 31(12): i348-56, 2015 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-26072502

RESUMO

UNLABELLED: Diseases caused by zoonotic viruses (viruses transmittable between humans and animals) are a major threat to public health throughout the world. By studying virus migration and mutation patterns, the field of phylogeography provides a valuable tool for improving their surveillance. A key component in phylogeographic analysis of zoonotic viruses involves identifying the specific locations of relevant viral sequences. This is usually accomplished by querying public databases such as GenBank and examining the geospatial metadata in the record. When sufficient detail is not available, a logical next step is for the researcher to conduct a manual survey of the corresponding published articles. MOTIVATION: In this article, we present a system for detection and disambiguation of locations (toponym resolution) in full-text articles to automate the retrieval of sufficient metadata. Our system has been tested on a manually annotated corpus of journal articles related to phylogeography using integrated heuristics for location disambiguation including a distance heuristic, a population heuristic and a novel heuristic utilizing knowledge obtained from GenBank metadata (i.e. a 'metadata heuristic'). RESULTS: For detecting and disambiguating locations, our system performed best using the metadata heuristic (0.54 Precision, 0.89 Recall and 0.68 F-score). Precision reaches 0.88 when examining only the disambiguation of location names. Our error analysis showed that a noticeable increase in the accuracy of toponym resolution is possible by improving the geospatial location detection. By improving these fundamental automated tasks, our system can be a useful resource to phylogeographers that rely on geospatial metadata of GenBank sequences. .


Assuntos
Filogeografia/métodos , Vírus/genética , Bases de Dados de Ácidos Nucleicos , Análise de Sequência
7.
Virus Evol ; 5(1): vey043, 2019 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-30838129

RESUMO

Discrete phylogeography using software such as BEAST considers the sampling location of each taxon as fixed; often to a single location without uncertainty. When studying viruses, this implies that there is no possibility that the location of the infected host for that taxa is somewhere else. Here, we relaxed this strong assumption and allowed for analytic integration of uncertainty for discrete virus phylogeography. We used automatic language processing methods to find and assign uncertainty to alternative potential locations. We considered two influenza case studies: H5N1 in Egypt; H1N1 pdm09 in North America. For each, we implemented scenarios in which 25 per cent of the taxa had different amounts of sampling uncertainty including 10, 30, and 50 per cent uncertainty and varied how it was distributed for each taxon. This includes scenarios that: (i) placed a specific amount of uncertainty on one location while uniformly distributing the remaining amount across all other candidate locations (correspondingly labeled 10, 30, and 50); (ii) assigned the remaining uncertainty to just one other location; thus 'splitting' the uncertainty among two locations (i.e. 10/90, 30/70, and 50/50); and (iii) eliminated uncertainty via two predefined heuristic approaches: assignment to a centroid location (CNTR) or the largest population in the country (POP). We compared all scenarios to a reference standard (RS) in which all taxa had known (absolutely certain) locations. From this, we implemented five random selections of 25 per cent of the taxa and used these for specifying uncertainty. We performed posterior analyses for each scenario, including: (a) virus persistence, (b) migration rates, (c) trunk rewards, and (d) the posterior probability of the root state. The scenarios with sampling uncertainty were closer to the RS than CNTR and POP. For H5N1, the absolute error of virus persistence had a median range of 0.005-0.047 for scenarios with sampling uncertainty-(i) and (ii) above-versus a range of 0.063-0.075 for CNTR and POP. Persistence for the pdm09 case study followed a similar trend as did our analyses of migration rates across scenarios (i) and (ii). When considering the posterior probability of the root state, we found all but one of the H5N1 scenarios with sampling uncertainty had agreement with the RS on the origin of the outbreak whereas both CNTR and POP disagreed. Our results suggest that assigning geospatial uncertainty to taxa benefits estimation of virus phylogeography as compared to ad-hoc heuristics. We also found that, in general, there was limited difference in results regardless of how the sampling uncertainty was assigned; uniform distribution or split between two locations did not greatly impact posterior results. This framework is available in BEAST v.1.10. In future work, we will explore viruses beyond influenza. We will also develop a web interface for researchers to use our language processing methods to find and assign uncertainty to alternative potential locations for virus phylogeography.

8.
AMIA Jt Summits Transl Sci Proc ; 2017: 114-122, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28815119

RESUMO

The field of phylogeography allows researchers to model the spread and evolution of viral genetic sequences. Phylogeography plays a major role in infectious disease surveillance, viral epidemiology and vaccine design. When conducting viral phylogeographic studies, researchers require the location of the infected host of the virus, which is often present in public databases such as GenBank. However, the geographic metadata in most GenBank records is not precise enough for many phylogeographic studies; therefore, researchers often need to search the articles linked to the records for more information, which can be a tedious process. Here, we describe two approaches for automatically detecting geographic location mentions in articles pertaining to virus-related GenBank records: a supervised sequence labeling approach with innovative features and a distant-supervision approach with novel noise- reduction methods. Evaluated on a manually annotated gold standard, our supervised sequence labeling and distant supervision approaches attained F-scores of 0.81 and 0.66, respectively.

10.
J Am Med Inform Assoc ; 23(5): 934-41, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-26911818

RESUMO

OBJECTIVE: The metadata reflecting the location of the infected host (LOIH) of virus sequences in GenBank often lacks specificity. This work seeks to enhance this metadata by extracting more specific geographic information from related full-text articles and mapping them to their latitude/longitudes using knowledge derived from external geographical databases. MATERIALS AND METHODS: We developed a rule-based information extraction framework for linking GenBank records to the latitude/longitudes of the LOIH. Our system first extracts existing geospatial metadata from GenBank records and attempts to improve it by seeking additional, relevant geographic information from text and tables in related full-text PubMed Central articles. The final extracted locations of the records, based on data assimilated from these sources, are then disambiguated and mapped to their respective geo-coordinates. We evaluated our approach on a manually annotated dataset comprising of 5728 GenBank records for the influenza A virus. RESULTS: We found the precision, recall, and f-measure of our system for linking GenBank records to the latitude/longitudes of their LOIH to be 0.832, 0.967, and 0.894, respectively. DISCUSSION: Our system had a high level of accuracy for linking GenBank records to the geo-coordinates of the LOIH. However, it can be further improved by expanding our database of geospatial data, incorporating spell correction, and enhancing the rules used for extraction. CONCLUSION: Our system performs reasonably well for linking GenBank records for the influenza A virus to the geo-coordinates of their LOIH based on record metadata and information extracted from related full-text articles.


Assuntos
Mineração de Dados/métodos , Bases de Dados de Ácidos Nucleicos , Geografia Médica , Vírus da Influenza A , Metadados , Animais , Humanos , Influenza Humana/epidemiologia , Processamento de Linguagem Natural , Zoonoses/epidemiologia
11.
AMIA Jt Summits Transl Sci Proc ; 2014: 102-11, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25717409

RESUMO

Zoonotic viruses represent emerging or re-emerging pathogens that pose significant public health threats throughout the world. It is therefore crucial to advance current surveillance mechanisms for these viruses through outlets such as phylogeography. Despite the abundance of zoonotic viral sequence data in publicly available databases such as GenBank, phylogeographic analysis of these viruses is often limited by the lack of adequate geographic metadata. However, many GenBank records include references to articles with more detailed information and automated systems may help extract this information efficiently and effectively. In this paper, we describe our efforts to determine the proportion of GenBank records with "insufficient" geographic metadata for seven well-studied viruses. We also evaluate the performance of four different Named Entity Recognition (NER) systems for automatically extracting related entities using a manually created gold-standard.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA