Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 458
Filtrar
1.
PLoS One ; 19(5): e0299583, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38696410

RESUMEN

The mapping of metabolite-specific data to pathways within cellular metabolism is a major data analysis step needed for biochemical interpretation. A variety of machine learning approaches, particularly deep learning approaches, have been used to predict these metabolite-to-pathway mappings, utilizing a training dataset of known metabolite-to-pathway mappings. A few such training datasets have been derived from the Kyoto Encyclopedia of Genes and Genomes (KEGG). However, several prior published machine learning approaches utilized an erroneous KEGG-derived training dataset that used SMILES molecular representations strings (KEGG-SMILES dataset) and contained a sizable proportion (~26%) duplicate entries. The presence of so many duplicates taint the training and testing sets generated from k-fold cross-validation of the KEGG-SMILES dataset. Therefore, the k-fold cross-validation performance of the resulting machine learning models was grossly inflated by the erroneous presence of these duplicate entries. Here we describe and evaluate the KEGG-SMILES dataset so that others may avoid using it. We also identify the prior publications that utilized this erroneous KEGG-SMILES dataset so their machine learning results can be properly and critically evaluated. In addition, we demonstrate the reduction of model k-fold cross-validation (CV) performance after de-duplicating the KEGG-SMILES dataset. This is a cautionary tale about properly vetting prior published benchmark datasets before using them in machine learning approaches. We hope others will avoid similar mistakes.


Asunto(s)
Redes y Vías Metabólicas , Aprendizaje Automático Supervisado , Humanos , Conjuntos de Datos como Asunto
2.
Metabolites ; 14(5)2024 May 07.
Artículo en Inglés | MEDLINE | ID: mdl-38786743

RESUMEN

A major limitation of most metabolomics datasets is the sparsity of pathway annotations for detected metabolites. It is common for less than half of the identified metabolites in these datasets to have a known metabolic pathway involvement. Trying to address this limitation, machine learning models have been developed to predict the association of a metabolite with a "pathway category", as defined by a metabolic knowledge base like KEGG. Past models were implemented as a single binary classifier specific to a single pathway category, requiring a set of binary classifiers for generating the predictions for multiple pathway categories. This past approach multiplied the computational resources necessary for training while diluting the positive entries in the gold standard datasets needed for training. To address these limitations, we propose a generalization of the metabolic pathway prediction problem using a single binary classifier that accepts the features both representing a metabolite and representing a pathway category and then predicts whether the given metabolite is involved in the corresponding pathway category. We demonstrate that this metabolite-pathway features pair approach not only outperforms the combined performance of training separate binary classifiers but demonstrates an order of magnitude improvement in robustness: a Matthews correlation coefficient of 0.784 ± 0.013 versus 0.768 ± 0.154.

3.
J Cheminform ; 16(1): 54, 2024 May 14.
Artículo en Inglés | MEDLINE | ID: mdl-38741211

RESUMEN

This work presents a proposed extension to the International Union of Pure and Applied Chemistry (IUPAC) International Chemical Identifier (InChI) standard that allows the representation of isotopically-resolved chemical entities at varying levels of ambiguity in isotope location. This extension includes an improved interpretation of the current isotopic layer within the InChI standard and a new isotopologue layer specification for representing chemical intensities with ambiguous isotope localization. Both improvements support the unique isotopically-resolved chemical identification of features detected and measured in analytical instrumentation, specifically nuclear magnetic resonance and mass spectrometry. SCIENTIFIC CONTRIBUTION: This new extension to the InChI standard would enable improved annotation of analytical datasets characterizing chemical entities, supporting the FAIR (Findable, Accessible, Interoperable, and Reusable) guiding principles of data stewardship for chemical datasets, ultimately promoting Open Science in chemistry.

4.
bioRxiv ; 2024 Apr 02.
Artículo en Inglés | MEDLINE | ID: mdl-38617261

RESUMEN

A major limitation of most metabolomics datasets is the sparsity of pathway annotations of detected metabolites. It is common for less than half of identified metabolites in these datasets to have known metabolic pathway involvement. Trying to address this limitation, machine learning models have been developed to predict the association of a metabolite with a "pathway category", as defined by one of the metabolic knowledgebases like the Kyoto Encyclopedia of Gene and Genomes. Most of these models are implemented as a single binary classifier specific to a single pathway category, requiring a set of binary classifiers for generating predictions for multiple pathway categories. This single binary classifier per pathway category approach both multiplies the computational resources necessary for training while diluting the positive entries in gold standard datasets needed for training. To address the limitations of training separate classifiers, we propose a generalization of the metabolic pathway prediction problem using a single binary classifier that accepts both features representing a metabolite and features representing a generic pathway category and then predicts whether the given metabolite is involved in the corresponding pathway category. We demonstrate that this metabolite-pathway features-pair approach is not only competitive with the combined performance of training separate binary classifiers, but it outperforms the previous benchmark models.

5.
Can Vet J ; 65(1): 29-32, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38164377

RESUMEN

A 4.6-year-old spayed female German shepherd dog was admitted to a specialty hospital emergency service upon referral for suspected gastrointestinal foreign body obstruction. Free abdominal fluid was collected, and results of cytologic evaluation were consistent with a septic abdomen. An abdominal barium study revealed free gas and intraperitoneal barium, along with an obstructive gas pattern within the small bowel. Ultrasonography revealed a full-thickness jejunal perforation. On exploratory laparotomy, the perforation was noted to be located mid-jejunum with no associated mass or foreign material. A resection and anastomosis were completed. Histopathologic evaluation of the affected jejunal tissue showed aberrant gastric glandular epithelium consistent with a gastric choristoma, or heterotopic gastric tissue. Key clinical message: Clinicians should consider gastric glandular choristoma as a differential diagnosis in cases of seemingly idiopathic small intestinal perforation with no known cause (i.e., foreign body penetration, neoplasia, NSAID use), and histopathologic evaluation should always be done to obtain a definitive diagnosis.


Perforation jéjunale et abdomen septique résultant d'un choristome chez un chien. Une chienne berger allemand stérilisée âgée de 4,6 ans a été admise dans le service d'urgence d'un hôpital spécialisé après avoir été référée pour une suspicion d'obstruction gastro-intestinale par un corps étranger. Du liquide abdominal libre a été prélevé et les résultats de l'évaluation cytologique étaient compatibles avec un abdomen septique. Un examen abdominal à l'aide de baryum a révélé du gaz libre et du baryum intrapéritonéal, ainsi qu'un patron de gaz obstructif dans l'intestin grêle. L'échographie a révélé une perforation sur toute l'épaisseur jéjunale. Lors d'une laparotomie exploratoire, il a été constaté que la perforation était située au milieu du jéjunum, sans masse ni corps étranger associé. Une résection et une anastomose ont été réalisées. L'évaluation histopathologique du tissu jéjunal affecté a montré un épithélium glandulaire gastrique aberrant compatible avec un choristome gastrique ou un tissu gastrique hétérotopique.Message clinique clé :Les cliniciens doivent considérer le choristome glandulaire gastrique comme diagnostic différentiel dans les cas de perforation de l'intestin grêle apparemment idiopathique sans cause connue (i.e. pénétration d'un corps étranger, néoplasie, utilisation d'AINS), et une évaluation histopathologique doit toujours être effectuée pour obtenir un diagnostic définitif.(Traduit par Dr Serge Messier).


Asunto(s)
Coristoma , Enfermedades de los Perros , Cuerpos Extraños , Perforación Intestinal , Gastropatías , Animales , Perros , Femenino , Perforación Intestinal/diagnóstico , Perforación Intestinal/cirugía , Perforación Intestinal/veterinaria , Coristoma/complicaciones , Coristoma/diagnóstico , Coristoma/cirugía , Coristoma/veterinaria , Bario , Abdomen , Gastropatías/veterinaria , Cuerpos Extraños/veterinaria , Enfermedades de los Perros/diagnóstico , Enfermedades de los Perros/cirugía
6.
Metabolites ; 13(12)2023 Dec 17.
Artículo en Inglés | MEDLINE | ID: mdl-38132881

RESUMEN

A major challenge to integrating public metabolic resources is the use of different nomenclatures by individual databases. This paper presents md_harmonize, an open-source Python package for harmonizing compounds and metabolic reactions across various metabolic databases. The md_harmonize package utilizes a neighborhood-specific graph coloring method for generating a unique identifier for each compound via atom identifiers based on a compound's chemical structure. The resulting harmonized compounds and reactions can be used for various downstream analyses, including the construction of atom-resolved metabolic networks and models for metabolic flux analysis. Parts of the md_harmonize package have been optimized using a variety of computational techniques to allow certain NP-complete problems handled by the software to be tractable for these specific use-cases. The software is available on GitHub and through the Python Package Index, with end-user documentation hosted on GitHub Pages.

7.
Metabolites ; 13(11)2023 Nov 01.
Artículo en Inglés | MEDLINE | ID: mdl-37999216

RESUMEN

Metabolic pathways are a human-defined grouping of life sustaining biochemical reactions, metabolites being both the reactants and products of these reactions. But many public datasets include identified metabolites whose pathway involvement is unknown, hindering metabolic interpretation. To address these shortcomings, various machine learning models, including those trained on data from the Kyoto Encyclopedia of Genes and Genomes (KEGG), have been developed to predict the pathway involvement of metabolites based on their chemical descriptions; however, these prior models are based on old metabolite KEGG-based datasets, including one benchmark dataset that is invalid due to the presence of over 1500 duplicate entries. Therefore, we have developed a new benchmark dataset derived from the KEGG following optimal standards of scientific computational reproducibility and including all source code needed to update the benchmark dataset as KEGG changes. We have used this new benchmark dataset with our atom coloring methodology to develop and compare the performance of Random Forest, XGBoost, and multilayer perceptron with autoencoder models generated from our new benchmark dataset. Best overall weighted average performance across 1000 unique folds was an F1 score of 0.8180 and a Matthews correlation coefficient of 0.7933, which was provided by XGBoost binary classification models for 11 KEGG-defined pathway categories.

8.
bioRxiv ; 2023 Oct 09.
Artículo en Inglés | MEDLINE | ID: mdl-37873272

RESUMEN

Metabolic pathways are a human-defined grouping of life sustaining biochemical reactions, metabolites being both the reactants and products of these reactions. But many public datasets include identified metabolites whose pathway involvement is unknown, hindering metabolic interpretation. To address these shortcomings, various machine learning models, including those trained on data from the Kyoto Encyclopedia of Genes and Genomes (KEGG), have been developed to predict the pathway involvement of metabolites based on their chemical descriptions; however, these prior models are based on old metabolite KEGG-based datasets, including one benchmark dataset that is invalid due to the presence of over 1500 duplicate entries. Therefore, we have developed a new benchmark dataset derived from the KEGG following optimal standards of scientific computational reproducibility and including all source code needed to update the benchmark dataset as KEGG changes. We have used this new benchmark dataset with our atom coloring methodology to develop and compare the performance of Random Forest, XGBoost, and multilayer perceptron with autoencoder models generated from our new benchmark dataset. Best overall weighted average performance across 1000 unique folds was an F1-score of 0.8180 and Matthews correlation coefficient of 0.7933, which was provided by XGBoost binary classification models for 11 KEGG-defined pathway categories.

9.
Mol Phylogenet Evol ; 188: 107911, 2023 11.
Artículo en Inglés | MEDLINE | ID: mdl-37648182

RESUMEN

Marine planktonic ciliates are largely oligotrichs and choreotrichs, which are two subclasses of the class Spirotrichea. The current phylogenetic assignments of oligotrichs and choreotrichs are inconsistent with previous results based on morphological features, probably hindered by the limited information from a single gene locus. Here we provide 53 new sequences from small subunit ribosomal RNA (SSU rDNA), ITS1-5.8S rDNA-ITS2, and large subunit ribosomal RNA (LSU rDNA) gene loci in 25 oligotrich and choreotrich species. We also predict RNA secondary structures for the ITS2 regions in 55 species, 48 species of which are reported for the first time. Based on these novel data, we make a more comprehensive phylogenetic reconstruction, revealing consistency between morphological taxonomy and an updated phylogenetic system for oligotrichs and choreotrichs. With the addition of data from ciliature patterns and genes, the phylogenetic analysis of the subclass Oligotrichia suggests three evolutionary trajectories, among which: 1) Novistrombidium asserts an ancestral ciliary pattern in Oligotrichia; 2) the subgenera division of Novistrombidium and Parallelostrombidium are fully supported; 3) the three families (Tontoniidae, Pelagostrombidiidae and Cyrtostrombidiidae) all evolved from the most diverse family Strombidiidae, which explains why strombidiids consistently form polyphyletic clades. In the subclass Choreotrichia, Strombidinopsis likely possesses an ancestral position to other choreotrichs, and both phylogenetic analysis and RNA secondary structure prediction support the hypothesis that tintinnids may have evolved from Strombidinopsis. The results presented here offer an updated hypothesis for the evolutionary history of oligotrichs and choreotrichs based on new evidence obtained by expanding sampling of molecular information across multiple gene loci.


Asunto(s)
Cilióforos , Humanos , Filogenia , Cilióforos/genética , ADN Ribosómico , ARN , ARN Ribosómico
10.
bioRxiv ; 2023 Jun 26.
Artículo en Inglés | MEDLINE | ID: mdl-37425965

RESUMEN

A key regulator of collective cell migration is prostaglandin (PG) signaling. However, it remains largely unclear whether PGs act within the migratory cells or their microenvironment to promote migration. Here we use Drosophila border cell migration as a model to uncover the cell-specific roles of two PGs in collective migration. Prior work shows PG signaling is required for on-time migration and cluster cohesion. We find that the PGE2 synthase cPGES is required in the substrate, while the PGF2α synthase Akr1B is required in the border cells for on-time migration. Akr1B acts in both the border cells and their substrate to regulate cluster cohesion. One means by which Akr1B regulates border cell migration is by promoting integrin-based adhesions. Additionally, Akr1B limits myosin activity, and thereby cellular stiffness, in the border cells, whereas cPGES limits myosin activity in both the border cells and their substrate. Together these data reveal that two PGs, PGE2 and PGF2α, produced in different locations, play key roles in promoting border cell migration. These PGs likely have similar migratory versus microenvironment roles in other collective cell migrations.

11.
BMC Bioinformatics ; 24(1): 299, 2023 Jul 24.
Artículo en Inglés | MEDLINE | ID: mdl-37482620

RESUMEN

BACKGROUND: An updated version of the mwtab Python package for programmatic access to the Metabolomics Workbench (MetabolomicsWB) data repository was released at the beginning of 2021. Along with updating the package to match the changes to MetabolomicsWB's 'mwTab' file format specification and enhancing the package's functionality, the included validation facilities were used to detect and catalog file inconsistencies and errors across all publicly available datasets in MetabolomicsWB. RESULTS: The MetabolomicsWB File Status website was developed to provide continuous validation of MetabolomicsWB data files and a useful interface to all found inconsistencies and errors. This list of detectable issues/errors include format parsing errors, format compliance issues, access problems via MetabolomicsWB's REST interface, and other small inconsistencies that can hinder reusability. The website uses the mwtab Python package to pull down and validate each available analysis file and then generates an html report. The website is updated on a weekly basis. Moreover, the Python website design utilizes GitHub and GitHub.io, providing an easy to replicate template for implementing other metadata, virtual, and meta- repositories. CONCLUSIONS: The MetabolomicsWB File Status website provides a metadata repository of validation metadata to promote the FAIR use of existing metabolomics datasets from the MetabolomicsWB data repository.


Asunto(s)
Metadatos , Programas Informáticos , Metabolómica , Almacenamiento y Recuperación de la Información
12.
Metabolites ; 13(7)2023 Jul 12.
Artículo en Inglés | MEDLINE | ID: mdl-37512549

RESUMEN

In recent years, the FAIR guiding principles and the broader concept of open science has grown in importance in academic research, especially as funding entities have aggressively promoted public sharing of research products. Key to public research sharing is deposition of datasets into online data repositories, but it can be a chore to transform messy unstructured data into the forms required by these repositories. To help generate Metabolomics Workbench depositions, we have developed the MESSES (Metadata from Experimental SpreadSheets Extraction System) software package, implemented in the Python 3 programming language and supported on Linux, Windows, and Mac operating systems. MESSES helps transform tabular data from multiple sources into a Metabolomics Workbench specific deposition format. The package provides three commands, extract, validate, and convert, that implement a natural data transformation workflow. Moreover, MESSES facilitates richer metadata capture than is typically attempted by manual efforts. The source code and extensive documentation is hosted on GitHub and is also available on the Python Package Index for easy installation.

13.
Eur J Protistol ; 90: 126003, 2023 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-37453202

RESUMEN

Ciliates belonging to the class Plagiopylea are obligate anaerobes that are often neglected due to their cryptic lifestyles, difficulty of observation, and overall under-sampling. Here, we investigate two species, namely Trimyema claviforme Kahl, 1933 and Plagiopyla nasuta Stein, 1860, collected in China from marine and freshwater anaerobic sediments, respectively. A complete morphological dataset, together with SSU rRNA gene sequence data were obtained and used to diagnose the species. No molecular sequencing had ever been performed on Trimyema claviforme, with its ciliature also previously unknown. Based on these novel data presented here, the ciliate is characterized by a claviform cell shape, with a size of 35-45 × 10-20 µm in vivo, 28-39 longitudinal somatic ciliary rows forming five ciliary girdles (four complete girdles and a shorter one), two dikinetids left to anterior end of oral kinety 1, and an epaulet. A Chinese population of the well-known ciliate P. nasuta was investigated, and morphological comparisons revealed phenotypic stability of the species. The phylogenetic analyses supported previous findings about the monophyly of the families Trimyemidae and Plagiopylidae, with Trimyema claviforme branching off early in the genus Trimyema. The Chinese population of P. nasuta clusters together with two other populations with full support corroborating their conspecificity.


Asunto(s)
Cilióforos , China , Cilióforos/genética , Agua Dulce/microbiología , Filogenia , Agua de Mar/microbiología
14.
Protist ; 174(4): 125975, 2023 08.
Artículo en Inglés | MEDLINE | ID: mdl-37453254

RESUMEN

Ciliates in the order Pleurostomatida are found free-living in many habitats including within biofilms, but some (e.g. Pseudoamphileptus spp.) are ectocommensal on various hosts. Due to issues involving overall undersampling, the exact diversity and molecular phylogeny of this group remain largely underexplored. To combat this deficiency, detailed investigations were undertaken in northern China. As a result of these studies, we provide the morphological descriptions of two new species. Pseudoamphileptus apomacrostoma sp. nov., a new ectocommensal species, is characterized by the broadly oval cell shape, numerous scattered contractile vacuoles, and unique densely bounded extrusomes; Amphileptus qingdaoensis sp. nov., a marine form, is characterized by possessing oblong extrusomes with a conical anterior end, a single contractile vacuole and 5-7 left and 18-23 right kineties. In addition, a new population of Amphileptus orientalis Zhang et al., 2022, a freshwater representative, was documented and an improved diagnosis is provided. The phylogenetic analyses based on the SSU rDNA sequences imply that the genus Pseudoamphileptus is monophyletic whereas the genus Amphileptus is paraphyletic. The new molecular sequences presented here further support the establishment of two new species.


Asunto(s)
Cilióforos , Filogenia , ADN Ribosómico/genética , China , Agua Dulce
15.
Sci Data ; 10(1): 388, 2023 06 16.
Artículo en Inglés | MEDLINE | ID: mdl-37328532

RESUMEN

Exposure to per- and polyfluoroalkyl substances (PFAS) in drinking water is widely recognized as a public health concern. Decision-makers who are responsible for managing PFAS drinking water risks lack the tools to acquire the information they need. In response to this need, we provide a detailed description of a Kentucky dataset that allows decision-makers to visualize potential hot-spot areas and evaluate drinking water systems that may be susceptible to PFAS contamination. The dataset includes information extracted from publicly available sources to create five different maps in ArcGIS Online and highlights potential sources of PFAS contamination in the environment in relation to drinking water systems. As datasets of PFAS drinking water sampling continue to grow as part of evolving regulatory requirements, we used this Kentucky dataset as an example to promote the reuse of this dataset and others like it. We incorporated the FAIR (Findable, Accessible, Interoperable, and Reusable) principles by creating a Figshare item that includes all data and associated metadata with these five ArcGIS maps.


Asunto(s)
Agua Potable , Fluorocarburos , Contaminantes Químicos del Agua , Agua Potable/análisis , Contaminantes Químicos del Agua/análisis , Fluorocarburos/análisis , Salud Pública , Secuencia de Bases
16.
Sci Data ; 10(1): 389, 2023 06 16.
Artículo en Inglés | MEDLINE | ID: mdl-37328607

RESUMEN

We present a draft Minimum Information About Geospatial Information System (MIAGIS) standard for facilitating public deposition of geospatial information system (GIS) datasets that follows the FAIR (Findable, Accessible, Interoperable and Reusable) principles. The draft MIAGIS standard includes a deposition directory structure and a minimum javascript object notation (JSON) metadata formatted file that is designed to capture critical metadata describing GIS layers and maps as well as their sources of data and methods of generation. The associated miagis Python package facilitates the creation of this MIAGIS metadata file and directly supports metadata extraction from both Esri JSON and GEOJSON GIS data formats plus options for extraction from user-specified JSON formats. We also demonstrate their use in crafting two example depositions of ArcGIS generated maps. We hope this draft MIAGIS standard along with the supporting miagis Python package will assist in establishing a GIS standards group that will develop the draft into a full standard for the wider GIS community as well as a future public repository for GIS datasets.


Asunto(s)
Sistemas de Información , Metadatos
17.
Mar Life Sci Technol ; 5(2): 178-195, 2023 May.
Artículo en Inglés | MEDLINE | ID: mdl-37275546

RESUMEN

Ciliates in the class Prostomatea play an important role in the global microbial loop due to their significant abundances and broad feeding strategies at the foundation of food webs. Despite their importance in ecosystems, the taxonomy and systematics of this group of ciliates has long been poorly understood, with this being especially true for members of the family Lagynusidae. Here we examine four lagynusids collected from sandy beaches in China, using silver-staining and 18S rRNA gene sequencing techniques. These investigations revealed two new genera and two new species and provided details for two little known forms: Penardella marina gen. nov., sp. nov., Apolagynus cucumis (as reported by Penard. Études sur les infusoires d'eau douce. Georg and Cie, Genève, 1922) gen. nov., comb. nov., Lagynus minutus sp. nov., and Lagynus elegans (Engelmann in Z Wiss Zool 11:347-393, 1862) Quennerstedt (Acta Univ Lund 4:1-48, 1867). Penardella gen. nov. can be morphologically distinguished by having more than three dikinetidal perioral kineties. Apolagynus gen. nov. differs from the closely related genus Lagynus in the absence of a conspicuous neck-like region. The ciliature of Apolagynus cucumis is revealed here for the first time, which demonstrates the classification of this species within Lagynusidae. Furthermore, Apolagynus binucleatus (Jiang et al., 2021) comb. nov. is established according to the new finding. The results of our phylogenetic analyses based on the 18S rRNA gene support the establishment of two new genera and indicate that Lagynusidae is monophyletic, which further strengthens its valid taxonomic status.

18.
Environ Health Perspect ; 131(6): 65001, 2023 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-37352010

RESUMEN

BACKGROUND: Funding agencies, publishers, and other stakeholders are pushing environmental health science investigators to improve data sharing; to promote the findable, accessible, interoperable, and reusable (FAIR) principles; and to increase the rigor and reproducibility of the data collected. Accomplishing these goals will require significant cultural shifts surrounding data management and strategies to develop robust and reliable resources that bridge the technical challenges and gaps in expertise. OBJECTIVE: In this commentary, we examine the current state of managing data and metadata-referred to collectively as (meta)data-in the experimental environmental health sciences. We introduce new tools and resources based on in vivo experiments to serve as examples for the broader field. METHODS: We discuss previous and ongoing efforts to improve (meta)data collection and curation. These include global efforts by the Functional Genomics Data Society to develop metadata collection tools such as the Investigation, Study, Assay (ISA) framework, and the Center for Expanded Data Annotation and Retrieval. We also conduct a case study of in vivo data deposited in the Gene Expression Omnibus that demonstrates the current state of in vivo environmental health data and highlights the value of using the tools we propose to support data deposition. DISCUSSION: The environmental health science community has played a key role in efforts to achieve the goals of the FAIR guiding principles and is well positioned to advance them further. We present a proposed framework to further promote these objectives and minimize the obstacles between data producers and data scientists to maximize the return on research investments. https://doi.org/10.1289/EHP11484.


Asunto(s)
Salud Ambiental , Genómica , Reproducibilidad de los Resultados , Difusión de la Información , Metadatos
19.
Sci Rep ; 13(1): 6709, 2023 04 25.
Artículo en Inglés | MEDLINE | ID: mdl-37185591

RESUMEN

Particle therapy (PT) used for cancer treatment can spare healthy tissue and reduce treatment toxicity. However, full exploitation of the dosimetric advantages of PT is not yet possible due to range uncertainties, warranting development of range-monitoring techniques. This study proposes a novel range-monitoring technique introducing the yet unexplored concept of simultaneous detection and imaging of fast neutrons and prompt-gamma rays produced in beam-tissue interactions. A quasi-monolithic organic detector array is proposed, and its feasibility for detecting range shifts in the context of proton therapy is explored through Monte Carlo simulations of realistic patient models and detector resolution effects. The results indicate that range shifts of [Formula: see text] can be detected at relatively low proton intensities ([Formula: see text] protons/spot) when spatial information obtained through imaging of both particle species are used simultaneously. This study lays the foundation for multi-particle detection and imaging systems in the context of range verification in PT.


Asunto(s)
Terapia de Protones , Humanos , Terapia de Protones/métodos , Diagnóstico por Imagen , Protones , Rayos gamma , Dosificación Radioterapéutica , Método de Montecarlo , Fantasmas de Imagen
20.
BMC Bioinformatics ; 24(1): 78, 2023 Mar 04.
Artículo en Inglés | MEDLINE | ID: mdl-36870946

RESUMEN

BACKGROUND: The Kyoto Encyclopedia of Genes and Genomes (KEGG) provides organized genomic, biomolecular, and metabolic information and knowledge that is reasonably current and highly useful for a wide range of analyses and modeling. KEGG follows the principles of data stewardship to be findable, accessible, interoperable, and reusable (FAIR) by providing RESTful access to their database entries via their web-accessible KEGG API. However, the overall FAIRness of KEGG is often limited by the library and software package support available in a given programming language. While R library support for KEGG is fairly strong, Python library support has been lacking. Moreover, there is no software that provides extensive command line level support for KEGG access and utilization. RESULTS: We present kegg_pull, a package implemented in the Python programming language that provides better KEGG access and utilization functionality than previous libraries and software packages. Not only does kegg_pull include an application programming interface (API) for Python programming, it also provides a command line interface (CLI) that enables utilization of KEGG for a wide range of shell scripting and data analysis pipeline use-cases. As kegg_pull's name implies, both the API and CLI provide versatile options for pulling (downloading and saving) an arbitrary (user defined) number of database entries from the KEGG API. Moreover, this functionality is implemented to efficiently utilize multiple central processing unit cores as demonstrated in several performance tests. Many options are provided to optimize fault-tolerant performance across a single or multiple processes, with recommendations provided based on extensive testing and practical network considerations. CONCLUSIONS: The new kegg_pull package enables new flexible KEGG retrieval use cases not available in previous software packages. The most notable new feature that kegg_pull provides is its ability to robustly pull an arbitrary number of KEGG entries with a single API method or CLI command, including pulling an entire KEGG database. We provide recommendations to users for the most effective use of kegg_pull according to their network and computational circumstances.


Asunto(s)
Análisis de Datos , Genómica , Biblioteca de Genes , Bases de Datos Factuales , Conocimiento
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...