Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 129.732
Filtrar
1.
Brief Bioinform ; 25(4)2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38980375

RESUMEN

Structural variation (SV) is an important form of genomic variation that influences gene function and expression by altering the structure of the genome. Although long-read data have been proven to better characterize SVs, SVs detected from noisy long-read data still include a considerable portion of false-positive calls. To accurately detect SVs in long-read data, we present SVDF, a method that employs a learning-based noise filtering strategy and an SV signature-adaptive clustering algorithm, for effectively reducing the likelihood of false-positive events. Benchmarking results from multiple orthogonal experiments demonstrate that, across different sequencing platforms and depths, SVDF achieves higher calling accuracy for each sample compared to several existing general SV calling tools. We believe that, with its meticulous and sensitive SV detection capability, SVDF can bring new opportunities and advancements to cutting-edge genomic research.


Asunto(s)
Algoritmos , Humanos , Análisis de Secuencia de ADN/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Genómica/métodos , Variación Estructural del Genoma , Programas Informáticos
2.
Brief Bioinform ; 25(4)2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38966948

RESUMEN

Variants in cis-regulatory elements link the noncoding genome to human pathology; however, detailed analytic tools for understanding the association between cell-level brain pathology and noncoding variants are lacking. CWAS-Plus, adapted from a Python package for category-wide association testing (CWAS), enhances noncoding variant analysis by integrating both whole-genome sequencing (WGS) and user-provided functional data. With simplified parameter settings and an efficient multiple testing correction method, CWAS-Plus conducts the CWAS workflow 50 times faster than CWAS, making it more accessible and user-friendly for researchers. Here, we used a single-nuclei assay for transposase-accessible chromatin with sequencing to facilitate CWAS-guided noncoding variant analysis at cell-type-specific enhancers and promoters. Examining autism spectrum disorder WGS data (n = 7280), CWAS-Plus identified noncoding de novo variant associations in transcription factor binding sites within conserved loci. Independently, in Alzheimer's disease WGS data (n = 1087), CWAS-Plus detected rare noncoding variant associations in microglia-specific regulatory elements. These findings highlight CWAS-Plus's utility in genomic disorders and scalability for processing large-scale WGS data and in multiple-testing corrections. CWAS-Plus and its user manual are available at https://github.com/joonan-lab/cwas/ and https://cwas-plus.readthedocs.io/en/latest/, respectively.


Asunto(s)
Secuenciación Completa del Genoma , Humanos , Secuenciación Completa del Genoma/métodos , Enfermedad de Alzheimer/genética , Estudio de Asociación del Genoma Completo/métodos , Trastorno del Espectro Autista/genética , Variación Genética , Programas Informáticos , Cromatina/genética , Cromatina/metabolismo , Genoma Humano
3.
Sci Rep ; 14(1): 15811, 2024 Jul 09.
Artículo en Inglés | MEDLINE | ID: mdl-38982221

RESUMEN

The Microsoft Kinect depth sensor, with its built-in software that automatically captures joint coordinates without markers, could be a potential tool for ergonomic studies. This study investigates the performance of Kinect in limb segment lengths using dual-energy X-ray absorptiometry (DXA) as a reference. Healthy children and adults (n = 76) were recruited for limb length measurements by Kinect and DXA. The results showed consistent ratios of arm, forearm, thigh, and leg lengths to height, which were 0.16, 0.14, 0.23, and 0.22 respectively, for both age groups and methods. Kinect exhibited perfect correlation among all limb lengths, indicating fixed proportions assumed by its algorithm. Comparing the two methods, there was a strong correlation (R = 0.850-0.985) and good to excellent agreement (ICC = 0.829-0.977), except for the right leg in adults, where agreement was slightly lower but still moderate (ICC = 0.712). The measurement bias between the methods ranged from - 1.455 to 0.536 cm. In conclusion, Kinect yields outcomes similar to DXA, indicating its potential utility as a tool for ergonomic studies. However, the built-in algorithm of Kinect assumes fixed limb proportions for individuals, which may not be ideal for studies focusing on investigating limb discrepancies or anatomical differences.


Asunto(s)
Absorciometría de Fotón , Humanos , Adulto , Masculino , Niño , Femenino , Absorciometría de Fotón/métodos , Adulto Joven , Algoritmos , Programas Informáticos , Adolescente , Persona de Mediana Edad , Antropometría/métodos
4.
Commun Biol ; 7(1): 834, 2024 Jul 09.
Artículo en Inglés | MEDLINE | ID: mdl-38982263

RESUMEN

Chromatin spatial organization plays a crucial role in gene regulation. Recently developed and prospering multiplexed DNA FISH technologies enable direct visualization of chromatin conformation in the nucleus. However, incomplete data caused by limited detection efficiency can substantially complicate and impair downstream analysis. Here, we present SnapFISH-IMPUTE that imputes missing values in multiplexed DNA FISH data. Analysis on multiple published datasets shows that the proposed method preserves the distribution of pairwise distances between imaging loci, and the imputed chromatin conformations are indistinguishable from the observed conformations. Additionally, imputation greatly improves downstream analyses such as identifying enhancer-promoter loops and clustering cells into distinct cell types. SnapFISH-IMPUTE is freely available at https://github.com/hyuyu104/SnapFISH-IMPUTE .


Asunto(s)
Cromatina , ADN , Hibridación Fluorescente in Situ , Hibridación Fluorescente in Situ/métodos , Cromatina/genética , ADN/genética , Humanos , Animales , Programas Informáticos
5.
BMC Bioinformatics ; 25(1): 233, 2024 Jul 09.
Artículo en Inglés | MEDLINE | ID: mdl-38982375

RESUMEN

BACKGROUND: Structural variations play an important role in bacterial genomes. They can mediate genome adaptation quickly in response to the external environment and thus can also play a role in antibiotic resistance. The detection of structural variations in bacteria is challenging, and the recognition of even small rearrangements can be important. Even though most detection tools are aimed at and benchmarked on eukaryotic genomes, they can also be used on prokaryotic genomes. The key features of detection are the ability to detect small rearrangements and support haploid genomes. Because of the limiting performance of a single detection tool, combining the detection abilities of multiple tools can lead to more robust results. There are already available workflows for structural variation detection for long-reads technologies and for the detection of single-nucleotide variation and indels, both aimed at bacteria. Yet we are unaware of structural variations detection workflows for the short-reads sequencing platform. Motivated by this gap we created our workflow. Further, we were interested in increasing the detection performance and providing more robust results. RESULTS: We developed an open-source bioinformatics pipeline, ProcaryaSV, for the detection of structural variations in bacterial isolates from paired-end short sequencing reads. Multiple tools, starting with quality control and trimming of sequencing data, alignment to the reference genome, and multiple structural variation detection tools, are integrated. All the partial results are then processed and merged with an in-house merging algorithm. Compared with a single detection approach, ProcaryaSV has improved detection performance and is a reproducible easy-to-use tool. CONCLUSIONS: The ProcaryaSV pipeline provides an integrative approach to structural variation detection from paired-end next-generation sequencing of bacterial samples. It can be easily installed and used on Linux machines. It is publicly available on GitHub at https://github.com/robinjugas/ProcaryaSV .


Asunto(s)
Genoma Bacteriano , Secuenciación de Nucleótidos de Alto Rendimiento , Programas Informáticos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Bacterias/genética
6.
BMC Bioinformatics ; 25(1): 232, 2024 Jul 09.
Artículo en Inglés | MEDLINE | ID: mdl-38982382

RESUMEN

BACKGROUND: Characterization of microbial growth is of both fundamental and applied interest. Modern platforms can automate collection of high-throughput microbial growth curves, necessitating the development of computational tools to handle and analyze these data to produce insights. RESULTS: To address this need, here I present a newly-developed R package: gcplyr. gcplyr can flexibly import growth curve data in common tabular formats, and reshapes it under a tidy framework that is flexible and extendable, enabling users to design custom analyses or plot data with popular visualization packages. gcplyr can also incorporate metadata and generate or import experimental designs to merge with data. Finally, gcplyr carries out model-free (non-parametric) analyses. These analyses do not require mathematical assumptions about microbial growth dynamics, and gcplyr is able to extract a broad range of important traits, including growth rate, doubling time, lag time, maximum density and carrying capacity, diauxie, area under the curve, extinction time, and more. CONCLUSIONS: gcplyr makes scripted analyses of growth curve data in R straightforward, streamlines common data wrangling and analysis steps, and easily integrates with common visualization and statistical analyses.


Asunto(s)
Programas Informáticos , Biología Computacional/métodos , Análisis de Datos
7.
BMC Oral Health ; 24(1): 770, 2024 Jul 10.
Artículo en Inglés | MEDLINE | ID: mdl-38982396

RESUMEN

BACKGROUND: High precision intra-oral scans, coupled with advanced software, enable virtual bracket removal (VBR) from digital models. VBR allows the delivery of retainers and clear aligners promptly following debonding, thus reducing the patients' appointments and minimizing the likelihood of tooth movement. The objective of this study was to compare the enamel surface before bonding and after VBR using three different Computer-aided design (CAD) software and to compare their accuracy. METHODS: Maxillary scans of 20 participants starting orthodontic treatment were selected for inclusion in the study, who exhibited mild to moderate crowding and required bonding of brackets on the labial surface of permanent maxillary teeth (from the maxillary left first molar to the maxillary right first molar). Two intra-oral scans were conducted on the same day, before bonding and immediately after bonding using CEREC Omnicam (Sirona Dental Systems, Bensheim, Germany). The virtual removal of the brackets from the post-bonding models was performed using OrthoAnalyzer (3Shape, Copenhagen, Denmark), Meshmixer (Autodesk, San Rafael, Calif, USA), and EasyRx (LLC, Atlanta, GA, USA) software. The models that underwent VBR were superimposed on the pre-bonding models by Medit Link App (Medit, Seoul, South Korea) using surface-based registration. The changes in the enamel surface following VBR using the three software packages were quantified using the Medit Link App. RESULTS: There was a significant difference among the 3Shape, Meshmixer, and EasyRx software in tooth surface change following VBR. Specifically, EasyRx exhibited lower levels of accuracy compared to the other two VBR software programs (p<.001, p<.001). A significant difference in enamel surface change was observed between tooth segments across all software groups, in both incisors and molars, with VBR of the molars exhibiting the lowest level of accuracy (3Shape p=.002, Meshmixer p<.001, EasyRx p<.001). Regarding the direction of tooth surface changes following VBR, it was observed that all three groups exhibited a significant increase in the percentage of inadequate bracket removal across all teeth segments. CONCLUSIONS: 3Shape and Meshmixer manual VBR software were found to be more accurate than EasyRx automated software, however, the differences were minimal and clinically insignificant.


Asunto(s)
Soportes Ortodóncicos , Programas Informáticos , Humanos , Diseño Asistido por Computadora , Desconsolidación Dental/métodos , Femenino , Adolescente , Masculino , Modelos Dentales , Esmalte Dental , Maloclusión/terapia , Recubrimiento Dental Adhesivo/métodos
8.
Brief Bioinform ; 25(4)2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38982642

RESUMEN

Inferring cell type proportions from bulk transcriptome data is crucial in immunology and oncology. Here, we introduce guided LDA deconvolution (GLDADec), a bulk deconvolution method that guides topics using cell type-specific marker gene names to estimate topic distributions for each sample. Through benchmarking using blood-derived datasets, we demonstrate its high estimation performance and robustness. Moreover, we apply GLDADec to heterogeneous tissue bulk data and perform comprehensive cell type analysis in a data-driven manner. We show that GLDADec outperforms existing methods in estimation performance and evaluate its biological interpretability by examining enrichment of biological processes for topics. Finally, we apply GLDADec to The Cancer Genome Atlas tumor samples, enabling subtype stratification and survival analysis based on estimated cell type proportions, thus proving its practical utility in clinical settings. This approach, utilizing marker gene names as partial prior information, can be applied to various scenarios for bulk data deconvolution. GLDADec is available as an open-source Python package at https://github.com/mizuno-group/GLDADec.


Asunto(s)
Programas Informáticos , Humanos , Perfilación de la Expresión Génica/métodos , Algoritmos , Transcriptoma , Biología Computacional/métodos , Neoplasias/genética , Biomarcadores de Tumor/genética , Marcadores Genéticos
9.
F1000Res ; 13: 556, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38984017

RESUMEN

Background: Determining the appropriate computational requirements and software performance is essential for efficient genomic surveillance. The lack of standardized benchmarking complicates software selection, especially with limited resources. Methods: We developed a containerized benchmarking pipeline to evaluate seven long-read assemblers-Canu, GoldRush, MetaFlye, Strainline, HaploDMF, iGDA, and RVHaplo-for viral haplotype reconstruction, using both simulated and experimental Oxford Nanopore sequencing data of HIV-1 and other viruses. Benchmarking was conducted on three computational systems to assess each assembler's performance, utilizing QUAST and BLASTN for quality assessment. Results: Our findings show that assembler choice significantly impacts assembly time, with CPU and memory usage having minimal effect. Assembler selection also influences the size of the contigs, with a minimum read length of 2,000 nucleotides required for quality assembly. A 4,000-nucleotide read length improves quality further. Canu was efficient among de novo assemblers but not suitable for multi-strain mixtures, while GoldRush produced only consensus assemblies. Strainline and MetaFlye were suitable for metagenomic sequencing data, with Strainline requiring high memory and MetaFlye operable on low-specification machines. Among reference-based assemblers, iGDA had high error rates, RVHaplo showed the best runtime and accuracy but became ineffective with similar sequences, and HaploDMF, utilizing machine learning, had fewer errors with a slightly longer runtime. Conclusions: The HIV-64148 pipeline, containerized using Docker, facilitates easy deployment and offers flexibility to select from a range of assemblers to match computational systems or study requirements. This tool aids in genome assembly and provides valuable information on HIV-1 sequences, enhancing viral evolution monitoring and understanding.


Asunto(s)
Biología Computacional , Genómica , VIH-1 , Programas Informáticos , VIH-1/genética , Biología Computacional/métodos , Genómica/métodos , Humanos , Genoma Viral/genética
10.
Bioinformatics ; 40(7)2024 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-38960860

RESUMEN

MOTIVATION: The increasing availability of complete genomes demands for models to study genomic variability within entire populations. Pangenome graphs capture the full genomic similarity and diversity between multiple genomes. In order to understand them, we need to see them. For visualization, we need a human-readable graph layout: a graph embedding in low (e.g. two) dimensional depictions. Due to a pangenome graph's potential excessive size, this is a significant challenge. RESULTS: In response, we introduce a novel graph layout algorithm: the Path-Guided Stochastic Gradient Descent (PG-SGD). PG-SGD uses the genomes, represented in the pangenome graph as paths, as an embedded positional system to sample genomic distances between pairs of nodes. This avoids the quadratic cost seen in previous versions of graph drawing by SGD. We show that our implementation efficiently computes the low-dimensional layouts of gigabase-scale pangenome graphs, unveiling their biological features. AVAILABILITY AND IMPLEMENTATION: We integrated PG-SGD in ODGI which is released as free software under the MIT open source license. Source code is available at https://github.com/pangenome/odgi.


Asunto(s)
Algoritmos , Programas Informáticos , Humanos , Genómica/métodos , Gráficos por Computador , Genoma
11.
Physiol Plant ; 176(4): e14407, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38973613

RESUMEN

Despite the abundance of species with transcriptomic data, a significant number of species still lack sequenced genomes, making it difficult to study gene function and expression in these organisms. While de novo transcriptome assembly can be used to assemble protein-coding transcripts from RNA-sequencing (RNA-seq) data, the datasets used often only feature samples of arbitrarily selected or similar experimental conditions, which might fail to capture condition-specific transcripts. We developed the Large-Scale Transcriptome Assembly Pipeline for de novo assembled transcripts (LSTrAP-denovo) to automatically generate transcriptome atlases of eukaryotic species. Specifically, given an NCBI TaxID, LSTrAP-denovo can (1) filter undesirable RNA-seq accessions based on read data, (2) select RNA-seq accessions via unsupervised machine learning to construct a sample-balanced dataset for download, (3) assemble transcripts via over-assembly, (4) functionally annotate coding sequences (CDS) from assembled transcripts and (5) generate transcriptome atlases in the form of expression matrices for downstream transcriptomic analyses. LSTrAP-denovo is easy to implement, written in Python, and is freely available at https://github.com/pengkenlim/LSTrAP-denovo/.


Asunto(s)
Eucariontes , Transcriptoma , Transcriptoma/genética , Eucariontes/genética , Programas Informáticos , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia de ARN/métodos
12.
Brief Bioinform ; 25(4)2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38975895

RESUMEN

Spatial transcriptomics provides valuable insights into gene expression within the native tissue context, effectively merging molecular data with spatial information to uncover intricate cellular relationships and tissue organizations. In this context, deciphering cellular spatial domains becomes essential for revealing complex cellular dynamics and tissue structures. However, current methods encounter challenges in seamlessly integrating gene expression data with spatial information, resulting in less informative representations of spots and suboptimal accuracy in spatial domain identification. We introduce stCluster, a novel method that integrates graph contrastive learning with multi-task learning to refine informative representations for spatial transcriptomic data, consequently improving spatial domain identification. stCluster first leverages graph contrastive learning technology to obtain discriminative representations capable of recognizing spatially coherent patterns. Through jointly optimizing multiple tasks, stCluster further fine-tunes the representations to be able to capture complex relationships between gene expression and spatial organization. Benchmarked against six state-of-the-art methods, the experimental results reveal its proficiency in accurately identifying complex spatial domains across various datasets and platforms, spanning tissue, organ, and embryo levels. Moreover, stCluster can effectively denoise the spatial gene expression patterns and enhance the spatial trajectory inference. The source code of stCluster is freely available at https://github.com/hannshu/stCluster.


Asunto(s)
Perfilación de la Expresión Génica , Transcriptoma , Perfilación de la Expresión Génica/métodos , Biología Computacional/métodos , Algoritmos , Humanos , Animales , Programas Informáticos , Aprendizaje Automático
13.
BMC Public Health ; 24(1): 1802, 2024 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-38971769

RESUMEN

BACKGROUND: Loneliness is a serious public health concern. Although previous interventions have had some success in mitigating loneliness, the field is in search of novel, more effective, and more scalable solutions. Here, we focus on "relational agents", a form of software agents that are increasingly powered by artificial intelligence and large language models (LLMs). We report on a systematic review and meta-analysis to investigate the impact of relational agents on loneliness across age groups. METHODS: In this systematic review and meta-analysis, we searched 11 databases including Ovid MEDLINE and Embase from inception to Sep 16, 2022. We included randomised controlled trials and non-randomised studies of interventions published in English across all age groups. These loneliness interventions, typically attempt to improve social skills, social support, social interaction, and maladaptive cognitions. Peer-reviewed journal articles, books, book chapters, Master's and PhD theses, or conference papers were eligible for inclusion. Two reviewers independently screened studies, extracted data, and assessed risk of bias via the RoB 2 and ROBINS-I tools. We calculated pooled estimates of Hedge's g in a random-effects meta-analysis and conducted sensitivity and sub-group analyses. We evaluated publication bias via funnel plots, Egger's test, and a trim-and-fill algorithm. FINDINGS: Our search identified 3,935 records of which 14 met eligibility criteria and were included in our meta-analysis. Included studies comprised 286 participants with individual study sample sizes ranging from 4 to 42 participants (x̄ = 20.43, s = 11.58, x̃ = 20). We used a Bonferroni correction with αBonferroni = 0.05 / 4 = 0.0125 and applied Knapp-Hartung adjustments. Relational agents reduced loneliness significantly at an adjusted αBonferroni (g = -0.552; 95% Knapp-Hartung CI, -0.877 to -0.226; P = 0.003), which corresponds to a moderate reduction in loneliness. CONCLUSION: Our results are currently the most comprehensive of their kind and provide promising evidence for the efficacy of relational agents. Relational agents are a promising technology that can alleviate loneliness in a scalable way and that can be a meaningful complement to other approaches. The advent of LLMs should boost their efficacy, and further research is needed to explore the optimal design and use of relational agents. Future research could also address shortcomings of current results, such as small sample sizes and high risk of bias. Particularly young audiences have been overlooked in past research.


Asunto(s)
Soledad , Adulto , Anciano , Humanos , Factores de Edad , Inteligencia Artificial , Soledad/psicología , Programas Informáticos , Adulto Joven , Persona de Mediana Edad , Anciano de 80 o más Años
14.
Sci Rep ; 14(1): 15581, 2024 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-38971877

RESUMEN

In higher organisms, individual cells respond to signals and perturbations by epigenetic regulation and transcriptional adaptation. However, in addition to shifting the expression level of individual genes, the adaptive response of cells can also lead to shifts in the proportions of different cell types. Recent methods such as scRNA-seq allow for the interrogation of expression on the single-cell level, and can quantify individual cell type clusters within complex tissue samples. In order to identify clusters showing differential composition between different biological conditions, differential proportion analysis has recently been introduced. However, bioinformatics tools for robust proportion analysis of both replicated and unreplicated single-cell datasets are critically missing. In this manuscript, we present Scanpro, a modular tool for proportion analysis, seamlessly integrating into widely accepted frameworks in the Python environment. Scanpro is fast, accurate, supports datasets without replicates, and is intended to be used by bioinformatics experts and beginners alike.


Asunto(s)
Biología Computacional , Análisis de la Célula Individual , Programas Informáticos , Análisis de la Célula Individual/métodos , Biología Computacional/métodos , Humanos , Animales , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia de ARN/métodos
15.
Commun Biol ; 7(1): 822, 2024 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-38971889

RESUMEN

Translational studies benefit from experimental designs where laboratory organisms use human-relevant behaviors. One such behavior is decision-making, however studying complex decision-making in rodents is labor-intensive and typically restricted to two levels of cost/reward. We design a fully automated, inexpensive, high-throughput framework to study decision-making across multiple levels of rewards and costs: the REward-COst in Rodent Decision-making (RECORD) system. RECORD integrates three components: 1) 3D-printed arenas, 2) custom electronic hardware, and 3) software. We validated four behavioral protocols without employing any food or water restriction, highlighting the versatility of our system. RECORD data exposes heterogeneity in decision-making both within and across individuals that is quantifiably constrained. Using oxycodone self-administration and alcohol-consumption as test cases, we reveal how analytic approaches that incorporate behavioral heterogeneity are sensitive to detecting perturbations in decision-making. RECORD is a powerful approach to studying decision-making in rodents, with features that facilitate translational studies of decision-making in psychiatric disorders.


Asunto(s)
Conducta Animal , Toma de Decisiones , Animales , Masculino , Ratas , Ratones , Oxicodona/administración & dosificación , Recompensa , Consumo de Bebidas Alcohólicas/psicología , Conducta Alimentaria , Autoadministración , Programas Informáticos
16.
Commun Biol ; 7(1): 823, 2024 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-38971915

RESUMEN

Recent progress in image-based spatial RNA profiling enables to spatially resolve tens to hundreds of distinct RNA species with high spatial resolution. It presents new avenues for comprehending tissue organization. In this context, the ability to assign detected RNA transcripts to individual cells is crucial for downstream analyses, such as in-situ cell type calling. Yet, accurate cell segmentation can be challenging in tissue data, in particular in the absence of a high-quality membrane marker. To address this issue, we introduce ComSeg, a segmentation algorithm that operates directly on single RNA positions and that does not come with implicit or explicit priors on cell shape. ComSeg is applicable in complex tissues with arbitrary cell shapes. Through comprehensive evaluations on simulated and experimental datasets, we show that ComSeg outperforms existing state-of-the-art methods for in-situ single-cell RNA profiling and in-situ cell type calling. ComSeg is available as a documented and open source pip package at https://github.com/fish-quant/ComSeg .


Asunto(s)
Algoritmos , Perfilación de la Expresión Génica , Análisis de la Célula Individual , Transcriptoma , Perfilación de la Expresión Génica/métodos , Análisis de la Célula Individual/métodos , Procesamiento de Imagen Asistido por Computador/métodos , Humanos , Animales , Programas Informáticos , ARN/genética , Hibridación Fluorescente in Situ/métodos
17.
Brief Bioinform ; 25(4)2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38975894

RESUMEN

Chimeric antigen receptor (CAR) therapy has emerged as a ground-breaking advancement in cancer treatment, harnessing the power of engineered human immune cells to target and eliminate cancer cells. The escalating interest and investment in CAR therapy in recent years emphasize its profound significance in clinical research, positioning it as a rapidly expanding frontier in the field of personalized cancer therapies. A crucial step in CAR therapy design is choosing the right target as it determines the therapy's effectiveness, safety and specificity against cancer cells, while sparing healthy tissues. Herein, we propose a suite of tools for the identification and analysis of potential CAR targets leveraging expression data from The Cancer Genome Atlas and Genotype-Tissue Expression Project, which are implemented in CARTAR website. These tools focus on pinpointing tumor-associated antigens, ensuring target selectivity and assessing specificity to avoid off-tumor toxicities and can be used to rationally designing dual CARs. In addition, candidate target expression can be explored in cancer cell lines using the expression data for the Cancer Cell Line Encyclopedia. To our best knowledge, CARTAR is the first website dedicated to the systematic search of suitable candidate targets for CAR therapy. CARTAR is publicly accessible at https://gmxenomica.github.io/CARTAR/.


Asunto(s)
Neoplasias , Receptores Quiméricos de Antígenos , Humanos , Receptores Quiméricos de Antígenos/genética , Receptores Quiméricos de Antígenos/metabolismo , Receptores Quiméricos de Antígenos/inmunología , Neoplasias/terapia , Neoplasias/genética , Inmunoterapia Adoptiva/métodos , Programas Informáticos , Internet , Biología Computacional/métodos , Bases de Datos Genéticas
18.
Brief Bioinform ; 25(4)2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38975892

RESUMEN

Understanding the biological functions and processes of genes, particularly those not yet characterized, is crucial for advancing molecular biology and identifying therapeutic targets. The hypothesis guiding this study is that the 3D proximity of genes correlates with their functional interactions and relevance in prokaryotes. We introduced 3D-GeneNet, an innovative software tool that utilizes high-throughput sequencing data from chromosome conformation capture techniques and integrates topological metrics to construct gene association networks. Through a series of comparative analyses focused on spatial versus linear distances, we explored various dimensions such as topological structure, functional enrichment levels, distribution patterns of linear distances among gene pairs, and the area under the receiver operating characteristic curve by utilizing model organism Escherichia coli K-12. Furthermore, 3D-GeneNet was shown to maintain good accuracy compared to multiple algorithms (neighbourhood, co-occurrence, coexpression, and fusion) across multiple bacteria, including E. coli, Brucella abortus, and Vibrio cholerae. In addition, the accuracy of 3D-GeneNet's prediction of long-distance gene interactions was identified by bacterial two-hybrid assays on E. coli K-12 MG1655, where 3D-GeneNet not only increased the accuracy of linear genomic distance tripled but also achieved 60% accuracy by running alone. Finally, it can be concluded that the applicability of 3D-GeneNet will extend to various bacterial forms, including Gram-negative, Gram-positive, single-, and multi-chromosomal bacteria through Hi-C sequencing and analysis. Such findings highlight the broad applicability and significant promise of this method in the realm of gene association network. 3D-GeneNet is freely accessible at https://github.com/gaoyuanccc/3D-GeneNet.


Asunto(s)
Redes Reguladoras de Genes , Programas Informáticos , Algoritmos , Biología Computacional/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Escherichia coli/genética , Escherichia coli/metabolismo
19.
Nat Commun ; 15(1): 5734, 2024 Jul 09.
Artículo en Inglés | MEDLINE | ID: mdl-38977664

RESUMEN

Metagenomic sequencing has provided great advantages in the characterisation of microbiomes, but currently available analysis tools lack the ability to combine subspecies-level taxonomic resolution and accurate abundance estimation with functional profiling of assembled genomes. To define the microbiome and its associations with human health, improved tools are needed to enable comprehensive understanding of the microbial composition and elucidation of the phylogenetic and functional relationships between the microbes. Here, we present MAGinator, a freely available tool, tailored for profiling of shotgun metagenomics datasets. MAGinator provides de novo identification of subspecies-level microbes and accurate abundance estimates of metagenome-assembled genomes (MAGs). MAGinator utilises the information from both gene- and contig-based methods yielding insight into both taxonomic profiles and the origin of genes and genetic content, used for inference of functional content of each sample by host organism. Additionally, MAGinator facilitates the reconstruction of phylogenetic relationships between the MAGs, providing a framework to identify clade-level differences.


Asunto(s)
Metagenoma , Metagenómica , Microbiota , Filogenia , Metagenómica/métodos , Metagenoma/genética , Humanos , Microbiota/genética , Programas Informáticos , Bacterias/genética , Bacterias/clasificación , Genoma Bacteriano/genética
20.
Syst Rev ; 13(1): 175, 2024 Jul 08.
Artículo en Inglés | MEDLINE | ID: mdl-38978084

RESUMEN

Software that employs screening prioritization through active learning (AL) has accelerated the screening process significantly by ranking an unordered set of records by their predicted relevance. However, failing to find a relevant paper might alter the findings of a systematic review, highlighting the importance of identifying elusive papers. The time to discovery (TD) measures how many records are needed to be screened to find a relevant paper, making it a helpful tool for detecting such papers. The main aim of this project was to investigate how the choice of the model and prior knowledge influence the TD values of the hard-to-find relevant papers and their rank orders. A simulation study was conducted, mimicking the screening process on a dataset containing titles, abstracts, and labels used for an already published systematic review. The results demonstrated that AL model choice, and mostly the choice of the feature extractor but not the choice of prior knowledge, significantly influenced the TD values and the rank order of the elusive relevant papers. Future research should examine the characteristics of elusive relevant papers to discover why they might take a long time to be found.


Asunto(s)
Aprendizaje Basado en Problemas , Humanos , Simulación por Computador , Programas Informáticos , Factores de Tiempo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...