Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Int J Mol Sci ; 25(9)2024 Apr 26.
Artículo en Inglés | MEDLINE | ID: mdl-38731932

RESUMEN

The serious drawback underlying the biological annotation of whole-genome sequence data is the p >> n problem, which means that the number of polymorphic variants (p) is much larger than the number of available phenotypic records (n). We propose a way to circumvent the problem by combining a LASSO logistic regression with deep learning to classify cows as susceptible or resistant to mastitis, based on single nucleotide polymorphism (SNP) genotypes. Among several architectures, the one with 204,642 SNPs was selected as the best. This architecture was composed of two layers with, respectively, 7 and 46 units per layer implementing respective drop-out rates of 0.210 and 0.358. The classification of the test data resulted in AUC = 0.750, accuracy = 0.650, sensitivity = 0.600, and specificity = 0.700. Significant SNPs were selected based on the SHapley Additive exPlanation (SHAP). As a final result, one GO term related to the biological process and thirteen GO terms related to molecular function were significantly enriched in the gene set that corresponded to the significant SNPs. Our findings revealed that the optimal approach can correctly predict susceptibility or resistance status for approximately 65% of cows. Genes marked by the most significant SNPs are related to the immune response and protein synthesis.


Asunto(s)
Aprendizaje Profundo , Mastitis Bovina , Polimorfismo de Nucleótido Simple , Secuenciación Completa del Genoma , Bovinos , Mastitis Bovina/genética , Animales , Femenino , Secuenciación Completa del Genoma/métodos , Predisposición Genética a la Enfermedad , Genotipo
2.
NAR Genom Bioinform ; 6(2): lqae040, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38686136

RESUMEN

This study compared computational approaches to parallelization of an SNP calling workflow. The data comprised DNA from five Holstein-Friesian cows sequenced with the Illumina platform. The pipeline consisted of quality control, alignment to the reference genome, post-alignment, and SNP calling. Three approaches to parallelization were compared: (i) a plain Bash script in which a pipeline for each cow was executed as separate processes invoked at the same time, (ii) a Bash script wrapped in a single Nextflow process and (iii) a Nextflow script with each component of the pipeline defined as a separate process. The results demonstrated that on average, the multi-process Nextflow script performed 15-27% faster depending on the number of assigned threads, with the biggest execution time advantage over the plain Bash approach observed with 10 threads. In terms of RAM usage, the most substantial variation was observed for the multi-process Nextflow, for which it increased with the number of assigned threads, while RAM consumption of the other setups did not depend much on the number of threads assigned for computations. Due to intermediate and log files generated, disk usage was markedly higher for the multi-process Nextflow than for the plain Bash and for the single-process Nextflow.

3.
Blood Cancer Discov ; 4(6): 440-451, 2023 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-37769148

RESUMEN

BCMA-targeted bispecific antibodies (BiAb) are efficacious in relapsed/refractory multiple myeloma; however, serious infections have emerged as important toxicities. In this retrospective study, we characterized all infections and their risk factors, and evaluated the impact of infection prophylaxis in patients treated with BCMA-targeted BiAbs. Among 37 patients, 15 (41%) experienced a grade 3-5 infection, with two infection-related deaths during deep remissions. Most (84%) infections occurred during disease remissions. The cumulative probability of grade 3-5 infection increased over time with no plateau. Among responders (n = 26), profound hypogammaglobulinemia occurred in 100% and continued throughout the entire duration of treatment. During periods when patients were receiving intravenous immunoglobulin (IVIg), the rate of grade 3-5 infections was 90% lower than during observation (incidence rate ratio, 0.10; 95% confidence interval, 0.01-0.80; P = 0.0307). No other risk factors for infection were identified. This study demonstrates that profound hypogammaglobulinemia is universal with BCMA-targeted BiAbs, with intravenous immunoglobulin potentially abrogating most of the infection risk. SIGNIFICANCE: To the best of our knowledge, this is the first study to comprehensively analyze risk factors and mitigation strategies to prevent infections in myeloma patients receiving anti-BCMA bispecific antibodies. Profound and prolonged hypogammaglobulinemia was universal among responders, while immunoglobulin replacement was associated with 90% lower rates of grade 3-5 infections. See related commentary by Garfall and Stadtmauer, p. 427 . This article is featured in Selected Articles from This Issue, p. 419.


Asunto(s)
Agammaglobulinemia , Anticuerpos Biespecíficos , Mieloma Múltiple , Humanos , Mieloma Múltiple/complicaciones , Mieloma Múltiple/tratamiento farmacológico , Inmunoglobulinas Intravenosas/uso terapéutico , Anticuerpos Biespecíficos/efectos adversos , Antígeno de Maduración de Linfocitos B/uso terapéutico , Agammaglobulinemia/tratamiento farmacológico , Estudios Retrospectivos
4.
PLoS One ; 18(1): e0279356, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-36662838

RESUMEN

Undoubtedly, genetic factors play an important role in susceptibility and resistance to COVID-19. In this study, we conducted the GWAS analysis. Out of 15,489,173 SNPs, we identified 18,191 significant SNPs for severe and 11,799 SNPs for resistant phenotype, showing that a great number of loci were significant in different COVID-19 representations. The majority of variants were synonymous (60.56% for severe, 58.46% for resistant phenotype) or located in introns (55.77% for severe, 59.83% for resistant phenotype). We identified the most significant SNPs for a severe outcome (in AJAP1 intron) and for COVID resistance (in FIG4 intron). We found no missense variants with a potential causal function on resistance to COVID-19; however, two missense variants were determined as significant a severe phenotype (in PM20D1 and LRP4 exons). None of the aforementioned SNPs and missense variants found in this study have been previously associated with COVID-19.


Asunto(s)
COVID-19 , Estudio de Asociación del Genoma Completo , Humanos , COVID-19/genética , Fenotipo , Mutación Missense , Exones , Polimorfismo de Nucleótido Simple , Predisposición Genética a la Enfermedad , Flavoproteínas/genética , Monoéster Fosfórico Hidrolasas/genética
5.
Genet Sel Evol ; 54(1): 80, 2022 Dec 16.
Artículo en Inglés | MEDLINE | ID: mdl-36526979

RESUMEN

Genome-wide association studies (GWAS) help identify polymorphic sites or genes linked to phenotypic variance, but a few identified genes and/or single nucleotide polymorphisms (SNPs) are unlikely to explain a large part of the phenotypic variability of complex traits. In this study, the focus was moved from single loci to functional units, expressed by the metabolic pathways as defined in the Kyoto Encyclopaedia of Genes and Genomes (KEGG) database. Consequently, the aim of this study was to estimate KEGG effects on stature in three Nordic dairy cattle breeds using SNP effects from GWAS as the dependent variable. The SNPs were annotated to genes, then the genes to KEGG pathways. The effects of KEGG pathways were estimated separately for each breed using a mixed linear model incorporating the similarity between pathways expressed by common genes. The KEGG pathway D-amino acid metabolism (map00473) was estimated to be significant for stature in two of the analysed breeds and revealed a borderline significance in the third breed. Thus, we demonstrate that the approach to statistical modelling of higher order functional effects on complex traits is useful, and provides evidence of the importance of D-amino acids for growth in cattle.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Bovinos/genética , Animales , Estudio de Asociación del Genoma Completo/veterinaria , Modelos Lineales , Sitios de Carácter Cuantitativo , Herencia Multifactorial
6.
Int J Mol Sci ; 23(15)2022 Aug 04.
Artículo en Inglés | MEDLINE | ID: mdl-35955824

RESUMEN

Background: Severe outcomes of COVID-19 account for up to 15% of all cases. The study aims to check if any gene variants related to cardiovascular (CVD) and pulmonary diseases (PD) are correlated with a severe outcome of COVID-19 in a Polish cohort of COVID-19 patients. Methods: In this study, a subset of 747 samples from unrelated individuals collected across Poland in 2020 and 2021 was used and whole-genome sequencing was performed. Results: The GWAS analysis of SNPs and short indels located in genes related to CVD identified one variant significant in COVID-19 severe outcome in the HADHA gene, while for the PD gene panel, we found two significant variants in the DRC1 gene. In this study, both potentially protective and risk variants were identified, of which variants in the HADHA gene deserve the most attention. Conclusions: This is the first study reporting the association between the HADHA and DRC1 genetic variants and COVID-19 severe outcome based on the cohort WGS analysis. Although all the identified variants are localised in introns, they may be correlated and therefore inherited along with other risk variants, potentially causative to severe outcome of COVID-19 but not discovered yet.


Asunto(s)
COVID-19 , Enfermedades Cardiovasculares , COVID-19/genética , Enfermedades Cardiovasculares/genética , Estudio de Asociación del Genoma Completo , Humanos , Mutación INDEL , Pulmón , Polimorfismo de Nucleótido Simple
7.
Int J Mol Sci ; 23(11)2022 Jun 03.
Artículo en Inglés | MEDLINE | ID: mdl-35682950

RESUMEN

COVID-19 infections pose a serious global health concern so it is crucial to identify the biomarkers for the susceptibility to and resistance against this disease that could help in a rapid risk assessment and reliable decisions being made on patients' treatment and their potential hospitalisation. Several studies investigated the factors associated with severe COVID-19 outcomes that can be either environmental, population based, or genetic. It was demonstrated that the genetics of the host plays an important role in the various immune responses and, therefore, there are different clinical presentations of COVID-19 infection. In this study, we aimed to use variant descriptive statistics from GWAS (Genome-Wide Association Study) and variant genomic annotations to identify metabolic pathways that are associated with a severe COVID-19 infection as well as pathways related to resistance to COVID-19. For this purpose, we applied a custom-designed mixed linear model implemented into custom-written software. Our analysis of more than 12.5 million SNPs did not indicate any pathway that was significant for a severe COVID-19 infection. However, the Allograft rejection pathway (hsa05330) was significant (p = 0.01087) for resistance to the infection. The majority of the 27 SNP marking genes constituting the Allograft rejection pathway were located on chromosome 6 (19 SNPs) and the remainder were mapped to chromosomes 2, 3, 10, 12, 20, and X. This pathway comprises several immune system components crucial for the self versus non-self recognition, but also the components of antiviral immunity. Our study demonstrated that not only single variants are important for resistance to COVID-19, but also the cumulative impact of several SNPs within the same pathway matters.


Asunto(s)
COVID-19 , Estudio de Asociación del Genoma Completo , Aloinjertos , COVID-19/genética , Predisposición Genética a la Enfermedad , Humanos , Inmunidad Innata , Polimorfismo de Nucleótido Simple
8.
Sci Rep ; 12(1): 7671, 2022 05 10.
Artículo en Inglés | MEDLINE | ID: mdl-35538164

RESUMEN

Since global temperature is expected to rise by 2 °C in 2050 heat stress may become the most severe environmental factor. In the study, we illustrate the application of mixed linear models for the analysis of whole transcriptome expression in livers and adrenal tissues of Sprague-Dawley rats obtained by a heat stress experiment. By applying those models, we considered four sources of variation in transcript expression, comprising transcripts (1), genes (2), Gene Ontology terms (3), and Reactome pathways (4) and focussed on accounting for the similarity within each source, which was expressed as a covariance matrix. Models based on transcripts or genes levels explained a larger proportion of log2 fold change than models fitting the functional components of Gene Ontology terms or Reactome pathways. In the liver, among the most significant genes were PNKD and TRIP12. In the adrenal tissue, one transcript of the SUCO gene was expressed more strongly in the control group than in the heat-stress group. PLEC had two transcripts, which were significantly overexpressed in the heat-stress group. PER3 was significant only on gene level. Moving to the functional scale, five Gene Ontologies and one Reactome pathway were significant in the liver. They can be grouped into ontologies related to DNA repair, histone ubiquitination, the regulation of embryonic development and cytoplasmic translation. Linear mixed models are valuable tools for the analysis of high-throughput biological data. Their main advantages are the possibility to incorporate information on covariance between observations and circumventing the problem of multiple testing.


Asunto(s)
Perfilación de la Expresión Génica , Trastornos de Estrés por Calor , Animales , Biodiversidad , Respuesta al Choque Térmico/genética , Modelos Lineales , Ratas , Ratas Sprague-Dawley , Temperatura , Transcriptoma
9.
J Appl Genet ; 61(4): 617-618, 2020 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-33044661

RESUMEN

The original version on this paper contained an error. Figure 5 was published with the same image of Fig. 4.

10.
J Appl Genet ; 61(4): 607-616, 2020 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-32996082

RESUMEN

A downside of next-generation sequencing technology is the high technical error rate. We built a tool, which uses array-based genotype information to classify next-generation sequencing-based SNPs into the correct and the incorrect calls. The deep learning algorithms were implemented via Keras. Several algorithms were tested: (i) the basic, naïve algorithm, (ii) the naïve algorithm modified by pre-imposing different weights on incorrect and correct SNP class in calculating the loss metric and (iii)-(v) the naïve algorithm modified by random re-sampling (with replacement) of the incorrect SNPs to match 30%/60%/100% of the number of correct SNPs. The training data set was composed of data from three bulls and consisted of 2,227,995 correct (97.94%) and 46,920 incorrect SNPs, while the validation data set consisted of data from one bull with 749,506 correct (98.05%) and 14,908 incorrect SNPs. The results showed that for a rare event classification problem, like incorrect SNP detection in NGS data, the most parsimonious naïve model and a model with the weighting of SNP classes provided the best results for the classification of the validation data set. Both classified 19% of truly incorrect SNPs as incorrect and 99% of truly correct SNPs as correct and resulted in the F1 score of 0.21 - the highest among the compared algorithms. We conclude the basic models were less adapted to the specificity of a training data set and thus resulted in better classification of the independent, validation data set, than the other tested models.


Asunto(s)
Aprendizaje Profundo , Técnicas de Genotipaje/métodos , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del Genoma/métodos , Algoritmos , Animales , Bovinos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...