Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Int J Mol Sci ; 21(15)2020 Aug 04.
Artigo em Inglês | MEDLINE | ID: mdl-32759818

RESUMO

The current COronaVIrus Disease 2019 (COVID-19) pandemic started in December 2019. COVID-19 cases are confirmed by the detection of SARS-CoV-2 RNA in biological samples by RT-qPCR. However, limited numbers of SARS-CoV-2 genomes were available when the first RT-qPCR methods were developed in January 2020 for initial in silico specificity evaluation and to verify whether the targeted loci are highly conserved. Now that more whole genome data have become available, we used the bioinformatics tool SCREENED and a total of 4755 publicly available SARS-CoV-2 genomes, downloaded at two different time points, to evaluate the specificity of 12 RT-qPCR tests (consisting of a total of 30 primers and probe sets) used for SARS-CoV-2 detection and the impact of the virus' genetic evolution on four of them. The exclusivity of these methods was also assessed using the human reference genome and 2624 closely related other respiratory viral genomes. The specificity of the assays was generally good and stable over time. An exception is the first method developed by the China Center for Disease Control and prevention (CDC), which exhibits three primer mismatches present in 358 SARS-CoV-2 genomes sequenced mainly in Europe from February 2020 onwards. The best results were obtained for the assay of Chan et al. (2020) targeting the gene coding for the spiking protein (S). This demonstrates that our user-friendly strategy can be used for a first in silico specificity evaluation of future RT-qPCR tests, as well as verifying that the former methods are still capable of detecting circulating SARS-CoV-2 variants.


Assuntos
Betacoronavirus/genética , Infecções por Coronavirus/diagnóstico , Genoma Viral , Pneumonia Viral/diagnóstico , RNA Viral/metabolismo , Reação em Cadeia da Polimerase em Tempo Real/métodos , Betacoronavirus/isolamento & purificação , COVID-19 , Infecções por Coronavirus/virologia , Bases de Dados Genéticas , Humanos , Fases de Leitura Aberta/genética , Pandemias , Pneumonia Viral/virologia , Polimorfismo de Nucleotídeo Único , RNA Viral/análise , RNA Polimerase Dependente de RNA/genética , SARS-CoV-2 , Sensibilidade e Especificidade , Sequenciamento Completo do Genoma
2.
Genet Epidemiol ; 41(4): 332-340, 2017 05.
Artigo em Inglês | MEDLINE | ID: mdl-28318110

RESUMO

For the association analysis of whole-genome sequencing (WGS) studies, we propose an efficient and fast spatial-clustering algorithm. Compared to existing analysis approaches for WGS data, that define the tested regions either by sliding or consecutive windows of fixed sizes along variants, a meaningful grouping of nearby variants into consecutive regions has the advantage that, compared to sliding window approaches, the number of tested regions is likely to be smaller. In comparison to consecutive, fixed-window approaches, our approach is likely to group nearby variants together. Given existing biological evidence that disease-associated mutations tend to physically cluster in specific regions along the chromosome, the identification of meaningful groups of nearby located variants could thus lead to a potential power gain for association analysis. Our algorithm defines consecutive genomic regions based on the physical positions of the variants, assuming an inhomogeneous Poisson process and groups together nearby variants. As parameters are estimated locally, the algorithm takes the differing variant density along the chromosome into account and provides locally optimal partitioning of variants into consecutive regions. An R-implementation of the algorithm is provided. We discuss the theoretical advances of our algorithm compared to existing, window-based approaches and show the performance and advantage of our introduced algorithm in a simulation study and by an application to Alzheimer's disease WGS data. Our analysis identifies a region in the ITGB3 gene that potentially harbors disease susceptibility loci for Alzheimer's disease. The region-based association signal of ITGB3 replicates in an independent data set and achieves formally genome-wide significance. Software Implementation: An implementation of the algorithm in R is available at: https://github.com/heidefier/cluster_wgs_data.


Assuntos
Estudo de Associação Genômica Ampla , Genoma , Análise de Sequência de DNA , Algoritmos , Doença de Alzheimer/genética , Análise por Conglomerados , Simulação por Computador , Genômica , Humanos , Modelos Genéticos , Software
3.
Comput Biol Med ; 169: 107820, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38113679

RESUMO

Using the accumulated whole-genome sequencing (WGS) data and assessing the functional effects of genetic variants, particularly non-coding variants, help identify new and rare variants and decipher the molecular mechanisms underlying diseases and traits but presents significant challenges. GwasWA is a comprehensive and efficient platform to identify causal variants and assess their functional effects based on WGS data. It covers the entire workflow from downloading and processing WGS data to detecting associated variants and assessing their functional effects with optimized configurations, standardized input/output formats, personalized analysis options, data visualization, and parallel processing capability that is crucial for large-scale studies. Applying GwasWA to real datasets identified three novel genes related to seed size and revealed the regulatory mechanism underlying the linkage between a human non-coding variant, rs80067372, and tumor necrosis factor levels. These results highlight the capability of GwasWA to detect novel variants based on WGS data and provide comprehensive insights into the molecular mechanisms underlying the association of variants with diseases and traits, thus contributing to medicine and biology. GwasWA and its documentation are freely available at https://github.com/unicorn-23/GwasWA.


Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Humanos , Fenótipo , Sequenciamento de Nucleotídeos em Larga Escala
4.
Animals (Basel) ; 13(24)2023 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-38136908

RESUMO

Enhancing the accuracy of genomic prediction is a key goal in genomic selection (GS) research. Integrating prior biological information into GS methods using appropriate models can improve prediction accuracy for complex traits. Genome-wide association study (GWAS) is widely utilized to identify potential candidate loci associated with complex traits in livestock and poultry, offering essential genomic insights. In this study, a GWAS was conducted on 685 Duroc × Landrace × Yorkshire (DLY) pigs to extract significant single-nucleotide polymorphisms (SNPs) as genomic features. We compared two GS models, genomic best linear unbiased prediction (GBLUP) and genomic feature BLUP (GFBLUP), by using imputed whole-genome sequencing (WGS) data on 651 Yorkshire pigs. The results revealed that the GBLUP model achieved prediction accuracies of 0.499 for backfat thickness (BFT) and 0.423 for loin muscle area (LMA). By applying the GFBLUP model with GWAS-based SNP preselection, the average prediction accuracies for BFT and LMA traits reached 0.491 and 0.440, respectively. Specifically, the GFBLUP model displayed a 4.8% enhancement in predicting LMA compared to the GBLUP model. These findings suggest that, in certain scenarios, the GFBLUP model may offer superior genomic prediction accuracy when compared to the GBLUP model, underscoring the potential value of incorporating genomic features to refine GS models.

5.
Evol Appl ; 15(12): 2054-2066, 2022 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-36540634

RESUMO

Integrating the single-nucleotide polymorphisms (SNPs) significantly affecting target traits from imputed whole-genome sequencing (iWGS) data into the genomic prediction (GP) model is an economic, efficient, and feasible strategy to improve prediction accuracy. The objective was to dissect the genetic architecture of intramuscular fat content (IFC) by genome wide association studies (GWAS) and to investigate the accuracy of GP based on pedigree-based BLUP (PBLUP) model, genomic best linear unbiased prediction (GBLUP) models and Bayesian mixture (BayesMix) models under different strategies. A total of 482 Suhuai pigs were genotyped using an 80 K SNP chip. Furthermore, 30 key samples were selected for resequencing and were used as a reference panel to impute the 80 K chip data to the WGS dataset. The 80 K data and iWGS data were used to perform GWAS and test GP accuracies under different scenarios. GWAS results revealed that there were four major regions affecting IFC. Two important functional candidate genes were found in the two most significant regions, including protein kinase C epsilon (PRKCE) and myosin light chain 2 (MYL2). The results of the predictions showed that the PBLUP model had the lowest reliability (0.096 ± 0.032). The reliability (0.229 ± 0.035) was improved by replacing pedigree information with 80 K chip data. Compared with using 80 K SNPs alone, pruning iWGS SNPs with the R-squared cutoff of linkage disequilibrium (0.55) led to a slight improvement (0.006), adding significant iWGS SNPs led to an improvement of reliability by 0.050 when using a one-component GBLUP, a further increase of 0.033 when using a two-component GBLUP model. For BayesMix models, compared with using 80 K SNPs alone, adding additional significant iWGS SNPs into one- or two-component BayesMix models led to improvements of reliabilities for IFC by 0.040 and 0.089, respectively. Our results may facilitate further identification of causal genes for IFC and may be beneficial for the improvement of IFC in pig breeding programs.

6.
Genes (Basel) ; 12(4)2021 04 13.
Artigo em Inglês | MEDLINE | ID: mdl-33924636

RESUMO

For 1 year now, the world is undergoing a coronavirus disease-2019 (COVID-19) pandemic due to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The most widely used method for COVID-19 diagnosis is the detection of viral RNA by RT-qPCR with a specific set of primers and probe. It is important to frequently evaluate the performance of these tests and this can be done first by an in silico approach. Previously, we reported some mismatches between the oligonucleotides of publicly available RT-qPCR assays and SARS-CoV-2 genomes collected from GISAID and NCBI, potentially impacting proper detection of the virus. In the present study, 11 primers and probe sets investigated during the first study were evaluated again with 84,305 new SARS-CoV-2 unique genomes collected between June 2020 and January 2021. The lower inclusivity of the China CDC assay targeting the gene N has continued to decrease with new mismatches detected, whereas the other evaluated assays kept their inclusivity above 99%. Additionally, some mutations specific to new SARS-CoV-2 variants of concern were found to be located in oligonucleotide annealing sites. This might impact the strategy to be considered for future SARS-CoV-2 testing. Given the potential threat of the new variants, it is crucial to assess if they can still be correctly targeted by the primers and probes of the RT-qPCR assays. Our study highlights that considering the evolution of the virus and the emergence of new variants, an in silico (re-)evaluation should be performed on a regular basis. Ideally, this should be done for all the RT-qPCR assays employed for SARS-CoV-2 detection, including also commercial tests, although the primer and probe sequences used in these kits are rarely disclosed, which impedes independent performance evaluation.


Assuntos
Teste de Ácido Nucleico para COVID-19/métodos , COVID-19/virologia , Reação em Cadeia da Polimerase em Tempo Real/métodos , SARS-CoV-2/genética , Simulação por Computador , Genoma Viral , Humanos , Mutação , Sondas RNA , Reação em Cadeia da Polimerase Via Transcriptase Reversa/métodos
7.
Front Genet ; 11: 243, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32318090

RESUMO

Poultry feed constitutes the largest cost in poultry production, estimated to be up to 70% of the total cost. Moreover, there is pressure on the poultry industry to increase production to meet the protein demand of humans and simultaneously reduce emissions to protect the environment. Therefore, improving feed efficiency plays an important role to improve profits and the environmental footprint in broiler production. In this study, using imputed whole-genome sequencing data, genome-wide association analysis (GWAS) was performed to identify single-nucleotide polymorphisms (SNPs) and genes associated with residual feed intake (RFI) and its component traits. Furthermore, a transcriptomic analysis between the high-RFI and the low-RFI groups was performed to validate the candidate genes from GWAS. The results showed that the heritability estimates of average daily gain (ADG), average daily feed intake (ADFI), and RFI were 0.29 (0.004), 0.37 (0.005), and 0.38 (0.004), respectively. Using imputed sequence-based GWAS, we identified seven significant SNPs and five candidate genes [MTSS I-BAR domain containing 1, folliculin, COP9 signalosome subunit 3, 5',3'-nucleotidase (mitochondrial), and gametocyte-specific factor 1] associated with RFI, 20 significant SNPs and one candidate gene (inositol polyphosphate multikinase) associated with ADG, and one significant SNP and one candidate gene (coatomer protein complex subunit alpha) associated with ADFI. After performing a transcriptomic analysis between the high-RFI and the low-RFI groups, both 38 up-regulated and 26 down-regulated genes were identified in the high-RFI group. Furthermore, integrating regional conditional GWAS and transcriptome analysis, ras-related dexamethasone induced 1 was the only overlapped gene associated with RFI, which also suggested that the region (GGA14: 4767015-4882318) is a new quantitative trait locus associated with RFI. In conclusion, using imputed sequence-based GWAS is an efficient method to identify significant SNPs and candidate genes in chicken. Our results provide valuable insights into the genetic mechanisms of RFI and its component traits, which would further improve the genetic gain of feed efficiency rapidly and cost-effectively in the context of marker-assisted breeding selection.

8.
Front Genet ; 10: 673, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31379929

RESUMO

Genomic prediction with imputed whole-genome sequencing (WGS) data is an attractive approach to improve predictive ability with low cost. However, high accuracy has not been realized using this method in livestock. In this study, we imputed 435 individuals from 600K single nucleotide polymorphism (SNP) chip data to WGS data using different reference panels. We also investigated the prediction accuracy of genomic best linear unbiased prediction (GBLUP) using imputed WGS data from different reference panels, linkage disequilibrium (LD)-based marker pruning, and pre-selected variants based on Genome-wide association society (GWAS) results. Results showed that the imputation accuracies from 600K to WGS data were 0.873 ± 0.038, 0.906 ± 0.036, and 0.979 ± 0.010 for the internal, external, and combined reference panels, respectively. In most traits of chickens, the prediction accuracy of imputed WGS data obtained from the internal reference panel was greater than or equal to that of the combined reference panel; the external reference panel had the lowest prediction accuracy. Compared with 600K chip data, GBLUP with imputed WGS data had only a small increase (1-3%) in prediction accuracy. Using only variants selected from imputed WGS data based on GWAS results resulted in almost no increase for most traits and even increased the bias of the regression coefficient. The impact of the degree of LD of selected and remaining variants on prediction accuracy was different. For average daily gain (ADG), residual feed intake (RFI), intestine length (IL), and body weight in 91 days (BW91), the accuracy of GBLUP increased as the degree of LD of selected variants decreased, but the opposite relationship occurred for the remaining variants. But for breast muscle weight (BMW) and average daily feed intake (ADFI), the accuracy of GBLUP increased as the degree of LD of selected variants increased, and the degree of LD of remaining variants had a small effect on prediction accuracy. Overall, the optimal imputation strategy to obtain WGS data for genomic prediction should consider the relationship between selected individuals and target population individuals to avoid heterogeneity of imputation. LD-based marker pruning can be used to improve the accuracy of genomic prediction using imputed WGS data.

9.
Meta Gene ; 5: 105-14, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-26137446

RESUMO

Whole genome sequencing (WGS) using next generation sequencing technologies paves the way to sequence the mitochondrial genomes with greater ease and lesser time. Here, we used the WGS data of Clarias batrachus, generated from Roche 454 and Ion Torrent sequencing platforms, to assemble the complete mitogenome using both de novo and reference based approaches. Both the methods yielded almost similar results and the best assembled mitogenome was of 16,510 bp size (GenBank Acc. No. KM259918). The mitogenome annotation resulted in 13 coding genes, 22 tRNA genes, 2 rRNA genes and one control region, and the gene order was found to be identical with other catfishes. Variation analyses between assembled and the reference (GenBank Acc. No. NC_023923) mitogenome revealed 51 variations. The phylogenetic analysis of coding DNA sequences and tRNA supports the monophyly of catfishes. Two SSRs were identified in C. batrachus mitogenome, out of which one was unique to this species. Based on the relative rate of gene evolution, protein coding mitochondrial genes were found to evolve at a much faster pace than the d-loop, which in turn are followed by the rRNAs; the tRNAs showed wide variability in the rate of sequence evolution, and on average evolve the slowest. Among the coding genes, ND2 evolves most rapidly. The variations present in the coding regions of the mitogenome and their comparative analyses with other catfish species may be useful in species conservation and management programs.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA