Pesquisa | BVS IEC

Pan-African genome demonstrates how population-specific genome graphs improve high-throughput sequencing data analysis.

Tetikol, H Serhat; Turgut, Deniz; Narci, Kubra; Budak, Gungor; Kalay, Ozem; Arslan, Elif; Demirkaya-Budak, Sinem; Dolgoborodov, Alexey; Kabakci-Zorlu, Duygu; Semenyuk, Vladimir; Jain, Amit; Davis-Dusenbery, Brandi N.

Nat Commun ; 13(1): 4384, 2022 08 04.

Artigo em Inglês | MEDLINE | ID: mdl-35927245

RESUMO

Graph-based genome reference representations have seen significant development, motivated by the inadequacy of the current human genome reference to represent the diverse genetic information from different human populations and its inability to maintain the same level of accuracy for non-European ancestries. While there have been many efforts to develop computationally efficient graph-based toolkits for NGS read alignment and variant calling, methods to curate genomic variants and subsequently construct genome graphs remain an understudied problem that inevitably determines the effectiveness of the overall bioinformatics pipeline. In this study, we discuss obstacles encountered during graph construction and propose methods for sample selection based on population diversity, graph augmentation with structural variants and resolution of graph reference ambiguity caused by information overload. Moreover, we present the case for iteratively augmenting tailored genome graphs for targeted populations and demonstrate this approach on the whole-genome samples of African ancestry. Our results show that population-specific graphs, as more representative alternatives to linear or generic graph references, can achieve significantly lower read mapping errors and enhanced variant calling sensitivity, in addition to providing the improvements of joint variant calling without the need of computationally intensive post-processing steps.

Assuntos

Análise de Dados , Sequenciamento de Nucleotídeos em Larga Escala , Genoma Humano/genética , Genômica/métodos , Humanos , Análise de Sequência de DNA/métodos , Software

The application of Lempel-Ziv and Titchener complexity analysis for equine telemetric electrocardiographic recordings.

Alexeenko, Vadim; Fraser, James A; Dolgoborodov, Alexey; Bowen, Mark; Huang, Christopher L-H; Marr, Celia M; Jeevaratnam, Kamalan.

Sci Rep ; 9(1): 2619, 2019 02 22.

Artigo em Inglês | MEDLINE | ID: mdl-30796330

RESUMO

The analysis of equine electrocardiographic (ECG) recordings is complicated by the absence of agreed abnormality classification criteria. We explore the applicability of several complexity analysis methods for characterization of non-linear aspects of electrocardiographic recordings. We here show that complexity estimates provided by Lempel-Ziv '76, Titchener's T-complexity and Lempel-Ziv '78 analysis of ECG recordings of healthy Thoroughbred horses are highly dependent on the duration of analysed ECG fragments and the heart rate. The results provide a methodological basis and a feasible reference point for the complexity analysis of equine telemetric ECG recordings that might be applied to automate detection of equine arrhythmias in equine clinical practice.

Assuntos

Algoritmos , Eletrocardiografia , Cavalos/fisiologia , Análise de Sistemas , Telemetria , Animais , Frequência Cardíaca/fisiologia , Processamento de Sinais Assistido por Computador

Fast and accurate genomic analyses using genome graphs.

Rakocevic, Goran; Semenyuk, Vladimir; Lee, Wan-Ping; Spencer, James; Browning, John; Johnson, Ivan J; Arsenijevic, Vladan; Nadj, Jelena; Ghose, Kaushik; Suciu, Maria C; Ji, Sun-Gou; Demir, Gülfem; Li, Lizao; Toptas, Berke Ç; Dolgoborodov, Alexey; Pollex, Björn; Spulber, Iosif; Glotova, Irina; Kómár, Péter; Stachyra, Andrew L; Li, Yilong; Popovic, Milos; Källberg, Morten; Jain, Amit; Kural, Deniz.

Nat Genet ; 51(2): 354-362, 2019 02.

Artigo em Inglês | MEDLINE | ID: mdl-30643257

RESUMO

The human reference genome serves as the foundation for genomics by providing a scaffold for alignment of sequencing reads, but currently only reflects a single consensus haplotype, thus impairing analysis accuracy. Here we present a graph reference genome implementation that enables read alignment across 2,800 diploid genomes encompassing 12.6 million SNPs and 4.0 million insertions and deletions (indels). The pipeline processes one whole-genome sequencing sample in 6.5 h using a system with 36 CPU cores. We show that using a graph genome reference improves read mapping sensitivity and produces a 0.5% increase in variant calling recall, with unaffected specificity. Structural variations incorporated into a graph genome can be genotyped accurately under a unified framework. Finally, we show that iterative augmentation of graph genomes yields incremental gains in variant calling accuracy. Our implementation is an important advance toward fulfilling the promise of graph genomes to radically enhance the scalability and accuracy of genomic analyses.

Assuntos

Genoma Humano/genética , Genômica/métodos , Humanos , Polimorfismo de Nucleotídeo Único/genética , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Deleção de Sequência/genética , Sequenciamento Completo do Genoma/métodos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA