Your browser doesn't support javascript.
loading
Efficient estimation for large-scale linkage disequilibrium patterns of the human genome.
Huang, Xin; Zhu, Tian-Neng; Liu, Ying-Chao; Qi, Guo-An; Zhang, Jian-Nan; Chen, Guo-Bo.
Afiliação
  • Huang X; Institute of Bioinformatics, Zhejiang University, Hangzhou, China.
  • Zhu TN; Center for General Practice Medicine, Department of General Practice Medicine, Zhejiang Provincial People's Hospital, People's Hospital of Hangzhou Medical College, Hangzhou, China.
  • Liu YC; Center for Reproductive Medicine, Department of Genetic and Genomic Medicine, and Clinical Research Institute, Zhejiang Provincial People's Hospital, People's Hospital of Hangzhou Medical College, Zhejiang, China.
  • Qi GA; Institute of Bioinformatics, Zhejiang University, Hangzhou, China.
  • Zhang JN; Institute of Bioinformatics, Zhejiang University, Hangzhou, China.
  • Chen GB; Institute of Bioinformatics, Zhejiang University, Hangzhou, China.
Elife ; 122023 Dec 27.
Article em En | MEDLINE | ID: mdl-38149842
ABSTRACT
In this study, we proposed an efficient algorithm (X-LD) for estimating linkage disequilibrium (LD) patterns for a genomic grid, which can be of inter-chromosomal scale or of small segments. Compared with conventional methods, the proposed method was significantly faster, dropped from O(nm2) to O(n2m)-n the sample size and m the number of SNPs, and consequently we were permitted to explore in depth unknown or reveal long-anticipated LD features of the human genome. Having applied the algorithm for 1000 Genome Project (1KG), we found (1) the extended LD, driven by population structure, universally existed, and the strength of inter-chromosomal LD was about 10% of their respective intra-chromosomal LD in relatively homogeneous cohorts, such as FIN, and to nearly 56% in admixed cohort, such as ASW. (2) After splitting each chromosome into upmost of more than a half million grids, we elucidated the LD of the HLA region was nearly 42 folders higher than chromosome 6 in CEU and 11.58 in ASW; on chromosome 11, we observed that the LD of its centromere was nearly 94.05 folders higher than chromosome 11 in YRI and 42.73 in ASW. (3) We uncovered the long-anticipated inversely proportional linear relationship between the length of a chromosome and the strength of chromosomal LD, and their Pearson's correlation was on average over 0.80 for 26 1KG cohorts. However, this linear norm was so far perturbed by chromosome 11 given its more completely sequenced centromere region. Uniquely chromosome 8 of ASW was found most deviated from the linear norm than any other autosomes. The proposed algorithm has been realized in C++ (called X-LD) and is available at https//github.com/gc5k/gear2, and can be applied to explore LD features in any sequenced populations.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genoma Humano / Polimorfismo de Nucleotídeo Único Limite: Humans Idioma: En Revista: Elife Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genoma Humano / Polimorfismo de Nucleotídeo Único Limite: Humans Idioma: En Revista: Elife Ano de publicação: 2023 Tipo de documento: Article