Your browser doesn't support javascript.
loading
Identical repeated backbone of the human genome.
Zepeda-Mendoza, Cinthya J; Lemus, Tzitziki; Yáñez, Omar; García, Delfino; Valle-García, David; Meza-Sosa, Karla F; Gutiérrez-Arcelus, María; Márquez-Ortiz, Yamile; Domínguez-Vidaña, Rocío; Gonzaga-Jauregui, Claudia; Flores, Margarita; Palacios, Rafael.
Afiliação
  • Zepeda-Mendoza CJ; Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, 62210, México. czepeda@lcg.unam.mx
BMC Genomics ; 11: 60, 2010 Jan 23.
Article em En | MEDLINE | ID: mdl-20096123
BACKGROUND: Identical sequences with a minimal length of about 300 base pairs (bp) have been involved in the generation of various meiotic/mitotic genomic rearrangements through non-allelic homologous recombination (NAHR) events. Genomic disorders and structural variation, together with gene remodelling processes have been associated with many of these rearrangements. Based on these observations, we identified and integrated all the 100% identical repeats of at least 300 bp in the NCBI version 36.2 human genome reference assembly into non-overlapping regions, thus defining the Identical Repeated Backbone (IRB) of the reference human genome. RESULTS: The IRB sequences are distributed all over the genome in 66,600 regions, which correspond to approximately 2% of the total NCBI human genome reference assembly. Important structural and functional elements such as common repeats, segmental duplications, and genes are contained in the IRB. About 80% of the IRB bp overlap with known copy-number variants (CNVs). By analyzing the genes embedded in the IRB, we were able to detect some identical genes not previously included in the Ensembl release 50 annotation of human genes. In addition, we found evidence of IRB gene copy-number polymorphisms in raw sequence reads of two diploid sequenced genomes. CONCLUSIONS: In general, the IRB offers new insight into the complex organization of the identical repeated sequences of the human genome. It provides an accurate map of potential NAHR sites which could be used in targeting the study of novel CNVs, predicting DNA copy-number variation in newly sequenced genomes, and improve genome annotation.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Sequências Repetitivas de Ácido Nucleico / Genoma Humano Limite: Humans Idioma: En Revista: BMC Genomics Assunto da revista: GENETICA Ano de publicação: 2010 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Sequências Repetitivas de Ácido Nucleico / Genoma Humano Limite: Humans Idioma: En Revista: BMC Genomics Assunto da revista: GENETICA Ano de publicação: 2010 Tipo de documento: Article