Pesquisa | Biblioteca Virtual em Saúde

Precise detection of de novo single nucleotide variants in human genomes.

Gómez-Romero, Laura; Palacios-Flores, Kim; Reyes, José; García, Delfino; Boege, Margareta; Dávila, Guillermo; Flores, Margarita; Schatz, Michael C; Palacios, Rafael.

Proc Natl Acad Sci U S A ; 115(21): 5516-5521, 2018 05 22.

Artigo em Inglês | MEDLINE | ID: mdl-29735690

RESUMO

The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent-offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we developed an alternative approach to accurately identify single nucleotide variants (SNVs) using only perfect matches. However, this approach could be applied only to haploid regions of the genome and was computationally intensive. In this study, we present a unique approach, coverage-based single nucleotide variant identification (COBASI), which allows the exploration of the entire genome using second-generation short sequence reads without extensive computing requirements. COBASI identifies SNVs using changes in coverage of exactly matching unique substrings, and is particularly suited for pinpointing de novo SNVs. Unlike other approaches that require population frequencies across hundreds of samples to filter out any methodological biases, COBASI can be applied to detect de novo SNVs within isolated families. We demonstrate this capability through extensive simulation studies and by studying a parent-offspring trio we sequenced using short reads. Experimental validation of all 58 candidate de novo SNVs and a selection of non-de novo SNVs found in the trio confirmed zero FP calls. COBASI is available as open source at https://github.com/Laura-Gomez/COBASI for any researcher to use.

Assuntos

Variações do Número de Cópias de DNA , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Pais , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/métodos , Software , Algoritmos , Criança , Humanos

Context-dependent individualization of nucleotides and virtual genomic hybridization allow the precise location of human SNPs.

Reyes, José; Gómez-Romero, Laura; Ibarra-Soria, Ximena; Palacios-Flores, Kim; Arriola, Luis R; Wences, Alejandro; García, Delfino; Boege, Margareta; Dávila, Guillermo; Flores, Margarita; Palacios, Rafael.

Proc Natl Acad Sci U S A ; 108(37): 15294-9, 2011 Sep 13.

Artigo em Inglês | MEDLINE | ID: mdl-21876154

RESUMO

We have entered the era of individual genomic sequencing, and can already see exponential progress in the field. It is of utmost importance to exclude false-positive variants from reported datasets. However, because of the nature of the used algorithms, this task has not been optimized to the required level of precision. This study presents a unique strategy for identifying SNPs, called COIN-VGH, that largely minimizes the presence of false-positives in the generated data. The algorithm was developed using the X-chromosome-specific regions from the previously sequenced genomes of Craig Venter and James Watson. The algorithm is based on the concept that a nucleotide can be individualized if it is analyzed in the context of its surrounding genomic sequence. COIN-VGH consists of defining the most comprehensive set of nucleotide strings of a defined length that map with 100% identity to a unique position within the human reference genome (HRG). Such set is used to retrieve sequence reads from a query genome (QG), allowing the production of a genomic landscape that represents a draft HRG-guided assembly of the QG. This landscape is analyzed for specific signatures that indicate the presence of SNPs. The fidelity of the variation signature was assessed using simulation experiments by virtually altering the HRG at defined positions. Finally, the signature regions identified in the HRG and in the QG reads are aligned and the precise nature and position of the corresponding SNPs are detected. The advantages of COIN-VGH over previous algorithms are discussed.

Assuntos

Simulação por Computador , Genoma Humano/genética , Hibridização de Ácido Nucleico/métodos , Nucleotídeos/genética , Polimorfismo de Nucleotídeo Único/genética , Cromossomos Humanos X/genética , Sondas de DNA/metabolismo , Humanos , Padrões de Referência

Identical repeated backbone of the human genome.

Zepeda-Mendoza, Cinthya J; Lemus, Tzitziki; Yáñez, Omar; García, Delfino; Valle-García, David; Meza-Sosa, Karla F; Gutiérrez-Arcelus, María; Márquez-Ortiz, Yamile; Domínguez-Vidaña, Rocío; Gonzaga-Jauregui, Claudia; Flores, Margarita; Palacios, Rafael.

BMC Genomics ; 11: 60, 2010 Jan 23.

Artigo em Inglês | MEDLINE | ID: mdl-20096123

RESUMO

BACKGROUND: Identical sequences with a minimal length of about 300 base pairs (bp) have been involved in the generation of various meiotic/mitotic genomic rearrangements through non-allelic homologous recombination (NAHR) events. Genomic disorders and structural variation, together with gene remodelling processes have been associated with many of these rearrangements. Based on these observations, we identified and integrated all the 100% identical repeats of at least 300 bp in the NCBI version 36.2 human genome reference assembly into non-overlapping regions, thus defining the Identical Repeated Backbone (IRB) of the reference human genome. RESULTS: The IRB sequences are distributed all over the genome in 66,600 regions, which correspond to approximately 2% of the total NCBI human genome reference assembly. Important structural and functional elements such as common repeats, segmental duplications, and genes are contained in the IRB. About 80% of the IRB bp overlap with known copy-number variants (CNVs). By analyzing the genes embedded in the IRB, we were able to detect some identical genes not previously included in the Ensembl release 50 annotation of human genes. In addition, we found evidence of IRB gene copy-number polymorphisms in raw sequence reads of two diploid sequenced genomes. CONCLUSIONS: In general, the IRB offers new insight into the complex organization of the identical repeated sequences of the human genome. It provides an accurate map of potential NAHR sites which could be used in targeting the study of novel CNVs, predicting DNA copy-number variation in newly sequenced genomes, and improve genome annotation.

Assuntos

Genoma Humano , Sequências Repetitivas de Ácido Nucleico , Variações do Número de Cópias de DNA , Humanos , Análise de Sequência de DNA

DNA diversification in two Sinorhizobium species.

Guo, Xianwu; Flores, Margarita; Morales, Lucía; García, Delfino; Bustos, Patricia; González, Víctor; Palacios, Rafael; Dávila, Guillermo.

J Bacteriol ; 189(17): 6474-6, 2007 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-17601787

RESUMO

The comparative analysis of genomic characteristics and single-nucleotide polymorphism patterns from large fragments borne on different replicons of Sinorhizobium spp. genomes clearly demonstrate that DNA recombination among closely related bacteria is a major event in the diversification of this genome, especially in pSymA, resulting in mosaic structure.

Assuntos

Variação Genética , Genoma Bacteriano , Polimorfismo Genético , Sinorhizobium/genética , DNA Bacteriano/química , DNA Bacteriano/genética , Evolução Molecular , Dados de Sequência Molecular , Recombinação Genética , Análise de Sequência de DNA

Recurrent DNA inversion rearrangements in the human genome.

Flores, Margarita; Morales, Lucía; Gonzaga-Jauregui, Claudia; Domínguez-Vidaña, Rocío; Zepeda, Cinthya; Yañez, Omar; Gutiérrez, María; Lemus, Tzitziki; Valle, David; Avila, Ma Carmen; Blanco, Daniel; Medina-Ruiz, Sofía; Meza, Karla; Ayala, Erandi; García, Delfino; Bustos, Patricia; González, Víctor; Girard, Lourdes; Tusie-Luna, Teresa; Dávila, Guillermo; Palacios, Rafael.

Proc Natl Acad Sci U S A ; 104(15): 6099-106, 2007 Apr 10.

Artigo em Inglês | MEDLINE | ID: mdl-17389356

RESUMO

Several lines of evidence suggest that reiterated sequences in the human genome are targets for nonallelic homologous recombination (NAHR), which facilitates genomic rearrangements. We have used a PCR-based approach to identify breakpoint regions of rearranged structures in the human genome. In particular, we have identified intrachromosomal identical repeats that are located in reverse orientation, which may lead to chromosomal inversions. A bioinformatic workflow pathway to select appropriate regions for analysis was developed. Three such regions overlapping with known human genes, located on chromosomes 3, 15, and 19, were analyzed. The relative proportion of wild-type to rearranged structures was determined in DNA samples from blood obtained from different, unrelated individuals. The results obtained indicate that recurrent genomic rearrangements occur at relatively high frequency in somatic cells. Interestingly, the rearrangements studied were significantly more abundant in adults than in newborn individuals, suggesting that such DNA rearrangements might start to appear during embryogenesis or fetal life and continue to accumulate after birth. The relevance of our results in regard to human genomic variation is discussed.

Assuntos

Inversão Cromossômica/genética , Cromossomos Humanos/genética , Rearranjo Gênico/genética , Componentes Genômicos/genética , Genoma Humano/genética , Adulto , Fatores Etários , Clonagem Molecular , Biologia Computacional/métodos , Humanos , Recém-Nascido , Reação em Cadeia da Polimerase/métodos , Sequências Repetitivas de Ácido Nucleico/genética , Análise de Sequência de DNA

Diversification of DNA sequences in the symbiotic genome of Rhizobium etli.

Flores, Margarita; Morales, Lucia; Avila, Agustín; González, Víctor; Bustos, Patricia; García, Delfino; Mora, Yolanda; Guo, Xianwu; Collado-Vides, Julio; Piñero, Daniel; Dávila, Guillermo; Mora, Jaime; Palacios, Rafael.

J Bacteriol ; 187(21): 7185-92, 2005 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-16237002

RESUMO

Bacteria of the genus Rhizobium and related genera establish nitrogen-fixing symbioses with the roots of leguminous plants. The genetic elements that participate in the symbiotic process are usually compartmentalized in the genome, either as independent replicons (symbiotic plasmids) or as symbiotic regions or islands in the chromosome. The complete nucleotide sequence of the symbiotic plasmid of Rhizobium etli model strain CFN42, symbiont of the common bean plant, has been reported. To better understand the basis of DNA sequence diversification of this symbiotic compartment, we analyzed the distribution of single-nucleotide polymorphisms in homologous regions from different Rhizobium etli strains. The distribution of polymorphisms is highly asymmetric in each of the different strains, alternating regions containing very few changes with regions harboring an elevated number of substitutions. The regions showing high polymorphism do not correspond with discrete genetic elements and are not the same in the different strains, indicating that they are not hypervariable regions of functional genes. Most interesting, some highly polymorphic regions share exactly the same nucleotide substitutions in more than one strain. Furthermore, in different regions of the symbiotic compartment, different sets of strains share the same substitutions. The data indicate that the majority of nucleotide substitutions are spread in the population by recombination and that the contribution of new mutations to polymorphism is relatively low. We propose that the horizontal transfer of homologous DNA segments among closely related organisms is a major source of genomic diversification.

Assuntos

DNA Bacteriano/genética , Genoma Bacteriano , Plasmídeos/genética , Polimorfismo de Nucleotídeo Único , Rhizobium etli/genética , Simbiose/genética , DNA Bacteriano/química , Transferência Genética Horizontal , Dados de Sequência Molecular , Fixação de Nitrogênio , Recombinação Genética , Análise de Sequência de DNA , Homologia de Sequência do Ácido Nucleico

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA