Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Más filtros

Bases de datos
Tipo del documento
Asunto de la revista
País de afiliación
Intervalo de año de publicación
1.
Proc Natl Acad Sci U S A ; 115(21): 5516-5521, 2018 05 22.
Artículo en Inglés | MEDLINE | ID: mdl-29735690

RESUMEN

The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent-offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we developed an alternative approach to accurately identify single nucleotide variants (SNVs) using only perfect matches. However, this approach could be applied only to haploid regions of the genome and was computationally intensive. In this study, we present a unique approach, coverage-based single nucleotide variant identification (COBASI), which allows the exploration of the entire genome using second-generation short sequence reads without extensive computing requirements. COBASI identifies SNVs using changes in coverage of exactly matching unique substrings, and is particularly suited for pinpointing de novo SNVs. Unlike other approaches that require population frequencies across hundreds of samples to filter out any methodological biases, COBASI can be applied to detect de novo SNVs within isolated families. We demonstrate this capability through extensive simulation studies and by studying a parent-offspring trio we sequenced using short reads. Experimental validation of all 58 candidate de novo SNVs and a selection of non-de novo SNVs found in the trio confirmed zero FP calls. COBASI is available as open source at https://github.com/Laura-Gomez/COBASI for any researcher to use.


Asunto(s)
Variaciones en el Número de Copia de ADN , Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Padres , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Algoritmos , Niño , Humanos
2.
Proc Natl Acad Sci U S A ; 108(37): 15294-9, 2011 Sep 13.
Artículo en Inglés | MEDLINE | ID: mdl-21876154

RESUMEN

We have entered the era of individual genomic sequencing, and can already see exponential progress in the field. It is of utmost importance to exclude false-positive variants from reported datasets. However, because of the nature of the used algorithms, this task has not been optimized to the required level of precision. This study presents a unique strategy for identifying SNPs, called COIN-VGH, that largely minimizes the presence of false-positives in the generated data. The algorithm was developed using the X-chromosome-specific regions from the previously sequenced genomes of Craig Venter and James Watson. The algorithm is based on the concept that a nucleotide can be individualized if it is analyzed in the context of its surrounding genomic sequence. COIN-VGH consists of defining the most comprehensive set of nucleotide strings of a defined length that map with 100% identity to a unique position within the human reference genome (HRG). Such set is used to retrieve sequence reads from a query genome (QG), allowing the production of a genomic landscape that represents a draft HRG-guided assembly of the QG. This landscape is analyzed for specific signatures that indicate the presence of SNPs. The fidelity of the variation signature was assessed using simulation experiments by virtually altering the HRG at defined positions. Finally, the signature regions identified in the HRG and in the QG reads are aligned and the precise nature and position of the corresponding SNPs are detected. The advantages of COIN-VGH over previous algorithms are discussed.


Asunto(s)
Simulación por Computador , Genoma Humano/genética , Hibridación de Ácido Nucleico/métodos , Nucleótidos/genética , Polimorfismo de Nucleótido Simple/genética , Cromosomas Humanos X/genética , Sondas de ADN/metabolismo , Humanos , Estándares de Referencia
3.
BMC Genomics ; 11: 60, 2010 Jan 23.
Artículo en Inglés | MEDLINE | ID: mdl-20096123

RESUMEN

BACKGROUND: Identical sequences with a minimal length of about 300 base pairs (bp) have been involved in the generation of various meiotic/mitotic genomic rearrangements through non-allelic homologous recombination (NAHR) events. Genomic disorders and structural variation, together with gene remodelling processes have been associated with many of these rearrangements. Based on these observations, we identified and integrated all the 100% identical repeats of at least 300 bp in the NCBI version 36.2 human genome reference assembly into non-overlapping regions, thus defining the Identical Repeated Backbone (IRB) of the reference human genome. RESULTS: The IRB sequences are distributed all over the genome in 66,600 regions, which correspond to approximately 2% of the total NCBI human genome reference assembly. Important structural and functional elements such as common repeats, segmental duplications, and genes are contained in the IRB. About 80% of the IRB bp overlap with known copy-number variants (CNVs). By analyzing the genes embedded in the IRB, we were able to detect some identical genes not previously included in the Ensembl release 50 annotation of human genes. In addition, we found evidence of IRB gene copy-number polymorphisms in raw sequence reads of two diploid sequenced genomes. CONCLUSIONS: In general, the IRB offers new insight into the complex organization of the identical repeated sequences of the human genome. It provides an accurate map of potential NAHR sites which could be used in targeting the study of novel CNVs, predicting DNA copy-number variation in newly sequenced genomes, and improve genome annotation.


Asunto(s)
Genoma Humano , Secuencias Repetitivas de Ácidos Nucleicos , Variaciones en el Número de Copia de ADN , Humanos , Análisis de Secuencia de ADN
4.
J Bacteriol ; 189(17): 6474-6, 2007 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-17601787

RESUMEN

The comparative analysis of genomic characteristics and single-nucleotide polymorphism patterns from large fragments borne on different replicons of Sinorhizobium spp. genomes clearly demonstrate that DNA recombination among closely related bacteria is a major event in the diversification of this genome, especially in pSymA, resulting in mosaic structure.


Asunto(s)
Variación Genética , Genoma Bacteriano , Polimorfismo Genético , Sinorhizobium/genética , ADN Bacteriano/química , ADN Bacteriano/genética , Evolución Molecular , Datos de Secuencia Molecular , Recombinación Genética , Análisis de Secuencia de ADN
5.
Proc Natl Acad Sci U S A ; 104(15): 6099-106, 2007 Apr 10.
Artículo en Inglés | MEDLINE | ID: mdl-17389356

RESUMEN

Several lines of evidence suggest that reiterated sequences in the human genome are targets for nonallelic homologous recombination (NAHR), which facilitates genomic rearrangements. We have used a PCR-based approach to identify breakpoint regions of rearranged structures in the human genome. In particular, we have identified intrachromosomal identical repeats that are located in reverse orientation, which may lead to chromosomal inversions. A bioinformatic workflow pathway to select appropriate regions for analysis was developed. Three such regions overlapping with known human genes, located on chromosomes 3, 15, and 19, were analyzed. The relative proportion of wild-type to rearranged structures was determined in DNA samples from blood obtained from different, unrelated individuals. The results obtained indicate that recurrent genomic rearrangements occur at relatively high frequency in somatic cells. Interestingly, the rearrangements studied were significantly more abundant in adults than in newborn individuals, suggesting that such DNA rearrangements might start to appear during embryogenesis or fetal life and continue to accumulate after birth. The relevance of our results in regard to human genomic variation is discussed.


Asunto(s)
Inversión Cromosómica/genética , Cromosomas Humanos/genética , Reordenamiento Génico/genética , Componentes Genómicos/genética , Genoma Humano/genética , Adulto , Factores de Edad , Clonación Molecular , Biología Computacional/métodos , Humanos , Recién Nacido , Reacción en Cadena de la Polimerasa/métodos , Secuencias Repetitivas de Ácidos Nucleicos/genética , Análisis de Secuencia de ADN
6.
J Bacteriol ; 187(21): 7185-92, 2005 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-16237002

RESUMEN

Bacteria of the genus Rhizobium and related genera establish nitrogen-fixing symbioses with the roots of leguminous plants. The genetic elements that participate in the symbiotic process are usually compartmentalized in the genome, either as independent replicons (symbiotic plasmids) or as symbiotic regions or islands in the chromosome. The complete nucleotide sequence of the symbiotic plasmid of Rhizobium etli model strain CFN42, symbiont of the common bean plant, has been reported. To better understand the basis of DNA sequence diversification of this symbiotic compartment, we analyzed the distribution of single-nucleotide polymorphisms in homologous regions from different Rhizobium etli strains. The distribution of polymorphisms is highly asymmetric in each of the different strains, alternating regions containing very few changes with regions harboring an elevated number of substitutions. The regions showing high polymorphism do not correspond with discrete genetic elements and are not the same in the different strains, indicating that they are not hypervariable regions of functional genes. Most interesting, some highly polymorphic regions share exactly the same nucleotide substitutions in more than one strain. Furthermore, in different regions of the symbiotic compartment, different sets of strains share the same substitutions. The data indicate that the majority of nucleotide substitutions are spread in the population by recombination and that the contribution of new mutations to polymorphism is relatively low. We propose that the horizontal transfer of homologous DNA segments among closely related organisms is a major source of genomic diversification.


Asunto(s)
ADN Bacteriano/genética , Genoma Bacteriano , Plásmidos/genética , Polimorfismo de Nucleótido Simple , Rhizobium etli/genética , Simbiosis/genética , ADN Bacteriano/química , Transferencia de Gen Horizontal , Datos de Secuencia Molecular , Fijación del Nitrógeno , Recombinación Genética , Análisis de Secuencia de ADN , Homología de Secuencia de Ácido Nucleico
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA