Pesquisa | Biblioteca Virtual em Saúde Fiocruz

Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells.

Peters, Brock A; Kermani, Bahram G; Sparks, Andrew B; Alferov, Oleg; Hong, Peter; Alexeev, Andrei; Jiang, Yuan; Dahl, Fredrik; Tang, Y Tom; Haas, Juergen; Robasky, Kimberly; Zaranek, Alexander Wait; Lee, Je-Hyuk; Ball, Madeleine Price; Peterson, Joseph E; Perazich, Helena; Yeung, George; Liu, Jia; Chen, Linsu; Kennemer, Michael I; Pothuraju, Kaliprasad; Konvicka, Karel; Tsoupko-Sitnikov, Mike; Pant, Krishna P; Ebert, Jessica C; Nilsen, Geoffrey B; Baccash, Jonathan; Halpern, Aaron L; Church, George M; Drmanac, Radoje.

Nature ; 487(7406): 190-5, 2012 Jul 11.

Artigo em Inglês | MEDLINE | ID: mdl-22785314

RESUMO

Recent advances in whole-genome sequencing have brought the vision of personal genomics and genomic medicine closer to reality. However, current methods lack clinical accuracy and the ability to describe the context (haplotypes) in which genome variants co-occur in a cost-effective manner. Here we describe a low-cost DNA sequencing and haplotyping process, long fragment read (LFR) technology, which is similar to sequencing long single DNA molecules without cloning or separation of metaphase chromosomes. In this study, ten LFR libraries were made using only â¼100 picograms of human DNA per sample. Up to 97% of the heterozygous single nucleotide variants were assembled into long haplotype contigs. Removal of false positive single nucleotide variants not phased by multiple LFR haplotypes resulted in a final genome error rate of 1 in 10 megabases. Cost-effective and accurate genome sequencing and haplotyping from 10-20 human cells, as demonstrated here, will enable comprehensive genetic studies and diverse clinical applications.

Assuntos

Genoma Humano , Genômica/métodos , Análise de Sequência de DNA/métodos , Alelos , Linhagem Celular , Feminino , Inativação Gênica , Variação Genética , Haplótipos , Humanos , Mutação , Reprodutibilidade dos Testes , Análise de Sequência de DNA/economia , Análise de Sequência de DNA/normas

Type 2 diabetes risk alleles demonstrate extreme directional differentiation among human populations, compared to other diseases.

Chen, Rong; Corona, Erik; Sikora, Martin; Dudley, Joel T; Morgan, Alex A; Moreno-Estrada, Andres; Nilsen, Geoffrey B; Ruau, David; Lincoln, Stephen E; Bustamante, Carlos D; Butte, Atul J.

PLoS Genet ; 8(4): e1002621, 2012.

Artigo em Inglês | MEDLINE | ID: mdl-22511877

RESUMO

Many disease-susceptible SNPs exhibit significant disparity in ancestral and derived allele frequencies across worldwide populations. While previous studies have examined population differentiation of alleles at specific SNPs, global ethnic patterns of ensembles of disease risk alleles across human diseases are unexamined. To examine these patterns, we manually curated ethnic disease association data from 5,065 papers on human genetic studies representing 1,495 diseases, recording the precise risk alleles and their measured population frequencies and estimated effect sizes. We systematically compared the population frequencies of cross-ethnic risk alleles for each disease across 1,397 individuals from 11 HapMap populations, 1,064 individuals from 53 HGDP populations, and 49 individuals with whole-genome sequences from 10 populations. Type 2 diabetes (T2D) demonstrated extreme directional differentiation of risk allele frequencies across human populations, compared with null distributions of European-frequency matched control genomic alleles and risk alleles for other diseases. Most T2D risk alleles share a consistent pattern of decreasing frequencies along human migration into East Asia. Furthermore, we show that these patterns contribute to disparities in predicted genetic risk across 1,397 HapMap individuals, T2D genetic risk being consistently higher for individuals in the African populations and lower in the Asian populations, irrespective of the ethnicity considered in the initial discovery of risk alleles. We observed a similar pattern in the distribution of T2D Genetic Risk Scores, which are associated with an increased risk of developing diabetes in the Diabetes Prevention Program cohort, for the same individuals. This disparity may be attributable to the promotion of energy storage and usage appropriate to environments and inconsistent energy intake. Our results indicate that the differential frequencies of T2D risk alleles may contribute to the observed disparity in T2D incidence rates across ethnic populations.

Assuntos

Diabetes Mellitus Tipo 2/genética , Frequência do Gene , Predisposição Genética para Doença , Genética Populacional , Polimorfismo de Nucleotídeo Único/genética , Povo Asiático/genética , População Negra/genética , Frequência do Gene/genética , Genoma Humano , Estudo de Associação Genômica Ampla , Projeto HapMap , Haplótipos , Humanos , Desequilíbrio de Ligação , Fatores de Risco , População Branca/genética

A public resource facilitating clinical use of genomes.

Ball, Madeleine P; Thakuria, Joseph V; Zaranek, Alexander Wait; Clegg, Tom; Rosenbaum, Abraham M; Wu, Xiaodi; Angrist, Misha; Bhak, Jong; Bobe, Jason; Callow, Matthew J; Cano, Carlos; Chou, Michael F; Chung, Wendy K; Douglas, Shawn M; Estep, Preston W; Gore, Athurva; Hulick, Peter; Labarga, Alberto; Lee, Je-Hyuk; Lunshof, Jeantine E; Kim, Byung Chul; Kim, Jong-Il; Li, Zhe; Murray, Michael F; Nilsen, Geoffrey B; Peters, Brock A; Raman, Anugraha M; Rienhoff, Hugh Y; Robasky, Kimberly; Wheeler, Matthew T; Vandewege, Ward; Vorhaus, Daniel B; Yang, Joyce L; Yang, Luhan; Aach, John; Ashley, Euan A; Drmanac, Radoje; Kim, Seong-Jin; Li, Jin Billy; Peshkin, Leonid; Seidman, Christine E; Seo, Jeong-Sun; Zhang, Kun; Rehm, Heidi L; Church, George M.

Proc Natl Acad Sci U S A ; 109(30): 11920-7, 2012 Jul 24.

Artigo em Inglês | MEDLINE | ID: mdl-22797899

RESUMO

Rapid advances in DNA sequencing promise to enable new diagnostics and individualized therapies. Achieving personalized medicine, however, will require extensive research on highly reidentifiable, integrated datasets of genomic and health information. To assist with this, participants in the Personal Genome Project choose to forgo privacy via our institutional review board- approved "open consent" process. The contribution of public data and samples facilitates both scientific discovery and standardization of methods. We present our findings after enrollment of more than 1,800 participants, including whole-genome sequencing of 10 pilot participant genomes (the PGP-10). We introduce the Genome-Environment-Trait Evidence (GET-Evidence) system. This tool automatically processes genomes and prioritizes both published and novel variants for interpretation. In the process of reviewing the presumed healthy PGP-10 genomes, we find numerous literature references implying serious disease. Although it is sometimes impossible to rule out a late-onset effect, stringent evidence requirements can address the high rate of incidental findings. To that end we develop a peer production system for recording and organizing variant evaluations according to standard evidence guidelines, creating a public forum for reaching consensus on interpretation of clinically relevant variants. Genome analysis becomes a two-step process: using a prioritized list to record variant evaluations, then automatically sorting reviewed variants using these annotations. Genome data, health and trait information, participant samples, and variant interpretations are all shared in the public domain-we invite others to review our results using our participant samples and contribute to our interpretations. We offer our public resource and methods to further personalized medical research.

Assuntos

Bases de Dados Genéticas , Variação Genética , Genoma Humano/genética , Fenótipo , Medicina de Precisão/métodos , Software , Linhagem Celular , Coleta de Dados , Humanos , Medicina de Precisão/tendências , Análise de Sequência de DNA

A sequence-based variation map of 8.27 million SNPs in inbred mouse strains.

Frazer, Kelly A; Eskin, Eleazar; Kang, Hyun Min; Bogue, Molly A; Hinds, David A; Beilharz, Erica J; Gupta, Robert V; Montgomery, Julie; Morenzoni, Matt M; Nilsen, Geoffrey B; Pethiyagoda, Charit L; Stuve, Laura L; Johnson, Frank M; Daly, Mark J; Wade, Claire M; Cox, David R.

Nature ; 448(7157): 1050-3, 2007 Aug 30.

Artigo em Inglês | MEDLINE | ID: mdl-17660834

RESUMO

A dense map of genetic variation in the laboratory mouse genome will provide insights into the evolutionary history of the species and lead to an improved understanding of the relationship between inter-strain genotypic and phenotypic differences. Here we resequence the genomes of four wild-derived and eleven classical strains. We identify 8.27 million high-quality single nucleotide polymorphisms (SNPs) densely distributed across the genome, and determine the locations of the high (divergent subspecies ancestry) and low (common subspecies ancestry) SNP-rate intervals for every pairwise combination of classical strains. Using these data, we generate a genome-wide haplotype map containing 40,898 segments, each with an average of three distinct ancestral haplotypes. For the haplotypes in the classical strains that are unequivocally assigned ancestry, the genetic contributions of the Mus musculus subspecies--M. m. domesticus, M. m. musculus, M. m. castaneus and the hybrid M. m. molossinus--are 68%, 6%, 3% and 10%, respectively; the remaining 13% of haplotypes are of unknown ancestral origin. The considerable regional redundancy of the SNP data will facilitate imputation of the majority of these genotypes in less-densely typed classical inbred strains to provide a complete view of variation in additional strains.

Assuntos

Camundongos Endogâmicos/genética , Polimorfismo de Nucleotídeo Único/genética , Animais , Cromossomos de Mamíferos/genética , Análise Mutacional de DNA , Bases de Dados Genéticas , Genoma/genética , Genômica , Haplótipos/genética , Camundongos , Camundongos Endogâmicos C57BL , Análise de Sequência com Séries de Oligonucleotídeos

A Systematic Comparison of Traditional and Multigene Panel Testing for Hereditary Breast and Ovarian Cancer Genes in More Than 1000 Patients.

Lincoln, Stephen E; Kobayashi, Yuya; Anderson, Michael J; Yang, Shan; Desmond, Andrea J; Mills, Meredith A; Nilsen, Geoffrey B; Jacobs, Kevin B; Monzon, Federico A; Kurian, Allison W; Ford, James M; Ellisen, Leif W.

J Mol Diagn ; 17(5): 533-44, 2015 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-26207792

RESUMO

Gene panels for hereditary breast and ovarian cancer risk assessment are gaining acceptance, even though the clinical utility of these panels is not yet fully defined. Technical questions remain, however, about the performance and clinical interpretation of gene panels in comparison with traditional tests. We tested 1105 individuals using a 29-gene next-generation sequencing panel and observed 100% analytical concordance with traditional and reference data on >750 comparable variants. These 750 variants included technically challenging classes of sequence and copy number variation that together represent a significant fraction (13.4%) of the pathogenic variants observed. For BRCA1 and BRCA2, we also compared variant interpretations in traditional reports to those produced using only non-proprietary resources and following criteria based on recent (2015) guidelines. We observed 99.8% net report concordance, albeit with a slightly higher variant of uncertain significance rate. In 4.5% of BRCA-negative cases, we uncovered pathogenic variants in other genes, which appear clinically relevant. Previously unseen variants requiring interpretation accumulated rapidly, even after 1000 individuals had been tested. We conclude that next-generation sequencing panel testing can provide results highly comparable to traditional testing and can uncover potentially actionable findings that may be otherwise missed. Challenges remain for the broad adoption of panel tests, some of which will be addressed by the accumulation of large public databases of annotated clinical variants.

Assuntos

Neoplasias da Mama/genética , Genes Neoplásicos , Testes Genéticos/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Neoplasias Ovarianas/genética , Estudos de Coortes , Variações do Número de Cópias de DNA , Análise Mutacional de DNA/métodos , Feminino , Genes BRCA1 , Genes BRCA2 , Predisposição Genética para Doença , Humanos , Síndromes Neoplásicas Hereditárias/genética

Whole-genome sequencing of asian lung cancers: second-hand smoke unlikely to be responsible for higher incidence of lung cancer among Asian never-smokers.

Krishnan, Vidhya G; Ebert, Philip J; Ting, Jason C; Lim, Elaine; Wong, Swee-Seong; Teo, Audrey S M; Yue, Yong G; Chua, Hui-Hoon; Ma, Xiwen; Loh, Gary S L; Lin, Yuhao; Tan, Joanna H J; Yu, Kun; Zhang, Shenli; Reinhard, Christoph; Tan, Daniel S W; Peters, Brock A; Lincoln, Stephen E; Ballinger, Dennis G; Laramie, Jason M; Nilsen, Geoffrey B; Barber, Thomas D; Tan, Patrick; Hillmer, Axel M; Ng, Pauline C.

Cancer Res ; 74(21): 6071-81, 2014 Nov 01.

Artigo em Inglês | MEDLINE | ID: mdl-25189529

RESUMO

Asian nonsmoking populations have a higher incidence of lung cancer compared with their European counterparts. There is a long-standing hypothesis that the increase of lung cancer in Asian never-smokers is due to environmental factors such as second-hand smoke. We analyzed whole-genome sequencing of 30 Asian lung cancers. Unsupervised clustering of mutational signatures separated the patients into two categories of either all the never-smokers or all the smokers or ex-smokers. In addition, nearly one third of the ex-smokers and smokers classified with the never-smoker-like cluster. The somatic variant profiles of Asian lung cancers were similar to that of European origin with G.C>T.A being predominant in smokers. We found EGFR and TP53 to be the most frequently mutated genes with mutations in 50% and 27% of individuals, respectively. Among the 16 never-smokers, 69% had an EGFR mutation compared with 29% of 14 smokers/ex-smokers. Asian never-smokers had lung cancer signatures distinct from the smoker signature and their mutation profiles were similar to European never-smokers. The profiles of Asian and European smokers are also similar. Taken together, these results suggested that the same mutational mechanisms underlie the etiology for both ethnic groups. Thus, the high incidence of lung cancer in Asian never-smokers seems unlikely to be due to second-hand smoke or other carcinogens that cause oxidative DNA damage, implying that routine EGFR testing is warranted in the Asian population regardless of smoking status.

Assuntos

Dano ao DNA/genética , Neoplasias Pulmonares/epidemiologia , Neoplasias Pulmonares/genética , Poluição por Fumaça de Tabaco/efeitos adversos , Povo Asiático/genética , Receptores ErbB/genética , Feminino , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Neoplasias Pulmonares/patologia , Masculino , Pessoa de Meia-Idade , Mutação , Fatores de Risco , Proteína Supressora de Tumor p53/genética

Computational techniques for human genome resequencing using mated gapped reads.

Carnevali, Paolo; Baccash, Jonathan; Halpern, Aaron L; Nazarenko, Igor; Nilsen, Geoffrey B; Pant, Krishna P; Ebert, Jessica C; Brownley, Anushka; Morenzoni, Matt; Karpinchyk, Vitali; Martin, Bruce; Ballinger, Dennis G; Drmanac, Radoje.

J Comput Biol ; 19(3): 279-92, 2012 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-22175250

RESUMO

Unchained base reads on self-assembling DNA nanoarrays have recently emerged as a promising approach to low-cost, high-quality resequencing of human genomes. Because of unique characteristics of these mated pair reads, existing computational methods for resequencing assembly, such as those based on map-consensus calling, are not adequate for accurate variant calling. We describe novel computational methods developed for accurate calling of SNPs and short substitutions and indels (<100 bp); the same methods apply to evaluation of hypothesized larger, structural variations. We use an optimization process that iteratively adjusts the genome sequence to maximize its a posteriori probability given the observed reads. For each candidate sequence, this probability is computed using Bayesian statistics with a simple read generation model and simplifying assumptions that make the problem computationally tractable. The optimization process iteratively applies one-base substitutions, insertions, and deletions until convergence is achieved to an optimum diploid sequence. A local de novo assembly procedure that generalizes approaches based on De Bruijn graphs is used to seed the optimization process in order to reduce the chance of converging to local optima. Finally, a correlation-based filter is applied to reduce the false positive rate caused by the presence of repetitive regions in the reference genome.

Assuntos

Mapeamento de Sequências Contíguas/métodos , Genoma Humano , Análise de Sequência de DNA/métodos , Algoritmos , Alelos , Sequência de Bases , Teorema de Bayes , Mapeamento Cromossômico , Simulação por Computador , Interpretação Estatística de Dados , Humanos , Modelos Genéticos

Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays.

Drmanac, Radoje; Sparks, Andrew B; Callow, Matthew J; Halpern, Aaron L; Burns, Norman L; Kermani, Bahram G; Carnevali, Paolo; Nazarenko, Igor; Nilsen, Geoffrey B; Yeung, George; Dahl, Fredrik; Fernandez, Andres; Staker, Bryan; Pant, Krishna P; Baccash, Jonathan; Borcherding, Adam P; Brownley, Anushka; Cedeno, Ryan; Chen, Linsu; Chernikoff, Dan; Cheung, Alex; Chirita, Razvan; Curson, Benjamin; Ebert, Jessica C; Hacker, Coleen R; Hartlage, Robert; Hauser, Brian; Huang, Steve; Jiang, Yuan; Karpinchyk, Vitali; Koenig, Mark; Kong, Calvin; Landers, Tom; Le, Catherine; Liu, Jia; McBride, Celeste E; Morenzoni, Matt; Morey, Robert E; Mutch, Karl; Perazich, Helena; Perry, Kimberly; Peters, Brock A; Peterson, Joe; Pethiyagoda, Charit L; Pothuraju, Kaliprasad; Richter, Claudia; Rosenbaum, Abraham M; Roy, Shaunak; Shafto, Jay; Sharanhovich, Uladzislau.

Science ; 327(5961): 78-81, 2010 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-19892942

RESUMO

Genome sequencing of large numbers of individuals promises to advance the understanding, treatment, and prevention of human diseases, among other applications. We describe a genome sequencing platform that achieves efficient imaging and low reagent consumption with combinatorial probe anchor ligation chemistry to independently assay each base from patterned nanoarrays of self-assembling DNA nanoballs. We sequenced three human genomes with this platform, generating an average of 45- to 87-fold coverage per genome and identifying 3.2 to 4.5 million sequence variants per genome. Validation of one genome data set demonstrates a sequence accuracy of about 1 false variant per 100 kilobases. The high accuracy, affordable cost of $4400 for sequencing consumables, and scalability of this platform enable complete human genome sequencing for the detection of rare variants in large-scale genetic studies.

Assuntos

DNA/química , Genoma Humano , Análise em Microsséries , Análise de Sequência de DNA/métodos , Sequência de Bases , Biologia Computacional , Custos e Análise de Custo , DNA/genética , Bases de Dados de Ácidos Nucleicos , Biblioteca Genômica , Genótipo , Haplótipos , Projeto Genoma Humano , Humanos , Masculino , Nanoestruturas , Nanotecnologia , Técnicas de Amplificação de Ácido Nucleico , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/economia , Análise de Sequência de DNA/instrumentação , Análise de Sequência de DNA/normas , Software

Whole-genome patterns of common DNA variation in three human populations.

Hinds, David A; Stuve, Laura L; Nilsen, Geoffrey B; Halperin, Eran; Eskin, Eleazar; Ballinger, Dennis G; Frazer, Kelly A; Cox, David R.

Science ; 307(5712): 1072-9, 2005 Feb 18.

Artigo em Inglês | MEDLINE | ID: mdl-15718463

RESUMO

Individual differences in DNA sequence are the genetic basis of human variability. We have characterized whole-genome patterns of common human DNA variation by genotyping 1,586,383 single-nucleotide polymorphisms (SNPs) in 71 Americans of European, African, and Asian ancestry. Our results indicate that these SNPs capture most common genetic variation as a result of linkage disequilibrium, the correlation among common SNP alleles. We observe a strong correlation between extended regions of linkage disequilibrium and functional genomic elements. Our data provide a tool for exploring many questions that remain regarding the causal role of common human DNA variation in complex human traits and for investigating the nature of genetic variation within and between human populations.

Assuntos

Povo Asiático/genética , Negro ou Afro-Americano/genética , Variação Genética , Genoma Humano , Polimorfismo de Nucleotídeo Único , População Branca/genética , Algoritmos , Estudos de Casos e Controles , Mapeamento Cromossômico , Bases de Dados Genéticas , Feminino , Frequência do Gene , Marcadores Genéticos , Predisposição Genética para Doença , Genótipo , Haplótipos , Humanos , Desequilíbrio de Ligação , Masculino , Herança Multifatorial , Recombinação Genética , Fatores de Risco , Seleção Genética

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA