Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 155
Filter
Add more filters

Publication year range
1.
Hum Genomics ; 15(1): 58, 2021 08 28.
Article in English | MEDLINE | ID: mdl-34454617

ABSTRACT

BACKGROUND: Low-pass sequencing (LPS) has been extensively investigated for applicability to various genetic studies due to its advantages over genotype array data including cost-effectiveness. Predicting the risk of complex diseases such as Parkinson's disease (PD) using polygenic risk score (PRS) based on the genetic variations has shown decent prediction accuracy. Although ultra-LPS has been shown to be effective in PRS calculation, array data has been favored to the majority of PRS analysis, especially for PD. RESULTS: Using eight high-coverage WGS, we assessed imputation approaches for downsampled LPS data ranging from 0.5 × to 7.0 × . We demonstrated that uncertain genotype calls of LPS diminished imputation accuracy, and an imputation approach using genotype likelihoods was plausible for LPS. Additionally, comparing imputation accuracies between LPS and simulated array illustrated that LPS had higher accuracies particularly at rare frequencies. To evaluate ultra-low coverage data in PRS calculation for PD, we prepared low-coverage WGS and genotype array of 87 PD cases and 101 controls. Genotype imputation of array and downsampled LPS were conducted using a population-specific reference panel, and we calculated risk scores based on the PD-associated SNPs from an East Asian meta-GWAS. The PRS models discriminated cases and controls as previously reported when both LPS and genotype array were used. Also strong correlations in PRS models for PD between LPS and genotype array were discovered. CONCLUSIONS: Overall, this study highlights the potentials of LPS under 1.0 × followed by genotype imputation in PRS calculation and suggests LPS as attractive alternatives to genotype array in the area of precision medicine for PD.


Subject(s)
Genetic Predisposition to Disease , Multifactorial Inheritance/genetics , Parkinson Disease/genetics , Whole Genome Sequencing/statistics & numerical data , Adult , Aged , Chromosome Mapping , Female , Genome-Wide Association Study , Genotype , Humans , Male , Middle Aged , Parkinson Disease/pathology , Polymorphism, Single Nucleotide/genetics , Risk Factors
2.
Hum Genomics ; 15(1): 28, 2021 05 10.
Article in English | MEDLINE | ID: mdl-33971976

ABSTRACT

BACKGROUND: Rare diseases are pathologies that affect less than 1 in 2000 people. They are difficult to diagnose due to their low frequency and their often highly heterogeneous symptoms. Rare diseases have in general a high impact on the quality of life and life expectancy of patients, which are in general children or young people. The advent of high-throughput sequencing techniques has improved diagnosis in several different areas, from pediatrics, achieving a diagnostic rate of 41% with whole genome sequencing (WGS) and 36% with whole exome sequencing, to neurology, achieving a diagnostic rate between 47 and 48.5% with WGS. This evidence has encouraged our group to pursue a molecular diagnosis using WGS for this and several other patients with rare diseases. RESULTS: We used whole genome sequencing to achieve a molecular diagnosis of a 7-year-old girl with a severe panvascular artery disease that remained for several years undiagnosed. We found a frameshift variant in one copy and a large deletion involving two exons in the other copy of a gene called YY1AP1. This gene is related to Grange syndrome, a recessive rare disease, whose symptoms include stenosis or occlusion of multiple arteries, congenital heart defects, brachydactyly, syndactyly, bone fragility, and learning disabilities. Bioinformatic analyses propose these mutations as the most likely cause of the disease, according to its frequency, in silico predictors, conservation analyses, and effect on the protein product. Additionally, we confirmed one mutation in each parent, supporting a compound heterozygous status in the child. CONCLUSIONS: In general, we think that this finding can contribute to the use of whole genome sequencing as a diagnosis tool of rare diseases, and in particular, it can enhance the set of known mutations associated with different diseases.


Subject(s)
Arterial Occlusive Diseases/genetics , Cell Cycle Proteins/genetics , Heart Defects, Congenital/genetics , Rare Diseases/genetics , Transcription Factors/genetics , Arterial Occlusive Diseases/diagnosis , Arterial Occlusive Diseases/pathology , Arteries/diagnostic imaging , Arteries/pathology , Child , Female , Frameshift Mutation/genetics , Heart Defects, Congenital/diagnosis , Heart Defects, Congenital/pathology , Homozygote , Humans , Pedigree , Rare Diseases/diagnosis , Rare Diseases/pathology , Whole Genome Sequencing
3.
Nature ; 538(7624): 243-247, 2016 Oct 13.
Article in English | MEDLINE | ID: mdl-27706134

ABSTRACT

Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9 Mb and a scaffold N50 size of 44.8 Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03 Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6 Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of unreported and Asian-specific structural variants, and high-quality haplotyping of clinically relevant alleles for precision medicine.


Subject(s)
Asian People/genetics , Contig Mapping , Genome, Human/genetics , Genomics , Haplotypes/genetics , Sequence Analysis, DNA , Alleles , Chromosomes, Artificial, Bacterial/genetics , Cytochrome P-450 CYP2D6/genetics , Diploidy , Genetic Variation/genetics , Histocompatibility Antigens Class II/genetics , Humans , Precision Medicine , Reference Standards , Republic of Korea
4.
Mol Biol Evol ; 37(1): 2-10, 2020 Jan 01.
Article in English | MEDLINE | ID: mdl-31504792

ABSTRACT

Recent reports have identified differences in the mutational spectra across human populations. Although some of these reports have been replicated in other cohorts, most have been reported only in the 1000 Genomes Project (1kGP) data. While investigating an intriguing putative population stratification within the Japanese population, we identified a previously unreported batch effect leading to spurious mutation calls in the 1kGP data and to the apparent population stratification. Because the 1kGP data are used extensively, we find that the batch effects also lead to incorrect imputation by leading imputation servers and a small number of suspicious GWAS associations. Lower quality data from the early phases of the 1kGP thus continue to contaminate modern studies in hidden ways. It may be time to retire or upgrade such legacy sequencing data.


Subject(s)
Human Genome Project , Artifacts , Genome-Wide Association Study , Humans , Japan , Mutation
5.
Nature ; 516(7530): 192-7, 2014 Dec 11.
Article in English | MEDLINE | ID: mdl-25503232

ABSTRACT

Pluripotency is defined by the ability of a cell to differentiate to the derivatives of all the three embryonic germ layers: ectoderm, mesoderm and endoderm. Pluripotent cells can be captured via the archetypal derivation of embryonic stem cells or via somatic cell reprogramming. Somatic cells are induced to acquire a pluripotent stem cell (iPSC) state through the forced expression of key transcription factors, and in the mouse these cells can fulfil the strictest of all developmental assays for pluripotent cells by generating completely iPSC-derived embryos and mice. However, it is not known whether there are additional classes of pluripotent cells, or what the spectrum of reprogrammed phenotypes encompasses. Here we explore alternative outcomes of somatic reprogramming by fully characterizing reprogrammed cells independent of preconceived definitions of iPSC states. We demonstrate that by maintaining elevated reprogramming factor expression levels, mouse embryonic fibroblasts go through unique epigenetic modifications to arrive at a stable, Nanog-positive, alternative pluripotent state. In doing so, we prove that the pluripotent spectrum can encompass multiple, unique cell states.


Subject(s)
Cellular Reprogramming/genetics , Cellular Reprogramming/physiology , Epigenesis, Genetic , Induced Pluripotent Stem Cells/cytology , Induced Pluripotent Stem Cells/metabolism , Animals , Embryonic Stem Cells/cytology , Embryonic Stem Cells/metabolism , Female , Fibroblasts/classification , Fibroblasts/cytology , Fibroblasts/metabolism , Histone Deacetylases/metabolism , Induced Pluripotent Stem Cells/classification , Mice , Mice, Nude , Transcription Factors/genetics , Transcription Factors/metabolism , Transgenes/genetics
6.
Nature ; 516(7530): 198-206, 2014 Dec 11.
Article in English | MEDLINE | ID: mdl-25503233

ABSTRACT

Somatic cell reprogramming to a pluripotent state continues to challenge many of our assumptions about cellular specification, and despite major efforts, we lack a complete molecular characterization of the reprograming process. To address this gap in knowledge, we generated extensive transcriptomic, epigenomic and proteomic data sets describing the reprogramming routes leading from mouse embryonic fibroblasts to induced pluripotency. Through integrative analysis, we reveal that cells transition through distinct gene expression and epigenetic signatures and bifurcate towards reprogramming transgene-dependent and -independent stable pluripotent states. Early transcriptional events, driven by high levels of reprogramming transcription factor expression, are associated with widespread loss of histone H3 lysine 27 (H3K27me3) trimethylation, representing a general opening of the chromatin state. Maintenance of high transgene levels leads to re-acquisition of H3K27me3 and a stable pluripotent state that is alternative to the embryonic stem cell (ESC)-like fate. Lowering transgene levels at an intermediate phase, however, guides the process to the acquisition of ESC-like chromatin and DNA methylation signature. Our data provide a comprehensive molecular description of the reprogramming routes and is accessible through the Project Grandiose portal at http://www.stemformatics.org.


Subject(s)
Cellular Reprogramming/genetics , Genome/genetics , Induced Pluripotent Stem Cells/cytology , Induced Pluripotent Stem Cells/metabolism , Animals , Chromatin/chemistry , Chromatin/genetics , Chromatin/metabolism , Chromatin Assembly and Disassembly , DNA Methylation , Embryonic Stem Cells/cytology , Embryonic Stem Cells/metabolism , Epistasis, Genetic/genetics , Fibroblasts/cytology , Fibroblasts/metabolism , Histones/chemistry , Histones/metabolism , Internet , Mice , Proteome/genetics , Proteomics , RNA, Long Noncoding/genetics , Transcription Factors/genetics , Transcription Factors/metabolism , Transcription, Genetic/genetics , Transcriptome/genetics , Transgenes/genetics
7.
J Korean Med Sci ; 35(7): e84, 2020 Feb 24.
Article in English | MEDLINE | ID: mdl-32080990

ABSTRACT

Novel coronavirus (SARS-CoV-2) is found to cause a large outbreak started from Wuhan since December 2019 in China and SARS-CoV-2 infections have been reported with epidemiological linkage to China in 25 countries until now. We isolated SARS-CoV-2 from the oropharyngeal sample obtained from the patient with the first laboratory-confirmed SARS-CoV-2 infection in Korea. Cytopathic effects of SARS-CoV-2 in the Vero cell cultures were confluent 3 days after the first blind passage of the sample. Coronavirus was confirmed with spherical particle having a fringe reminiscent of crown on transmission electron microscopy. Phylogenetic analyses of whole genome sequences showed that it clustered with other SARS-CoV-2 reported from Wuhan.


Subject(s)
Betacoronavirus , Coronavirus Infections , Oropharynx/virology , Pneumonia, Viral , Whole Genome Sequencing , Adult , Betacoronavirus/genetics , Betacoronavirus/isolation & purification , COVID-19 , Coronavirus Infections/diagnosis , Female , Humans , Microscopy, Electron, Transmission , Phylogeny , Pneumonia, Viral/diagnosis , Republic of Korea , SARS-CoV-2
8.
Breast Cancer Res Treat ; 173(2): 385-395, 2019 Jan.
Article in English | MEDLINE | ID: mdl-30350268

ABSTRACT

PURPOSE: Triple-negative breast cancer (TNBC) accounts for 10-20% of all diagnosed BCs and it is enriched in BRCA1 mutation. Guidelines for Western countries suggest that BRCA 1/2 genetic testing should be done for patients with TNBC diagnosed less than 60 years, but there is lack of evidence supporting genetic testing in Asian populations. We determined the prevalence of germline BRCA 1/2 mutations among unselected Korean patients with TNBC and analyzed oncologic outcomes. METHODS: From among 1628 women with TNBC who underwent surgery at Samsung Medical Center (SMC) between Jul 2008 and Jan 2016, 999 samples were available in the SMC biobank for testing germline BRCA 1/2 mutations using next-generation DNA sequencing. RESULTS: Overall, 131 Korean patients (13.1%) had BRCA 1/2 mutations: 97 (9.7%) were in BRCA 1, and 35 (3.5%) were in BRCA 2. One patient had both BRCA 1 and BRCA 2 mutations. Overall, 68 distinct pathologic or likely pathogenic variants (43 BRCA1 and 25 BRCA2) were found. Among those diagnosed at ≤ 60 years, the prevalence of BRCA 1/2 mutation was 14.5%. The mean age of diagnosis of BRCA1/2 mutation carriers was significantly younger than that of non-carriers (45.6 vs. 50.1 years, p < 0.0001). The median follow-up duration was 53.6 months. There were no significant differences in disease-free survival, overall survival, or breast cancer-specific survival (p = 0.799, 0.092, and 0.124, respectively) between BRCA 1/2 carriers and non-carriers, although BRCA 1/2 carriers showed significantly worse contralateral breast cancer-free survival (p < 0.0001) than non-carriers. CONCLUSION: In unselected TNBC patients, we found BRCA 1/2 mutations in 13.1% of overall patients and 14.5% of patients ≤ 60 years. We suggest that Korean women with TNBC diagnosed at ≤ 60 years should be tested for BRCA1/2 mutation.


Subject(s)
BRCA1 Protein/genetics , BRCA2 Protein/genetics , Genetic Testing/statistics & numerical data , Triple Negative Breast Neoplasms/genetics , Adult , Age Factors , Asian People/genetics , Asian People/statistics & numerical data , DNA Mutational Analysis , Disease-Free Survival , Female , Follow-Up Studies , Genetic Testing/methods , High-Throughput Nucleotide Sequencing , Humans , Kaplan-Meier Estimate , Middle Aged , Republic of Korea/epidemiology , Retrospective Studies , Triple Negative Breast Neoplasms/diagnosis , Triple Negative Breast Neoplasms/mortality
9.
PLoS Genet ; 12(8): e1006239, 2016 08.
Article in English | MEDLINE | ID: mdl-27494611

ABSTRACT

Follicular thyroid carcinoma (FTC) and benign follicular adenoma (FA) are indistinguishable by preoperative diagnosis due to their similar histological features. Here we report the first RNA sequencing study of these tumors, with data for 30 minimally invasive FTCs (miFTCs) and 25 FAs. We also compared 77 classical papillary thyroid carcinomas (cPTCs) and 48 follicular variant of PTCs (FVPTCs) to observe the differences in their molecular properties. Mutations in H/K/NRAS, DICER1, EIF1AX, IDH1, PTEN, SOS1, and SPOP were identified in miFTC or FA. We identified a low frequency of fusion genes in miFTC (only one, PAX8-PPARG), but a high frequency of that in PTC (17.60%). The frequencies of BRAFV600E and H/K/NRAS mutations were substantially different in miFTC and cPTC, and those of FVPTC were intermediate between miFTC and cPTC. Gene expression analysis demonstrated three molecular subtypes regardless of their histological features, including Non-BRAF-Non-RAS (NBNR), as well as BRAF-like and RAS-like. The novel molecular subtype, NBNR, was associated with DICER1, EIF1AX, IDH1, PTEN, SOS1, SPOP, and PAX8-PPARG. The transcriptome of miFTC or encapsulated FVPTC was indistinguishable from that of FA, providing a molecular explanation for the similarly indolent behavior of these tumors. We identified upregulation of genes that are related to mitochondrial biogenesis including ESRRA and PPARGC1A in oncocytic follicular thyroid neoplasm. Arm-level copy number variations were correlated to histological and molecular characteristics. These results expanded the current molecular understanding of thyroid cancer and may lead to new diagnostic and therapeutic approaches to the disease.


Subject(s)
Adenoma/genetics , Carcinoma/genetics , Neoplasm Proteins/genetics , Thyroid Neoplasms/genetics , Transcriptome/genetics , Adenoma/diagnosis , Adenoma/pathology , Adult , Aged , Carcinoma/diagnosis , Carcinoma, Papillary , Female , Gene Expression Regulation, Neoplastic/genetics , High-Throughput Nucleotide Sequencing , Humans , Male , Middle Aged , Mutation , Neoplasm Proteins/biosynthesis , Thyroid Cancer, Papillary , Thyroid Neoplasms/diagnosis
10.
BMC Cancer ; 18(1): 494, 2018 05 02.
Article in English | MEDLINE | ID: mdl-29716549

ABSTRACT

BACKGROUND: Somatic amplifications of the LYL1 gene are relatively common occurrences in patients who develop uterine corpus endometrial carcinoma (UCEC) as opposed to other cancers. This study was undertaken to determine whether such genetic alterations affect survival outcomes of UCEC. METHODS: In 370 patients with UCEC, we analysed clinicopathologic characteristics and corresponding genomic data from The Cancer Genome Atlas database. Patients were stratified according to LYL1 gene status, grouped as amplification or non-amplification. Heightened levels of cancer-related genes expressed in concert with LYL1 amplification were similarly investigated through differentially expressed gene and gene set enrichment analyses. Factors associated with survival outcomes were also identified. RESULTS: Somatic LYL1 gene amplification was observed in 22 patients (5.9%) with UCEC. Patients displaying amplification (vs. non-amplification) were significantly older at the time of diagnosis and more often were marked by non-endometrioid, high-grade, or advanced disease. In survival analysis, the amplification subset showed poorer progression-free survival (PFS) and overall survival (OS) rates (3-year PFS: 34.4% vs. 79.9%, P = 0.031; 5-year OS: 25.1% vs. 84.9%, P = 0.014). However, multivariate analyses adjusted for tumor histologic type, grade, and stage did not confirm LYL1 gene amplification as an independent prognostic factor for either PFS or OS. Nevertheless, MAPK, WNT, and cell cycle pathways were significantly enriched by LYL1 gene amplification (P < 0.001, P = 0.002, and P = 0.004, respectively). CONCLUSIONS: Despite not being identified as an independent prognostic factor in UCEC, LYL1 gene amplification is associated with other poor prognostic factors and correlated with upregulation of cancer-related pathways.


Subject(s)
Basic Helix-Loop-Helix Transcription Factors/genetics , Biomarkers, Tumor , Endometrial Neoplasms/genetics , Endometrial Neoplasms/mortality , Gene Amplification , Neoplasm Proteins/genetics , Aged , Aged, 80 and over , Comorbidity , DNA Copy Number Variations , Endometrial Neoplasms/pathology , Female , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Middle Aged , Neoplasm Grading , Neoplasm Staging , Prognosis , Risk Factors
11.
Gynecol Oncol ; 148(2): 375-382, 2018 02.
Article in English | MEDLINE | ID: mdl-29233531

ABSTRACT

OBJECTIVE: To analyze whole exome sequencing (WES) data on ovarian clear cell carcinoma (OCCC) in Korean patients via the technique of next generation sequencing (NGS). Genomic profiles were compared between endometriosis-associated OCCC (EMS-OCCC) and Non-EMS-OCCC. METHODS: We used serum samples and cancer tissues, stored at the Seoul National University Hospital Human Biobank, that were initially collected from women diagnosed with OCCC between 2012 and 2016. In total, 15 patients were enrolled: 5 with pathologically confirmed EMS-OCCC and 10 with Non-EMS-OCCC. We performed NGS WES on 15 fresh frozen OCCC tissues and matched serum samples, enabling comprehensive genomic characterization of OCCC. RESULTS: OCCC was characterized by complex genomic alterations, with a median of 178 exonic mutations (range, 111-25,798) and a median of 343 somatic copy number variations (range, 43-1,820) per tumor sample. In all, 54 somatic mutations were discovered across 14 genes, including PIK3CA (40%), ARID1A (40%), and KRAS (20%) in the 15 Korean OCCCs. Copy number gains in NTRK1 (33%), MYC (40%), and GNAS (47%) and copy number losses in TET2 (73%), TSC1 (67%), BRCA2 (60%), and SMAD4 (47%) were frequent. The significantly altered pathways were associated with proliferation and survival (including the PI3K/AKT, TP53, and ERBB2 pathways) in 87% of OCCCs and with chromatin remodeling in 47% of OCCCs. No significant differences in frequencies of genetic alterations were detected between EMS-OCCC and Non-EMS-OCCC groups. CONCLUSION: We successfully characterized the genomic landscape of 15 Korean patients with OCCC. We identified potential therapeutic targets for the treatment of this malignancy.


Subject(s)
Adenocarcinoma, Clear Cell/genetics , Ovarian Neoplasms/genetics , Adenocarcinoma, Clear Cell/ethnology , Adolescent , Adult , Aged , Class I Phosphatidylinositol 3-Kinases/genetics , DNA Copy Number Variations/genetics , DNA-Binding Proteins , Female , Humans , Middle Aged , Mutation/genetics , Nuclear Proteins/genetics , Ovarian Neoplasms/ethnology , Proto-Oncogene Proteins p21(ras)/genetics , Republic of Korea/ethnology , Transcription Factors/genetics , Exome Sequencing , Young Adult
12.
Prenat Diagn ; 38(11): 835-840, 2018 10.
Article in English | MEDLINE | ID: mdl-30048567

ABSTRACT

OBJECTIVE: The aim of this study was to develop a common targeted massively parallel sequencing platform for the noninvasive prenatal diagnosis (NIPD) of multiple X-linked diseases. METHOD: The custom capture probe was designed to target 33 genes and recombination hotspots. We tested the carrier mother and male proband pair of 6 families. Plasma DNA of the pregnant carrier mother was collected at different gestational weeks and sequenced. The fetal genotype of each family was determined by estimating the imbalance between the 2 maternal haplotypes constructed using a common custom-designed platform. RESULTS: The targeted sequencing of the maternal, proband, and fetal genomic DNAs and maternal plasma DNAs resulted in uniform coverage across the target region. Three to 5 recombination points were observed in each sample. However, these recombination points did not affect the haplotype dosage analysis for fetal genotype prediction. Consequently, all fetal genotypes in the 6 families obtained from haplotype dosage analysis of maternal plasma sequencing data were predicted correctly. CONCLUSIONS: Since a single platform that covers multiple diseases may prevent the need for disease-specific probes for the NIPD of individual disorders, this approach may provide a practical advantage for clinically implementing the NIPD of X-linked diseases.


Subject(s)
Genetic Diseases, X-Linked/diagnosis , Maternal Serum Screening Tests , DNA Mutational Analysis , Female , Humans , Pregnancy , Recombination, Genetic
13.
Cancer ; 122(9): 1370-9, 2016 05 01.
Article in English | MEDLINE | ID: mdl-26969876

ABSTRACT

BACKGROUND: Recent reports suggest that mutations in the promoter of the gene encoding telomerase reverse transcriptase (TERT) affect thyroid cancer outcomes. METHODS: In all, 551 patients with differentiated thyroid cancer (DTC) enrolled in this study. The median follow-up duration was 4.8 years (interquartile range, 3.4-10.6 years). RESULTS: TERT promoter mutations were detected in 25 DTCs (4.5%): 2.8% in neither BRAF-mutated nor RAS-mutated tumors, 4.8% in BRAF-mutated tumors, and 11.3% in RAS-mutated tumors. Moreover, they were frequently observed in American Thyroid Association (ATA) high-risk and TNM stage III/IV groups (9.1% and 12.9%, respectively). The coexistence of BRAF or RAS with TERT promoter mutations increased aggressive clinicopathologic features, recurrence (hazard ratio [HR] for BRAF, 4.64; 95% confidence interval [CI], 1.42-15.18; HR for RAS, 5.36; 95% CI, 1.20-24.02), and mortality (HR for BRAF, 15.13; 95% CI, 1.55-148.23; HR for RAS, 14.75; 95% CI, 1.30-167.00), even after adjustments for the age at diagnosis and sex, although the significance was lost after additional adjustments for pathologic characteristics. Furthermore, TERT promoter mutations significantly increased the risk of both recurrence and mortality in the ATA high-risk (HR for recurrence, 5.79; 95% CI, 2.07-16.18; HR for mortality, 16.16; 95% CI, 2.10-124.15) and TNM stage III/IV groups (HR for recurrence, 3.60; 95% CI, 1.19-10.85; HR for mortality, 9.06; 95% CI, 2.09-39.26). CONCLUSIONS: The coexistence of BRAF or RAS mutations enhanced the prognostic effects of TERT promoter mutations. Furthermore, TERT promoter mutations strengthened the predictions of mortality and recurrence by the ATA and TNM staging systems, particularly for high-risk patients with DTC. Cancer 2016;122:1370-1379. © 2016 American Cancer Society.


Subject(s)
Genes, ras , Mutation , Neoplasm Recurrence, Local/genetics , Proto-Oncogene Proteins B-raf/genetics , Telomerase/genetics , Thyroid Neoplasms/genetics , Adult , Female , Follow-Up Studies , Humans , Male , Middle Aged , Neoplasm Recurrence, Local/mortality , Neoplasm Recurrence, Local/pathology , Neoplasm Staging , Prognosis , Thyroid Neoplasms/mortality , Thyroid Neoplasms/pathology , Time Factors
16.
Genome Res ; 22(3): 436-45, 2012 Mar.
Article in English | MEDLINE | ID: mdl-22194472

ABSTRACT

The identification of the molecular events that drive cancer transformation is essential to the development of targeted agents that improve the clinical outcome of lung cancer. Many studies have reported genomic driver mutations in non-small-cell lung cancers (NSCLCs) over the past decade; however, the molecular pathogenesis of >40% of NSCLCs is still unknown. To identify new molecular targets in NSCLCs, we performed the combined analysis of massively parallel whole-genome and transcriptome sequencing for cancer and paired normal tissue of a 33-yr-old lung adenocarcinoma patient, who is a never-smoker and has no familial cancer history. The cancer showed no known driver mutation in EGFR or KRAS and no EML4-ALK fusion. Here we report a novel fusion gene between KIF5B and the RET proto-oncogene caused by a pericentric inversion of 10p11.22-q11.21. This fusion gene overexpresses chimeric RET receptor tyrosine kinase, which could spontaneously induce cellular transformation. We identified the KIF5B-RET fusion in two more cases out of 20 primary lung adenocarcinomas in the replication study. Our data demonstrate that a subset of NSCLCs could be caused by a fusion of KIF5B and RET, and suggest the chimeric oncogene as a promising molecular target for the personalized diagnosis and treatment of lung cancer.


Subject(s)
Adenocarcinoma/genetics , Kinesins/genetics , Lung Neoplasms/genetics , Oncogene Proteins, Fusion/genetics , Proto-Oncogene Proteins c-ret/genetics , Adult , Alternative Splicing , Base Sequence , Chromosome Breakpoints , Chromosomes, Human, Pair 10 , Exons , Gene Order , Genome, Human , High-Throughput Nucleotide Sequencing , Humans , Male , Models, Molecular , Oncogene Proteins, Fusion/chemistry , Protein Conformation , Protein-Tyrosine Kinases/chemistry , Protein-Tyrosine Kinases/genetics , Proto-Oncogene Mas , Transcriptome , Translocation, Genetic
17.
Genome Res ; 22(11): 2109-19, 2012 Nov.
Article in English | MEDLINE | ID: mdl-22975805

ABSTRACT

All cancers harbor molecular alterations in their genomes. The transcriptional consequences of these somatic mutations have not yet been comprehensively explored in lung cancer. Here we present the first large scale RNA sequencing study of lung adenocarcinoma, demonstrating its power to identify somatic point mutations as well as transcriptional variants such as gene fusions, alternative splicing events, and expression outliers. Our results reveal the genetic basis of 200 lung adenocarcinomas in Koreans including deep characterization of 87 surgical specimens by transcriptome sequencing. We identified driver somatic mutations in cancer genes including EGFR, KRAS, NRAS, BRAF, PIK3CA, MET, and CTNNB1. Candidates for novel driver mutations were also identified in genes newly implicated in lung adenocarcinoma such as LMTK2, ARID1A, NOTCH2, and SMARCA4. We found 45 fusion genes, eight of which were chimeric tyrosine kinases involving ALK, RET, ROS1, FGFR2, AXL, and PDGFRA. Among 17 recurrent alternative splicing events, we identified exon 14 skipping in the proto-oncogene MET as highly likely to be a cancer driver. The number of somatic mutations and expression outliers varied markedly between individual cancers and was strongly correlated with smoking history of patients. We identified genomic blocks within which gene expression levels were consistently increased or decreased that could be explained by copy number alterations in samples. We also found an association between lymph node metastasis and somatic mutations in TP53. These findings broaden our understanding of lung adenocarcinoma and may also lead to new diagnostic and therapeutic approaches.


Subject(s)
Adenocarcinoma/genetics , Lung Neoplasms/genetics , Mutation , Transcriptome , Exons , Female , Gene Expression Regulation, Neoplastic , Genes, Neoplasm , Genetic Association Studies , Humans , Lymphatic Metastasis/genetics , Male , Polymorphism, Single Nucleotide , Proto-Oncogene Mas , Republic of Korea , Smoking/adverse effects
18.
Clin Chem ; 61(6): 829-37, 2015 Jun.
Article in English | MEDLINE | ID: mdl-25847990

ABSTRACT

BACKGROUND: Noninvasive prenatal diagnosis of monogenic disorders using maternal plasma and targeted massively parallel sequencing is being investigated actively. We previously demonstrated that comprehensive genetic diagnosis of a Duchenne muscular dystrophy (DMD) patient is feasible using a single targeted sequencing platform. Here we demonstrate the applicability of this approach to carrier detection and noninvasive prenatal diagnosis. METHODS: Custom solution-based target enrichment was designed to cover the entire dystrophin (DMD) gene region. Targeted massively parallel sequencing was performed using genomic DNA from 4 mother and proband pairs to test whether carrier status could be detected reliably. Maternal plasma DNA at varying gestational weeks was collected from the same families and sequenced using the same targeted platform to predict the inheritance of the DMD mutation by their fetus. Overrepresentation of an inherited allele was determined by comparing the allele fraction of 2 phased haplotypes after examining and correcting for the recombination event. RESULTS: The carrier status of deletion/duplication and point mutations was detected reliably through using a single targeted massively parallel sequencing platform. Whether the fetus had inherited the DMD mutation was predicted correctly in all 4 families as early as 6 weeks and 5 days of gestation. In one of these, detection of the recombination event and reconstruction of the phased haplotype produced a correct diagnosis. CONCLUSIONS: Noninvasive prenatal diagnosis of DMD is feasible using a single targeted massively parallel sequencing platform with tiling design.


Subject(s)
Dystrophin/genetics , Genetic Carrier Screening/methods , Muscular Dystrophy, Duchenne/diagnosis , Muscular Dystrophy, Duchenne/genetics , Mutation , Prenatal Diagnosis/methods , DNA/blood , Female , Haplotypes , Heterozygote , Humans , Pregnancy , Sequence Analysis, DNA/methods
19.
Nature ; 460(7258): 1011-5, 2009 Aug 20.
Article in English | MEDLINE | ID: mdl-19587683

ABSTRACT

Recent advances in sequencing technologies have initiated an era of personal genome sequences. To date, human genome sequences have been reported for individuals with ancestry in three distinct geographical regions: a Yoruba African, two individuals of northwest European origin, and a person from China. Here we provide a highly annotated, whole-genome sequence for a Korean individual, known as AK1. The genome of AK1 was determined by an exacting, combined approach that included whole-genome shotgun sequencing (27.8x coverage), targeted bacterial artificial chromosome sequencing, and high-resolution comparative genomic hybridization using custom microarrays featuring more than 24 million probes. Alignment to the NCBI reference, a composite of several ethnic clades, disclosed nearly 3.45 million single nucleotide polymorphisms (SNPs), including 10,162 non-synonymous SNPs, and 170,202 deletion or insertion polymorphisms (indels). SNP and indel densities were strongly correlated genome-wide. Applying very conservative criteria yielded highly reliable copy number variants for clinical considerations. Potential medical phenotypes were annotated for non-synonymous SNPs, coding domain indels, and structural variants. The integration of several human whole-genome sequences derived from several ethnic groups will assist in understanding genetic ancestry, migration patterns and population bottlenecks.


Subject(s)
Asian People/genetics , Genome, Human/genetics , Chromosomes, Artificial, Bacterial/genetics , Comparative Genomic Hybridization , Computational Biology , Humans , INDEL Mutation/genetics , Korea , Oligonucleotide Array Sequence Analysis , Polymorphism, Single Nucleotide/genetics , Sequence Analysis, DNA
20.
Proc Natl Acad Sci U S A ; 109(30): 11920-7, 2012 Jul 24.
Article in English | MEDLINE | ID: mdl-22797899

ABSTRACT

Rapid advances in DNA sequencing promise to enable new diagnostics and individualized therapies. Achieving personalized medicine, however, will require extensive research on highly reidentifiable, integrated datasets of genomic and health information. To assist with this, participants in the Personal Genome Project choose to forgo privacy via our institutional review board- approved "open consent" process. The contribution of public data and samples facilitates both scientific discovery and standardization of methods. We present our findings after enrollment of more than 1,800 participants, including whole-genome sequencing of 10 pilot participant genomes (the PGP-10). We introduce the Genome-Environment-Trait Evidence (GET-Evidence) system. This tool automatically processes genomes and prioritizes both published and novel variants for interpretation. In the process of reviewing the presumed healthy PGP-10 genomes, we find numerous literature references implying serious disease. Although it is sometimes impossible to rule out a late-onset effect, stringent evidence requirements can address the high rate of incidental findings. To that end we develop a peer production system for recording and organizing variant evaluations according to standard evidence guidelines, creating a public forum for reaching consensus on interpretation of clinically relevant variants. Genome analysis becomes a two-step process: using a prioritized list to record variant evaluations, then automatically sorting reviewed variants using these annotations. Genome data, health and trait information, participant samples, and variant interpretations are all shared in the public domain-we invite others to review our results using our participant samples and contribute to our interpretations. We offer our public resource and methods to further personalized medical research.


Subject(s)
Databases, Genetic , Genetic Variation , Genome, Human/genetics , Phenotype , Precision Medicine/methods , Software , Cell Line , Data Collection , Humans , Precision Medicine/trends , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL