RESUMEN
Monozygotic or 'identical' twins have been widely studied to dissect the relative contributions of genetics and environment in human diseases. In multiple sclerosis (MS), an autoimmune demyelinating disease and common cause of neurodegeneration and disability in young adults, disease discordance in monozygotic twins has been interpreted to indicate environmental importance in its pathogenesis. However, genetic and epigenetic differences between monozygotic twins have been described, challenging the accepted experimental model in disambiguating the effects of nature and nurture. Here we report the genome sequences of one MS-discordant monozygotic twin pair, and messenger RNA transcriptome and epigenome sequences of CD4(+) lymphocytes from three MS-discordant, monozygotic twin pairs. No reproducible differences were detected between co-twins among approximately 3.6 million single nucleotide polymorphisms (SNPs) or approximately 0.2 million insertion-deletion polymorphisms. Nor were any reproducible differences observed between siblings of the three twin pairs in HLA haplotypes, confirmed MS-susceptibility SNPs, copy number variations, mRNA and genomic SNP and insertion-deletion genotypes, or the expression of approximately 19,000 genes in CD4(+) T cells. Only 2 to 176 differences in the methylation of approximately 2 million CpG dinucleotides were detected between siblings of the three twin pairs, in contrast to approximately 800 methylation differences between T cells of unrelated individuals and several thousand differences between tissues or between normal and cancerous tissues. In the first systematic effort to estimate sequence variation among monozygotic co-twins, we did not find evidence for genetic, epigenetic or transcriptome differences that explained disease discordance. These are the first, to our knowledge, female, twin and autoimmune disease individual genome sequences reported.
Asunto(s)
Epigénesis Genética/genética , Genoma Humano/genética , Esclerosis Múltiple/genética , ARN Mensajero/genética , Gemelos Monocigóticos/genética , Adolescente , Adulto , Desequilibrio Alélico/genética , Mama/metabolismo , Neoplasias de la Mama/genética , Linfocitos T CD4-Positivos/metabolismo , Estudios de Casos y Controles , Islas de CpG/genética , Variaciones en el Número de Copia de ADN/genética , Metilación de ADN/genética , Femenino , Predisposición Genética a la Enfermedad/genética , Haplotipos/genética , Heterocigoto , Humanos , Mutación INDEL/genética , Pulmón/metabolismo , Neoplasias Pulmonares/genética , Masculino , Polimorfismo Genético/genética , Sitios de Carácter Cuantitativo/genética , ARN Mensajero/análisis , ARN Mensajero/metabolismoRESUMEN
Recent advances in sequencing technologies have initiated an era of personal genome sequences. To date, human genome sequences have been reported for individuals with ancestry in three distinct geographical regions: a Yoruba African, two individuals of northwest European origin, and a person from China. Here we provide a highly annotated, whole-genome sequence for a Korean individual, known as AK1. The genome of AK1 was determined by an exacting, combined approach that included whole-genome shotgun sequencing (27.8x coverage), targeted bacterial artificial chromosome sequencing, and high-resolution comparative genomic hybridization using custom microarrays featuring more than 24 million probes. Alignment to the NCBI reference, a composite of several ethnic clades, disclosed nearly 3.45 million single nucleotide polymorphisms (SNPs), including 10,162 non-synonymous SNPs, and 170,202 deletion or insertion polymorphisms (indels). SNP and indel densities were strongly correlated genome-wide. Applying very conservative criteria yielded highly reliable copy number variants for clinical considerations. Potential medical phenotypes were annotated for non-synonymous SNPs, coding domain indels, and structural variants. The integration of several human whole-genome sequences derived from several ethnic groups will assist in understanding genetic ancestry, migration patterns and population bottlenecks.
Asunto(s)
Pueblo Asiatico/genética , Genoma Humano/genética , Cromosomas Artificiales Bacterianos/genética , Hibridación Genómica Comparativa , Biología Computacional , Humanos , Mutación INDEL/genética , Corea (Geográfico) , Análisis de Secuencia por Matrices de Oligonucleótidos , Polimorfismo de Nucleótido Simple/genética , Análisis de Secuencia de ADNRESUMEN
Schizophrenia (SCZ) is a common, disabling mental illness with high heritability but complex, poorly understood genetic etiology. As the first phase of a genomic convergence analysis of SCZ, we generated 16.7 billion nucleotides of short read, shotgun sequences of cDNA from post-mortem cerebellar cortices of 14 patients and six, matched controls. A rigorous analysis pipeline was developed for analysis of digital gene expression studies. Sequences aligned to approximately 33,200 transcripts in each sample, with average coverage of 450 reads per gene. Following adjustments for confounding clinical, sample and experimental sources of variation, 215 genes differed significantly in expression between cases and controls. Golgi apparatus, vesicular transport, membrane association, Zinc binding and regulation of transcription were over-represented among differentially expressed genes. Twenty three genes with altered expression and involvement in presynaptic vesicular transport, Golgi function and GABAergic neurotransmission define a unifying molecular hypothesis for dysfunction in cerebellar cortex in SCZ.