RESUMEN
The rapid and dynamic implementation of Next-Generation Sequencing (NGS)-based assays has revolutionized genetic testing, and in the near future, nearly all molecular alterations of the human genome will be diagnosable via massive parallel sequencing. While this progress will further corroborate the central role of human genetics in the multidisciplinary management of patients with genetic disorders, it must be accompanied by quality assurance measures in order to allow the safe and optimal use of knowledge ascertained from genome diagnostics. To achieve this, several valuable tools and guidelines have been developed to support the quality of genome diagnostics. In this paper, authors with experience in diverse aspects of genomic analysis summarize the current status of quality assurance in genome diagnostics, with the aim of facilitating further standardization and quality improvement in one of the core competencies of the field.
RESUMEN
In 2016, guidelines for diagnostic Next Generation Sequencing (NGS) have been published by EuroGentest in order to assist laboratories in the implementation and accreditation of NGS in a diagnostic setting. These guidelines mainly focused on Whole Exome Sequencing (WES) and targeted (gene panels) sequencing detecting small germline variants (Single Nucleotide Variants (SNVs) and insertions/deletions (indels)). Since then, Whole Genome Sequencing (WGS) has been increasingly introduced in the diagnosis of rare diseases as WGS allows the simultaneous detection of SNVs, Structural Variants (SVs) and other types of variants such as repeat expansions. The use of WGS in diagnostics warrants the re-evaluation and update of previously published guidelines. This work was jointly initiated by EuroGentest and the Horizon2020 project Solve-RD. Statements from the 2016 guidelines have been reviewed in the context of WGS and updated where necessary. The aim of these recommendations is primarily to list the points to consider for clinical (laboratory) geneticists, bioinformaticians, and (non-)geneticists, to provide technical advice, aid clinical decision-making and the reporting of the results.
Asunto(s)
Exoma , Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Polimorfismo de Nucleótido Simple , Enfermedades Raras/diagnóstico , Enfermedades Raras/genética , Secuenciación Completa del GenomaRESUMEN
We present, on behalf of EuroGentest and the European Society of Human Genetics, guidelines for the evaluation and validation of next-generation sequencing (NGS) applications for the diagnosis of genetic disorders. The work was performed by a group of laboratory geneticists and bioinformaticians, and discussed with clinical geneticists, industry and patients' representatives, and other stakeholders in the field of human genetics. The statements that were written during the elaboration of the guidelines are presented here. The background document and full guidelines are available as supplementary material. They include many examples to assist the laboratories in the implementation of NGS and accreditation of this service. The work and ideas presented by others in guidelines that have emerged elsewhere in the course of the past few years were also considered and are acknowledged in the full text. Interestingly, a few new insights that have not been cited before have emerged during the preparation of the guidelines. The most important new feature is the presentation of a 'rating system' for NGS-based diagnostic tests. The guidelines and statements have been applauded by the genetic diagnostic community, and thus seem to be valuable for the harmonization and quality assurance of NGS diagnostics in Europe.
Asunto(s)
Acreditación/legislación & jurisprudencia , Enfermedades Genéticas Congénitas/diagnóstico , Pruebas Genéticas/normas , Secuenciación de Nucleótidos de Alto Rendimiento/normas , Proteínas/genética , Biomarcadores/metabolismo , Bases de Datos Genéticas , Europa (Continente) , Expresión Génica , Enfermedades Genéticas Congénitas/genética , Enfermedades Genéticas Congénitas/patología , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Secuenciación de Nucleótidos de Alto Rendimiento/estadística & datos numéricos , Humanos , Hallazgos Incidentales , Difusión de la Información/legislación & jurisprudencia , Consentimiento Informado , Proyectos de Investigación/normas , Sensibilidad y EspecificidadRESUMEN
BACKGROUND: Chlamydia pneumoniae (Cpn) are obligate intracellular bacteria that cause acute infections of the upper and lower respiratory tract and have been implicated in chronic inflammatory diseases. Although of significant clinical relevance, complete genome sequences of only four clinical Cpn strains have been obtained. All of them were isolated from the respiratory tract and shared more than 99% sequence identity. Here we investigate genetic differences on the whole-genome level that are related to Cpn tissue tropism and pathogenicity. RESULTS: We have sequenced the genomes of 18 clinical isolates from different anatomical sites (e.g. lung, blood, coronary arteries) of diseased patients, and one animal isolate. In total 1,363 SNP loci and 184 InDels have been identified in the genomes of all clinical Cpn isolates. These are distributed throughout the whole chlamydial genome and enriched in highly variable regions. The genomes show clear evidence of recombination in at least one potential region but no phage insertions. The tyrP gene was always encoded as single copy in all vascular isolates. Phylogenetic reconstruction revealed distinct evolutionary lineages containing primarily non-respiratory Cpn isolates. In one of these, clinical isolates from coronary arteries and blood monocytes were closely grouped together. They could be distinguished from all other isolates by characteristic nsSNPs in genes involved in RB to EB transition, inclusion membrane formation, bacterial stress response and metabolism. CONCLUSIONS: This study substantially expands the genomic data of Cpn and elucidates its evolutionary history. The translation of the observed Cpn genetic differences into biological functions and the prediction of novel pathogen-oriented diagnostic strategies have to be further explored.
Asunto(s)
Infecciones por Chlamydophila/microbiología , Chlamydophila pneumoniae/genética , Chlamydophila pneumoniae/aislamiento & purificación , Tropismo , Animales , Sangre/microbiología , Infecciones por Chlamydophila/veterinaria , Chlamydophila pneumoniae/crecimiento & desarrollo , Vasos Coronarios/microbiología , Genoma Bacteriano , Humanos , Mutación INDEL , Pulmón/microbiología , Filogenia , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN/métodosRESUMEN
CpG methylation in mammalian DNA is known to interfere with gene expression by inhibiting the binding of transactivators to their cognate sequence motifs or recruiting proteins involved in gene repression. An Epstein-Barr virus-encoded transcription factor, Zta, was the first example of a sequence-specific transcription factor that preferentially recognizes and selectively binds DNA sequence motifs with methylated CpG residues, reverses epigenetic silencing and activates gene transcription. The DNA binding domain of Zta is homologous to c-Fos, a member of the cellular AP-1 (activator protein 1) transcription factor family, which regulates cell proliferation and survival, apoptosis, transformation and oncogenesis. We have identified a novel AP-1 binding site termed meAP-1, which contains a CpG dinucleotide. If methylated, meAP-1 sites are preferentially bound by the AP-1 heterodimer c-Jun/c-Fos in vitro and in cellular chromatin in vivo. In activated human primary B cells, c-Jun/c-Fos locates to these methylated elements in promoter regions of transcriptionally activated genes. Reminiscent of the viral Zta protein, c-Jun/c-Fos is the first identified cellular member of the AP-1 family of transactivators that can induce expression of genes with methylated, hence repressed promoters, reversing epigenetic silencing.
Asunto(s)
Proteínas Proto-Oncogénicas c-fos/metabolismo , Proteínas Proto-Oncogénicas c-jun/metabolismo , Elementos Reguladores de la Transcripción , Factor de Transcripción AP-1/metabolismo , Activación Transcripcional , 5-Metilcitosina/metabolismo , Linfocitos B/metabolismo , Sitios de Unión , Línea Celular , Islas de CpG , ADN/química , ADN/metabolismo , Metilación de ADN , ADN Viral/química , ADN Viral/metabolismo , Dimerización , Genoma Humano , Herpesvirus Humano 4/genética , Humanos , Motivos de Nucleótidos , Regiones Promotoras GenéticasRESUMEN
Approximately 20 % of individuals with Parkinson's disease (PD) report a positive family history. Yet, a large portion of causal and disease-modifying variants is still unknown. We used exome sequencing in two affected individuals from a family with late-onset PD to identify 15 potentially causal variants. Segregation analysis and frequency assessment in 862 PD cases and 1,014 ethnically matched controls highlighted variants in EEF1D and LRRK1 as the best candidates. Mutation screening of the coding regions of these genes in 862 cases and 1,014 controls revealed several novel non-synonymous variants in both genes in cases and controls. An in silico multi-model bioinformatics analysis was used to prioritize identified variants in LRRK1 for functional follow-up. However, protein expression, subcellular localization, and cell viability were not affected by the identified variants. Although it has yet to be proven conclusively that variants in LRRK1 are indeed causative of PD, our data strengthen a possible role for LRRK1 in addition to LRRK2 in the genetic underpinnings of PD but, at the same time, highlight the difficulties encountered in the study of rare variants identified by next-generation sequencing in diseases with autosomal dominant or complex patterns of inheritance.
Asunto(s)
Variación Genética , Enfermedad de Parkinson/genética , Proteínas Serina-Treonina Quinasas/genética , Algoritmos , Supervivencia Celular , Análisis Mutacional de ADN , Exoma , Salud de la Familia , Femenino , Dosificación de Gen , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Genotipo , Alemania , Humanos , Masculino , Persona de Mediana Edad , Modelos Genéticos , Mutación , Análisis de Secuencia por Matrices de Oligonucleótidos , Factor 1 de Elongación Peptídica/genética , FenotipoRESUMEN
Approximately 20% of individuals with Parkinson's disease (PD) report a positive family history. Yet, a large portion of causal and disease-modifying variants is still unknown. We used exome sequencing in two affected individuals from a family with late-onset familial PD followed by frequency assessment in 975 PD cases and 1014 ethnically-matched controls and linkage analysis to identify potentially causal variants. Based on the predicted penetrance and the frequencies, a variant in PLXNA4 proved to be the best candidate and PLXNA4 was screened for additional variants in 862 PD cases and 940 controls, revealing an excess of rare non-synonymous coding variants in PLXNA4 in individuals with PD. Although we cannot conclude that the variant in PLXNA4 is indeed the causative variant, these findings are interesting in the light of a surfacing role of axonal guidance mechanisms in neurodegenerative disorders but, at the same time, highlight the difficulties encountered in the study of rare variants identified by next-generation sequencing in diseases with autosomal dominant or complex patterns of inheritance.
Asunto(s)
Variación Genética , Enfermedad de Parkinson/genética , Penetrancia , Receptores de Superficie Celular/genética , Estudios de Casos y Controles , Familia , Femenino , Ligamiento Genético , Humanos , MasculinoRESUMEN
Myocardial infarction, a leading cause of death in the Western world, usually occurs when the fibrous cap overlying an atherosclerotic plaque in a coronary artery ruptures. The resulting exposure of blood to the atherosclerotic material then triggers thrombus formation, which occludes the artery. The importance of genetic predisposition to coronary artery disease and myocardial infarction is best documented by the predictive value of a positive family history. Next-generation sequencing in families with several affected individuals has revolutionized mutation identification. Here we report the segregation of two private, heterozygous mutations in two functionally related genes, GUCY1A3 (p.Leu163Phefs*24) and CCT7 (p.Ser525Leu), in an extended myocardial infarction family. GUCY1A3 encodes the α1 subunit of soluble guanylyl cyclase (α1-sGC), and CCT7 encodes CCTη, a member of the tailless complex polypeptide 1 ring complex, which, among other functions, stabilizes soluble guanylyl cyclase. After stimulation with nitric oxide, soluble guanylyl cyclase generates cGMP, which induces vasodilation and inhibits platelet activation. We demonstrate in vitro that mutations in both GUCY1A3 and CCT7 severely reduce α1-sGC as well as ß1-sGC protein content, and impair soluble guanylyl cyclase activity. Moreover, platelets from digenic mutation carriers contained less soluble guanylyl cyclase protein and consequently displayed reduced nitric-oxide-induced cGMP formation. Mice deficient in α1-sGC protein displayed accelerated thrombus formation in the microcirculation after local trauma. Starting with a severely affected family, we have identified a link between impaired soluble-guanylyl-cyclase-dependent nitric oxide signalling and myocardial infarction risk, possibly through accelerated thrombus formation. Reversing this defect may provide a new therapeutic target for reducing the risk of myocardial infarction.
Asunto(s)
Susceptibilidad a Enfermedades/metabolismo , Infarto del Miocardio/metabolismo , Óxido Nítrico/metabolismo , Transducción de Señal , Animales , Chaperonina con TCP-1/genética , Chaperonina con TCP-1/metabolismo , GMP Cíclico/metabolismo , Exoma/genética , Femenino , Predisposición Genética a la Enfermedad , Guanilato Ciclasa/deficiencia , Guanilato Ciclasa/genética , Guanilato Ciclasa/metabolismo , Células HEK293 , Humanos , Masculino , Ratones , Mutación/genética , Infarto del Miocardio/genética , Infarto del Miocardio/fisiopatología , Linaje , Activación Plaquetaria , Receptores Citoplasmáticos y Nucleares/deficiencia , Receptores Citoplasmáticos y Nucleares/genética , Receptores Citoplasmáticos y Nucleares/metabolismo , Reproducibilidad de los Resultados , Solubilidad , Guanilil Ciclasa Soluble , Trombosis/metabolismo , VasodilataciónRESUMEN
SUMMARY: With the introduction of the next generation sequencing (NGS) technologies, remarkable new diagnostic applications have been established in daily routine. Implementation of NGS is challenging in clinical diagnostics, but definite advantages and new diagnostic possibilities make the switch to the technology inevitable. In addition to the higher sequencing capacity, clonal sequencing of single molecules, multiplexing of samples, higher diagnostic sensitivity, workflow miniaturization, and cost benefits are some of the valuable features of the technology. After the recent advances, NGS emerged as a proven alternative for classical Sanger sequencing in the typing of human leukocyte antigens (HLA). By virtue of the clonal amplification of single DNA molecules ambiguous typing results can be avoided. Simultaneously, a higher sample throughput can be achieved by tagging of DNA molecules with multiplex identifiers and pooling of PCR products before sequencing. In our experience, up to 380 samples can be typed for HLA-A, -B, and -DRB1 in high-resolution during every sequencing run. In molecular oncology, NGS shows a markedly increased sensitivity in comparison to the conventional Sanger sequencing and is developing to the standard diagnostic tool in detection of somatic mutations in cancer cells with great impact on personalized treatment of patients.
RESUMEN
BACKGROUND: Genome- and population-wide re-sequencing would allow for most efficient detection of causal trait variants. However, despite a strong decrease of costs for next-generation sequencing in the last few years, re-sequencing of large numbers of individuals is not yet affordable. We therefore resorted to re-sequencing of a limited number of bovine animals selected to explain a major proportion of the population's genomic variation, so called key animals, in order to provide a catalogue of functional variants and a substrate for population- and genome-wide imputation of variable sites. RESULTS: Forty-three animals accounting for about 69 percent of the genetic diversity of the Fleckvieh population, a cattle breed of Southern Germany and Austria, were sequenced with coverages ranging from 4.17 to 24.98 and averaging 7.46. After alignment to the reference genome (UMD3.1) and multi-sample variant calling, more than 17 million variant positions were identified, about 90 percent biallelic single nucleotide variants (SNVs) and 10 percent short insertions and deletions (InDels). The comparison with high-density chip data revealed a sensitivity of at least 92 percent and a specificity of 81 percent for sequencing based genotyping, and 97 percent and 93 percent when a imputation step was included. There are 91,733 variants in coding regions of 18,444 genes, 46 percent being non-synonymous exchanges, of which 575 variants are predicted to cause premature stop codons. Three variants are listed in the OMIA database as causal for specific phenotypes. CONCLUSIONS: Low- to medium-coverage re-sequencing of individuals explaining a major fraction of a population's genomic variation allows for the efficient and reliable detection of most variants. Imputation strongly improves genotype quality of lowly covered samples and thus enables maximum density genotyping by sequencing. The functional annotation of variants provides the basis for exhaustive genotype imputation in the population, e.g., for highest-resolution genome-wide association studies.
Asunto(s)
Variación Genética/genética , Genómica , Análisis de Secuencia de ADN/métodos , Animales , Bovinos , Genotipo , Mutación INDEL/genética , Masculino , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
We characterize a consanguineous Egyptian family with an autosomal recessively inherited familial cortical myoclonic tremor and epilepsy. We used multipoint linkage analysis to map the causative mutation to a 12.7 megabase interval within 1q31.3-q32.2 with a log of odds score of 3.6. For further investigation of the linked region in an efficient and unbiased manner, we performed exome sequencing. Within the suspected region we identified a homozygous single base pair deletion (c.503_503delG) leading to a frameshift in the coding region of the sixth exon of CNTN2 alias TAG-1 (p.Trp168fs), which segregated in the respective family. Many studies point towards an important role of the CNTN2 product contactin 2 in neuronal excitability. Contactin 2, a glycosylphosphatidylinositol-anchored neuronal membrane protein, and another transmembrane protein called contactin associated protein-like 2 (CNTNAP2 alias CASPR2) are together necessary to maintain voltage-gated potassium channels at the juxtaparanodal region. CNTN2 knockout mice were previously reported to suffer from spontaneous seizures and mutations in the CNTNAP2 gene have been described to cause epilepsy in humans. To further delineate the role of CNTN2 in patients with epilepsy, we sequenced the coding exons in 189 Caucasian patients with epilepsy. No recessive mutation was detected and heterozygote carriers of rare CNTN2 variants do not seem to be predisposed to epilepsy. Given the severity of the mutation and the proposed function of the gene, we consider this mutation as the most likely cause for cortical myoclonic tremor and epilepsy in this family.
Asunto(s)
Contactina 2/genética , Epilepsias Mioclónicas/genética , Mutación del Sistema de Lectura/genética , Adulto , Consanguinidad , Egipto , Epilepsias Mioclónicas/diagnóstico , Epilepsias Mioclónicas/fisiopatología , Exoma/genética , Femenino , Ligamiento Genético , Humanos , Masculino , Proteínas de la Membrana/genética , Persona de Mediana Edad , Proteínas del Tejido Nervioso/genética , Linaje , Temblor/diagnóstico , Temblor/genética , Temblor/fisiopatología , Adulto JovenRESUMEN
Metabolic bone disorders arise as primary diseases or may be secondary due to a multitude of organ malfunctions. Animal models are required to understand the molecular mechanisms responsible for the imbalances of bone metabolism in disturbed bone mineralization diseases. Here we present the isolation of mutant mouse models for metabolic bone diseases by phenotyping blood parameters that target bone turnover within the large-scale genome-wide Munich ENU Mutagenesis Project. A screening panel of three clinical parameters, also commonly used as biochemical markers in patients with metabolic bone diseases, was chosen. Total alkaline phosphatase activity and total calcium and inorganic phosphate levels in plasma samples of F1 offspring produced from ENU-mutagenized C3HeB/FeJ male mice were measured. Screening of 9,540 mice led to the identification of 257 phenodeviants of which 190 were tested by genetic confirmation crosses. Seventy-one new dominant mutant lines showing alterations of at least one of the biochemical parameters of interest were confirmed. Fifteen mutations among three genes (Phex, Casr, and Alpl) have been identified by positional-candidate gene approaches and one mutation of the Asgr1 gene, which was identified by next-generation sequencing. All new mutant mouse lines are offered as a resource for the scientific community.
Asunto(s)
Enfermedades Óseas Metabólicas/genética , Modelos Animales de Enfermedad , Ratones/genética , Fosfatasa Alcalina/genética , Secuencia de Aminoácidos , Animales , Secuencia de Bases , Enfermedades Óseas Metabólicas/sangre , Enfermedades Óseas Metabólicas/enzimología , Calcio/sangre , Cromosomas de los Mamíferos , Análisis Mutacional de ADN , Etilnitrosourea/farmacología , Femenino , Masculino , Ratones Endogámicos C3H , Ratones Endogámicos C57BL , Mutagénesis , Mutágenos/farmacología , Mutación , Endopeptidasa Neutra Reguladora de Fosfato PHEX/genética , Fenotipo , Fosfatos/sangre , Polimorfismo de Nucleótido Simple , Receptores Sensibles al Calcio/genética , Estadísticas no Paramétricas , Cromosoma XRESUMEN
To identify rare causal variants in late-onset Parkinson disease (PD), we investigated an Austrian family with 16 affected individuals by exome sequencing. We found a missense mutation, c.1858G>A (p.Asp620Asn), in the VPS35 gene in all seven affected family members who are alive. By screening additional PD cases, we saw the same variant cosegregating with the disease in an autosomal-dominant mode with high but incomplete penetrance in two further families with five and ten affected members, respectively. The mean age of onset in the affected individuals was 53 years. Genotyping showed that the shared haplotype extends across 65 kilobases around VPS35. Screening the entire VPS35 coding sequence in an additional 860 cases and 1014 controls revealed six further nonsynonymous missense variants. Three were only present in cases, two were only present in controls, and one was present in cases and controls. The familial mutation p.Asp620Asn and a further variant, c.1570C>T (p.Arg524Trp), detected in a sporadic PD case were predicted to be damaging by sequence-based and molecular-dynamics analyses. VPS35 is a component of the retromer complex and mediates retrograde transport between endosomes and the trans-Golgi network, and it has recently been found to be involved in Alzheimer disease.
Asunto(s)
Mutación Missense , Enfermedad de Parkinson/genética , Proteínas de Transporte Vesicular/genética , Adulto , Edad de Inicio , Anciano , Anciano de 80 o más Años , Estudios de Cohortes , Endosomas/genética , Endosomas/metabolismo , Femenino , Variación Genética , Haplotipos , Humanos , Enlace de Hidrógeno , Masculino , Persona de Mediana Edad , Enfermedad de Parkinson/metabolismo , Linaje , Conformación Proteica , Proteínas de Transporte Vesicular/metabolismo , Red trans-Golgi/metabolismoRESUMEN
Intellectual disability inherited in an autosomal-recessive fashion represents an important fraction of severe cognitive-dysfunction disorders. Yet, the extreme heterogeneity of these conditions markedly hampers gene identification. Here, we report on eight affected individuals who were from three consanguineous families and presented with severe intellectual disability, absent speech, shy character, stereotypic laughter, muscular hypotonia that progressed to spastic paraplegia, microcephaly, foot deformity, decreased muscle mass of the lower limbs, inability to walk, and growth retardation. Using a combination of autozygosity mapping and either Sanger sequencing of candidate genes or next-generation exome sequencing, we identified one mutation in each of three genes encoding adaptor protein complex 4 (AP4) subunits: a nonsense mutation in AP4S1 (NM_007077.3: c.124C>T, p.Arg42(∗)), a frameshift mutation in AP4B1 (NM_006594.2: c.487_488insTAT, p.Glu163_Ser739delinsVal), and a splice mutation in AP4E1 (NM_007347.3: c.542+1_542+4delGTAA, r.421_542del, p.Glu181Glyfs(∗)20). Adaptor protein complexes (AP1-4) are ubiquitously expressed, evolutionarily conserved heterotetrameric complexes that mediate different types of vesicle formation and the selection of cargo molecules for inclusion into these vesicles. Interestingly, two mutations affecting AP4M1 and AP4E1 have recently been found to cause cerebral palsy associated with severe intellectual disability. Combined with previous observations, these results support the hypothesis that AP4-complex-mediated trafficking plays a crucial role in brain development and functioning and demonstrate the existence of a clinically recognizable syndrome due to deficiency of the AP4 complex.
Asunto(s)
Complejo 4 de Proteína Adaptadora/genética , Carácter , Características Humanas , Discapacidad Intelectual/genética , Paraplejía/genética , Adolescente , Niño , Femenino , Ligamiento Genético , Humanos , Lactante , Masculino , Linaje , Adulto JovenRESUMEN
DNA methylation is the major modification of eukaryotic genomes and plays an essential role in mammalian gene regulation. In general, cytosine-phosphatidyl-guanosine (CpG)-methylated promoters are transcriptionally repressed and nuclear proteins such as MECP2, MBD1, MBD2, and MBD4 bind CpG-methylated DNA and contribute to epigenetic silencing. Methylation of viral DNA also regulates gene expression of Epstein-Barr virus (EBV), which is a model of herpes virus latency. In latently infected human B cells, the viral DNA is CpG-methylated, the majority of viral genes is repressed and virus synthesis is therefore abrogated. EBV's BZLF1 encodes a transcription factor of the AP-1 family (Zta) and is the master gene to overcome viral gene repression. In a genome-wide screen, we now identify and characterize those viral genes, which Zta regulates. Among them are genes essential for EBV's lytic phase, which paradoxically depend on strictly CpG-methylated promoters for their Zta-induced expression. We identified novel DNA recognition motifs, termed meZRE (methyl-Zta-responsive element), which Zta selectively binds in order to 'read' DNA in a methylation- and sequence-dependent manner unlike any other known protein. Zta is a homodimer but its binding characteristics to meZREs suggest a sequential, non-palindromic and bipartite DNA recognition element, which confers superior DNA binding compared to CpG-free ZREs. Our findings indicate that Zta has evolved to transactivate cytosine-methylated, hence repressed, silent promoters as a rule to overcome epigenetic silencing.
Asunto(s)
Islas de CpG/genética , Metilación de ADN , Epigénesis Genética , Regulación Viral de la Expresión Génica , Regiones Promotoras Genéticas/genética , Transactivadores/genética , Latencia del Virus/genética , Linfocitos B/patología , Linfocitos B/virología , Western Blotting , Células Cultivadas , Inmunoprecipitación de Cromatina , ADN Viral/genética , ADN Viral/metabolismo , Ensayo de Cambio de Movilidad Electroforética , Infecciones por Virus de Epstein-Barr/genética , Infecciones por Virus de Epstein-Barr/patología , Infecciones por Virus de Epstein-Barr/virología , Genes Virales , Herpesvirus Humano 4/fisiología , Humanos , Inmunoprecipitación , Riñón/citología , Riñón/metabolismo , Riñón/virología , Luciferasas/metabolismo , ARN Mensajero/genética , Elementos de Respuesta/genética , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Transactivadores/metabolismo , Factor de Transcripción AP-1/metabolismo , Replicación ViralRESUMEN
Mental retardation affects 2-3% of the population and shows a high heritability.Neurodevelopmental disorders that include pronounced impairment in language and speech skills occur less frequently. For most cases, the molecular basis of mental retardation with or without speech and language disorder is unknown due to the heterogeneity of underlying genetic factors.We have used molecular karyotyping on 1523 patients with mental retardation to detect copy number variations (CNVs) including deletions or duplications. These studies revealed three heterozygous overlapping deletions solely affecting the forkhead box P1 (FOXP1) gene. All three patients had moderate mental retardation and significant language and speech deficits. Since our results are consistent with a de novo occurrence of these deletions, we considered them as causal although we detected a single large deletion including FOXP1 and additional genes in 4104 ancestrally matched controls. These findings are of interest with regard to the structural and functional relationship between FOXP1 and FOXP2. Mutations in FOXP2 have been previously related to monogenic cases of developmental verbal dyspraxia. Both FOXP1 and FOXP2 are expressed in songbird and human brain regions that are important for the developmental processes that culminate in speech and language.
Asunto(s)
Factores de Transcripción Forkhead/genética , Discapacidad Intelectual/genética , Trastornos del Lenguaje/genética , Proteínas Represoras/genética , Eliminación de Secuencia , Trastornos del Habla/genética , Secuencia de Bases , Estudios de Casos y Controles , Niño , Preescolar , Cromosomas Artificiales Bacterianos/genética , Roturas del ADN , Cartilla de ADN/genética , Femenino , Heterocigoto , Humanos , Hibridación Fluorescente in Situ , Masculino , Reacción en Cadena de la PolimerasaRESUMEN
BACKGROUND: The majority of the 2 million bovine single nucleotide polymorphisms (SNPs) currently available in dbSNP have been identified in a single breed, Hereford cattle, during the bovine genome project. In an attempt to evaluate the variance of a second breed, we have produced a whole genome sequence at low coverage of a single Fleckvieh bull. RESULTS: We generated 24 gigabases of sequence, mainly using 36-bp paired-end reads, resulting in an average 7.4-fold sequence depth. This coverage was sufficient to identify 2.44 million SNPs, 82% of which were previously unknown, and 115,000 small indels. A comparison with the genotypes of the same animal, generated on a 50 k oligonucleotide chip, revealed a detection rate of 74% and 30% for homozygous and heterozygous SNPs, respectively. The false positive rate, as determined by comparison with genotypes determined for 196 randomly selected SNPs, was approximately 1.1%. We further determined the allele frequencies of the 196 SNPs in 48 Fleckvieh and 48 Braunvieh bulls. 95% of the SNPs were polymorphic with an average minor allele frequency of 24.5% and with 83% of the SNPs having a minor allele frequency larger than 5%. CONCLUSIONS: This work provides the first single cattle genome by next-generation sequencing. The chosen approach - low to medium coverage re-sequencing - added more than 2 million novel SNPs to the currently publicly available SNP resource, providing a valuable resource for the construction of high density oligonucleotide arrays in the context of genome-wide association studies.
Asunto(s)
Bovinos/genética , Genoma , Polimorfismo de Nucleótido Simple , Animales , Femenino , Mutación INDEL , Masculino , Análisis de Secuencia de ADNRESUMEN
There are several sequence-dependent factors regulating gene expression. Some of them have been extensively studied, among the most prominent are GC content and codon usage bias. Other factors hypothesized to have an impact on gene expression are gene length and the thermodynamic stability of mRNA secondary structure. In this work, we analyzed two different microarray datasets of Drosophila melanogaster gene expression and one dataset of Escherichia coli. To investigate the relationship between gene expression, codon usage bias and GC content of first, second and third codon position, gene length and mRNA stability we employed a multiple regression analysis using a comprehensive linear model. It is shown that codon usage bias and GC content of the first, second and third codon position show a significant influence on gene expression, whereas no significant effect of mRNA secondary structure stability is observed.