RESUMEN
Regulation of nuclear import is fundamental to eukaryotic biology. The majority of nuclear import pathways are mediated by importin-cargo interactions. Yet not all nuclear proteins interact with importins, necessitating the identification of a general importin-independent nuclear import pathway. Here, we identify a code that determines importin-independent nuclear import of ankyrin repeats (ARs), a structural motif found in over 250 human proteins with diverse functions. AR-containing proteins (ARPs) with a hydrophobic residue at the 13th position of two consecutive ARs bind RanGDP efficiently, and consequently enter the nucleus. This code, experimentally tested in 17 ARPs, predicts the nuclear-cytoplasmic localization of over 150 annotated human ARPs with high accuracy and is acquired by the most common familial melanoma-associated CDKN2A mutation, leading to nuclear accumulation of mutant p16ink4a. The RaDAR (RanGDP/AR) pathway represents a general importin-independent nuclear import pathway and is frequently used by AR-containing transcriptional regulators, especially those regulating NF-κB/p53.
Asunto(s)
Transporte Activo de Núcleo Celular , Repetición de Anquirina , Inhibidor p16 de la Quinasa Dependiente de Ciclina/metabolismo , Transporte de Proteínas , Inhibidor p16 de la Quinasa Dependiente de Ciclina/genética , Humanos , Modelos Moleculares , Complejos Multiproteicos/metabolismo , Proteínas/química , Proteínas/metabolismo , Proteína de Unión al GTP ran/metabolismoRESUMEN
Critical illness in COVID-19 is an extreme and clinically homogeneous disease phenotype that we have previously shown1 to be highly efficient for discovery of genetic associations2. Despite the advanced stage of illness at presentation, we have shown that host genetics in patients who are critically ill with COVID-19 can identify immunomodulatory therapies with strong beneficial effects in this group3. Here we analyse 24,202 cases of COVID-19 with critical illness comprising a combination of microarray genotype and whole-genome sequencing data from cases of critical illness in the international GenOMICC (11,440 cases) study, combined with other studies recruiting hospitalized patients with a strong focus on severe and critical disease: ISARIC4C (676 cases) and the SCOURGE consortium (5,934 cases). To put these results in the context of existing work, we conduct a meta-analysis of the new GenOMICC genome-wide association study (GWAS) results with previously published data. We find 49 genome-wide significant associations, of which 16 have not been reported previously. To investigate the therapeutic implications of these findings, we infer the structural consequences of protein-coding variants, and combine our GWAS results with gene expression data using a monocyte transcriptome-wide association study (TWAS) model, as well as gene and protein expression using Mendelian randomization. We identify potentially druggable targets in multiple systems, including inflammatory signalling (JAK1), monocyte-macrophage activation and endothelial permeability (PDE4A), immunometabolism (SLC2A5 and AK5), and host factors required for viral entry and replication (TMPRSS2 and RAB2A).
Asunto(s)
COVID-19 , Enfermedad Crítica , Predisposición Genética a la Enfermedad , Variación Genética , Estudio de Asociación del Genoma Completo , Humanos , COVID-19/genética , Predisposición Genética a la Enfermedad/genética , Variación Genética/genética , Genotipo , Técnicas de Genotipaje , Monocitos/metabolismo , Fenotipo , Proteínas de Unión al GTP rab/genética , Transcriptoma , Secuenciación Completa del GenomaRESUMEN
Thymic epithelial cell differentiation, growth and function depend on the expression of the transcription factor Foxn1; however, its target genes have never been physically identified. Using static and inducible genetic model systems and chromatin studies, we developed a genome-wide map of direct Foxn1 target genes for postnatal thymic epithelia and defined the Foxn1 binding motif. We determined the function of Foxn1 in these cells and found that, in addition to the transcriptional control of genes involved in the attraction and lineage commitment of T cell precursors, Foxn1 regulates the expression of genes involved in antigen processing and thymocyte selection. Thus, critical events in thymic lympho-stromal cross-talk and T cell selection are indispensably choreographed by Foxn1.
Asunto(s)
Células Epiteliales/fisiología , Factores de Transcripción Forkhead/metabolismo , Células Precursoras de Linfocitos T/fisiología , Linfocitos T/fisiología , Timo/fisiología , Animales , Presentación de Antígeno/genética , Comunicación Celular , Diferenciación Celular/genética , Linaje de la Célula/genética , Células Cultivadas , Selección Clonal Mediada por Antígenos/genética , Factores de Transcripción Forkhead/genética , Regulación de la Expresión Génica , Genoma/genética , Ratones , Ratones Endogámicos C57BL , Ratones Mutantes , Ratones TransgénicosRESUMEN
Not all bases in the human genome are equally prone to chance mutations. Michaelson et al. show that individuals can acquire clusters of de novo DNA changes and propose that the likelihood of single base pair change differs across three orders of magnitude.
RESUMEN
Although evidence exists for a causal association between 25-hydroxyvitamin D (25(OH)D) serum levels, and multiple sclerosis (MS), the role of variation in vitamin D receptor (VDR) binding in MS is unknown. Here, we leveraged previously identified variants associated with allele imbalance in VDR binding (VDR-binding variant; VDR-BV) in ChIP-exo data from calcitriol-stimulated lymphoblastoid cell lines and 25(OH)D serum levels from genome-wide association studies to construct genetic instrumental variables (GIVs). GIVs are composed of one or more genetic variants that serve as proxies for exposures of interest. Here, GIVs for both VDR-BVs and 25(OH)D were used in a two-sample Mendelian Randomization study to investigate the relationship between VDR binding at a locus, 25(OH)D serum levels, and MS risk. Data for 13,598 MS cases and 38,887 controls of European ancestry from Kaiser Permanente Northern California, Swedish MS studies, and the UK Biobank were included. We estimated the association between each VDR-BV GIV and MS. Significant interaction between a VDR-BV GIV and a GIV for serum 25OH(D) was evidence for a causal association between VDR-BVs and MS unbiased by pleiotropy. We observed evidence for associations between two VDR-BVs (rs2881514, rs2531804) and MS after correction for multiple tests. There was evidence of interaction between rs2881514 and a 25(OH)D GIV, providing evidence of a causal association between rs2881514 and MS. This study is the first to demonstrate evidence that variation in VDR binding at a locus contributes to MS risk. Our results are relevant to other autoimmune diseases in which vitamin D plays a role.
Asunto(s)
Esclerosis Múltiple , Humanos , Esclerosis Múltiple/genética , Receptores de Calcitriol/genética , Receptores de Calcitriol/metabolismo , Alelos , Estudio de Asociación del Genoma Completo , Vitamina D/metabolismo , Calcitriol , Polimorfismo de Nucleótido SimpleRESUMEN
Do long noncoding RNAs (lncRNAs) contribute little or substantively to human biology? To address how lncRNA loci and their transcripts, structures, interactions, and functions contribute to human traits and disease, we adopt a genome-wide perspective. We intend to provoke alternative interpretation of questionable evidence and thorough inquiry into unsubstantiated claims. We discuss pitfalls of lncRNA experimental and computational methods as well as opposing interpretations of their results. The majority of evidence, we argue, indicates that most lncRNA transcript models reflect transcriptional noise or provide minor regulatory roles, leaving relatively few human lncRNAs that contribute centrally to human development, physiology, or behavior. These important few tend to be spliced and better conserved but lack a simple syntax relating sequence to structure and mechanism, and so resist simple categorization. This genome-wide view should help investigators prioritize individual lncRNAs based on their likely contribution to human biology.
Asunto(s)
ARN Largo no Codificante , Genoma , Humanos , ARN Largo no Codificante/genéticaRESUMEN
In fission yeast, RNAi directs heterochromatin formation at centromeres, telomeres, and the mating type locus. Noncoding RNAs transcribed from repeat elements generate siRNAs that are incorporated into the Argonaute-containing RITS complex and direct it to nascent homologous transcripts. This leads to recruitment of the CLRC complex, including the histone methyltransferase Clr4, promoting H3K9 methylation and heterochromatin formation. A key question is what mediates the recruitment of Clr4/CLRC to transcript-bound RITS. We have identified a LIM domain protein, Stc1, that is required for centromeric heterochromatin integrity. Our analyses show that Stc1 is specifically required to establish H3K9 methylation via RNAi, and interacts both with the RNAi effector Ago1, and with the chromatin-modifying CLRC complex. Moreover, tethering Stc1 to a euchromatic locus is sufficient to induce silencing and heterochromatin formation independently of RNAi. We conclude that Stc1 associates with RITS on centromeric transcripts and recruits CLRC, thereby coupling RNAi to chromatin modification.
Asunto(s)
Proteínas Portadoras/metabolismo , Ensamble y Desensamble de Cromatina , Heterocromatina/metabolismo , Proteínas de Schizosaccharomyces pombe/metabolismo , Schizosaccharomyces/genética , Schizosaccharomyces/metabolismo , Proteínas de Ciclo Celular/genética , N-Metiltransferasa de Histona-Lisina , Metiltransferasas/genética , Interferencia de ARN , Proteínas de Schizosaccharomyces pombe/genéticaRESUMEN
Despite many decades of study, mitotic chromosome structure and composition remain poorly characterized. Here, we have integrated quantitative proteomics with bioinformatic analysis to generate a series of independent classifiers that describe the approximately 4,000 proteins identified in isolated mitotic chromosomes. Integrating these classifiers by machine learning uncovers functional relationships between protein complexes in the context of intact chromosomes and reveals which of the approximately 560 uncharacterized proteins identified here merits further study. Indeed, of 34 GFP-tagged predicted chromosomal proteins, 30 were chromosomal, including 13 with centromere-association. Of 16 GFP-tagged predicted nonchromosomal proteins, 14 were confirmed to be nonchromosomal. An unbiased analysis of the whole chromosome proteome from genetic knockouts of kinetochore protein Ska3/Rama1 revealed that the APC/C and RanBP2/RanGAP1 complexes depend on the Ska complex for stable association with chromosomes. Our integrated analysis predicts that up to 97 new centromere-associated proteins remain to be discovered in our data set.
Asunto(s)
Proteínas Cromosómicas no Histona/análisis , Cromosomas/química , Mitosis , Proteómica/métodos , Animales , Línea Celular , Proteínas Cromosómicas no Histona/metabolismo , Cromosomas/metabolismo , Humanos , Cinetocoros/metabolismo , Huso Acromático/metabolismoRESUMEN
To induce central T-cell tolerance, medullary thymic epithelial cells (mTEC) collectively express most protein-coding genes, thereby presenting an extensive library of tissue-restricted antigens (TRAs). To resolve mTEC diversity and whether promiscuous gene expression (PGE) is stochastic or coordinated, we sequenced transcriptomes of 6,894 single mTEC, enriching for 1,795 rare cells expressing either of two TRAs, TSPAN8 or GP2. Transcriptional heterogeneity allowed partitioning of mTEC into 15 reproducible subpopulations representing distinct maturational trajectories, stages and subtypes, including novel mTEC subsets, such as chemokine-expressing and ciliated TEC, which warrant further characterisation. Unexpectedly, 50 modules of genes were robustly defined each showing patterns of co-expression within individual cells, which were mainly not explicable by chromosomal location, biological pathway or tissue specificity. Further, TSPAN8+ and GP2+ mTEC were randomly dispersed within thymic medullary islands. Consequently, these data support observations that PGE exhibits ordered co-expression, although mechanisms underlying this instruction remain biologically indeterminate. Ordered co-expression and random spatial distribution of a diverse range of TRAs likely enhance their presentation and encounter with passing thymocytes, while maintaining mTEC identity.
Asunto(s)
Biomarcadores/metabolismo , Células Epiteliales/metabolismo , Análisis de la Célula Individual/métodos , Timo/metabolismo , Transcriptoma , Animales , Biomarcadores/análisis , Diferenciación Celular , Células Epiteliales/citología , Femenino , Perfilación de la Expresión Génica , Ratones , Ratones Endogámicos BALB C , Ratones Endogámicos C57BL , Especificidad de Órganos , Timo/citologíaRESUMEN
RNA is not only a messenger operating between DNA and protein. Transcription of essentially the entire eukaryotic genome generates a myriad of non-protein-coding RNA species that show complex overlapping patterns of expression and regulation. Although long noncoding RNAs (lncRNAs) are among the least well-understood of these transcript species, they cannot all be dismissed as merely transcriptional "noise." Here, we review the evolution of lncRNAs and their roles in transcriptional regulation, epigenetic gene regulation, and disease.
Asunto(s)
Evolución Molecular , Regulación de la Expresión Génica , ARN no Traducido/metabolismo , Animales , Humanos , ARN no Traducido/genéticaRESUMEN
Protein posttranslational modifications add great sophistication to biological systems. Citrullination, a key regulatory mechanism in human physiology and pathophysiology, is enigmatic from an evolutionary perspective. Although the citrullinating enzymes peptidylarginine deiminases (PADIs) are ubiquitous across vertebrates, they are absent from yeast, worms, and flies. Based on this distribution PADIs were proposed to have been horizontally transferred, but this has been contested. Here, we map the evolutionary trajectory of PADIs into the animal lineage. We present strong phylogenetic support for a clade encompassing animal and cyanobacterial PADIs that excludes fungal and other bacterial homologs. The animal and cyanobacterial PADI proteins share functionally relevant primary and tertiary synapomorphic sequences that are distinct from a second PADI type present in fungi and actinobacteria. Molecular clock calculations and sequence divergence analyses using the fossil record estimate the last common ancestor of the cyanobacterial and animal PADIs to be less than 1 billion years old. Additionally, under an assumption of vertical descent, PADI sequence change during this evolutionary time frame is anachronistically low, even when compared with products of likely endosymbiont gene transfer, mitochondrial proteins, and some of the most highly conserved sequences in life. The consilience of evidence indicates that PADIs were introduced from cyanobacteria into animals by horizontal gene transfer (HGT). The ancestral cyanobacterial PADI is enzymatically active and can citrullinate eukaryotic proteins, suggesting that the PADI HGT event introduced a new catalytic capability into the regulatory repertoire of animals. This study reveals the unusual evolution of a pleiotropic protein modification.
Asunto(s)
Cianobacterias , Transferencia de Gen Horizontal , Animales , Citrulinación , Secuencia Conservada , Cianobacterias/genética , Evolución Molecular , FilogeniaRESUMEN
Answering genetics' big data questions often needs an interdisciplinary team whose members freely share their diverse expertise in analysis, statistics, and computation. Sharing requires mutual trust and open acknowledgement of strengths and weaknesses, including those of established geneticists. Only then will newcomers to genetics contribute far beyond their entry-level expectations.
Asunto(s)
Biología Computacional/estadística & datos numéricos , Conducta Cooperativa , Genética/educación , Genética/normas , Servicios de Información/estadística & datos numéricos , Investigación Biomédica Traslacional , HumanosRESUMEN
To efficiently transform genetic associations into drug targets requires evidence that a particular gene, and its encoded protein, contribute causally to a disease. To achieve this, we employ a three-step proteome-by-phenome Mendelian Randomization (MR) approach. In step one, 154 protein quantitative trait loci (pQTLs) were identified and independently replicated. From these pQTLs, 64 replicated locally-acting variants were used as instrumental variables for proteome-by-phenome MR across 846 traits (step two). When its assumptions are met, proteome-by-phenome MR, is equivalent to simultaneously running many randomized controlled trials. Step 2 yielded 38 proteins that significantly predicted variation in traits and diseases in 509 instances. Step 3 revealed that amongst the 271 instances from GeneAtlas (UK Biobank), 77 showed little evidence of pleiotropy (HEIDI), and 92 evidence of colocalization (eCAVIAR). Results were wide ranging: including, for example, new evidence for a causal role of tyrosine-protein phosphatase non-receptor type substrate 1 (SHPS1; SIRPA) in schizophrenia, and a new finding that intestinal fatty acid binding protein (FABP2) abundance contributes to the pathogenesis of cardiovascular disease. We also demonstrated confirmatory evidence for the causal role of four further proteins (FGF5, IL6R, LPL, LTA) in cardiovascular disease risk.
Asunto(s)
Enfermedades Cardiovasculares/genética , Análisis de la Aleatorización Mendeliana , Proteoma/genética , Esquizofrenia/genética , Antígenos de Diferenciación/genética , Enfermedades Cardiovasculares/patología , Proteínas de Unión a Ácidos Grasos/genética , Femenino , Factor 5 de Crecimiento de Fibroblastos/genética , Estudios de Asociación Genética/métodos , Humanos , Lipoproteína Lipasa/genética , Linfotoxina-alfa/genética , Masculino , Sitios de Carácter Cuantitativo , Receptores Inmunológicos/genética , Receptores de Interleucina-6/genética , Esquizofrenia/patologíaRESUMEN
The Caenorhabditis elegans gene rec-1 was the first genetic locus identified in metazoa to affect the distribution of meiotic crossovers along the chromosome. We report that rec-1 encodes a distant paralog of HIM-5, which was discovered by whole-genome sequencing and confirmed by multiple genome-edited alleles. REC-1 is phosphorylated by cyclin-dependent kinase (CDK) in vitro, and mutation of the CDK consensus sites in REC-1 compromises meiotic crossover distribution in vivo. Unexpectedly, rec-1; him-5 double mutants are synthetic-lethal due to a defect in meiotic double-strand break formation. Thus, we uncovered an unexpected robustness to meiotic DSB formation and crossover positioning that is executed by HIM-5 and REC-1 and regulated by phosphorylation.
Asunto(s)
Proteínas de Caenorhabditis elegans/genética , Caenorhabditis elegans/genética , Caenorhabditis elegans/metabolismo , Proteínas de Ciclo Celular/genética , Intercambio Genético/genética , Roturas del ADN de Doble Cadena , Animales , Proteínas de Caenorhabditis elegans/metabolismo , Proteínas de Ciclo Celular/metabolismo , Meiosis/genéticaRESUMEN
Myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) is a complex multisystem illness that lacks effective therapy and a biomedical understanding of its causes. Despite a prevalence of â¼0.2-0.4% and its high public health burden, and evidence that it has a heritable component, ME/CFS has not yet benefited from the advances in technology and analytical tools that have improved our understanding of many other complex diseases. Here we critically review existing evidence that genetic factors alter ME/CFS risk before concluding that most ME/CFS candidate gene associations are not replicated by the larger CFS cohort within the UK Biobank. Multiple genome-wide association studies of this cohort also have not yielded consistently significant associations. Ahead of upcoming larger genome-wide association studies, we discuss how these could generate new lines of enquiry into the DNA variants, genes and cell types that are causally involved in ME/CFS disease.
Asunto(s)
Epigénesis Genética , Síndrome de Fatiga Crónica/patología , Regulación de la Expresión Génica , Marcadores Genéticos , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Estudios de Cohortes , Síndrome de Fatiga Crónica/etiología , Síndrome de Fatiga Crónica/metabolismo , Humanos , Factores de RiesgoRESUMEN
Many long non-coding RNAs (lncRNAs) are expressed during central nervous system (CNS) development, yet their in vivo roles and mechanisms of action remain poorly understood. Paupar, a CNS-expressed lncRNA, controls neuroblastoma cell growth by binding and modulating the activity of transcriptional regulatory elements in a genome-wide manner. We show here that the Paupar lncRNA directly binds KAP1, an essential epigenetic regulatory protein, and thereby regulates the expression of shared target genes important for proliferation and neuronal differentiation. Paupar promotes KAP1 chromatin occupancy and H3K9me3 deposition at a subset of distal targets, through the formation of a ribonucleoprotein complex containing Paupar, KAP1 and the PAX6 transcription factor. Paupar-KAP1 genome-wide co-occupancy reveals a fourfold enrichment of overlap between Paupar and KAP1 bound sequences, the majority of which also appear to associate with PAX6. Furthermore, both Paupar and Kap1 loss-of-function in vivo disrupt olfactory bulb neurogenesis. These observations provide important conceptual insights into the trans-acting modes of lncRNA-mediated epigenetic regulation and the mechanisms of KAP1 genomic recruitment, and identify Paupar and Kap1 as regulators of neurogenesis in vivo.
Asunto(s)
Cromatina/genética , Células-Madre Neurales/citología , Neuroblastoma/patología , Neurogénesis , Bulbo Olfatorio/citología , ARN Largo no Codificante/metabolismo , Proteína 28 que Contiene Motivos Tripartito/metabolismo , Animales , Animales Recién Nacidos , Ciclo Celular , Proliferación Celular , Células Cultivadas , Epigénesis Genética , Genómica , Ratones , Células-Madre Neurales/metabolismo , Neuroblastoma/genética , Neuroblastoma/metabolismo , Bulbo Olfatorio/metabolismo , Factor de Transcripción PAX6/genética , Factor de Transcripción PAX6/metabolismo , ARN Largo no Codificante/genética , Elementos Reguladores de la Transcripción , Proteína 28 que Contiene Motivos Tripartito/genéticaRESUMEN
Accurate genome duplication underlies genetic homeostasis. Metazoan Mdm2 binding protein (MTBP) forms a main regulatory platform for origin firing together with Treslin/TICRR and TopBP1 (Topoisomerase II binding protein 1 (TopBP1)-interacting replication stimulating protein/TopBP1-interacting checkpoint and replication regulator). We report the first comprehensive analysis of MTBP and reveal conserved and metazoa-specific MTBP functions in replication. This suggests that metazoa have evolved specific molecular mechanisms to adapt replication principles conserved with yeast to the specific requirements of the more complex metazoan cells. We uncover one such metazoa-specific process: a new replication factor, cyclin-dependent kinase 8/19-cyclinC (Cdk8/19-cyclin C), binds to a central domain of MTBP. This interaction is required for complete genome duplication in human cells. In the absence of MTBP binding to Cdk8/19-cyclin C, cells enter mitosis with incompletely duplicated chromosomes, and subsequent chromosome segregation occurs inaccurately. Using remote homology searches, we identified MTBP as the metazoan orthologue of yeast synthetic lethal with Dpb11 7 (Sld7). This homology finally demonstrates that the set of yeast core factors sufficient for replication initiation in vitro is conserved in metazoa. MTBP and Sld7 contain two homologous domains that are present in no other protein, one each in the N and C termini. In MTBP the conserved termini flank the metazoa-specific Cdk8/19-cyclin C binding region and are required for normal origin firing in human cells. The N termini of MTBP and Sld7 share an essential origin firing function, the interaction with Treslin/TICRR or its yeast orthologue Sld3, respectively. The C termini may function as homodimerisation domains. Our characterisation of broadly conserved and metazoa-specific initiation processes sets the basis for further mechanistic dissection of replication initiation in vertebrates. It is a first step in understanding the distinctions of origin firing in higher eukaryotes.
Asunto(s)
Proteínas Portadoras/metabolismo , Proteínas Portadoras/fisiología , Proteínas de Saccharomyces cerevisiae/metabolismo , Proteínas Portadoras/genética , Proteínas de Ciclo Celular/metabolismo , Biología Computacional/métodos , Ciclina C/genética , Ciclina C/metabolismo , Quinasa 8 Dependiente de Ciclina/genética , Quinasa 8 Dependiente de Ciclina/metabolismo , Quinasa 8 Dependiente de Ciclina/fisiología , Quinasas Ciclina-Dependientes/metabolismo , Quinasas Ciclina-Dependientes/fisiología , Replicación del ADN/fisiología , Proteínas de Unión al ADN/metabolismo , Células HEK293 , Células HeLa , Humanos , Mitosis , Unión Proteica , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/genéticaRESUMEN
BACKGROUND: Myalgic encephalomyelitis / chronic fatigue syndrome (ME/CFS) is a common, long-term condition characterised by post-exertional malaise, often with fatigue that is not significantly relieved by rest. ME/CFS has no confirmed diagnostic test or effective treatment and we lack knowledge of its causes. Identification of genes and cellular processes whose disruption adds to ME/CFS risk is a necessary first step towards development of effective therapy. METHODS: Here we describe DecodeME, an ongoing study co-produced by people with lived experience of ME/CFS and scientists. Together we designed the study and obtained funding and are now recruiting up to 25,000 people in the UK with a clinical diagnosis of ME/CFS. Those eligible for the study are at least 16 years old, pass international study criteria, and lack any alternative diagnoses that can result in chronic fatigue. These will include 5,000 people whose ME/CFS diagnosis was a consequence of SARS-CoV-2 infection. Questionnaires are completed online or on paper. Participants' saliva DNA samples are acquired by post, which improves participation by more severely-affected individuals. Digital marketing and social media approaches resulted in 29,000 people with ME/CFS in the UK pre-registering their interest in participating. We will perform a genome-wide association study, comparing participants' genotypes with those from UK Biobank as controls. This should generate hypotheses regarding the genes, mechanisms and cell types contributing to ME/CFS disease aetiology. DISCUSSION: The DecodeME study has been reviewed and given a favourable opinion by the North West - Liverpool Central Research Ethics Committee (21/NW/0169). Relevant documents will be available online ( www.decodeme.org.uk ). Genetic data will be disseminated as associated variants and genomic intervals, and as summary statistics. Results will be reported on the DecodeME website and via open access publications.