ABSTRACT
Human genomics is undergoing a step change from being a predominantly research-driven activity to one driven through health care as many countries in Europe now have nascent precision medicine programmes. To maximize the value of the genomic data generated, these data will need to be shared between institutions and across countries. In recognition of this challenge, 21 European countries recently signed a declaration to transnationally share data on at least 1 million human genomes by 2022. In this Roadmap, we identify the challenges of data sharing across borders and demonstrate that European research infrastructures are well-positioned to support the rapid implementation of widespread genomic data access.
Subject(s)
Biomedical Research , Genome, Human , Human Genome Project , Europe , HumansABSTRACT
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
ABSTRACT
The Ataxia Global Initiative (AGI) is a worldwide multi-stakeholder research platform to systematically enhance trial-readiness in degenerative ataxias. The next-generation sequencing (NGS) working group of the AGI aims to improve methods, platforms, and international standards for ataxia NGS analysis and data sharing, ultimately allowing to increase the number of genetically ataxia patients amenable for natural history and treatment trials. Despite extensive implementation of NGS for ataxia patients in clinical and research settings, the diagnostic gap remains sizeable, as approximately 50% of patients with hereditary ataxia remain genetically undiagnosed. One current shortcoming is the fragmentation of patients and NGS datasets on different analysis platforms and databases around the world. The AGI NGS working group in collaboration with the AGI associated research platforms-CAGC, GENESIS, and RD-Connect GPAP-provides clinicians and scientists access to user-friendly and adaptable interfaces to analyze genome-scale patient data. These platforms also foster collaboration within the ataxia community. These efforts and tools have led to the diagnosis of > 500 ataxia patients and the discovery of > 30 novel ataxia genes. Here, the AGI NGS working group presents their consensus recommendations for NGS data sharing initiatives in the ataxia field, focusing on harmonized NGS variant analysis and standardized clinical and metadata collection, combined with collaborative data and analysis tool sharing across platforms.
Subject(s)
Cerebellar Ataxia , Spinocerebellar Degenerations , Humans , Cerebellar Ataxia/genetics , Databases, Factual , High-Throughput Nucleotide Sequencing/methods , Information DisseminationABSTRACT
Consanguineous marriages have a prevalence rate of 24% in Turkey. These carry an increased risk of autosomal recessive genetic conditions, leading to severe disability or premature death, with a significant health and economic burden. A definitive molecular diagnosis could not be achieved in these children previously, as infrastructures and access to sophisticated diagnostic options were limited. We studied the cause of neurogenetic disease in 246 children from 190 consanguineous families recruited in three Turkish hospitals between 2016 and 2020. All patients underwent deep phenotyping and trio whole exome sequencing, and data were integrated in advanced international bioinformatics platforms. We detected causative variants in 119 known disease genes in 72% of families. Due to overlapping phenotypes 52% of the confirmed genetic diagnoses would have been missed on targeted diagnostic gene panels. Likely pathogenic variants in 27 novel genes in 14% of the families increased the diagnostic yield to 86%. Eighty-two per cent of causative variants (141/172) were homozygous, 11 of which were detected in genes previously only associated with autosomal dominant inheritance. Eight families carried two pathogenic variants in different disease genes. De novo (9.3%), X-linked recessive (5.2%) and compound heterozygous (3.5%) variants were less frequent compared to non-consanguineous populations. This cohort provided a unique opportunity to better understand the genetic characteristics of neurogenetic diseases in a consanguineous population. Contrary to what may be expected, causative variants were often not on the longest run of homozygosity and the diagnostic yield was lower in families with the highest degree of consanguinity, due to the high number of homozygous variants in these patients. Pathway analysis highlighted that protein synthesis/degradation defects and metabolic diseases are the most common pathways underlying paediatric neurogenetic disease. In our cohort 164 families (86%) received a diagnosis, enabling prevention of transmission and targeted treatments in 24 patients (10%). We generated an important body of genomic data with lasting impacts on the health and wellbeing of consanguineous families and economic benefit for the healthcare system in Turkey and elsewhere. We demonstrate that an untargeted next generation sequencing approach is far superior to a more targeted gene panel approach, and can be performed without specialized bioinformatics knowledge by clinicians using established pipelines in populations with high rates of consanguinity.
Subject(s)
Exome , Consanguinity , Exome/genetics , Homozygote , Humans , Mutation , Pedigree , Phenotype , Exome SequencingABSTRACT
Mantle cell lymphoma (MCL) is characterized by the t(11;14)(q13;q32) translocation resulting in overexpression of cyclin D1. However, a small subset of cyclin D1- MCL has been recognized, and approximately one-half of them harbor CCND2 translocations while the primary event in cyclin D1-/D2- MCL remains elusive. To identify other potential mechanisms driving MCL pathogenesis, we investigated 56 cyclin D1-/SOX11+ MCL by fluorescence in situ hybridization (FISH), whole-genome/exome sequencing, and gene-expression and copy-number arrays. FISH with break-apart probes identified CCND2 rearrangements in 39 cases (70%) but not CCND3 rearrangements. We analyzed 3 of these negative cases by whole-genome/exome sequencing and identified IGK (n = 2) and IGL (n = 1) enhancer hijackings near CCND3 that were associated with cyclin D3 overexpression. By specific FISH probes, including the IGK enhancer region, we detected 10 additional cryptic IGK juxtapositions to CCND3 (6 cases) and CCND2 (4 cases) in MCL that overexpressed, respectively, these cyclins. A minor subset of 4 cyclin D1- MCL cases lacked cyclin D rearrangements and showed upregulation of CCNE1 and CCNE2. These cases had blastoid morphology, high genomic complexity, and CDKN2A and RB1 deletions. Both genomic and gene-expression profiles of cyclin D1- MCL cases were indistinguishable from cyclin D1+ MCL. In conclusion, virtually all cyclin D1- MCLs carry CCND2/CCND3 rearrangements with immunoglobulin genes, including a novel IGK/L enhancer hijacking mechanism. A subset of cyclin D1-/D2-/D3- MCL with aggressive features has cyclin E dysregulation. Specific FISH probes may allow the molecular identification and diagnosis of cyclin D1- MCL.
Subject(s)
Cyclin D2/genetics , Cyclin D3/genetics , Enhancer Elements, Genetic , Gene Rearrangement , Immunoglobulin Light Chains/genetics , Lymphoma, Mantle-Cell/genetics , Aged , Cyclin D1/genetics , Cyclin D1/metabolism , Female , Humans , Lymphoma, Mantle-Cell/pathology , Male , Middle Aged , Prognosis , SOXC Transcription Factors/genetics , Translocation, GeneticABSTRACT
Plasmablastic lymphoma mutational profile is undescribed. Here we performed a targeted exonic NGS analysis of 30 plasmablastic lymphoma cases with a B cell lymphoma dedicated panel and FISH for the detection of MYC rearrangements. A complete phenotyping of the neoplastic and microenvironment cell populations was also performed. We have identified an enrichment in recurrent genetic events in MYC (69% with MYC translocation or amplification and 3 cases with missense point mutations), PRDM1/Blimp1 and STAT3 mutations. These gene mutations were more frequent in EBV positive disease. Other genetic events included mutations in BRAF, EP300, BCR (CD79A and CD79B), NOTCH pathway (NOTCH2, NOTCH1 and SGK1) and MYD88pL265P. Immunohistochemical analysis showed consistent MYC expression, higher in cases with MYC rearrangements together with phospho-STAT3 (Tyr705) overexpression in cases with STAT3 SH2 domain mutations. Microenvironment populations were heterogeneous and unrelated with EBV, with an enrichment of Tumor Associated Macrophages (TAM) and PD1 positive T cells. PD-L1 was expressed in all cases in the TAM population but only in 5 cases in the neoplastic cells (4 out of 14 EBV positive cases). HLA expression was absent in the majority of PBL cases. In summary, Plasmablastic lymphoma mutational profile is heterogeneous and related with EBV infection. Genetic events in MYC, STAT3 and PRDM1/Blimp1 are more frequent in EBV positive disease. An enrichment in TAM and PD1 reactive T lymphocytes is found in the microenvironment of PBL cases, that express PD-L1 in the neoplastic cells in a fraction of cases.
Subject(s)
Epstein-Barr Virus Infections , Plasmablastic Lymphoma , Carcinogenesis , Humans , Plasmablastic Lymphoma/diagnosis , Plasmablastic Lymphoma/genetics , Proto-Oncogene Proteins c-myc/genetics , Proto-Oncogene Proteins c-myc/metabolism , STAT3 Transcription Factor/genetics , Translocation, Genetic , Tumor Microenvironment/geneticsABSTRACT
The Human Phenotype Ontology (HPO)-a standardized vocabulary of phenotypic abnormalities associated with 7000+ diseases-is used by thousands of researchers, clinicians, informaticians and electronic health record systems around the world. Its detailed descriptions of clinical abnormalities and computable disease definitions have made HPO the de facto standard for deep phenotyping in the field of rare disease. The HPO's interoperability with other ontologies has enabled it to be used to improve diagnostic accuracy by incorporating model organism data. It also plays a key role in the popular Exomiser tool, which identifies potential disease-causing variants from whole-exome or whole-genome sequencing data. Since the HPO was first introduced in 2008, its users have become both more numerous and more diverse. To meet these emerging needs, the project has added new content, language translations, mappings and computational tooling, as well as integrations with external community data. The HPO continues to collaborate with clinical adopters to improve specific areas of the ontology and extend standardized disease descriptions. The newly redesigned HPO website (www.human-phenotype-ontology.org) simplifies browsing terms and exploring clinical features, diseases, and human genes.
Subject(s)
Biological Ontologies , Computational Biology/methods , Congenital Abnormalities/genetics , Genetic Predisposition to Disease/genetics , Knowledge Bases , Rare Diseases/genetics , Congenital Abnormalities/diagnosis , Databases, Genetic , Genetic Variation , Humans , Internet , Phenotype , Rare Diseases/diagnosis , Whole Genome Sequencing/methodsABSTRACT
Purpose: The aim of the present work is the molecular diagnosis of three patients with deafness and retinal degeneration. Methods: Three patients from two unrelated families were initially analyzed with custom gene panels for Usher genes, non-syndromic hearing loss, or inherited syndromic retinopathies and further investigated by means of clinical or whole exome sequencing. Results: The study allowed us to detect likely pathogenic variants in PEX6, a gene typically involved in peroxisomal biogenesis disorders (PBDs). Beside deaf-blindness, both families showed additional features: Siblings from Family 1 showed enamel alteration and abnormal peroxisome. In addition, the brother had mild neurodevelopmental delay and nephrolithiasis. The case II:1 from Family 2 showed intellectual disability, enamel alteration, and dysmorphism. Conclusions: We have reported three new cases with pathogenic variants in PEX6 presenting with milder forms of the Zellweger spectrum disorders (ZSD). The three cases showed distinct clinical features. Thus, expanding the phenotypic spectrum of PBDs and ascertaining exome sequencing is an effective strategy for an accurate diagnosis of clinically overlapping and genetically heterogeneous disorders such as deafness-blindness association.
Subject(s)
ATPases Associated with Diverse Cellular Activities/genetics , Hearing Loss, Sensorineural/genetics , Retinitis Pigmentosa/genetics , Zellweger Syndrome/genetics , Adult , Child , Craniofacial Abnormalities/genetics , Dental Enamel/abnormalities , Female , Humans , Intellectual Disability/genetics , Male , Mutation , Nephrolithiasis/genetics , Neurodevelopmental Disorders/genetics , Pedigree , Peroxisomes/genetics , Peroxisomes/metabolism , Peroxisomes/pathology , Exome SequencingABSTRACT
High-throughput screening of an ethyl methanesulfonate-generated mutant collection of Cucurbita pepo using the ethylene triple-response test resulted in the identification of two semi-dominant ethylene-insensitive mutants: etr1a and etr2b. Both mutations altered sex determination mechanisms, promoting conversion of female into bisexual or hermaphrodite flowers, and monoecy into andromonoecy, thereby delaying the transition to female flowering and reducing the number of pistillate flowers per plant. The mutations also altered the growth rate and maturity of petals and carpels in pistillate flowers, lengthening the time required for flowers to reach anthesis, as well as stimulating the growth rate of ovaries and the parthenocarpic development of fruits. Whole-genome sequencing allowed identification of the causal mutation of the phenotypes as two missense mutations in the coding region of CpETR1A and CpETR2B, each one corresponding to one of the duplicates of ethylene receptor genes highly homologous to Arabidopsis ETR1 and ETR2. The phenotypes of homozygous and heterozygous single- and double-mutant plants indicated that the two ethylene receptors cooperate in the control of the ethylene response. The level of ethylene insensitivity, which was determined by the strength of each mutant allele and the dose of wild-type and mutant etr1a and etr2b alleles, correlated with the degree of phenotypic changes in the mutants.
Subject(s)
Cucurbita/genetics , Flowers/growth & development , Gene Expression Regulation, Plant/physiology , Plant Proteins/genetics , Receptors, Cell Surface/genetics , Cucurbita/growth & development , Flowers/genetics , Plant Proteins/metabolism , Receptors, Cell Surface/metabolismABSTRACT
Phenotype-based filtering and prioritization contribute to the interpretation of genetic variants detected in exome sequencing. However, it is currently unclear how extensive this phenotypic annotation should be. In this study, we compare methods for incorporating phenotype into the interpretation process and assess the extent to which phenotypic annotation aids prioritization of the correct variant. Using a cohort of 29 patients with congenital myasthenic syndromes with causative variants in known or newly discovered disease genes, exome data and the Human Phenotype Ontology (HPO)-coded phenotypic profiles, we show that gene-list filters created from phenotypic annotations perform similarly to curated disease-gene virtual panels. We use Exomiser, a prioritization tool incorporating phenotypic comparisons, to rank candidate variants while varying phenotypic annotation. Analyzing 3,712 combinations, we show that increasing phenotypic annotation improved prioritization of the causative variant, from 62% ranked first on variant alone to 90% with seven HPO annotations. We conclude that any HPO-based phenotypic annotation aids variant discovery and that annotation with over five terms is recommended in our context. Although focused on a constrained cohort, this provides real-world validation of the utility of phenotypic annotation for variant prioritization. Further research is needed to extend this concept to other diseases and more diverse cohorts.
Subject(s)
Computational Biology/methods , Exome Sequencing , Exome , Molecular Sequence Annotation , Neuromuscular Diseases/diagnosis , Neuromuscular Diseases/genetics , Phenotype , Databases, Genetic , Genetic Association Studies , Genetic Predisposition to Disease , Humans , Rare Diseases/diagnosis , Rare Diseases/genetics , Reproducibility of ResultsABSTRACT
3-Methylglutaconic aciduria (3-MGA-uria) syndromes comprise a heterogeneous group of diseases associated with mitochondrial membrane defects. Whole-exome sequencing identified compound heterozygous mutations in TIMM50 (c.[341 G>A];[805 G>A]) in a boy with West syndrome, optic atrophy, neutropenia, cardiomyopathy, Leigh syndrome, and persistent 3-MGA-uria. A comprehensive analysis of the mitochondrial function was performed in fibroblasts of the patient to elucidate the molecular basis of the disease. TIMM50 protein was severely reduced in the patient fibroblasts, regardless of the normal mRNA levels, suggesting that the mutated residues might be important for TIMM50 protein stability. Severe morphological defects and ultrastructural abnormalities with aberrant mitochondrial cristae organization in muscle and fibroblasts were found. The levels of fully assembled OXPHOS complexes and supercomplexes were strongly reduced in fibroblasts from this patient. High-resolution respirometry demonstrated a significant reduction of the maximum respiratory capacity. A TIMM50-deficient HEK293T cell line that we generated using CRISPR/Cas9 mimicked the respiratory defect observed in the patient fibroblasts; notably, this defect was rescued by transfection with a plasmid encoding the TIMM50 wild-type protein. In summary, we demonstrated that TIMM50 deficiency causes a severe mitochondrial dysfunction by targeting key aspects of mitochondrial physiology, such as the maintenance of proper mitochondrial morphology, OXPHOS assembly, and mitochondrial respiratory capacity.
Subject(s)
Membrane Transport Proteins/genetics , Mitochondria/genetics , Mitochondria/metabolism , Mutation , Biomarkers , Electron Transport , Energy Metabolism , Fibroblasts/metabolism , Gene Expression , Genetic Predisposition to Disease , Humans , Infant , Male , Mitochondria/ultrastructure , Mitochondrial Diseases/genetics , Mitochondrial Precursor Protein Import Complex Proteins , Mitochondrial Proteins/genetics , Mitochondrial Proteins/metabolism , Muscle, Skeletal/metabolism , Muscle, Skeletal/ultrastructure , Phenotype , Protein Transport , Spasms, Infantile/diagnosis , Spasms, Infantile/genetics , Exome SequencingABSTRACT
Genome sequencing projects are discovering millions of genetic variants in humans, and interpretation of their functional effects is essential for understanding the genetic basis of variation in human traits. Here we report sequencing and deep analysis of messenger RNA and microRNA from lymphoblastoid cell lines of 462 individuals from the 1000 Genomes Project--the first uniformly processed high-throughput RNA-sequencing data from multiple human populations with high-quality genome sequences. We discover extremely widespread genetic variation affecting the regulation of most genes, with transcript structure and expression level variation being equally common but genetically largely independent. Our characterization of causal regulatory variation sheds light on the cellular mechanisms of regulatory and loss-of-function variation, and allows us to infer putative causal variants for dozens of disease-associated loci. Altogether, this study provides a deep understanding of the cellular mechanisms of transcriptome variation and of the landscape of functional variants in the human genome.
Subject(s)
Genetic Variation/genetics , Genome, Human/genetics , High-Throughput Nucleotide Sequencing , Sequence Analysis, RNA , Transcriptome/genetics , Alleles , Cell Line, Transformed , Exons/genetics , Gene Expression Profiling , Humans , Polymorphism, Single Nucleotide/genetics , Quantitative Trait Loci/genetics , RNA, Messenger/analysis , RNA, Messenger/geneticsABSTRACT
Congenital myasthenic syndrome (CMS) is a heterogeneous disorder that causes fatigable muscle weakness. CMS has been associated with variants in the MuSK gene and, to date, 16 patients have been reported. MuSK-CMS patients present a different phenotypic pattern of limb girdle weakness. Here, we describe four additional patients and discuss the phenotypic and clinical relationship with those previously reported. Two novel damaging missense variants are described: c.1742T > A; p.I581N found in homozygosis, and c.1634T > C; p.L545P found in compound heterozygosis with p.R166*. The reported patients had predominant limb girdle weakness with symptom onset at 12, 17, 18, and 30 years of age, and the majority exhibited a good clinical response to Salbutamol therapy, but not to esterase inhibitors. Meta-analysis including previously reported variants revealed an increased likelihood of a severe, respiratory phenotype with null alleles. Missense variants exclusively affecting the kinase domain, but not the catalytic site, are associated with late onset. These data refine the phenotype associated with MuSK-related CMS.
Subject(s)
Genes, Recessive , Muscular Dystrophies, Limb-Girdle/genetics , Muscular Dystrophies, Limb-Girdle/pathology , Mutation , Myasthenic Syndromes, Congenital/genetics , Myasthenic Syndromes, Congenital/pathology , Receptor Protein-Tyrosine Kinases/genetics , Receptors, Cholinergic/genetics , Adult , Age of Onset , Child , Female , Humans , Male , Muscle Weakness/genetics , Muscle Weakness/pathology , Prognosis , Young AdultABSTRACT
Congenital disorders of glycosylation (CDG) are a heterogeneous and rapidly growing group of diseases caused by abnormal glycosylation of proteins and/or lipids. Mutations in genes involved in the homeostasis of the endoplasmic reticulum (ER), the Golgi apparatus (GA), and the vesicular trafficking from the ER to the ER-Golgi intermediate compartment (ERGIC) have been found to be associated with CDG. Here, we report a patient with defects in both N- and O-glycosylation combined with a delayed vesicular transport in the GA due to mutations in TRAPPC11, a subunit of the TRAPPIII complex. TRAPPIII is implicated in the anterograde transport from the ER to the ERGIC as well as in the vesicle export from the GA. This report expands the spectrum of genetic alterations associated with CDG, providing new insights for the diagnosis and the understanding of the physiopathological mechanisms underlying glycosylation disorders.
Subject(s)
Congenital Disorders of Glycosylation/diagnosis , Congenital Disorders of Glycosylation/genetics , Mutation , Vesicular Transport Proteins/genetics , Abnormalities, Multiple/diagnosis , Abnormalities, Multiple/genetics , Alleles , Amino Acid Substitution , Brain/abnormalities , Brain/diagnostic imaging , Genotype , Humans , Magnetic Resonance Imaging/methods , Male , Phenotype , Whole Genome SequencingABSTRACT
As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next-generation sequencing as standard practice in research and diagnostics. However, computing cost-performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state-of-the-art read aligners (BWA-MEM and GEM3) and variant callers (FreeBayes, GATK HaplotypeCaller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available.
Subject(s)
Computational Biology/methods , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Exome , Genetic Variation , Genome, Human , Humans , SoftwareABSTRACT
BACKGROUND: Taste receptors (TASRs) are essential for the body's recognition of chemical compounds. In the tongue, TASRs sense the sweet and umami and the toxin-related bitter taste thus promoting a particular eating behaviour. Moreover, their relevance in other organs is now becoming evident. In the intestine, they regulate nutrient absorption and gut motility. Upon ligand binding, TASRs activate the appetite-reward circuitry to signal the nervous system and keep body homeostasis. With the aim to identify genetic variation in the swine TASRs and in the genes from the appetite and the reward pathways, we have sequenced the exons of 201 TASRs and appetite-reward genes from 304 pigs belonging to ten breeds, wild boars and to two phenotypically extreme groups from a F2 resource with data on growth and fat deposition. RESULTS: We identified 2,766 coding variants 395 of which were predicted to have a strong impact on protein sequence and function. 334 variants were present in only one breed and at predicted alternative allele frequency (pAAF) ≥ 0.1. The Asian pigs and the wild boars showed the largest proportion of breed specific variants. We also compared the pAAF of the two F2 groups and found that variants in TAS2R39 and CD36 display significant differences suggesting that these genes could influence growth and fat deposition. We developed a 128-variant genotyping assay and confirmed 57 of these variants. CONCLUSIONS: We have identified thousands of variants affecting TASRs as well as genes involved in the appetite and the reward mechanisms. Some of these genes have been already associated to taste preferences, appetite or behaviour in humans and mouse. We have also detected indications of a potential relationship of some of these genes with growth and fat deposition, which could have been caused by changes in taste preferences, appetite or reward and ultimately impact on food intake. A genotyping array with 57 variants in 31 of these genes is now available for genotyping and start elucidating the impact of genetic variation in these genes on pig biology and breeding.
Subject(s)
Appetite/genetics , Mutation , Receptors, G-Protein-Coupled/genetics , Taste/genetics , Animals , Breeding , Genetic Variation , Phenotype , Phylogeny , Polymorphism, Single Nucleotide , Reproducibility of Results , Selection, Genetic , Sequence Analysis, DNA , SwineABSTRACT
Plasmids of the incompatibility group HI1 (IncHI1) have been isolated from several Gram-negative pathogens and are associated with the spread of multidrug resistance. Their conjugation is tightly regulated and it is inhibited at temperatures higher than 30°C, indicating that conjugation occurs outside warm-blooded hosts. Using R27, the prototype of IncHI1 plasmids, we report that plasmid transfer efficiency in E. coli strongly depends on the physiological state of the donor cells. Conjugation frequency is high when cells are actively growing, dropping sharply when cells enter the stationary phase of growth. Accordingly, our transcriptomic assays show significant downregulation of numerous R27 genes during the stationary phase, including several tra (transfer) genes. Growth phase-dependent regulation of tra genes transcription is independent of H-NS, a silencer of horizontal gene transfer, and ppGpp and RpoS, regulators of the stationary phase, but highly dependent on the plasmid-encoded regulatory circuit TrhR/TrhY-HtdA. The metabolic sensor cAMP, whose synthesis is chromosomally encoded, is also involved in the growth phase regulation of R27 conjugation by modulating htdA expression. Our data suggest that the involvement of regulators encoded by both chromosome and plasmid are required for efficient physiological control of IncHI1 plasmid conjugation.
Subject(s)
Conjugation, Genetic , Cyclic AMP/metabolism , Escherichia coli Proteins/metabolism , Escherichia coli/growth & development , Escherichia coli/genetics , Plasmids/genetics , Escherichia coli/metabolism , Escherichia coli Proteins/genetics , Gene Expression Regulation, Bacterial , Gene Transfer, Horizontal , Plasmids/metabolism , RegulonABSTRACT
There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for "the needle in a haystack" to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen to learn they have a rare phenotype in common and can "match" these cases to build evidence for causality. However, serendipity has never proven to be a reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. Three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow.
Subject(s)
Genetic Predisposition to Disease/genetics , Information Dissemination/methods , Rare Diseases/genetics , Database Management Systems , Databases, Genetic , Genetic Association Studies , Humans , SoftwareABSTRACT
BACKGROUND: Modern horses represent heterogeneous populations specifically selected for appearance and performance. Genomic regions under high selective pressure show characteristic runs of homozygosity (ROH) which represent a low genetic diversity. This study aims at detecting the number and functional distribution of ROHs in different horse populations using next generation sequencing data. METHODS: Next generation sequencing was performed for two Sorraia, one Dülmen Horse, one Arabian, one Saxon-Thuringian Heavy Warmblood, one Thoroughbred and four Hanoverian. After quality control reads were mapped to the reference genome EquCab2.70. ROH detection was performed using PLINK, version 1.07 for a trimmed dataset with 11,325,777 SNPs and a mean read depth of 12. Stretches with homozygous genotypes of >40 kb as well as >400 kb were defined as ROHs. SNPs within consensus ROHs were tested for neutrality. Functional classification was done for genes annotated within ROHs using PANTHER gene list analysis and functional variants were tested for their distribution among breed or non-breed groups. RESULTS: ROH detection was performed using whole genome sequences of ten horses of six populations representing various breed types and non-breed horses. In total, an average number of 3492 ROHs were detected in windows of a minimum of 50 consecutive homozygous SNPs and an average number of 292 ROHs in windows of 500 consecutive homozygous SNPs. Functional analyses of private ROHs in each horse revealed a high frequency of genes affecting cellular, metabolic, developmental, immune system and reproduction processes. In non-breed horses, 198 ROHs in 50-SNP windows and seven ROHs in 500-SNP windows showed an enrichment of genes involved in reproduction, embryonic development, energy metabolism, muscle and cardiac development whereas all seven breed horses revealed only three common ROHs in 50-SNP windows harboring the fertility-related gene YES1. In the Hanoverian, a total of 18 private ROHs could be shown to be located in the region of genes potentially involved in neurologic control, signaling, glycogen balance and reproduction. Comparative analysis of homozygous stretches common in all ten horses displayed three ROHs which were all located in the region of KITLG, the ligand of KIT known to be involved in melanogenesis, haematopoiesis and gametogenesis. CONCLUSIONS: The results of this study give a comprehensive insight into the frequency and number of ROHs in various horses and their potential influence on population diversity and selection pressures. Comparisons of breed and non-breed horses suggest a significant artificial as well as natural selection pressure on reproduction performance in all types of horse populations.
Subject(s)
Breeding , Horses/genetics , Reproduction/genetics , Selection, Genetic , Animals , Genomics , Genotype , High-Throughput Nucleotide Sequencing , Homozygote , Phenotype , Polymorphism, Single NucleotideABSTRACT
BACKGROUND: Exome sequencing has become a popular method to evaluate undirected mutagenesis experiments in mice. However, the most suitable mouse strain for the biological model may be relatively distant from the standard mouse reference genome. For pinpointing causative variants, a matching reference with gene annotations is essential, but not always readily available. RESULTS: We present an approach that allows to use murine Ensembl annotations on alternative mouse strain assemblies. We resolved ENU-induced mutation screening for 8 phenotypic mutant lines generated on C3HeB/FeJ background aligning the sequences against the closely related, but not annotated reference of C3H/HeJ. Variants occurring in all strains were filtered out as specific for the C3HeB/FeJ strain but unrelated to mutagenesis. Variants occurring exclusively in all individuals of one mutant line and matching the inheritance model were selected as mutagenesis-related. These variants were annotated with gene and exon names lifted over from the standard murine reference mm9 to C3H/HeJ using megablast. For each mutant line, we could restrict the results to exonic variants in between 1 and 23 genes. CONCLUSIONS: The presented method of exonic annotation lift-over proved to be a valuable tool in the search for mutagenesis-derived coding genomic variants and the assessment of genotype-phenotype relationships.