RESUMO
SUMMARY: Missense mutations that change protein stability are strongly associated with human genetic disease. With the recent availability of predicted structures for all human proteins generated using the AlphaFold2 prediction model, genome-wide assessment of the stability effects of genetic variation can, for the first time, be easily performed. This facilitates the interrogation of personal genetic variation for potentially pathogenic effects through the application of stability metrics. Here, we present a novel tool to prioritize variants predicted to cause strong instability in essential proteins. We show that by filtering by ΔΔG values and then prioritizing by StabilitySort Z-scores, we are able to more accurately discriminate pathogenic, protein-destabilizing mutations from population variation, compared with other mutation effect predictors. AVAILABILITY AND IMPLEMENTATION: StabilitySort is available as a web service (https://www.stabilitysort.org), as a data download for integration with other tools (https://www.stabilitysort.org/download) or can be deployed as a standalone system from source code (https://gitlab.com/baaron/StabilitySort). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Genoma , Software , Humanos , Proteínas/genética , Estabilidade Proteica , Mutação , Variação GenéticaRESUMO
The diversity of color vision systems found in extant vertebrates suggests that different evolutionary selection pressures have driven specializations in photoreceptor complement and visual pigment spectral tuning appropriate for an animal's behavior, habitat, and life history. Aquatic vertebrates in particular show high variability in chromatic vision and have become important models for understanding the role of color vision in prey detection, predator avoidance, and social interactions. In this study, we examined the capacity for chromatic vision in elasmobranch fishes, a group that have received relatively little attention to date. We used microspectrophotometry to measure the spectral absorbance of the visual pigments in the outer segments of individual photoreceptors from several ray and shark species, and we sequenced the opsin mRNAs obtained from the retinas of the same species, as well as from additional elasmobranch species. We reveal the phylogenetically widespread occurrence of dichromatic color vision in rays based on two cone opsins, RH2 and LWS. We also confirm that all shark species studied to date appear to be cone monochromats but report that in different species the single cone opsin may be of either the LWS or the RH2 class. From this, we infer that cone monochromacy in sharks has evolved independently on multiple occasions. Together with earlier discoveries in secondarily aquatic marine mammals, this suggests that cone-based color vision may be of little use for large marine predators, such as sharks, pinnipeds, and cetaceans.
Assuntos
Opsinas/genética , Opsinas/metabolismo , Retina/metabolismo , Tubarões/metabolismo , Rajidae/metabolismo , Animais , Visão de Cores , Proteínas de Peixes/genética , Proteínas de Peixes/metabolismo , Perfilação da Expressão Gênica , Microespectrofotometria , Filogenia , Células Fotorreceptoras Retinianas Cones/metabolismo , Análise de Sequência de RNA , Tubarões/genética , Rajidae/genéticaRESUMO
OBJECTIVE: To compare the frequency of rare variants in genes of the pathophysiologically relevant endosomal Toll-like receptor (eTLR) pathway and any quantifiable differences in variant rarity, predicted deleteriousness, or molecular proximity in patients with systemic lupus erythematosus (SLE) and healthy controls. PATIENTS AND METHODS: 65 genes associated with the eTLR pathway were identified by literature search and pathway analysis. Using next generation sequencing techniques, these were compared in two randomised cohorts of patients with SLE (n = 114 and n = 113) with 197 healthy controls. Genetically determined ethnicity was used to normalise minor allele frequencies (MAF) for the identified genetic variants and these were then compared by their frequency: rare (MAF < 0.005), uncommon (MAF 0.005-0.02), and common (MAF >0.02). This was compared to the results for 65 randomly selected genes. RESULTS: Patients with SLE are more likely to carry a rare nonsynonymous variant affecting proteins within the eTLR pathway than healthy controls. Furthermore, individuals with SLE are more likely to have multiple rare variants in this pathway. There were no differences in rarity, Combined Annotation Dependent Depletion (CADD) score, or molecular proximity for rare eTLR pathway variants. CONCLUSIONS: Rare non-synonymous variants are enriched in patients with SLE in the eTLR pathway. This supports the hypothesis that SLE arises from several rare variants of relatively large effect rather than many common variants of small effect.
Assuntos
Lúpus Eritematoso Sistêmico , Receptores Toll-Like , Endossomos/genética , Frequência do Gene , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Lúpus Eritematoso Sistêmico/genética , Mutação , Receptores Toll-Like/genéticaRESUMO
T-cell immunity requires extremely rapid clonal proliferation of rare, antigen-specific T lymphocytes to form effector cells. Here we identify a critical role for ETAA1 in this process by surveying random germ line mutations in mice using exome sequencing and bioinformatic annotation to prioritize mutations in genes of unknown function with potential effects on the immune system, followed by breeding to homozygosity and testing for immune system phenotypes. Effector CD8+ and CD4+ T-cell formation following immunization, lymphocytic choriomeningitis virus (LCMV) infection, or herpes simplex virus 1 (HSV1) infection was profoundly decreased despite normal immune cell development in adult mice homozygous for two different Etaa1 mutations: an exon 2 skipping allele that deletes Gly78-Leu119, and a Cys166Stop truncating allele that eliminates most of the 877-aa protein. ETAA1 deficiency decreased clonal expansion cell autonomously within the responding T cells, causing no decrease in their division rate but increasing TP53-induced mRNAs and phosphorylation of H2AX, a marker of DNA replication stress induced by the ATM and ATR kinases. Homozygous ETAA1-deficient adult mice were otherwise normal, healthy, and fertile, although slightly smaller, and homozygotes were born at lower frequency than expected, consistent with partial lethality after embryonic day 12. Taken together with recently reported evidence in human cancer cell lines that ETAA1 activates ATR kinase through an exon 2-encoded domain, these findings reveal a surprisingly specific requirement for this ATR activator in adult mice restricted to rapidly dividing effector T cells. This specific requirement may provide new ways to suppress pathological T-cell responses in transplantation or autoimmunity.
Assuntos
Antígenos de Superfície/imunologia , Linfócitos T CD4-Positivos/imunologia , Linfócitos T CD8-Positivos/imunologia , Divisão Celular/imunologia , Imunidade Celular , Mutação , Animais , Antígenos de Superfície/genética , Proteínas Mutadas de Ataxia Telangiectasia/genética , Proteínas Mutadas de Ataxia Telangiectasia/imunologia , Linfócitos T CD4-Positivos/patologia , Linfócitos T CD8-Positivos/patologia , Divisão Celular/genética , Herpes Simples/genética , Herpes Simples/imunologia , Herpes Simples/patologia , Herpesvirus Humano 1/genética , Herpesvirus Humano 1/imunologia , Coriomeningite Linfocítica/genética , Coriomeningite Linfocítica/imunologia , Coriomeningite Linfocítica/patologia , Vírus da Coriomeningite Linfocítica/genética , Vírus da Coriomeningite Linfocítica/imunologia , Camundongos , Camundongos Mutantes , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/imunologiaRESUMO
BACKGROUND: Short-read resequencing of genomes produces abundant information of the genetic variation of individuals. Due to their numerous nature, these variants are rarely exhaustively validated. Furthermore, low levels of undetected variant miscalling will have a systematic and disproportionate impact on the interpretation of individual genome sequence information, especially should these also be carried through into in reference databases of genomic variation. RESULTS: We find that sequence variation from short-read sequence data is subject to recurrent-yet-intermittent miscalling that occurs in a sequence intrinsic manner and is very sensitive to sequence read length. The miscalls arise from difficulties aligning short reads to redundant genomic regions, where the rate of sequencing error approaches the sequence diversity between redundant regions. We find the resultant miscalled variants to be sensitive to small sequence variations between genomes, and thereby are often intrinsic to an individual, pedigree, strain or human ethnic group. In human exome sequences, we identify 2-300 recurrent false positive variants per individual, almost all of which are present in public databases of human genomic variation. From the exomes of non-reference strains of inbred mice, we identify 3-5000 recurrent false positive variants per mouse - the number of which increasing with greater distance between an individual mouse strain and the reference C57BL6 mouse genome. We show that recurrently miscalled variants may be reproduced for a given genome from repeated simulation rounds of read resampling, realignment and recalling. As such, it is possible to identify more than two-thirds of false positive variation from only ten rounds of simulation. CONCLUSION: Identification and removal of recurrent false positive variants from specific individual variant sets will improve overall data quality. Variant miscalls arising are highly sequence intrinsic and are often specific to an individual, pedigree or ethnicity. Further, read length is a strong determinant of whether given false variants will be called for any given genome - which has profound significance for cohort studies that pool datasets collected and sequenced at different points in time.
Assuntos
Genômica/métodos , Mutação de Sentido Incorreto/genética , Análise de Sequência de DNA/métodos , Animais , Bases de Dados Genéticas , Etnicidade/genética , Reações Falso-Positivas , Humanos , Camundongos , Projetos de PesquisaRESUMO
The repeat region of the Plasmodium falciparum circumsporozoite protein (CSP) is a major vaccine antigen because it can be targeted by parasite neutralizing antibodies; however, little is known about this interaction. We used isothermal titration calorimetry, X-ray crystallography and mutagenesis-validated modeling to analyze the binding of a murine neutralizing antibody to Plasmodium falciparum CSP. Strikingly, we found that the repeat region of CSP is bound by multiple antibodies. This repeating pattern allows multiple weak interactions of single FAB domains to accumulate and yield a complex with a dissociation constant in the low nM range. Because the CSP protein can potentially cross-link multiple B cell receptors (BCRs) we hypothesized that the B cell response might be T cell independent. However, while there was a modest response in mice deficient in T cell help, the bulk of the response was T cell dependent. By sequencing the BCRs of CSP-repeat specific B cells in inbred mice we found that these cells underwent somatic hypermutation and affinity maturation indicative of a T-dependent response. Last, we found that the BCR repertoire of responding B cells was limited suggesting that the structural simplicity of the repeat may limit the breadth of the immune response.
Assuntos
Anticorpos Antiprotozoários/imunologia , Linfócitos B/imunologia , Vacinas Antimaláricas/imunologia , Plasmodium falciparum/imunologia , Proteínas de Protozoários/imunologia , Linfócitos T/imunologia , Animais , Afinidade de Anticorpos , Cristalografia por Raios X , Feminino , Humanos , Vacinas Antimaláricas/administração & dosagem , Vacinas Antimaláricas/genética , Masculino , Camundongos , Camundongos Endogâmicos BALB C , Camundongos Endogâmicos C57BL , Plasmodium falciparum/química , Plasmodium falciparum/genética , Proteínas de Protozoários/administração & dosagem , Proteínas de Protozoários/química , Proteínas de Protozoários/genéticaRESUMO
Upon publication of the original article [1], the authors had flagged that Fig. 1 had been published twice, as both Fig. 1 and Additional file 3.
RESUMO
Plants have developed complex defense mechanisms to protect themselves against pathogens. A wide-host-range fungus, Austropuccinia psidii, which has caused severe damage to ecosystems and plantations worldwide, is a major threat to Australian ecosystems dominated by members of the family Myrtaceae. In particular, the east coast wetland foundation tree species Melaleuca quinquenervia, appears to be variably susceptible to this pathogen. Understanding the molecular basis of host resistance would enable better management of this rust disease. We identified resistant and susceptible individuals of M. quinquenervia and explored their differential gene expression in order to discover the molecular basis of resistance against A. psidii. Rust screening of germplasm showed a varying degree of response, with fully resistant to highly susceptible individuals. We used transcriptome profiling in samples collected before and at 5 days postinoculation (dpi). Differential gene expression analysis showed that numerous defense-related genes were induced in susceptible plants at 5 dpi. Mapping reads against the A. psidii genome showed that only susceptible plants contained fungal-derived transcripts. Resistant plants exhibited an overexpression of candidate A. psidii resistance-related genes such as receptor-like kinases, nucleotide-binding site leucine-rich repeat proteins, glutathione S-transferases, WRKY transcriptional regulators, and pathogenesis-related proteins. We identified large differences in the expression of defense-related genes among resistant individuals.
Assuntos
Basidiomycota/fisiologia , Resistência à Doença/genética , Melaleuca/genética , Doenças das Plantas/imunologia , Transcriptoma , Austrália , Ecossistema , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Melaleuca/imunologia , Melaleuca/microbiologia , Modelos Biológicos , Doenças das Plantas/microbiologia , Folhas de Planta/genética , Folhas de Planta/imunologia , Folhas de Planta/microbiologia , Análise de Sequência de RNA , ÁrvoresRESUMO
BACKGROUND: Knowledge of plant secondary cell wall (SCW) regulation and deposition is mainly based on the Arabidopsis model of a 'typical' lignocellulosic SCW. However, SCWs in other plants can vary from this. The SCW of mature cotton seed fibres is highly cellulosic and lacks lignification whereas xylem SCWs are lignocellulosic. We used cotton as a model to study different SCWs and the expression of the genes involved in their formation via RNA deep sequencing and chemical analysis of stem and seed fibre. RESULTS: Transcriptome comparisons from cotton xylem and pith as well as from a developmental series of seed fibres revealed tissue-specific and developmentally regulated expression of several NAC transcription factors some of which are likely to be important as top tier regulators of SCW formation in xylem and/or seed fibre. A so far undescribed hierarchy was identified between the top tier NAC transcription factors SND1-like and NST1/2 in cotton. Key SCW MYB transcription factors, homologs of Arabidopsis MYB46/83, were practically absent in cotton stem xylem. Lack of expression of other lignin-specific MYBs in seed fibre relative to xylem could account for the lack of lignin deposition in seed fibre. Expression of a MYB103 homolog correlated with temporal expression of SCW CesAs and cellulose synthesis in seed fibres. FLAs were highly expressed and may be important structural components of seed fibre SCWs. Finally, we made the unexpected observation that cell walls in the pith of cotton stems contained lignin and had a higher S:G ratio than in xylem, despite that tissue's lacking many of the gene transcripts normally associated with lignin biosynthesis. CONCLUSIONS: Our study in cotton confirmed some features of the currently accepted gene regulatory cascade for 'typical' plant SCWs, but also revealed substantial differences, especially with key downstream NACs and MYBs. The lignocellulosic SCW of cotton xylem appears to be achieved differently from that in Arabidopsis. Pith cell walls in cotton stems are compositionally very different from that reported for other plant species, including Arabidopsis. The current definition of a 'typical' primary or secondary cell wall might not be applicable to all cell types in all plant species.
Assuntos
Parede Celular/metabolismo , Perfilação da Expressão Gênica , Gossypium/citologia , Gossypium/genética , Celulose/biossíntese , Gossypium/metabolismo , Especificidade de Órgãos , Caules de Planta/crescimento & desenvolvimento , Caules de Planta/metabolismo , Propanóis/metabolismo , Fatores de Transcrição/metabolismo , Transcrição GênicaRESUMO
We applied high-throughput sequencing to eye tissue from several species of basal vertebrates (a hagfish, two species of lamprey, and five species of gnathostome fish), and we analyzed the mRNA sequences for the proteins underlying activation of the phototransduction cascade. The molecular phylogenies that we constructed from these sequences are consistent with the 2R WGD model of two rounds of whole genome duplication. Our analysis suggests that agnathans retain an additional representative (that has been lost in gnathostomes) in each of the gene families we studied; the evidence is strong for the G-protein α subunit (GNAT) and the cGMP phosphodiesterase (PDE6), and indicative for the cyclic nucleotide-gated channels (CNGA and CNGB). Two of the species (the hagfish Eptatretus cirrhatus and the lamprey Mordacia mordax) possess only a single class of photoreceptor, simplifying deductions about the composition of cascade protein isoforms utilized in their photoreceptors. For the other lamprey, Geotria australis, analysis of the ratios of transcript levels in downstream and upstream migrant animals permits tentative conclusions to be drawn about the isoforms used in four of the five spectral classes of photoreceptor. Overall, our results suggest that agnathan rod-like photoreceptors utilize the same GNAT1 as gnathostomes, together with a homodimeric PDE6 that may be agnathan-specific, whereas agnathan cone-like photoreceptors utilize a GNAT that may be agnathan-specific, together with the same PDE6C as gnathostomes. These findings help elucidate the evolution of the vertebrate phototransduction cascade from an ancestral chordate phototransduction cascade that existed prior to the vertebrate radiation.
Assuntos
Peixes/genética , Transdução de Sinal Luminoso/genética , Animais , Evolução Biológica , Evolução Molecular , Olho/metabolismo , Subunidades alfa de Proteínas de Ligação ao GTP/genética , Subunidades alfa de Proteínas de Ligação ao GTP/metabolismo , Genoma , Glucosídeos/genética , Glucosídeos/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala , Lampreias/genética , Fenóis/metabolismo , Filogenia , Células Fotorreceptoras Retinianas Cones/metabolismo , Células Fotorreceptoras Retinianas Cones/fisiologiaRESUMO
Datasets of hundreds or thousands of SNPs (Single Nucleotide Polymorphisms) from multiple individuals per species are increasingly used to study population structure, species delimitation and shallow phylogenetics. The principal software tool to infer species or population trees from SNP data is currently the BEAST template SNAPP which uses a Bayesian coalescent analysis. However, it is computationally extremely demanding and tolerates only small amounts of missing data. We used simulated and empirical SNPs from plants (Australian Craspedia, Asteraceae, and Pelargonium, Geraniaceae) to compare species trees produced (1) by SNAPP, (2) using SVD quartets, and (3) using Bayesian and parsimony analysis with several different approaches to summarising data from multiple samples into one set of traits per species. Our aims were to explore the impact of tree topology and missing data on the results, and to test which data summarising and analyses approaches would best approximate the results obtained from SNAPP for empirical data. SVD quartets retrieved the correct topology from simulated data, as did SNAPP except in the case of a very unbalanced phylogeny. Both methods failed to retrieve the correct topology when large amounts of data were missing. Bayesian analysis of species level summary data scoring the two alleles of each SNP as independent characters and parsimony analysis of data scoring each SNP as one character produced trees with branch length distributions closest to the true trees on which SNPs were simulated. For empirical data, Bayesian inference and Dollo parsimony analysis of data scored allele-wise produced phylogenies most congruent with the results of SNAPP. In the case of study groups divergent enough for missing data to be phylogenetically informative (because of additional mutations preventing amplification of genomic fragments or bioinformatic establishment of homology), scoring of SNP data as a presence/absence matrix irrespective of allele content might be an additional option. As this depends on sampling across species being reasonably even and a random distribution of non-informative instances of missing data, however, further exploration of this approach is needed. Properly chosen data summary approaches to inferring species trees from SNP data may represent a potential alternative to currently available individual-level coalescent analyses especially for quick data exploration and when dealing with computationally demanding or patchy datasets.
Assuntos
Simulação por Computador , Filogenia , Polimorfismo de Nucleotídeo Único/genética , Teorema de Bayes , Software , Especificidade da EspécieRESUMO
PREMISE OF THE STUDY: The Cape Floristic Region (CFR) of South Africa is renowned for its botanical diversity, but the evolutionary origins of this diversity remain controversial. Both neutral and adaptive processes have been implicated in driving diversification, but population-level studies of plants in the CFR are rare. Here, we investigate the limits to gene flow and potential environmental drivers of selection in Protea repens L. (Proteaceae L.), a widespread CFR species. METHODS: We sampled 19 populations across the range of P. repens and used genotyping by sequencing to identify 2066 polymorphic loci in 663 individuals. We used a Bayesian FST outlier analysis to identify single-nucleotide polymorphisms (SNPs) marking genomic regions that may be under selection; we used those SNPs to identify potential drivers of selection and excluded them from analyses of gene flow and genetic structure. RESULTS: A pattern of isolation by distance suggested limited gene flow between nearby populations. The populations of P. repens fell naturally into two or three groupings, which corresponded to an east-west split. Differences in rainfall seasonality contributed to diversification in highly divergent loci, as do barriers to gene flow that have been identified in other species. CONCLUSIONS: The strong pattern of isolation by distance is in contrast to the findings in the only other widespread species in the CFR that has been similarly studied, while the effects of rainfall seasonality are consistent with well-known patterns. Assessing the generality of these results will require investigations of other CFR species.
Assuntos
Meio Ambiente , Genótipo , Proteaceae/classificação , Isolamento Reprodutivo , Teorema de Bayes , Fluxo Gênico , Genética Populacional , Proteaceae/genética , África do SulRESUMO
Cisplatin is an effective anticancer drug; however, cisplatin use often leads to nephrotoxicity, which limits its clinical effectiveness. In this study, we determined the effect of dichloroacetate, a novel anticancer agent, in a mouse model of cisplatin-induced AKI. Pretreatment with dichloroacetate significantly attenuated the cisplatin-induced increase in BUN and serum creatinine levels, renal tubular apoptosis, and oxidative stress. Additionally, pretreatment with dichloroacetate accelerated tubular regeneration after cisplatin-induced renal damage. Whole transcriptome sequencing revealed that dichloroacetate prevented mitochondrial dysfunction and preserved the energy-generating capacity of the kidneys by preventing the cisplatin-induced downregulation of fatty acid and glucose oxidation, and of genes involved in the Krebs cycle and oxidative phosphorylation. Notably, dichloroacetate did not interfere with the anticancer activity of cisplatin in vivo. These data provide strong evidence that dichloroacetate preserves renal function when used in conjunction with cisplatin.
Assuntos
Antineoplásicos/efeitos adversos , Cisplatino/efeitos adversos , Ácido Dicloroacético/uso terapêutico , Nefropatias/induzido quimicamente , Nefropatias/prevenção & controle , Animais , Antineoplásicos/uso terapêutico , Cisplatino/uso terapêutico , Feminino , Masculino , Camundongos , Camundongos Endogâmicos BALB CRESUMO
BACKGROUND: The advent of genotyping by Next Generation Sequencing has enabled rapid discovery of thousands of single nucleotide polymorphism (SNP) markers and high throughput genotyping of large populations at an affordable cost. Genotyping by sequencing (GBS), a reduced representation library sequencing method, allows highly multiplexed sequencing of genomic subsets. This method has limitations for small organisms with low amounts of genomic DNA, such as the bluetongue virus (BTV) vectors, Culicoides midges. RESULTS: This study employed the GBS method to isolate SNP markers de novo from whole genome amplified Culicoides brevitarsis genomic DNA. The individuals were collected from regions representing two different Australian patterns of BTV strain distribution: the Northern Territory (NT) and the east coast. We isolated 8145 SNPs using GBS. Phylogenetic analysis conducted using the filtered 3263 SNPs revealed the presence of a distinct C. brevitarsis sub-population in the NT and this was confirmed by analysis of mitochondrial DNA. Two loci showed a very strong signal for selection and were unique to the NT population. Bayesian analysis with STRUCTURE indicated a possible two-population cluster. CONCLUSIONS: The results suggest that genotyping vectors with high density markers in combination with biological and environmental data is useful. However, more extensive sampling over a wider spatial and temporal range is needed. The presence of sub-structure in populations and loci under natural selection indicates the need for further investigation of the role of vectors in shaping the two Australian systems of BTV transmission. The described workflow is transferable to genotyping of small, non-model organisms, including arthropod vectors of pathogens of economic and medical importance.
Assuntos
Ceratopogonidae/genética , Genética Populacional , Genoma de Inseto , Genômica , Genótipo , Animais , Teorema de Bayes , Complexo IV da Cadeia de Transporte de Elétrons/genética , Biblioteca Gênica , Genes Mitocondriais , Genômica/métodos , Haplótipos , Northern Territory , Filogenia , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Seleção GenéticaRESUMO
Genomewide SNP data generated by nontargeted methods such as RAD and GBS are increasingly being used in phylogenetic and phylogeographic analyses. When these methods are used in the absence of a reference genome, however, little is known about the locations and evolution of the SNPs. In using such data to address phylogenetic questions, researchers risk drawing false conclusions, particularly if a representative number of SNPs is not obtained. Here, we empirically test the robustness of phylogenetic inference based on SNP data for closely related lineages. We conducted a genomewide analysis of 75 712 SNPs, generated via GBS, of southern bull-kelp (Durvillaea). Durvillaea chathamensis co-occurs with D. antarctica on Chatham Island, but the two species have previously been found to be so genetically similar that the status of the former has been questioned. Our results show that D. chathamensis, which differs from D. antarctica ecologically as well as morphologically, is indeed a reproductively isolated species. Furthermore, our replicated analyses show that D. chathamensis cannot be reliably distinguished phylogenetically from closely related D. antarctica using subsets (ranging in size from 400 to 10 000 sites) of the 40 912 parsimony-informative SNPs in our data set and that bootstrap values alone can give misleading impressions of the strength of phylogenetic inferences. These results highlight the importance of independently replicating SNP analyses to verify that phylogenetic inferences based on nontargeted SNP data are robust. Our study also demonstrates that modern genomic approaches can be used to identify cases of recent or incipient speciation that traditional approaches (e.g. Sanger sequencing of a few loci) may be unable to detect or resolve.
Assuntos
Kelp/genética , Filogenia , Polimorfismo de Nucleotídeo Único , Especiação Genética , Genômica , Análise de Sequência de DNARESUMO
Species delimitation has seen a paradigm shift as increasing accessibility of genomic-scale data enables separation of lineages with convergent morphological traits and the merging of recently diverged ecotypes that have distinguishing characteristics. We inferred the process of lineage formation among Australian species in the widespread and highly variable genus Pelargonium by combining phylogenomic and population genomic analyses along with breeding system studies and character analysis. Phylogenomic analysis and population genetic clustering supported seven of the eight currently described species but provided little evidence for differences in genetic structure within the most widely distributed group that containing P. australe. In contrast, morphometric analysis detected three deep lineages within Australian Pelargonium; with P. australe consisting of five previously unrecognized entities occupying separate geographic ranges. The genomic approach enabled elucidation of parallel evolution in some traits formerly used to delineate species, as well as identification of ecotypic morphological differentiation within recognized species. Highly variable morphology and trait convergence each contribute to the discordance between phylogenomic relationships and morphological taxonomy. Data suggest that genetic divergence among species within the Australian Pelargonium may result from allopatric speciation while morphological differentiation within and among species may be more strongly driven by environmental differences.
Assuntos
Evolução Biológica , Genética Populacional , Pelargonium/classificação , Filogenia , Austrália , GenótipoRESUMO
BACKGROUND: Multiple autoimmune syndrome (MAS), an extreme phenotype of autoimmune disorders, is a very well suited trait to tackle genomic variants of these conditions. Whole exome sequencing (WES) is a widely used strategy for detection of protein coding and splicing variants associated with inherited diseases. METHODS: The DNA of eight patients affected by MAS [all of whom presenting with Sjögren's syndrome (SS)], four patients affected by SS alone and 38 unaffected individuals, were subject to WES. Filters to identify novel and rare functional (pathogenic-deleterious) homozygous and/or compound heterozygous variants in these patients and controls were applied. Bioinformatics tools such as the Human gene connectome as well as pathway and network analysis were applied to test overrepresentation of genes harbouring these variants in critical pathways and networks involved in autoimmunity. RESULTS: Eleven novel and rare functional variants were identified in cases but not in controls, harboured in: MACF1, KIAA0754, DUSP12, ICA1, CELA1, LRP1/STAT6, GRIN3B, ANKLE1, TMEM161A, and FKRP. These were subsequently subject to network analysis and their functional relatedness to genes already associated with autoimmunity was evaluated. Notably, the LRP1/STAT6 novel mutation was homozygous in one MAS affected patient and heterozygous in another. LRP1/STAT6 disclosed the strongest plausibility for autoimmunity. LRP1/STAT6 are involved in extracellular and intracellular anti-inflammatory pathways that play key roles in maintaining the homeostasis of the immune system. Further; networks, pathways, and interaction analyses showed that LRP1 is functionally related to the HLA-B and IL10 genes and it has a substantial impact within immunological pathways and/or reaction to bacterial and other foreign proteins (phagocytosis, regulation of phospholipase A2 activity, negative regulation of apoptosis and response to lipopolysaccharides). Further, ICA1 and STAT6 were also closely related to AIRE and IRF5, two very well known autoimmunity genes. CONCLUSIONS: Novel and rare exonic mutations that may account for autoimmunity were identified. Among those, the LRP1/STAT6 novel mutation has the strongest case for being categorised as potentially causative of MAS given the presence of intriguing patterns of functional interaction with other major genes shaping autoimmunity.
Assuntos
Predisposição Genética para Doença , Genoma Humano , Mutação/genética , Síndrome de Sjogren/genética , Adulto , Idoso , Autoimunidade/genética , Sequência de Bases , Estudos de Casos e Controles , Conectoma , Feminino , Redes Reguladoras de Genes , Humanos , Pessoa de Meia-Idade , FenótipoRESUMO
Systemic lupus erythematosus (SLE) is a heterogeneous autoimmune disease with a clear genetic component. While most SLE patients carry rare gene variants in lupus risk genes, little is known about their contribution to disease pathogenesis. Amongst them, SH2B3-a negative regulator of cytokine and growth factor receptor signaling-harbors rare coding variants in over 5% of SLE patients. Here, we show that unlike the variant found exclusively in healthy controls, SH2B3 rare variants found in lupus patients are predominantly hypomorphic alleles, failing to suppress IFNGR signaling via JAK2-STAT1. The generation of two mouse lines carrying patients' variants revealed that SH2B3 is important in limiting the number of immature and transitional B cells. Furthermore, hypomorphic SH2B3 was shown to impair the negative selection of immature/transitional self-reactive B cells and accelerate autoimmunity in sensitized mice, at least in part due to increased IL-4R signaling and BAFF-R expression. This work identifies a previously unappreciated role for SH2B3 in human B cell tolerance and lupus risk.
Assuntos
Autoimunidade , Lúpus Eritematoso Sistêmico , Animais , Humanos , Camundongos , Autoimunidade/genética , Fator Ativador de Células B/metabolismo , Linfócitos B , Lúpus Eritematoso Sistêmico/genética , Células Precursoras de Linfócitos BRESUMO
Background and Objectives: Chronic kidney disease progression to ESKD is associated with a marked increase in mortality and morbidity. Its progression is highly variable and difficult to predict. Methods: This is an observational, retrospective, single-centre study. The cohort was patients attending hospital and nephrology clinic at The Canberra Hospital from September 1996 to March 2018. Demographic data, vital signs, kidney function test, proteinuria, and serum glucose were extracted. The model was trained on the featurised time series data with XGBoost. Its performance was compared against six nephrologists and the Kidney Failure Risk Equation (KFRE). Results: A total of 12,371 patients were included, with 2,388 were found to have an adequate density (three eGFR data points in the first 2 years) for subsequent analysis. Patients were divided into 80%/20% ratio for training and testing datasets.ML model had superior performance than nephrologist in predicting ESKD within 2 years with 93.9% accuracy, 60% sensitivity, 97.7% specificity, 75% positive predictive value. The ML model was superior in all performance metrics to the KFRE 4- and 8-variable models.eGFR and glucose were found to be highly contributing to the ESKD prediction performance. Conclusions: The computational predictions had higher accuracy, specificity and positive predictive value, which indicates the potential integration into clinical workflows for decision support.
RESUMO
Systemic lupus erythematosus (SLE) is the prototypic systemic autoimmune disease. It is thought that many common variant gene loci of weak effect act additively to predispose to common autoimmune diseases, while the contribution of rare variants remains unclear. Here we describe that rare coding variants in lupus-risk genes are present in most SLE patients and healthy controls. We demonstrate the functional consequences of rare and low frequency missense variants in the interacting proteins BLK and BANK1, which are present alone, or in combination, in a substantial proportion of lupus patients. The rare variants found in patients, but not those found exclusively in controls, impair suppression of IRF5 and type-I IFN in human B cell lines and increase pathogenic lymphocytes in lupus-prone mice. Thus, rare gene variants are common in SLE and likely contribute to genetic risk.