RESUMO
Recent advances in AI-based methods have revolutionized the field of structural biology. Concomitantly, high-throughput sequencing and functional genomics have generated genetic variants at an unprecedented scale. However, efficient tools and resources are needed to link disparate data types-to 'map' variants onto protein structures, to better understand how the variation causes disease, and thereby design therapeutics. Here we present the Genomics 2 Proteins portal ( https://g2p.broadinstitute.org/ ): a human proteome-wide resource that maps 20,076,998 genetic variants onto 42,413 protein sequences and 77,923 structures, with a comprehensive set of structural and functional features. Additionally, the Genomics 2 Proteins portal allows users to interactively upload protein residue-wise annotations (for example, variants and scores) as well as the protein structure beyond databases to establish the connection between genomics to proteins. The portal serves as an easy-to-use discovery tool for researchers and scientists to hypothesize the structure-function relationship between natural or synthetic variations and their molecular phenotypes.
Assuntos
Bases de Dados de Proteínas , Genômica , Humanos , Genômica/métodos , Proteínas/genética , Proteínas/química , Proteoma/genética , Conformação Proteica , Software , Testes Genéticos/métodos , Variação Genética , Sequência de AminoácidosRESUMO
Multiple layers of regulation modulate the activity and localization of protein kinases. However, many details of kinase regulation remain incompletely understood. Here, we apply saturation mutagenesis and a chemical genetic method for allosterically modulating kinase global conformation to Src kinase, providing insight into known regulatory mechanisms and revealing a previously undiscovered interaction between Src's SH4 and catalytic domains. Abrogation of this interaction increased phosphotransferase activity, promoted membrane association, and provoked phosphotransferase-independent alterations in cell morphology. Thus, Src's SH4 domain serves as an intramolecular regulator coupling catalytic activity, global conformation, and localization, as well as mediating a phosphotransferase-independent function. Sequence conservation suggests that the SH4 domain regulatory interaction exists in other Src-family kinases. Our combined approach's ability to reveal a regulatory mechanism in one of the best-studied kinases suggests that it could be applied broadly to provide insight into kinase structure, regulation, and function.
Assuntos
Domínio Catalítico/genética , Mutagênese/genética , Conformação Proteica , Quinases da Família src/química , Regulação Alostérica/genética , Membrana Celular/química , Membrana Celular/enzimologia , Células HEK293 , Humanos , Fosforilação , Quinases da Família src/genéticaRESUMO
Magmatic iron-meteorite parent bodies are the earliest planetesimals in the Solar System, and they preserve information about conditions and planet-forming processes in the solar nebula. In this study, we include comprehensive elemental compositions and fractional-crystallization modeling for iron meteorites from the cores of five differentiated asteroids from the inner Solar System. Together with previous results of metallic cores from the outer Solar System, we conclude that asteroidal cores from the outer Solar System have smaller sizes, elevated siderophile-element abundances, and simpler crystallization processes than those from the inner Solar System. These differences are related to the formation locations of the parent asteroids because the solar protoplanetary disk varied in redox conditions, elemental distributions, and dynamics at different heliocentric distances. Using highly siderophile-element data from iron meteorites, we reconstruct the distribution of calcium-aluminum-rich inclusions (CAIs) across the protoplanetary disk within the first million years of Solar-System history. CAIs, the first solids to condense in the Solar System, formed close to the Sun. They were, however, concentrated within the outer disk and depleted within the inner disk. Future models of the structure and evolution of the protoplanetary disk should account for this distribution pattern of CAIs.
RESUMO
Cellular omics such as single-cell genomics, proteomics, and microbiomics allow the characterization of tissue and microbial community composition, which can be compared between conditions to identify biological drivers. This strategy has been critical to revealing markers of disease progression, such as cancer and pathogen infection. A dedicated statistical method for differential variability analysis is lacking for cellular omics data, and existing methods for differential composition analysis do not model some compositional data properties, suggesting there is room to improve model performance. Here, we introduce sccomp, a method for differential composition and variability analyses that jointly models data count distribution, compositionality, group-specific variability, and proportion mean-variability association, being aware of outliers. sccomp provides a comprehensive analysis framework that offers realistic data simulation and cross-study knowledge transfer. Here, we demonstrate that mean-variability association is ubiquitous across technologies, highlighting the inadequacy of the very popular Dirichlet-multinomial distribution. We show that sccomp accurately fits experimental data, significantly improving performance over state-of-the-art algorithms. Using sccomp, we identified differential constraints and composition in the microenvironment of primary breast cancer.
Assuntos
Genômica , Microbiota , Proteômica/métodos , Simulação por Computador , AlgoritmosRESUMO
Clinical interpretation of missense variants is challenging because the majority identified by genetic testing are rare and their functional effects are unknown. Consequently, most variants are of uncertain significance and cannot be used for clinical diagnosis or management. Although not much can be done to ameliorate variant rarity, multiplexed assays of variant effect (MAVEs), where thousands of single-nucleotide variant effects are simultaneously measured experimentally, provide functional evidence that can help resolve variants of unknown significance (VUSs). However, a rigorous assessment of the clinical value of multiplexed functional data for variant interpretation is lacking. Thus, we systematically combined previously published BRCA1, TP53, and PTEN multiplexed functional data with phenotype and family history data for 324 VUSs identified by a single diagnostic testing laboratory. We curated 49,281 variant functional scores from MAVEs for these three genes and integrated four different TP53 multiplexed functional datasets into a single functional prediction for each variant by using machine learning. We then determined the strength of evidence provided by each multiplexed functional dataset and reevaluated 324 VUSs. Multiplexed functional data were effective in driving variant reclassification when combined with clinical data, eliminating 49% of VUSs for BRCA1, 69% for TP53, and 15% for PTEN. Thus, multiplexed functional data, which are being generated for numerous genes, are poised to have a major impact on clinical variant interpretation.
Assuntos
Proteína BRCA1/genética , Testes Genéticos , Mutação de Sentido Incorreto , PTEN Fosfo-Hidrolase/genética , Proteína Supressora de Tumor p53/genética , Adulto , Coleta de Dados , Conjuntos de Dados como Assunto , Estudos de Associação Genética , Humanos , Anamnese , Fenótipo , Valor Preditivo dos TestesRESUMO
The distribution of fitness effects of mutation plays a central role in constraining protein evolution. The underlying mechanisms by which mutations lead to fitness effects are typically attributed to changes in protein specific activity or abundance. Here, we reveal the importance of a mutation's collateral fitness effects, which we define as effects that do not derive from changes in the protein's ability to perform its physiological function. We comprehensively measured the collateral fitness effects of missense mutations in the Escherichia coli TEM-1 ß-lactamase antibiotic resistance gene using growth competition experiments in the absence of antibiotic. At least 42% of missense mutations in TEM-1 were deleterious, indicating that for some proteins collateral fitness effects occur as frequently as effects on protein activity and abundance. Deleterious mutations caused improper posttranslational processing, incorrect disulfide-bond formation, protein aggregation, changes in gene expression, and pleiotropic effects on cell phenotype. Deleterious collateral fitness effects occurred more frequently in TEM-1 than deleterious effects on antibiotic resistance in environments with low concentrations of the antibiotic. The surprising prevalence of deleterious collateral fitness effects suggests they may play a role in constraining protein evolution, particularly for highly expressed proteins, for proteins under intermittent selection for their physiological function, and for proteins whose contribution to fitness is buffered against deleterious effects on protein activity and protein abundance.
Assuntos
Evolução Molecular , Aptidão Genética/genética , Mutação de Sentido Incorreto/genética , Mutação de Sentido Incorreto/fisiologia , Escherichia coli/enzimologia , Escherichia coli/genética , Escherichia coli/fisiologia , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , beta-Lactamases/química , beta-Lactamases/genética , beta-Lactamases/metabolismoRESUMO
Immune-stimulatory ligands, such as major histocompatibility complex molecules and the T-cell costimulatory ligand CD86, are central to productive immunity. Endogenous mammalian membrane-associated RING-CHs (MARCH) act on these and other targets to regulate antigen presentation and activation of adaptive immunity, whereas virus-encoded homologs target the same molecules to evade immune responses. Substrate specificity is encoded in or near the membrane-embedded domains of MARCHs and the proteins they regulate, but the exact sequences that distinguish substrates from nonsubstrates are poorly understood. Here, we examined the requirements for recognition of the costimulatory ligand CD86 by two different MARCH-family proteins, human MARCH1 and Kaposi's sarcoma herpesvirus modulator of immune recognition 2 (MIR2), using deep mutational scanning. We identified a highly specific recognition surface in the hydrophobic core of the CD86 transmembrane (TM) domain (TMD) that is required for recognition by MARCH1 and prominently features a proline at position 254. In contrast, MIR2 requires no specific sequences in the CD86 TMD but relies primarily on an aspartic acid at position 244 in the CD86 extracellular juxtamembrane region. Surprisingly, MIR2 recognized CD86 with a TMD composed entirely of valine, whereas many different single amino acid substitutions in the context of the native TM sequence conferred MIR2 resistance. These results show that the human and viral proteins evolved completely different recognition modes for the same substrate. That some TM sequences are incompatible with MIR2 activity, even when no specific recognition motif is required, suggests a more complicated mechanism of immune modulation via CD86 than was previously appreciated.
Assuntos
Antígeno B7-2/química , Ubiquitina-Proteína Ligases/metabolismo , Proteínas Virais/metabolismo , Antígeno B7-2/genética , Antígeno B7-2/metabolismo , Membrana Celular/metabolismo , Regulação para Baixo , Células HEK293 , Células HeLa , Humanos , Mutação , Domínios Proteicos , Transporte ProteicoRESUMO
SUMMARY: Multiplexed assays of variant effect (MAVEs) are capable of experimentally testing all possible single nucleotide or amino acid variants in selected genomic regions, generating 'variant effect maps', which provide biochemical insight and functional evidence to enable more rapid and accurate clinical interpretation of human variation. Because the international community applying MAVE approaches is growing rapidly, we developed the online MaveRegistry platform to catalyze collaboration, reduce redundant efforts, allow stakeholders to nominate targets and enable tracking and sharing of progress on ongoing MAVE projects. AVAILABILITY AND IMPLEMENTATION: MaveRegistry service: https://registry.varianteffect.org. MaveRegistry source code: https://github.com/kvnkuang/maveregistry-front-end.
RESUMO
The single transmembrane domain (TMD) of the human thrombopoietin receptor (TpoR/myeloproliferative leukemia [MPL] protein), encoded by exon 10 of the MPL gene, is a hotspot for somatic mutations associated with myeloproliferative neoplasms (MPNs). Approximately 6% and 14% of JAK2 V617F- essential thrombocythemia and primary myelofibrosis patients, respectively, have "canonical" MPL exon 10 driver mutations W515L/K/R/A or S505N, which generate constitutively active receptors and consequent loss of Tpo dependence. Other "noncanonical" MPL exon 10 mutations have also been identified in patients, both alone and in combination with canonical mutations, but, in almost all cases, their functional consequences and relevance to disease are unknown. Here, we used a deep mutational scanning approach to evaluate all possible single amino acid substitutions in the human TpoR TMD for their ability to confer cytokine-independent growth in Ba/F3 cells. We identified all currently recognized driver mutations and 7 novel mutations that cause constitutive TpoR activation, and a much larger number of second-site mutations that enhance S505N-driven activation. We found examples of both of these categories in published and previously unpublished MPL exon 10 sequencing data from MPN patients, demonstrating that some, if not all, of the new mutations reported here represent likely drivers or modifiers of myeloproliferative disease.
Assuntos
Substituição de Aminoácidos , Transtornos Mieloproliferativos/genética , Receptores de Trombopoetina/genética , Animais , Linhagem Celular , Éxons , Humanos , Camundongos , Modelos Moleculares , Mutação , Domínios Proteicos , Receptores de Trombopoetina/químicaRESUMO
Bromodomain and extra terminal protein (BET) inhibitors are first-in-class targeted therapies that deliver a new therapeutic opportunity by directly targeting bromodomain proteins that bind acetylated chromatin marks. Early clinical trials have shown promise, especially in acute myeloid leukaemia, and therefore the evaluation of resistance mechanisms is crucial to optimize the clinical efficacy of these drugs. Here we use primary mouse haematopoietic stem and progenitor cells immortalized with the fusion protein MLL-AF9 to generate several single-cell clones that demonstrate resistance, in vitro and in vivo, to the prototypical BET inhibitor, I-BET. Resistance to I-BET confers cross-resistance to chemically distinct BET inhibitors such as JQ1, as well as resistance to genetic knockdown of BET proteins. Resistance is not mediated through increased drug efflux or metabolism, but is shown to emerge from leukaemia stem cells both ex vivo and in vivo. Chromatin-bound BRD4 is globally reduced in resistant cells, whereas the expression of key target genes such as Myc remains unaltered, highlighting the existence of alternative mechanisms to regulate transcription. We demonstrate that resistance to BET inhibitors, in human and mouse leukaemia cells, is in part a consequence of increased Wnt/ß-catenin signalling, and negative regulation of this pathway results in restoration of sensitivity to I-BET in vitro and in vivo. Together, these findings provide new insights into the biology of acute myeloid leukaemia, highlight potential therapeutic limitations of BET inhibitors, and identify strategies that may enhance the clinical utility of these unique targeted therapies.
Assuntos
Benzodiazepinas/farmacologia , Resistencia a Medicamentos Antineoplásicos/efeitos dos fármacos , Leucemia Mieloide Aguda/tratamento farmacológico , Leucemia Mieloide Aguda/metabolismo , Células-Tronco Neoplásicas/efeitos dos fármacos , Células-Tronco Neoplásicas/patologia , Proteínas Nucleares/antagonistas & inibidores , Fatores de Transcrição/antagonistas & inibidores , Animais , Azepinas/farmacologia , Proteínas de Ciclo Celular , Linhagem Celular Tumoral , Células Cultivadas , Cromatina/metabolismo , Células Clonais/efeitos dos fármacos , Células Clonais/metabolismo , Células Clonais/patologia , Resistencia a Medicamentos Antineoplásicos/genética , Epigênese Genética , Regulação Neoplásica da Expressão Gênica/efeitos dos fármacos , Genes myc/genética , Células-Tronco Hematopoéticas/citologia , Células-Tronco Hematopoéticas/efeitos dos fármacos , Células-Tronco Hematopoéticas/metabolismo , Humanos , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/patologia , Camundongos , Terapia de Alvo Molecular , Células-Tronco Neoplásicas/metabolismo , Proteínas Nucleares/metabolismo , Fatores de Transcrição/metabolismo , Transcrição Gênica/efeitos dos fármacos , Triazóis/farmacologia , Via de Sinalização Wnt/efeitos dos fármacos , beta Catenina/metabolismoRESUMO
Devil facial tumour disease (DFTD) comprises two genetically distinct transmissible cancers (DFT1 and DFT2) endangering the survival of the Tasmanian devil (Sarcophilus harrisii) in the wild. DFT1 first arose from a cell of the Schwann cell lineage; however, the tissue-of-origin of the recently discovered DFT2 cancer is unknown. In this study, we compared the transcriptome and proteome of DFT2 tumours to DFT1 and normal Tasmanian devil tissues to determine the tissue-of-origin of the DFT2 cancer. Our findings demonstrate that DFT2 expresses a range of Schwann cell markers and exhibits expression patterns consistent with a similar origin to the DFT1 cancer. Furthermore, DFT2 cells express genes associated with the repair response to peripheral nerve damage. These findings suggest that devils may be predisposed to transmissible cancers of Schwann cell origin. The combined effect of factors such as frequent nerve damage from biting, Schwann cell plasticity and low genetic diversity may allow these cancers to develop on rare occasions. The emergence of two independent transmissible cancers from the same tissue in the Tasmanian devil presents an unprecedented opportunity to gain insight into cancer development, evolution and immune evasion in mammalian species.
Assuntos
Biomarcadores Tumorais/metabolismo , Neoplasias Faciais/veterinária , Marsupiais/fisiologia , Proteoma/análise , Células de Schwann/patologia , Transcriptoma , Animais , Biomarcadores Tumorais/genética , Neoplasias Faciais/genética , Neoplasias Faciais/metabolismo , Neoplasias Faciais/patologia , Humanos , Células de Schwann/metabolismoRESUMO
Palliative care initiatives are needed in nephrology, yet implementation is lacking. We created a 6-hour workshop to teach the skills of active listening, responding to emotion, and exploring goals and values to nurses and social workers working in dialysis units. The workshop consisted of interactive didactics and structured role play with trained simulated patients. We assessed preparedness using a Likert scale and utilized paired t tests to measure the impact using a self-assessment survey following the training. Ten nurses and two social workers from six dialysis units completed the training. Mean scores improved in all domains: demonstrating empathic behaviors, responding to emotion and end-of-life concerns, eliciting family's concerns at end-of-life and patient's goals, and discussing spiritual concerns. Further testing in larger samples may help to confirm these results.
Assuntos
Planejamento Antecipado de Cuidados , Cuidados Paliativos , Comunicação , Humanos , Diálise Renal , Assistentes SociaisRESUMO
OBJECTIVE: Few studies have examined the impact of a single clinical evidence technology (CET) on provider practice or patient outcomes from the provider's perspective. A previous cluster-randomized controlled trial with patient-reported data tested the effectiveness of a CET (i.e., VisualDx) in improving skin problem outcomes but found no significant effect. The objectives of this follow-up study were to identify barriers and facilitators to the use of the CET from the perspective of primary care providers (PCPs) and to identify reasons why the CET did not affect outcomes in the trial. METHODS: Using a convergent mixed methods design, the authors had PCPs complete a post-trial survey and participate in interviews about using the CET for managing patients' skin problems. Data from both methods were integrated. RESULTS: PCPs found the CET somewhat easy to use but only occasionally useful. Less experienced PCPs used the CET more frequently. Data from interviews revealed barriers and facilitators at four steps of evidence-based practice: clinical question recognition, information acquisition, appraisal of relevance, and application with patients. Facilitators included uncertainty in dermatology, intention for use, convenience of access, diagnosis and treatment support, and patient communication. Barriers included confidence in dermatology, preference for other sources, interface difficulties, presence of irrelevant information, and lack of decision impact. CONCLUSION: PCPs found the CET useful for diagnosis, treatment support, and patient communication. However, the barriers of interface difficulties, irrelevant search results, and preferred use of other sources limited its positive impact on patient skin problem management.
Assuntos
Atitude do Pessoal de Saúde , Medicina Baseada em Evidências/instrumentação , Atenção Primária à Saúde/métodos , Dermatopatias/terapia , Feminino , Seguimentos , Humanos , Masculino , Inquéritos e QuestionáriosRESUMO
PURPOSE: Despite most modern colour spaces treating colour as three-dimensional (3-D), colour data is usually not visualised in 3-D (and two-dimensional (2-D) projection-plane segments and multiple 2-D perspective views are used instead). The objectives of this article are firstly, to introduce a truly 3-D percept of colour space using stereo-pairs, secondly to view colour discrimination data using that platform, and thirdly to apply formal statistics and multivariate methods to analyse the data in 3-D. This is the first demonstration of the software that generated stereo-pairs of RGB colour space, as well as of a new computerised procedure that investigated colour discrimination by measuring colour just noticeable differences (JND). METHODS: An initial pilot study and thorough investigation of instrument repeatability were performed. Thereafter, to demonstrate the capabilities of the software, five colour-normal and one colour-deficient subject were examined using the JND procedure and multivariate methods of data analysis. RESULTS: Scatter plots of responses were meaningfully examined in 3-D and were useful in evaluating multivariate normality as well as identifying outliers. The extent and direction of the difference between each JND response and the stimulus colour point was calculated and appreciated in 3-D. Ellipsoidal surfaces of constant probability density (distribution ellipsoids) were fitted to response data; the volumes of these ellipsoids appeared useful in differentiating the colour-deficient subject from the colour-normals. Hypothesis tests of variances and covariances showed many statistically significant differences between the results of the colour-deficient subject and those of the colour-normals, while far fewer differences were found when comparing within colour-normals. CONCLUSIONS: The 3-D visualisation of colour data using stereo-pairs, as well as the statistics and multivariate methods of analysis employed, were found to be unique and useful tools in the representation and study of colour. Many additional studies using these methods along with the JND and other procedures have been identified and will be reported in future publications.
Assuntos
Percepção de Cores/fisiologia , Discriminação Psicológica/fisiologia , Percepção Espacial/fisiologia , Adulto , Percepção de Profundidade/fisiologia , Feminino , Humanos , Masculino , Análise Multivariada , Projetos Piloto , Software , Adulto JovemRESUMO
A planetary scientist recounts an audacious mission to retrieve mineral samples from space.
RESUMO
Recent advances in AI-based methods have revolutionized the field of structural biology. Concomitantly, high-throughput sequencing and functional genomics technologies have enabled the detection and generation of variants at an unprecedented scale. However, efficient tools and resources are needed to link these two disparate data types - to "map" variants onto protein structures, to better understand how the variation causes disease and thereby design therapeutics. Here we present the Genomics 2 Proteins Portal (G2P; g2p.broadinstitute.org/): a human proteome-wide resource that maps 19,996,443 genetic variants onto 42,413 protein sequences and 77,923 structures, with a comprehensive set of structural and functional features. Additionally, the G2P portal generalizes the capability of linking genomics to proteins beyond databases by allowing users to interactively upload protein residue-wise annotations (variants, scores, etc.) as well as the protein structure to establish the connection. The portal serves as an easy-to-use discovery tool for researchers and scientists to hypothesize the structure-function relationship between natural or synthetic variations and their molecular phenotype.
RESUMO
Multiplexed assays of variant effect (MAVEs) have emerged as a powerful approach for interrogating thousands of genetic variants in a single experiment. The flexibility and widespread adoption of these techniques across diverse disciplines have led to a heterogeneous mix of data formats and descriptions, which complicates the downstream use of the resulting datasets. To address these issues and promote reproducibility and reuse of MAVE data, we define a set of minimum information standards for MAVE data and metadata and outline a controlled vocabulary aligned with established biomedical ontologies for describing these experimental designs.
Assuntos
Metadados , Projetos de Pesquisa , Reprodutibilidade dos TestesRESUMO
The large-scale experimental measures of variant functional assays submitted to MaveDB have the potential to provide key information for resolving variants of uncertain significance, but the reporting of results relative to assayed sequence hinders their downstream utility. The Atlas of Variant Effects Alliance mapped multiplexed assays of variant effect data to human reference sequences, creating a robust set of machine-readable homology mappings. This method processed approximately 2.5 million protein and genomic variants in MaveDB, successfully mapping 98.61% of examined variants and disseminating data to resources such as the UCSC Genome Browser and Ensembl Variant Effect Predictor.
RESUMO
Background: Multiplexed Assays of Variant Effects (MAVEs) can test all possible single variants in a gene of interest. The resulting saturation-style data may help resolve variant classification disparities between populations, especially for variants of uncertain significance (VUS). Methods: We analyzed clinical significance classifications in 213,663 individuals of European-like genetic ancestry versus 206,975 individuals of non-European-like genetic ancestry from All of Us and the Genome Aggregation Database. Then, we incorporated clinically calibrated MAVE data into the Clinical Genome Resource's Variant Curation Expert Panel rules to automate VUS reclassification for BRCA1, TP53, and PTEN . Results: Using two orthogonal statistical approaches, we show a higher prevalence ( p ≤5.95e-06) of VUS in individuals of non-European-like genetic ancestry across all medical specialties assessed in all three databases. Further, in the non-European-like genetic ancestry group, higher rates of Benign or Likely Benign and variants with no clinical designation ( p ≤2.5e-05) were found across many medical specialties, whereas Pathogenic or Likely Pathogenic assignments were higher in individuals of European-like genetic ancestry ( p ≤2.5e-05). Using MAVE data, we reclassified VUS in individuals of non-European-like genetic ancestry at a significantly higher rate in comparison to reclassified VUS from European-like genetic ancestry ( p =9.1e-03) effectively compensating for the VUS disparity. Further, essential code analysis showed equitable impact of MAVE evidence codes but inequitable impact of allele frequency ( p =7.47e-06) and computational predictor ( p =6.92e-05) evidence codes for individuals of non-European-like genetic ancestry. Conclusions: Generation of saturation-style MAVE data should be a priority to reduce VUS disparities and produce equitable training data for future computational predictors.