RESUMO
Somatic variation is a major type of genetic variation contributing to human diseases including cancer. Of the vast quantities of somatic variants identified, the functional impact of many somatic variants, in particular the missense variants, remains unclear. Lack of the functional information prevents the translation of rich variation data into clinical applications. We previously developed a method named Ramachandran Plot-Molecular Dynamics Simulations (RP-MDS), aiming to predict the function of germline missense variants based on their effects on protein structure stability, and successfully applied to predict the deleteriousness of unclassified germline missense variants in multiple cancer genes. We hypothesized that regardless of their different genetic origins, somatic missense variants and germline missense variants could have similar effects on the stability of their affected protein structure. As such, the RP-MDS method designed for germline missense variants should also be applicable to predict the function of somatic missense variants. In the current study, we tested our hypothesis by using the somatic missense variants in TP53 as a model. Of the 397 somatic missense variants analyzed, RP-MDS predicted that 195 (49.1%) variants were deleterious as they significantly disturbed p53 structure. The results were largely validated by using a p53-p21 promoter-green fluorescent protein (GFP) reporter gene assay. Our study demonstrated that deleterious somatic missense variants can be identified by referring to their effects on protein structural stability.
Assuntos
Mutação de Sentido Incorreto , Estabilidade Proteica , Proteína Supressora de Tumor p53 , Humanos , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/química , Simulação de Dinâmica Molecular , Neoplasias/genética , Conformação ProteicaRESUMO
BACKGROUND: Mismatch repair (MMR) system is evolutionarily conserved for genome stability maintenance. Germline pathogenic variants (PVs) in MMR genes that lead to MMR functional deficiency are associated with high cancer risk. Knowing the evolutionary origin of germline PVs in human MMR genes will facilitate understanding the biological base of MMR deficiency in cancer. However, systematic knowledge is lacking to address the issue. In this study, we performed a comprehensive analysis to know the evolutionary origin of human MMR PVs. METHODS: We retrieved MMR gene variants from the ClinVar database. The genomes of 100 vertebrates were collected from the UCSC genome browser and ancient human sequencing data were obtained through comprehensive data mining. Cross-species conservation analysis was performed based on the phylogenetic relationship among 100 vertebrates. Rescaled ancient sequencing data were used to perform variant calling for archeological analysis. RESULTS: Using the phylogenetic approach, we traced the 3369 MMR PVs identified in modern humans in 99 non-human vertebrate genomes but found no evidence for cross-species conservation as the source for human MMR PVs. Using the archeological approach, we searched the human MMR PVs in over 5000 ancient human genomes dated from 45,045 to 100 years before present and identified a group of MMR PVs shared between modern and ancient humans mostly within 10,000 years with similar quantitative patterns. CONCLUSION: Our study reveals that MMR PVs in modern humans were arisen within the recent human evolutionary history.
Assuntos
Neoplasias Encefálicas , Neoplasias Colorretais , Reparo de Erro de Pareamento de DNA , Síndromes Neoplásicas Hereditárias , Humanos , Reparo de Erro de Pareamento de DNA/genética , Filogenia , Mutação em Linhagem Germinativa/genética , Células GerminativasRESUMO
BACKGROUND: Admixture occurs between different ethnic human populations. The global colonization in recent centuries by Europeans led to the most significant admixture in human history. While admixture may enhance genetic diversity for better fitness, it may also impact on human health by transmitting genetic variants for disease susceptibility in the admixture population. The admixture by Portuguese global exploration initiated in the 15th century has reached over 20 million of Portuguese-heritage population worldwide. It provides a valuable model to study the impact of admixture on human health. BRCA1 and BRCA2 (BRCA) are two of the important tumor suppressor genes. The pathogenic variation (PV) in BRCA is well determined to cause high risk of hereditary breast and ovarian cancer. Tracing the distribution of Portuguese BRCA PV in Portuguese-heritage population will help to understand the impact of admixture on cancer susceptibility in modern humans. In this study, we analyzed the distribution of the Portuguese-originated BRCA variation in Brazilian population, which has high degree Portuguese-heritage. METHODS: By comprehensive data mining, standardization and annotation, we generated a Portuguese-derived BRCA variation dataset and a Brazilian-derived BRCA variation dataset. We compared the two BRCA variation datasets to identify the BRCA variants shared between the two populations. RESULTS: The Portuguese-derived BRCA variation dataset consists of 220 BRCA variants including 78 PVs from 11,482 Portuguese cancer patients, 93 (42.2%) in BRCA1 and 127 (57.7%) in BRCA2. Of the 556 Portuguese BRCA PV carriers carrying the 78 PVs, 331 (59.5%) carried the three Portuguese-BRCA founder PVs of BRCA1 c.2037delinsCC, BRCA1 c.3331_3334del and BRCA2 c.156_157insAlu. The Brazilian-derived BRCA variation dataset consists of 255 BRCA PVs from 7,711 cancer patients, 136 (53.3%) in BRCA1 and 119 (46.6%) in BRCA2. We developed an open database named dbBRCA-Portuguese ( https://genemutation.fhs.um.edu.mo/dbbrca-portuguese/ ) and an open database named dbBRCA-Brazilian ( https://genemutation.fhs.um.edu.mo/dbbrca-brazilian ) to host the BRCA variation data from Portuguese and Brazilian populations. We compared the BRCA PV datasets between Portuguese and Brazilian populations, and identified 29 Portuguese-specific BRCA PVs shared between Portuguese and Brazilian populations, 14 in BRCA1 including the Portuguese founder BRCA1 c.3331_3334del and BRCA1 c.2037delinsCC, and 15 in BRCA2 including the Portuguese founder BRCA2 c.156_157insAlu. Searching the 78 Portuguese BRCA PVs in over 5,000 ancient human genomes identified evolution origin for only 8 PVs in Europeans dated between 37,470 and 3,818 years before present, confirming the Portuguese-specificity of Portuguese BRCA PVs; comparing the 78 Portuguese BRCA PVs Portuguese, 255 Brazilian BRCA PVs, and 134 African BRCA PVs showed little overlapping, ruling out the possibility that the BRCA PVs shared between Portuguese and Brazilian may also be contributed by African. CONCLUSION: Our study provides evidence that the admixture in recent human history contributed to cancer susceptibility in modern humans.
Assuntos
Proteína BRCA1 , Proteína BRCA2 , Humanos , Proteína BRCA2/genética , Proteína BRCA1/genética , Portugal , Feminino , Predisposição Genética para Doença , Brasil , Variação Genética , Neoplasias da Mama/genética , Neoplasias Ovarianas/genéticaRESUMO
BACKGROUND: Genome stability is maintained by the DNA damage repair (DDR) system composed of multiple DNA repair pathways of hundreds of genes. Germline pathogenic variation (PV) in DDR genes damages function of the affected DDR genes, leading to genome instability and high risk of diseases, in particular, cancer. Knowing evolutionary origin of the PVs in human DDR genes is essential to understand the etiology of human diseases. However, answer to the issue remains largely elusive. In this study, we analyzed evolutionary origin for the PVs in human DDR genes. METHODS: We identified 169 DDR genes by referring to various databases and identified PVs in the DDR genes of modern humans from ClinVar database. We performed a phylogenetic analysis to analyze the conservation of human DDR PVs in 100 vertebrates through cross-species genomic data comparison using the phyloFit program of the PHAST package and visualized the results using the GraphPad Prism software and the ggplot module. We identified DDR PVs from over 5000 ancient humans developed a database to host the DDR PVs ( https://genemutation.fhs.um.edu.mo/dbDDR-AncientHumans ). Using the PV data, we performed a molecular archeological analysis to compare the DDR PVs between modern humans and ancient humans. We analyzed evolution selection of DDR genes across 20 vertebrates using the CodeML in PAML for phylogenetic analysis. RESULTS: Our phylogenic analysis ruled out cross-species conservation as the origin of human DDR PVs. Our archeological approach identified rich DDR PVs shared between modern and ancient humans, which were mostly dated within the last 5000 years. We also observed similar pattern of quantitative PV distribution between modern and ancient humans. We further detected a set of ATM, BRCA2 and CHEK2 PVs shared between human and Neanderthals. CONCLUSIONS: Our study reveals that human DDR PVs mostly arose in recent human history. We propose that human high cancer risk caused by DDR PVs can be a by-product of human evolution.
Assuntos
Reparo do DNA , Neoplasias , Humanos , Filogenia , Reparo do DNA/genética , Genes BRCA2 , Neoplasias/genética , Instabilidade Genômica , Dano ao DNA/genética , Predisposição Genética para DoençaRESUMO
Pathogenic variation in DNA mismatch repair (MMR) gene MLH1 is associated with Lynch syndrome (LS), an autosomal dominant hereditary cancer. Of the 3798 MLH1 germline variants collected in the ClinVar database, 38.7% (1469) were missense variants, of which 81.6% (1199) were classified as Variants of Uncertain Significance (VUS) due to the lack of functional evidence. Further determination of the impact of VUS on MLH1 function is important for the VUS carriers to take preventive action. We recently developed a protein structure-based method named "Deep Learning-Ramachandran Plot-Molecular Dynamics Simulation (DL-RP-MDS)" to evaluate the deleteriousness of MLH1 missense VUS. The method extracts protein structural information by using the Ramachandran plot-molecular dynamics simulation (RP-MDS) method, then combines the variation data with an unsupervised learning model composed of auto-encoder and neural network classifier to identify the variants causing significant change in protein structure. In this report, we applied the method to classify 447 MLH1 missense VUS. We predicted 126/447 (28.2%) MLH1 missense VUS were deleterious. Our study demonstrates that DL-RP-MDS is able to classify the missense VUS based solely on their impact on protein structure.
Assuntos
Neoplasias Colorretais Hereditárias sem Polipose , Aprendizado Profundo , Proteína 1 Homóloga a MutL , Humanos , Neoplasias Colorretais Hereditárias sem Polipose/genética , Bases de Dados Factuais , Reparo de Erro de Pareamento de DNA , Simulação de Dinâmica Molecular , Proteína 1 Homóloga a MutL/genéticaRESUMO
Pathogenic variation in BRCA1 and BRCA2 (BRCA) causes high risk of breast and ovarian cancer, and BRCA variation data are important markers for BRCA-related clinical cancer applications. However, comprehensive BRCA variation data are lacking from the Asian population despite its large population size, heterogenous genetic background and diversified living environment across the Asia continent. We performed a systematic study on BRCA variation in Asian population including extensive data mining, standardization, annotation and characterization. We identified 7587 BRCA variants from 685 592 Asian individuals in 40 Asia countries and regions, including 1762 clinically actionable pathogenic variants and 4915 functionally unknown variants (https://genemutation.fhs.um.edu.mo/Asian-BRCA/). We observed the highly ethnic-specific nature of Asian BRCA variants between Asian and non-Asian populations and within Asian populations, highlighting that the current European descendant population-based BRCA data is inadequate to reflect BRCA variation in the Asian population. We also provided archeological evidence for the evolutionary origin and arising time of Asian BRCA variation. We further provided structural-based evidence for the deleterious variants enriched within the functionally unknown Asian BRCA variants. The data from our study provide a current view of BRCA variation in the Asian population and a rich resource to guide clinical applications of BRCA-related cancer for the Asian population.
Assuntos
Neoplasias da Mama , Neoplasias Ovarianas , Feminino , Humanos , Ásia/epidemiologia , Asiático , Povo Asiático/genética , Proteína BRCA1/genética , Neoplasias da Mama/genética , Predisposição Genética para Doença , Mutação em Linhagem Germinativa , Neoplasias Ovarianas/genéticaRESUMO
BACKGROUND: Identifying genetic disease-susceptible individuals through population screening is considered as a promising approach for disease prevention. DNA mismatch repair (MMR) genes including MLH1, MSH2, MSH6 and PMS2 play essential roles in maintaining microsatellite stability through DNA mismatch repair, and pathogenic variation in MMR genes causes microsatellite instability and is the genetic predisposition for cancer as represented by the Lynch syndrome. While the prevalence and spectrum of MMR variation has been extensively studied in cancer, it remains largely elusive in the general population. Lack of the knowledge prevents effective prevention for MMR variation-caused cancer. In the current study, we addressed the issue by using the Chinese population as a model. METHODS: We performed extensive data mining to collect MMR variant data from 18 844 ethnic Chinese individuals and comprehensive analyses for the collected MMR variants to determine its prevalence, spectrum and features of the MMR data in the Chinese population. RESULTS: We identified 17 687 distinct MMR variants. We observed substantial differences of MMR variation between the general Chinese population and Chinese patients with cancer, identified highly Chinese-specific MMR variation through comparing MMR data between Chinese and non-Chinese populations, predicted the enrichment of deleterious variants in the unclassified Chinese-specific MMR variants, determined MMR pathogenic prevalence of 0.18% in the general Chinese population and determined that MMR variation in the general Chinese population is evolutionarily neutral. CONCLUSION: Our study provides a comprehensive view of MMR variation in the general Chinese population, a resource for biological study of human MMR variation, and a reference for MMR-related cancer applications.
Assuntos
Neoplasias Colorretais Hereditárias sem Polipose , Reparo de Erro de Pareamento de DNA , China/epidemiologia , Neoplasias Colorretais Hereditárias sem Polipose/genética , Reparo de Erro de Pareamento de DNA/genética , Mutação em Linhagem Germinativa , Humanos , Instabilidade de Microssatélites , Endonuclease PMS2 de Reparo de Erro de Pareamento/genética , Proteína 1 Homóloga a MutL/genética , Proteína 2 Homóloga a MutS/genética , PrevalênciaRESUMO
BACKGROUND: Germline mutation in BRCA1 and BRCA2 (BRCA) is genetic predisposition for breast and ovarian cancer. Identification of mutation carriers is a critical step to prevent and treat the cancer in the mutation carriers. Human BRCA variation has been well determined as ethnic-specific by studies in Ashkenazi Jewish, Polish and Icelandic populations in the 1990s. However, sufficient evidence is lacking to determine if ethnic-specific BRCA variation is also present in Asia population, which is the largest and the most diversified in modern humans. Our current study aims to investigate ethnic-specific BRCA variation in Asian population. METHODS: We performed a comprehensive data mining to collect BRCA variation data in Indian, Chinese, Korean and Japanese populations derived from over 78 000 cancer and 40 000 non-cancer cases. We standardised all BRCA variation data following the international standard. We made a systematic comparison between the datasets including variant composition, variation spectrum, variant type, clinical class, founder mutation and high-frequent variants. RESULTS: Our analysis showed that over half of the Asian BRCA variants were Asian-specific, and significant differences were present between the four Asia populations in each category analysed. CONCLUSION: Data from our study reveal that ethnic-specific BRCA variation is commonly present in Asia population as existing in non-Asian populations. Our study indicates that ethnicity should be an important factor to consider in prevention and treatment of BRCA mutation-related cancer in the Asia population. We recommend that the current BRCA variation databases should include ethnic variation information in order to function as true global BRCA references.
Assuntos
Proteína BRCA1/genética , Proteína BRCA2/genética , Variação Genética , Neoplasias/genética , Povo Asiático/genética , Efeito Fundador , Predisposição Genética para Doença , Humanos , Índia , Japão , MutaçãoRESUMO
Functional classification of genetic variants is a key for their clinical applications in patient care. However, abundant variant data generated by the next-generation DNA sequencing technologies limit the use of experimental methods for their classification. Here, we developed a protein structure and deep learning (DL)-based system for genetic variant classification, DL-RP-MDS, which comprises two principles: 1) Extracting protein structural and thermodynamics information using the Ramachandran plot-molecular dynamics simulation (RP-MDS) method, 2) combining those data with an unsupervised learning model of auto-encoder and a neural network classifier to identify the statistical significance patterns of the structural changes. We observed that DL-RP-MDS provided higher specificity than over 20 widely used in silico methods in classifying the variants of three DNA damage repair genes: TP53, MLH1, and MSH2. DL-RP-MDS offers a powerful platform for high-throughput genetic variant classification. The software and online application are available at https://genemutation.fhs.um.edu.mo/DL-RP-MDS/.
RESUMO
Two-dimensional (2D) transition metal dichalcogenides (TMDCs) draw much attention as critical semiconductor materials for 2D, optoelectronic, and spin electronic devices. Although controlled doping of 2D semiconductors can also be used to tune their bandgap and type of carrier and further change their electronic, optical, and catalytic properties, this remains an ongoing challenge. Here, we successfully doped a series of metal elements (including Hf, Zr, Gd, and Dy) into the monolayer MoS2 through a single-step chemical vapor transport (CVT), and the atomic embedded structure is confirmed by scanning transmission electron microscope (STEM) with a probe corrector measurement. In addition, the host crystal is well preserved, and no random atomic aggregation is observed. More importantly, adjusting the band structure of MoS2 enhanced the fluorescence and the carrier effect. This work provides a growth method for doping non-like elements into 2D MoS2 and potentially many other 2D materials to modify their properties.
RESUMO
TP53 is crucial for maintaining genome stability and preventing oncogenesis. Germline pathogenic variation in TP53 damages its function, causing genome instability and increased cancer risk. Despite extensive study in TP53, the evolutionary origin of the human TP53 germline pathogenic variants remains largely unclear. In this study, we applied phylogenetic and archaeological approaches to identify the evolutionary origin of TP53 germline pathogenic variants in modern humans. In the phylogenic analysis, we searched 406 human TP53 germline pathogenic variants in 99 vertebrates distributed in eight clades of Primate, Euarchontoglires, Laurasiatheria, Afrotheria, Mammal, Aves, Sarcopterygii and Fish, but we observed no direct evidence for the cross-species conservation as the origin; in the archaeological analysis, we searched the variants in 5031 ancient human genomes dated between 45045 and 100 years before present, and identified 45 pathogenic variants in 62 ancient humans dated mostly within the last 8000 years; we also identified 6 pathogenic variants in 3 Neanderthals dated 44000 to 38515 years before present and 1 Denisovan dated 158 550 years before present. Our study reveals that TP53 germline pathogenic variants in modern humans were likely originated in recent human history and partially inherited from the extinct Neanderthals and Denisovans.
RESUMO
BRCA1 and BRCA2 (BRCA) play essential roles in maintaining genome stability. BRCA germline pathogenic variants increase cancer risk. However, the evolutionary origin of human BRCA pathogenic variants remains largely elusive. We tested the 2,972 human BRCA1 and 3,652 human BRCA2 pathogenic variants from ClinVar database in 100 vertebrates across eight clades, but failed to find evidence to show cross-species evolution conservation as the origin; we searched the variants in 2,792 ancient human genome data, and identified 28 BRCA1 and 22 BRCA2 pathogenic variants in 44 cases dated from 45,000 to 300 yr ago; we analyzed the haplotype-dated human BRCA pathogenic founder variants, and observed that they were mostly arisen within the past 3,000 yr; we traced ethnic distribution of human BRCA pathogenic variants, and found that the majority were present in single or a few ethnic populations. Based on the data, we propose that human BRCA pathogenic variants were highly likely arisen in recent human history after the latest out-of-Africa migration, and the expansion of modern human population could largely increase the variation spectrum.
Assuntos
Proteína BRCA1/genética , Proteína BRCA2/genética , Animais , Evolução Biológica , DNA Antigo/análise , Bases de Dados Genéticas , Evolução Molecular , Feminino , Predisposição Genética para Doença/genética , Mutação em Linhagem Germinativa/genética , Haplótipos/genética , Humanos , MutaçãoRESUMO
Formamidine-based hybrid perovskite is an excellent optoelectronic material; however, its intrinsic non-layered crystalline structure makes it hard to isolate the corresponding 2D counterparts. In this work, a unique liquid-epitaxy technique was introduced to grow micro-sized two-dimensional FAPbX3 perovskite sheets. Such ultrathin sheets exhibited excellent photo-induced carrier properties with high crystalline quality, as well as provided new opportunities for next-generation optoelectronic devices.