RESUMO
An expanding range of genetic syndromes are characterized by genome-wide disruptions in DNA methylation profiles referred to as episignatures. Episignatures are distinct, highly sensitive, and specific biomarkers that have recently been applied in clinical diagnosis of genetic syndromes. Episignatures are contained within the broader disorder-specific genome-wide DNA methylation changes, which can share significant overlap among different conditions. In this study, we performed functional genomic assessment and comparison of disorder-specific and overlapping genome-wide DNA methylation changes related to 65 genetic syndromes with previously described episignatures. We demonstrate evidence of disorder-specific and recurring genome-wide differentially methylated probes (DMPs) and regions (DMRs). The overall distribution of DMPs and DMRs across the majority of the neurodevelopmental genetic syndromes analyzed showed substantial enrichment in gene promoters and CpG islands, and under-representation of the more variable intergenic regions. Analysis showed significant enrichment of the DMPs and DMRs in gene pathways and processes related to neurodevelopment, including neurogenesis, synaptic signaling and synaptic transmission. This study expands beyond the diagnostic utility of DNA methylation episignatures by demonstrating correlation between the function of the mutated genes and the consequent genomic DNA methylation profiles as a key functional element in the molecular etiology of genetic neurodevelopmental disorders.
Assuntos
Metilação de DNA , Transtornos do Neurodesenvolvimento , Ilhas de CpG/genética , Metilação de DNA/genética , DNA Intergênico , Epigênese Genética , Humanos , Transtornos do Neurodesenvolvimento/diagnóstico , Transtornos do Neurodesenvolvimento/genética , SíndromeRESUMO
Nucleosomes are basic units of DNA packing in eukaryotes. Their structure is well conserved from yeast to human and consists of the histone octamer core and 147 bp DNA wrapped around it. Nucleosomes are bound to a majority of the eukaryotic genomic DNA, including its regulatory regions. Hence, they also play a major role in gene regulation. For the latter, their precise positioning on DNA is essential. In the present paper, we describe Galaxy dnpatterntools-software package for nucleosome DNA sequence analysis and mapping. This software will be useful for computational biologists practitioners to conduct more profound studies of gene regulatory mechanisms.
Assuntos
Montagem e Desmontagem da Cromatina , Nucleossomos , DNA/metabolismo , Humanos , Nucleossomos/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Análise de Sequência de DNARESUMO
Background and Objectives: The pathogenic variants of SLC9A6 are a known cause of a rare, X-linked neurological disorder called Christianson syndrome (CS). The main characteristics of CS are developmental delay, intellectual disability, and neurological findings. This study investigated the genetic basis and explored the molecular changes that led to CS in two male siblings presenting with intellectual disability, epilepsy, behavioural problems, gastrointestinal dysfunction, poor height, and weight gain. Materials and Methods: Next-generation sequencing of a tetrad was applied to identify the DNA changes and Sanger sequencing of proband's cDNA was used to evaluate the impact of a splice site variant on mRNA structure. Bioinformatical tools were used to investigate SLC9A6 protein structure changes. Results: Sequencing and bioinformatical analysis revealed a novel donor splice site variant (NC_000023.11(NM_001042537.1):c.899 + 1G > A) that leads to a frameshift and a premature stop codon. Protein structure modelling showed that the truncated protein is unlikely to form any functionally relevant SLC9A6 dimers. Conclusions: Molecular and bioinformatical analysis revealed the impact of a novel donor splice site variant in the SLC9A6 gene that leads to truncated and functionally disrupted protein causing the phenotype of CS in the affected individuals.
Assuntos
Epilepsia , Deficiência Intelectual , Microcefalia , Ataxia , Epilepsia/genética , Doenças Genéticas Ligadas ao Cromossomo X , Humanos , Deficiência Intelectual/genética , Lituânia , Masculino , Microcefalia/genética , Transtornos da Motilidade OcularRESUMO
Whole exome sequencing (WES)-based assays undergo rigorous validation before being implemented in diagnostic laboratories. This validation process generates experimental evidence that allows laboratories to predict the performance of the intended assay. The NA12878 Genome in a Bottle (GIAB) HapMap reference sample is commonly used for validation in diagnostic laboratories. We investigated what data points should be taken into consideration when validating WES-based assays using the GIAB reference in a diagnostic setting. We delineate specific factors that require special consideration and identify OMIM genes associated with diseases that may 'bypass' validation. Four replicates of the NA12878 sample were sequenced at the CHEO Genetics Diagnostic Laboratory on a NextSeq 500; the data were analyzed using the bcbio_nexgen v1.1.2 pipeline. The hap.py validation engine, Real Time Genomics vcfeval tool, and high confidence (HC) variant calls in HC regions available for the GIAB sample were used to validate the obtained variant calls. The same validation process was then used to evaluate variant calls obtained for the same sample by two other clinical diagnostic laboratories. We showed that variant calls in NA12878 can be confidently measured only in the regions that intersect between the GIAB HC regions and the target regions of exome capture. Of the 4139 (as of October 2019) OMIM genes associated with a phenotype and having a known molecular basis of disease, 84 were fully outside of the GIAB HC regions and many of the remaining OMIM genes were only partially covered by the HC regions. A significant proportion of variants identified in the NA12878 sample outside of the HC regions have unknown (UNK) status due to the absence of HC reference alleles. Verification of such calls is possible either by an alternative truth set or by orthogonal testing. Similarly, many variants outside of exome capture regions, if not accounted for, will be deemed false negatives due to insufficient probe coverage. Our results demonstrate the importance of the intersection between genomic regions of interest, capture regions, and the high confidence regions. If not considered, false and ambiguous variant calls could have a negative impact on diagnostic accuracy of the intended WES-based diagnostic assay and increase the need for confirmatory testing. To enable laboratories to identify 'problematic' regions and optimize validation efforts, we have made our VCF and BED files available in UCSC Genome Browser: NA12878 WES Benchmark. Relevant genes and genome annotations are evolving, we implemented a general purpose algorithm to cross-reference OMIM genes with the genomic regions of interest that can be applied to capture genes/regions outside HC regions (see repository of data material section).
Assuntos
Sequenciamento do Exoma/métodos , Genoma Humano/genética , Alelos , Exoma/genética , Variação Genética/genética , Genômica/métodos , Humanos , Anotação de Sequência Molecular/métodosRESUMO
Nucleosome positioning DNA sequence patterns (NPS)-usually distributions of particular dinucleotides or other sequence elements in nucleosomal DNA-at least partially determine chromatin structure and arrangements of nucleosomes that in turn affect gene expression. Statistically, NPS are defined as oscillations of the dinucleotide periodicity of about 10 base pairs (bp) which reflects the double helix period. We compared the nucleosomal DNA patterns in mouse, human and yeast organisms and observed few distinctive patterns that can be termed as packing and regulatory referring to distinctive modes of chromatin function. For the first time the NPS patterns in nucleus accumbens cells (NAC) in mouse brain were characterized and compared to the patterns in human CD4+ and apoptotic lymphocyte cells and well studied patterns in yeast. The NPS patterns in human CD4+ cells and mouse brain cells had very high positive correlation. However, there was no correlation between them and patterns in human apoptotic lymphocyte cells and yeast, but the latter two were highly correlated with each other. By their dinucleotide arrangements the analyzed NPS patterns classified into stable canonical WW/SS (W = A or T and S = C or G dinucleotide) and less stable RR/YY (R = A or G and Y = C or T dinucleotide) patterns and anti-patterns. In the anti-patterns positioning of the dinucleotides is flipped compared to those in the regular patterns. Stable canonical WW/SS patterns and anti-patterns are ubiquitously observed in many organisms and they had high resemblance between yeast and human apoptotic cells. Less stable RR/YY patterns had higher positive correlation between mouse and normal human cells. Our analysis and evidence from scientific literature lead to idea that various distinct patterns in nucleosomal DNA can be related to the two roles of the chromatin: packing (WW/SS) and regulatory (RR/YY and "anti").
Assuntos
Cromatina , DNA , Nucleossomos , Animais , Sequência de Bases , Células Cultivadas , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Biologia Computacional , DNA/química , DNA/genética , DNA/metabolismo , Humanos , Camundongos , Nucleossomos/química , Nucleossomos/genética , Nucleossomos/metabolismo , Saccharomyces cerevisiae/genética , SoftwareRESUMO
Suicide research is directed at understanding social, economic, and biological causes of suicide thoughts and behaviors. (1) Background: Worldwide, certain countries have high suicide mortality rates (SMRs) compared to others. Age-standardized suicide mortality rates (SMRs) published by the World Health Organization (WHO) plus numerous bibliographic records of the Web of Science (WoS) database provide resources to understand these disparities between countries and regions. (2) Methods: Hierarchical clustering was applied to age-standardized suicide mortality rates per 100,000 population from 2000-2019. Keywords of country-specific suicide-related publications collected from WoS were analyzed by network and association rule mining. Keyword embedding was carried out using a recurrent neural network. (3) Results: Countries with similar SMR trends formed naturally distinct groups of high, medium, and low suicide mortality rates. Major themes in suicide research worldwide are depression, mental disorders, youth suicide, euthanasia, hopelessness, loneliness, unemployment, and drugs. Prominent themes differentiating countries and regions include: alcohol in post-Soviet countries; HIV/AIDS in Sub-Saharan Africa, war veterans and PTSD in the Middle East, students in East Asia, and many others. (4) Conclusion: Countries naturally group into high, medium, and low SMR categories characterized by different keyword-informed themes. The compiled dataset and presented methodology enable enrichment of analytical results by bibliographic data where observed results are difficult to interpret.
Assuntos
Aprendizado de Máquina , Suicídio , Suicídio/estatística & dados numéricos , Humanos , Saúde Global , Análise por ConglomeradosRESUMO
Introduction: The influence of physical frailty and sarcopenia (PFS) on the well-being of older people and continuous pressure on the healthcare systems has prompted a research on the pathophysiology and molecular mechanisms of these conditions. Nonetheless some biomarkers have been suggested as potential markers for PFS none of them have been shown to highlight the complex nature of PFS, which reveals that there is a need for an understanding of the possible biomarker candidates. The aim of this study was to identify the current research hotspots, status, and trends in the field of biomarkers and molecular mechanisms for PFS. Methods: The bibliometric and scientometric analyses were performed using VOSviewer (version 1.6.18) and open source software platform Cytoscape v.3.9 (for visualizing and constructing a network of keywords). Data of publications (from 1997 to 2023) related to biomarkers and molecular mechanisms of PFS were obtained (in May 2023) from the database of Science Citation Index Expanded of Web of Science, Scopus, and PubMed. The keywords obtained from the Scopus database were used to perform a meaningful keyword analysis. A network of keyword relationships was build using Cytoscape. Results: In this study, we present biomarker keywords for PFS in relation to other keywords potentially designating processes and mechanisms and reveal the biomarker identities and current contexts in which these biomarker identities are discussed. Conclusions: Over recent years, scientific interest in the field of PFS has increased and focused on the inflammatory process and probably will be concentrated on myokines (such as cytokines and small proteins) that are synthetized and released by skeletal muscles in response to physical activity. Moreover, proteomic and genetic markers are deeply involved in PFS.
RESUMO
The substantial decline in skeletal muscle mass, strength, and gait speed is a sign of severe sarcopenia, which may partly depend on genetic risk factors. So far, hundreds of genome-wide significant single nucleotide polymorphisms (SNPs) associated with handgrip strength, lean mass and walking pace have been identified in the UK Biobank cohort; however, their pleiotropic effects on all three phenotypes have not been investigated. By combining summary statistics of genome-wide association studies (GWAS) of handgrip strength, lean mass and walking pace, we have identified 78 independent SNPs (from 73 loci) associated with all three traits with consistent effect directions. Of the 78 SNPs, 55 polymorphisms were also associated with body fat percentage and 25 polymorphisms with type 2 diabetes (T2D), indicating that sarcopenia, obesity and T2D share many common risk alleles. Follow-up bioinformatic analysis revealed that sarcopenia risk alleles were associated with tiredness, falls in the last year, neuroticism, alcohol intake frequency, smoking, time spent watching television, higher salt, white bread, and processed meat intake; whereas protective alleles were positively associated with bone mineral density, serum testosterone, IGF1, and 25-hydroxyvitamin D levels, height, intelligence, cognitive performance, educational attainment, income, physical activity, ground coffee drinking and healthier diet (muesli, cereal, wholemeal or wholegrain bread, potassium, magnesium, cheese, oily fish, protein, water, fruit, and vegetable intake). Furthermore, the literature data suggest that single-bout resistance exercise may induce significant changes in the expression of 26 of the 73 implicated genes in m. vastus lateralis, which may partly explain beneficial effects of strength training in the prevention and treatment of sarcopenia. In conclusion, we have identified and characterized 78 SNPs associated with sarcopenia and 55 SNPs with sarcopenic obesity in European-ancestry individuals from the UK Biobank.
Assuntos
Diabetes Mellitus Tipo 2 , Sarcopenia , Humanos , Sarcopenia/genética , Sarcopenia/complicações , Força da Mão , Diabetes Mellitus Tipo 2/complicações , Estudo de Associação Genômica Ampla , Bancos de Espécimes Biológicos , Obesidade/epidemiologia , Obesidade/genética , Obesidade/complicações , Genômica , Reino Unido/epidemiologia , Músculo EsqueléticoRESUMO
Despite its poorly adapted codon usage, HIV-1 replicates and is expressed extremely well in human host cells. HIV-1 has recently been shown to package non-lysyl transfer RNAs (tRNAs) in addition to the tRNA(Lys) needed for priming reverse transcription and integration of the HIV-1 genome. By comparing the codon usage of HIV-1 genes with that of its human host, we found that tRNAs decoding codons that are highly used by HIV-1 but avoided by its host are overrepresented in HIV-1 virions. In particular, tRNAs decoding A-ending codons, required for the expression of HIV's A-rich genome, are highly enriched. Because the affinity of Gag-Pol for all tRNAs is nonspecific, HIV packaging is most likely passive and reflects the tRNA pool at the time of viral particle formation. Codon usage of HIV-1 early genes is similar to that of highly expressed host genes, but codon usage of HIV-1 late genes was better adapted to the selectively enriched tRNA pool, suggesting that alterations in the tRNA pool are induced late in viral infection. If HIV-1 genes are adapting to an altered tRNA pool, codon adaptation of HIV-1 may be better than previously thought.
Assuntos
HIV-1/genética , HIV-1/metabolismo , Biossíntese de Proteínas , RNA de Transferência/genética , RNA de Transferência/metabolismo , Adaptação Biológica/genética , Códon/genética , Regulação Viral da Expressão Gênica/genética , Infecções por HIV/metabolismo , Humanos , Montagem de Vírus/genéticaRESUMO
Sports genomics is the scientific discipline that focuses on the organization and function of the genome in elite athletes, and aims to develop molecular methods for talent identification, personalized exercise training, nutritional need and prevention of exercise-related diseases. It postulates that both genetic and environmental factors play a key role in athletic performance and related phenotypes. This update on the panel of genetic markers (DNA polymorphisms) associated with athlete status and soft-tissue injuries covers advances in research reported in recent years, including one whole genome sequencing (WGS) and four genome-wide association (GWAS) studies, as well as findings from collaborative projects and meta-analyses. At end of 2020, the total number of DNA polymorphisms associated with athlete status was 220, of which 97 markers have been found significant in at least two studies (35 endurance-related, 24 power-related, and 38 strength-related). Furthermore, 29 genetic markers have been linked to soft-tissue injuries in at least two studies. The most promising genetic markers include HFE rs1799945, MYBPC3 rs1052373, NFIA-AS2 rs1572312, PPARA rs4253778, and PPARGC1A rs8192678 for endurance; ACTN3 rs1815739, AMPD1 rs17602729, CPNE5 rs3213537, CKM rs8111989, and NOS3 rs2070744 for power; LRPPRC rs10186876, MMS22L rs9320823, PHACTR1 rs6905419, and PPARG rs1801282 for strength; and COL1A1 rs1800012, COL5A1 rs12722, COL12A1 rs970547, MMP1 rs1799750, MMP3 rs679620, and TIMP2 rs4789932 for soft-tissue injuries. It should be appreciated, however, that hundreds and even thousands of DNA polymorphisms are needed for the prediction of athletic performance and injury risk.
Assuntos
Desempenho Atlético , Estudo de Associação Genômica Ampla , Actinina/genética , Atletas , Genômica , Genótipo , Humanos , Peptídeos e Proteínas de Sinalização Intracelular/genéticaRESUMO
Human athletic performance is a complex phenotype influenced by environmental and genetic factors, with most exercise-related traits being polygenic in nature. The aim of this article is to outline some of the challenge faced by sports genetics as this relatively new field moves forward. This review summarizes recent advances in sports science and discusses the impact of the genome, epigenome and other omics (such as proteomics and metabolomics) on athletic performance. The article also highlights the current status of gene doping and examines the possibility of applying genetic knowledge to predict athletes' injury risk and to prevent the rare but alarming occurrence of sudden deaths during sporting events. Future research in large cohorts of athletes has the potential to detect new genetic variants and to confirm the previously identified DNA variants believed to explain the natural predisposition of some individuals to certain athletic abilities and health benefits. It is hoped that this article will be useful to sports scientists who seek a greater understanding of how genetics influences exercise science and how genomic and other multi-omics approaches might support performance analysis, coaching, personalizing nutrition, rehabilitation and sports medicine, as well as the potential to develop new rationale for future scientific investigation.
RESUMO
The MSTN gene is a negative regulator of muscle growth that is attracting attention as a candidate gene for physical performance traits. We hypothesised that variants of MSTN might be associated with the status of elite athlete. We therefore sought to study the potential role of MSTN in the physical performance of athletes by analysing the whole coding sequence of the MSTN gene in a cohort of Lithuanian elite athletes (n = 103) and non-athletes (n = 127). Consequently, two genetic variants were identified: the deletion of one of three adenines in the first intron (c.373+90delA, rs11333758) and a non-synonymous variant in the second exon (c.458A>G, p.Lys(K)153Arg(R), rs1805086). Among all samples, the MSTN rs1805086 Lys(K) allele was the most common form in both groups. Homozygous genotype for the less common Arg(R) allele was identified in only one elite canoe rower, and we could find no direct association between rs1805086 and successful results in elite athletes. Surprisingly, the intronic variant (rs11333758) was abundant among all samples. The main finding was that endurance-oriented athletes had 2.1 greater odds of being MSTN deletion genotype than non-athletes (13.6% vs. 0.8%). The present study confirms the association of the polymorphism rs11333758 with endurance performance status in Lithuanian elite athletes.
Assuntos
Atletas , Miostatina/genética , Resistência Física/genética , Polimorfismo de Nucleotídeo Único , Adulto , Feminino , Humanos , Masculino , FenótipoRESUMO
Total genotype score (TGS) reflects additive effect of genotypes on predicting a complex trait such as athletic performance. Scores assigned to genotypes in the TGS should represent an extent of the genotype's predisposition to the trait. Then, combination of genotypes highly ranks those individuals, who have a trait expressed. Usually, the genotypes are scored by the evidence of a genotype-phenotype relationship published in scientific studies. The scores can be revised computationally using genotype data of athletes, if available. From the available genotype data of 180 Lithuanian elite athletes we created an endurance-mixed-power performance TGS profile based on known ACE rs1799752, ACTN3 rs1815739, and AMPD1 rs17602729, and an emerging MB rs7293 gene markers. We analysed an ability of this TGS profile to stratify athletes according to the sport category that they practice. Logistic regression classifiers were trained to compute the genotype scores that represented the endurance versus power traits in the group of analysed athletes more accurately. We observed differences in TGS distributions in female and male group of athletes. The genotypes with possibly different effects on the athletic performance traits in females and males were described. Our data-driven analysis and TGS modelling tools are freely available to practitioners.
Assuntos
Atletas/estatística & dados numéricos , Desempenho Atlético/fisiologia , Marcadores Genéticos , Herança Multifatorial , Resistência Física , Polimorfismo Genético , Adulto , Estudos de Casos e Controles , Feminino , Frequência do Gene , Genótipo , Humanos , Masculino , FenótipoRESUMO
Biallelic mutations in the high-temperature requirement A serine peptidase 1 (HTRA1) gene are known to cause an extremely rare cerebral autosomal recessive arteriopathy with subcortical infarcts and leukoencephalopathy (CARASIL), which belongs to the group of hereditary cerebral small vessel diseases and is mainly observed in the Japanese population. Even though this pathology is inherited in an autosomal recessive manner, recent studies have described symptomatic carriers with heterozygous HTRA1 mutations who have milder symptoms than patients with biallelic HTRA1 mutations. We present the case of a Lithuanian male patient who had a stroke at the age of 36, experienced several transient ischemic attacks, and developed an early onset, progressing dementia. These clinical symptoms were associated with extensive leukoencephalopathy, lacunar infarcts, and microbleeds based on brain magnetic resonance imaging (MRI). A novel heterozygous in-frame HTRA1 gene deletion (NM_002775.5:c.533_535del; NP_002766.1:p.(Lys178del)) was identified by next generation sequencing. The variant was consistent with the patient's phenotype, which could not be explained by alternative causes, appeared highly deleterious after in silico analysis, and was not reported in the medical literature or population databases to date.
Assuntos
Demência/complicações , Deleção de Genes , Serina Peptidase 1 de Requerimento de Alta Temperatura A/genética , Acidente Vascular Cerebral/patologia , Humanos , Masculino , Pessoa de Meia-Idade , Prognóstico , Acidente Vascular Cerebral/etiologia , Acidente Vascular Cerebral/metabolismoRESUMO
The prehistory of the Lithuanian population and genetic relationship to other populations are poorly studied. Thus, the Lithuanian population, as an object of study, is interesting due to its partial isolation with genetic distinctiveness within the European context and with preserved ancient genetic composition. The main objects of this study was to infer demographic parameters, effective population size (Ne), and divergence time using high-density single nucleotide polymorphism (SNP) genotyping data generated with the Illumina HumanOmmiExpress-12v1.1 array in 295 individuals from the Lithuanian population and to compare our data with other populations from the Human Genome Cell Line Diversity Panel (HGDP-CEPH). We also aimed to reconstruct past events between the main ethnolinguistic regions-Aukstaitija and Zemaitija of Lithuania. Historically, these regions probably developed as two independent Baltic tribes. Our results of Ne in the Lithuanian population through time demonstrated a substantial reduction of Ne over the 150,000-25,000 years before present (YBP). The estimated long-term Ne of the Lithuanian population is quite low-it equals 5404, which likely is a consequence of the bottlenecks associated with the last glacial period of 25,000-12,000 YBP in Europe. The obtained divergence time estimates between the study populations are in agreement with recent studies. The reconstructed past events in Aukstaitija and Zemaitija showed significant differences between these two regions of Lithuania.
Assuntos
Evolução Molecular , Genótipo , População/genética , Feminino , Genoma Humano , Migração Humana , Humanos , Lituânia , Masculino , Pessoa de Meia-Idade , Polimorfismo GenéticoRESUMO
A cohort of 1242 individuals tested in a clinical diagnostic laboratory was used to test whether the filtering allele frequencies (FAFs)-based framework, recently recommended for MHY7-associated cardiomyopathy, is extendable to 45 cardiomyopathy genes. Statistical analysis revealed a threshold of 0.00164% for the extreme outlier allele frequencies (AFs), based on the Genome Aggregation Database (exome fraction) total AFs of 138 unique pathogenic and likely pathogenic variants; 135 of them (97.8%) had AFs of <0.004%, the recommended threshold to apply moderate pathogenicity evidence for MYH7-associated cardiomyopathy. Of the 460 cases reported with only variant(s) of unknown clinical significance (VUCSs), 97 (21%) solely had VUCSs with FAFs >0.03%, frequencies above which were estimated herein as strong evidence against pathogenicity. Interestingly, 74.5% (172/231) of the unique VUCSs with FAFs >0.03% had Genome Aggregation Database maximum allele frequencies across all populations AFs >0.1%, deemed herein as stand-alone evidence against pathogenicity. Accordingly, using an FAF threshold of >0.1%, compared with AF >1%, led us to issue considerably more (25.9% versus 41.3%) negative patient reports. Also, 82.7% (N = 629) of the unique classified benign or likely benign variants with AFs <1% had FAFs >0.1%, reinforcing the use of this filtering strategy. Together, these data demonstrate that implementing FAF thresholds may considerably decrease the amount of variant interpretations and significantly reduce the cost of genetic testing for clinical genetic laboratories, without compromising the accuracy of genetic diagnostic services.
Assuntos
Frequência do Gene , Testes Genéticos , Variação Genética , Laboratórios , Alelos , Cardiomiopatias/diagnóstico , Cardiomiopatias/genética , Análise Custo-Benefício , Testes Genéticos/métodos , Sequenciamento de Nucleotídeos em Larga Escala , HumanosRESUMO
BACKGROUND: Blepharophimosis is a fixed reduction in the vertical distance between the upper and lower eyelids with short palpebral fissures. It is a rare facial malformation and is considered an important diagnostic feature in dysmorphic analysis. It is likely that many patients with blepharophimosis-mental retardation syndrome have submicroscopic chromosomal rearrangements, and the use of molecular karyotyping can narrow the known blepharophimosis-mental retardation-critical regions or clarify the effect of the haploinsufficiency of the involved genes on the phenotype. MATERIALS AND METHODS: A female patient presented with bilateral blepharophimosis, ptosis, epicanthus inversus, telecanthus, low-set and small ears, other minor anomalies, hypotonia and psychomotor developmental delay. Metabolic investigations and array CGH analysis were performed. The results of molecular karyotyping were confirmed by real-time PCR analysis. RESULTS: Molecular karyotyping revealed a 5.2 Mb deletion in the 10q22.1q22.3 region. Real-time PCR analysis of the proband and her parents confirmed the deletion in the proband and revealed its de novo origin. CONCLUSIONS: With ptosis, hypotonia, and developmental delay as the main diagnostic features of our patient, the effect of histone acetyltransferase-encoding KAT6B gene haploinsufficiency was suspected to have a significant role in determining the phenotype. Detailed clinical characterization of the patient provided additional information on the clinical manifestation of the 10q22 deletion.
Assuntos
Blefarofimose/diagnóstico , Blefarofimose/genética , Deleção Cromossômica , Cromossomos Humanos Par 10/genética , Histona Acetiltransferases/genética , Anormalidades da Pele/diagnóstico , Anormalidades da Pele/genética , Anormalidades Urogenitais/diagnóstico , Anormalidades Urogenitais/genética , Pré-Escolar , Deficiências do Desenvolvimento/diagnóstico , Deficiências do Desenvolvimento/genética , Feminino , Humanos , Deficiência Intelectual/diagnóstico , Deficiência Intelectual/genética , Cariotipagem , Reação em Cadeia da Polimerase em Tempo RealRESUMO
OBJECTIVE: Demonstrate that incorporating domain knowledge into feature selection methods helps identify interpretable features with predictive capability comparable to a state-of-the-art classifier. METHODS: Two feature selection methods, one using a genetic algorithm (GA) the other a L(1)-norm support vector machine (SVM), were investigated on three real-world biomedical magnetic resonance (MR) spectral datasets of increasing difficulty. Consensus sets of the feature sets obtained by the two methods were also assessed. RESULTS AND CONCLUSIONS: Features identified independently by the two methods and by their consensus, determine class-discriminatory groups or individual features, whose predictive power compares favorably with that of a state-of-the-art classifier. Furthermore, the identified feature signatures form stable groupings at definite spectral positions, hence are readily interpretable. This is a useful and important practical result for generating hypothesis for the domain expert.
Assuntos
Algoritmos , Inteligência Artificial , Líquidos Corporais , Candida albicans/classificação , Candida tropicalis/classificação , HumanosRESUMO
BACKGROUND: Every next generation sequencing (NGS) platform relies on proprietary and open source computational tools to analyze sequencing data. NGS tools for Illumina platforms are well documented which is not the case with AB SOLiD systems. We applied several computational and variant calling pipelines to analyse targeted exome sequencing data obtained using AB SOLiD 5500 system. Our investigated tools comprised proprietary LifeScope's pipeline in combination with open source color-space competent mapping programs and a variant caller. We present instrumental details of the pipelines that were used and quantitative comparative analysis of variant lists generated by LifeScope's pipeline versus open source tools. RESULTS: Sufficient coverage of targeted regions was achieved by all investigated pipelines. High variability was observed in identities of variants across the mapping programs. We observed less than 50% concordance of variant lists produced by approaches based on different mapping algorithms. We summarized different approaches with regards to coverage (DP) and quality (QUAL) properties of the variants provided by GATK and found that LifeScope's computational pipeline is superior. Fusion of information on mapping profiles (pileup) at genomic positions of variants in several different alignments proved to be a useful strategy to assess questionable singleton variants. CONCLUSIONS: We quantitatively supported a conclusion that Lifescope's pipeline is superior for processing sequencing data obtained by AB SOLiD 5500 system. Nevertheless the use of alternative pipelines is encouraged because aggregation of information from other mapping and variant calling approaches helps to resolve questionable calls and increases the confidence of the call. It was noted that a coverage threshold for variant to be considered for further analysis has to be chosen in data-driven way to prevent a loss of important information.
Assuntos
Biologia Computacional/métodos , Exoma/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Algoritmos , Genoma Humano/genética , Genômica/métodos , Humanos , Polimorfismo de Nucleotídeo Único , Reprodutibilidade dos Testes , SoftwareRESUMO
A major goal of cell therapy for vascular diseases is to promote revascularization through the injection of endothelial stem/progenitor cells. The gene regulatory mechanisms that underlie endothelial progenitor-mediated vascular repair, however, remain elusive. Here, we identify the transcription factor TAL1/SCL as a key mediator of the vascular repair function of primary human endothelial colony-forming cells (ECFCs). Genome-wide analyses in ECFCs demonstrate that TAL1 activates a transcriptional program that promotes cell adhesion and migration. At the mechanistic level, we show that TAL1 upregulates the expression of migratory and adhesion genes through recruitment of the histone acetyltransferase p300. Based on these findings, we establish a strategy that enhances the revascularization efficiency of ECFCs after ischemia through ex vivo priming with the histone deacetylase inhibitor TSA. Thus, small molecule epigenetics drugs are effective tools for modifying the epigenome of stem/progenitor cells prior to transplantation as a means to enhance their therapeutic potential.