Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 39
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Am J Hum Genet ; 110(12): 1996-2002, 2023 Dec 07.
Article in English | MEDLINE | ID: mdl-37995684

ABSTRACT

In this perspective we discuss the current lack of genetic and environmental diversity in functional genomics datasets. There is a well-described Eurocentric bias in genetic and functional genomic research that has a clear impact on the benefit this research can bring to underrepresented populations. Current research focused on genetic variant-to-function experiments aims to identify molecular QTLs, but the lack of data from genetically diverse individuals has limited analyses to mostly populations of European ancestry. Although some efforts have been established to increase diversity in functional genomic studies, much remains to be done to consistently generate data for underrepresented populations from now on. We discuss the major barriers for this continuity and suggest actionable insights, aiming to empower research and researchers from underserved populations.


Subject(s)
Genomics , Population Groups , Humans
2.
Bioinformatics ; 40(Supplement_1): i11-i19, 2024 Jun 28.
Article in English | MEDLINE | ID: mdl-38940154

ABSTRACT

MOTIVATION: Wikipedia is a vital open educational resource in computational biology. The quality of computational biology coverage in English-language Wikipedia has improved steadily in recent years. However, there is an increasingly large 'knowledge gap' between computational biology resources in English-language Wikipedia, and Wikipedias in non-English languages. Reducing this knowledge gap by providing educational resources in non-English languages would reduce language barriers which disadvantage non-native English speaking learners across multiple dimensions in computational biology. RESULTS: Here, we provide a comprehensive assessment of computational biology coverage in Spanish-language Wikipedia, the second most accessed Wikipedia worldwide. Using Spanish-language Wikipedia as a case study, we generate quantitative and qualitative data before and after a targeted educational event, specifically, a Spanish-focused student editing competition. Our data demonstrates how such events and activities can narrow the knowledge gap between English and non-English educational resources, by improving existing articles and creating new articles. Finally, based on our analysis, we suggest ways to prioritize future initiatives to improve open educational resources in other languages. AVAILABILITY AND IMPLEMENTATION: Scripts for data analysis are available at: https://github.com/ISCBWikiTeam/spanish.


Subject(s)
Computational Biology , Computational Biology/methods , Humans , Language , Internet
3.
Twin Res Hum Genet ; 27(2): 85-96, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38699821

ABSTRACT

TwinsMX registry is a national research initiative in Mexico that aims to understand the complex interplay between genetics and environment in shaping physical and mental health traits among the country's population. With a multidisciplinary approach, TwinsMX aims to advance our knowledge of the genetic and environmental mechanisms underlying ethnic variations in complex traits and diseases, including behavioral, psychometric, anthropometric, metabolic, cardiovascular and mental disorders. With information gathered from over 2800 twins, this article updates the prevalence of several complex traits; and describes the advances and novel ideas we have implemented such as magnetic resonance imaging. The future expansion of the TwinsMX registry will enhance our comprehension of the intricate interplay between genetics and environment in shaping health and disease in the Mexican population. Overall, this report describes the progress in the building of a solid database that will allow the study of complex traits in the Mexican population, valuable not only for our consortium, but also for the worldwide scientific community, by providing new insights of understudied genetically admixed populations.


Subject(s)
Gene-Environment Interaction , Registries , Humans , Mexico/epidemiology , Male , Female , Adult , Diseases in Twins/genetics , Diseases in Twins/epidemiology , Middle Aged , Twins, Monozygotic/genetics , Twins, Dizygotic/genetics , Mental Disorders/genetics , Mental Disorders/epidemiology , Cardiovascular Diseases/genetics , Cardiovascular Diseases/epidemiology
4.
Nucleic Acids Res ; 50(W1): W670-W676, 2022 07 05.
Article in English | MEDLINE | ID: mdl-35544234

ABSTRACT

RSAT (Regulatory Sequence Analysis Tools) enables the detection and the analysis of cis-regulatory elements in genomic sequences. This software suite performs (i) de novo motif discovery (including from genome-wide datasets like ChIP-seq/ATAC-seq) (ii) genomic sequences scanning with known motifs, (iii) motif analysis (quality assessment, comparisons and clustering), (iv) analysis of regulatory variations and (v) comparative genomics. RSAT comprises 50 tools. Six public Web servers (including a teaching server) are offered to meet the needs of different biological communities. RSAT philosophy and originality are: (i) a multi-modal access depending on the user needs, through web forms, command-line for local installation and programmatic web services, (ii) a support for virtually any genome (animals, bacteria, plants, totalizing over 10 000 genomes directly accessible). Since the 2018 NAR Web Software Issue, we have developed a large REST API, extended the support for additional genomes and external motif collections, enhanced some tools and Web forms, and developed a novel tool that builds or refine gene regulatory networks using motif scanning (network-interactions). The RSAT website provides extensive documentation, tutorials and published protocols. RSAT code is under open-source license and now hosted in GitHub. RSAT is available at http://www.rsat.eu/.


Subject(s)
Genomics , Transcription Factors , Animals , Transcription Factors/genetics , Genomics/methods , Software , Sequence Analysis, DNA/methods , Gene Regulatory Networks
5.
Twin Res Hum Genet ; : 1-9, 2023 Oct 16.
Article in English | MEDLINE | ID: mdl-37842863

ABSTRACT

Astigmatism and myopia are two common ocular refractive errors that can impact daily life, including learning and productivity. Current knowledge suggests that the etiology of these conditions is the result of a complex interplay between genetic and environmental factors. Studies in populations of European ancestry have demonstrated a higher concordance of refractive errors in monozygotic (MZ) twins compared to dizygotic (DZ) twins. However, there is a lack of studies on genetically informative samples of multi-ethnic ancestry. This study aimed to estimate the genetic contribution to astigmatism and myopia in the Mexican population. A sample of 1399 families, including 243 twin pairs and 1156 single twins, completed a medical questionnaire about their own and their co-twin's diagnosis of astigmatism and myopia. Concordance rates for astigmatism and myopia were estimated, and heritability and genetic correlations were determined using a bivariate ACE Cholesky decomposition method, decomposed into A (additive genetic), C (shared environmental) and E (unique environmental) components. The results showed a higher concordance rate for astigmatism and myopia for MZ twins (.74 and .74, respectively) than for DZ twins (.50 and .55). The AE model, instead of the ACE model, best fitted the data. Based on this, heritability estimates were .81 for astigmatism and .81 for myopia, with a cross-trait genetic correlation of rA = .80, nonshared environmental correlation rE = .89, and a phenotypic correlation of rP = .80. These results are consistent with previous findings in other populations, providing evidence for a similar genetic architecture of these conditions in the multi-ethnic Mexican population.

6.
Trends Biochem Sci ; 43(6): 452-468, 2018 06.
Article in English | MEDLINE | ID: mdl-29673772

ABSTRACT

Gene expression in higher eukaryotes is precisely regulated in time and space through the interplay between promoters and gene-distal regulatory regions, known as enhancers. The original definition of enhancers implies the ability to activate gene expression remotely, while promoters entail the capability to locally induce gene expression. Despite the conventional distinction between them, promoters and enhancers share many genomic and epigenomic features. One intriguing finding in the gene regulation field comes from the observation that many core promoter regions display enhancer activity. Recent high-throughput reporter assays along with clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-related approaches have indicated that this phenomenon is common and might have a strong impact on our global understanding of genome organisation and gene expression regulation.


Subject(s)
Enhancer Elements, Genetic/genetics , Promoter Regions, Genetic/genetics , Animals , Gene Expression Regulation/genetics , High-Throughput Screening Assays , Humans
7.
Development ; 144(13): 2428-2444, 2017 07 01.
Article in English | MEDLINE | ID: mdl-28536097

ABSTRACT

The transcriptional pathways activated downstream of vascular endothelial growth factor (VEGF) signaling during angiogenesis remain incompletely characterized. By assessing the signals responsible for induction of the Notch ligand delta-like 4 (DLL4) in endothelial cells, we find that activation of the MAPK/ERK pathway mirrors the rapid and dynamic induction of DLL4 transcription and that this pathway is required for DLL4 expression. Furthermore, VEGF/ERK signaling induces phosphorylation and activation of the ETS transcription factor ERG, a prerequisite for DLL4 induction. Transcription of DLL4 coincides with dynamic ERG-dependent recruitment of the transcriptional co-activator p300. Genome-wide gene expression profiling identified a network of VEGF-responsive and ERG-dependent genes, and ERG chromatin immunoprecipitation (ChIP)-seq revealed the presence of conserved ERG-bound putative enhancer elements near these target genes. Functional experiments performed in vitro and in vivo confirm that this network of genes requires ERK, ERG and p300 activity. Finally, genome-editing and transgenic approaches demonstrate that a highly conserved ERG-bound enhancer located upstream of HLX (which encodes a transcription factor implicated in sprouting angiogenesis) is required for its VEGF-mediated induction. Collectively, these findings elucidate a novel transcriptional pathway contributing to VEGF-dependent angiogenesis.


Subject(s)
E1A-Associated p300 Protein/metabolism , Extracellular Signal-Regulated MAP Kinases/metabolism , Gene Expression Regulation/drug effects , Gene Regulatory Networks/drug effects , Transcription, Genetic/drug effects , Vascular Endothelial Growth Factor A/pharmacology , Adaptor Proteins, Signal Transducing , Animals , Calcium-Binding Proteins , Cattle , Enhancer Elements, Genetic/genetics , Female , Gene Expression Regulation, Developmental/drug effects , Human Umbilical Vein Endothelial Cells/drug effects , Human Umbilical Vein Endothelial Cells/metabolism , Humans , Intercellular Signaling Peptides and Proteins/metabolism , Introns/genetics , MAP Kinase Signaling System/drug effects , Male , Mice , Neovascularization, Physiologic/genetics , Transcriptional Regulator ERG/metabolism , Zebrafish/embryology
8.
Nucleic Acids Res ; 46(W1): W209-W214, 2018 07 02.
Article in English | MEDLINE | ID: mdl-29722874

ABSTRACT

RSAT (Regulatory Sequence Analysis Tools) is a suite of modular tools for the detection and the analysis of cis-regulatory elements in genome sequences. Its main applications are (i) motif discovery, including from genome-wide datasets like ChIP-seq/ATAC-seq, (ii) motif scanning, (iii) motif analysis (quality assessment, comparisons and clustering), (iv) analysis of regulatory variations, (v) comparative genomics. Six public servers jointly support 10 000 genomes from all kingdoms. Six novel or refactored programs have been added since the 2015 NAR Web Software Issue, including updated programs to analyse regulatory variants (retrieve-variation-seq, variation-scan, convert-variations), along with tools to extract sequences from a list of coordinates (retrieve-seq-bed), to select motifs from motif collections (retrieve-matrix), and to extract orthologs based on Ensembl Compara (get-orthologs-compara). Three use cases illustrate the integration of new and refactored tools to the suite. This Anniversary update gives a 20-year perspective on the software suite. RSAT is well-documented and available through Web sites, SOAP/WSDL (Simple Object Access Protocol/Web Services Description Language) web services, virtual machines and stand-alone programs at http://www.rsat.eu/.


Subject(s)
Regulatory Sequences, Nucleic Acid , Software , Genetic Variation , Genomics/history , High-Throughput Nucleotide Sequencing/history , History, 20th Century , History, 21st Century , Internet , Nucleotide Motifs , Software/history
9.
Twin Res Hum Genet ; 22(6): 611-616, 2019 12.
Article in English | MEDLINE | ID: mdl-31931895

ABSTRACT

TwinsMX is a national twin registry in Mexico recently created with institutional support from the Universidad Nacional Autónoma de México. It aims to serve as a platform to advance epidemiological and genetic research in the country and to disentangle the genetic and environmental contributions to health and disease in the admixed Mexican population. Here, we describe our recruitment and data collection strategies and discuss both the progress to date and future directions. More information about the registry is available on our website: https://twinsmxofficial.unam.mx/ (content in Spanish).


Subject(s)
Diseases in Twins/epidemiology , Diseases in Twins/genetics , Gene-Environment Interaction , Registries/statistics & numerical data , Twins, Dizygotic/genetics , Twins, Monozygotic/genetics , Adolescent , Adult , Female , Follow-Up Studies , Humans , Male , Mexico/epidemiology , Middle Aged , Patient Selection , Young Adult
10.
Genet Epidemiol ; 41(5): 455-466, 2017 07.
Article in English | MEDLINE | ID: mdl-28421636

ABSTRACT

Tissue factor pathway inhibitor (TFPI) regulates the formation of intravascular blood clots, which manifest clinically as ischemic heart disease, ischemic stroke, and venous thromboembolism (VTE). TFPI plasma levels are heritable, but the genetics underlying TFPI plasma level variability are poorly understood. Herein we report the first genome-wide association scan (GWAS) of TFPI plasma levels, conducted in 251 individuals from five extended French-Canadian Families ascertained on VTE. To improve discovery, we also applied a hypothesis-driven (HD) GWAS approach that prioritized single nucleotide polymorphisms (SNPs) in (1) hemostasis pathway genes, and (2) vascular endothelial cell (EC) regulatory regions, which are among the highest expressers of TFPI. Our GWAS identified 131 SNPs with suggestive evidence of association (P-value < 5 × 10-8 ), but no SNPs reached the genome-wide threshold for statistical significance. Hemostasis pathway genes were not enriched for TFPI plasma level associated SNPs (global hypothesis test P-value = 0.147), but EC regulatory regions contained more TFPI plasma level associated SNPs than expected by chance (global hypothesis test P-value = 0.046). We therefore stratified our genome-wide SNPs, prioritizing those in EC regulatory regions via stratified false discovery rate (sFDR) control, and reranked the SNPs by q-value. The minimum q-value was 0.27, and the top-ranked SNPs did not show association evidence in the MARTHA replication sample of 1,033 unrelated VTE cases. Although this study did not result in new loci for TFPI, our work lays out a strategy to utilize epigenomic data in prioritization schemes for future GWAS studies.


Subject(s)
Biomarkers/blood , Lipoproteins/blood , Lipoproteins/genetics , Polymorphism, Single Nucleotide/genetics , Regulatory Sequences, Nucleic Acid/genetics , Venous Thromboembolism/blood , Venous Thromboembolism/genetics , Adult , Canada , Cells, Cultured , Endothelium, Vascular/metabolism , Endothelium, Vascular/pathology , Epigenomics , Female , Genome-Wide Association Study/methods , Humans , Male , Venous Thromboembolism/diagnosis
11.
Nucleic Acids Res ; 44(D1): D133-43, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26527724

ABSTRACT

RegulonDB (http://regulondb.ccg.unam.mx) is one of the most useful and important resources on bacterial gene regulation,as it integrates the scattered scientific knowledge of the best-characterized organism, Escherichia coli K-12, in a database that organizes large amounts of data. Its electronic format enables researchers to compare their results with the legacy of previous knowledge and supports bioinformatics tools and model building. Here, we summarize our progress with RegulonDB since our last Nucleic Acids Research publication describing RegulonDB, in 2013. In addition to maintaining curation up-to-date, we report a collection of 232 interactions with small RNAs affecting 192 genes, and the complete repertoire of 189 Elementary Genetic Sensory-Response units (GENSOR units), integrating the signal, regulatory interactions, and metabolic pathways they govern. These additions represent major progress to a higher level of understanding of regulated processes. We have updated the computationally predicted transcription factors, which total 304 (184 with experimental evidence and 120 from computational predictions); we updated our position-weight matrices and have included tools for clustering them in evolutionary families. We describe our semiautomatic strategy to accelerate curation, including datasets from high-throughput experiments, a novel coexpression distance to search for 'neighborhood' genes to known operons and regulons, and computational developments.


Subject(s)
Databases, Genetic , Escherichia coli K12/genetics , Gene Expression Regulation, Bacterial , Regulon , Cluster Analysis , Escherichia coli K12/metabolism , Gene Regulatory Networks , Operon , Position-Specific Scoring Matrices , RNA, Small Untranslated/metabolism , Transcription Factors/classification
12.
Nucleic Acids Res ; 43(W1): W50-6, 2015 Jul 01.
Article in English | MEDLINE | ID: mdl-25904632

ABSTRACT

RSAT (Regulatory Sequence Analysis Tools) is a modular software suite for the analysis of cis-regulatory elements in genome sequences. Its main applications are (i) motif discovery, appropriate to genome-wide data sets like ChIP-seq, (ii) transcription factor binding motif analysis (quality assessment, comparisons and clustering), (iii) comparative genomics and (iv) analysis of regulatory variations. Nine new programs have been added to the 43 described in the 2011 NAR Web Software Issue, including a tool to extract sequences from a list of coordinates (fetch-sequences from UCSC), novel programs dedicated to the analysis of regulatory variants from GWAS or population genomics (retrieve-variation-seq and variation-scan), a program to cluster motifs and visualize the similarities as trees (matrix-clustering). To deal with the drastic increase of sequenced genomes, RSAT public sites have been reorganized into taxon-specific servers. The suite is well-documented with tutorials and published protocols. The software suite is available through Web sites, SOAP/WSDL Web services, virtual machines and stand-alone programs at http://www.rsat.eu/.


Subject(s)
Regulatory Elements, Transcriptional , Software , Binding Sites , Genetic Variation , Genomics , Humans , Internet , Nucleotide Motifs , Transcription Factors/metabolism
13.
Nucleic Acids Res ; 42(20): e156, 2014 Nov 10.
Article in English | MEDLINE | ID: mdl-25249628

ABSTRACT

Understanding the role of a given transcription factor (TF) in regulating gene expression requires precise mapping of its binding sites in the genome. Chromatin immunoprecipitation-exo, an emerging technique using λ exonuclease to digest TF unbound DNA after ChIP, is designed to reveal transcription factor binding site (TFBS) boundaries with near-single nucleotide resolution. Although ChIP-exo promises deeper insights into transcription regulation, no dedicated bioinformatics tool exists to leverage its advantages. Most ChIP-seq and ChIP-chip analytic methods are not tailored for ChIP-exo, and thus cannot take full advantage of high-resolution ChIP-exo data. Here we describe a novel analysis framework, termed MACE (model-based analysis of ChIP-exo) dedicated to ChIP-exo data analysis. The MACE workflow consists of four steps: (i) sequencing data normalization and bias correction; (ii) signal consolidation and noise reduction; (iii) single-nucleotide resolution border peak detection using the Chebyshev Inequality and (iv) border matching using the Gale-Shapley stable matching algorithm. When applied to published human CTCF, yeast Reb1 and our own mouse ONECUT1/HNF6 ChIP-exo data, MACE is able to define TFBSs with high sensitivity, specificity and spatial resolution, as evidenced by multiple criteria including motif enrichment, sequence conservation, direct sequence pileup, nucleosome positioning and open chromatin states. In addition, we show that the fundamental advance of MACE is the identification of two boundaries of a TFBS with high resolution, whereas other methods only report a single location of the same event. The two boundaries help elucidate the in vivo binding structure of a given TF, e.g. whether the TF may bind as dimers or in a complex with other co-factors.


Subject(s)
Chromatin Immunoprecipitation/methods , Sequence Analysis, DNA/methods , Transcription Factors/metabolism , Algorithms , Animals , Binding Sites , CCCTC-Binding Factor , Computer Simulation , DNA-Binding Proteins/metabolism , Exodeoxyribonucleases , Genome , Hepatocyte Nuclear Factor 6/metabolism , Humans , Male , Mice , Mice, Inbred C57BL , Repressor Proteins/metabolism , Saccharomyces cerevisiae Proteins/metabolism
14.
Nucleic Acids Res ; 41(Database issue): D203-13, 2013 Jan.
Article in English | MEDLINE | ID: mdl-23203884

ABSTRACT

This article summarizes our progress with RegulonDB (http://regulondb.ccg.unam.mx/) during the past 2 years. We have kept up-to-date the knowledge from the published literature regarding transcriptional regulation in Escherichia coli K-12. We have maintained and expanded our curation efforts to improve the breadth and quality of the encoded experimental knowledge, and we have implemented criteria for the quality of our computational predictions. Regulatory phrases now provide high-level descriptions of regulatory regions. We expanded the assignment of quality to various sources of evidence, particularly for knowledge generated through high-throughput (HT) technology. Based on our analysis of most relevant methods, we defined rules for determining the quality of evidence when multiple independent sources support an entry. With this latest release of RegulonDB, we present a new highly reliable larger collection of transcription start sites, a result of our experimental HT genome-wide efforts. These improvements, together with several novel enhancements (the tracks display, uploading format and curational guidelines), address the challenges of incorporating HT-generated knowledge into RegulonDB. Information on the evolutionary conservation of regulatory elements is also available now. Altogether, RegulonDB version 8.0 is a much better home for integrating knowledge on gene regulation from the sources of information currently available.


Subject(s)
Databases, Genetic , Escherichia coli K12/genetics , Gene Expression Regulation, Bacterial , Regulatory Elements, Transcriptional , Transcription, Genetic , Bacterial Proteins/metabolism , Databases, Genetic/standards , Evolution, Molecular , Genomics , Internet , Promoter Regions, Genetic , Regulon , Repressor Proteins/metabolism , Sequence Analysis, RNA , Transcription Factors/metabolism , Transcription Initiation Site
15.
J Bacteriol ; 196(2): 325-36, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24187088

ABSTRACT

The small RNAs CsrB and CsrC of Salmonella indirectly control the expression of numerous genes encoding widespread cellular functions, including virulence. The expression of csrB and csrC genes, which are located in different chromosomal regions, is coordinated by positive transcriptional control mediated by the two-component regulatory system BarA/SirA. Here, we identified by computational analysis an 18-bp inverted repeat (IR) sequence located far upstream from the promoter of Salmonella enterica serovar Typhimurium csrB and csrC genes. Deletion analysis and site-directed mutagenesis of the csrB and csrC regulatory regions revealed that this IR sequence is required for transcriptional activation of both genes. Protein-DNA and protein-protein interaction assays showed that the response regulator SirA specifically binds to the IR sequence and provide evidence that SirA acts as a dimer. Interestingly, whereas the IR sequence was essential for the SirA-mediated expression of csrB, our results revealed that SirA controls the expression of csrC not only by binding to the IR sequence but also by an indirect mode involving the Csr system. Additional computational, biochemical, and genetic analyses demonstrated that the integration host factor (IHF) global regulator positively controls the expression of csrB, but not of csrC, by interacting with a sequence located between the promoter and the SirA-binding site. These findings contribute to the better understanding of the regulatory mechanism controlling the expression of CsrB and CsrC.


Subject(s)
Gene Expression Regulation, Bacterial , Genes, Bacterial , RNA, Small Untranslated/biosynthesis , Regulatory Elements, Transcriptional , Salmonella typhimurium/genetics , Bacterial Proteins/metabolism , Computational Biology , DNA Mutational Analysis , DNA, Bacterial/genetics , DNA, Bacterial/metabolism , Mutagenesis, Site-Directed , Protein Binding , Protein Multimerization , RNA, Small Untranslated/genetics , Sequence Deletion , Trans-Activators/metabolism
16.
J Parkinsons Dis ; 14(3): 483-493, 2024.
Article in English | MEDLINE | ID: mdl-38457145

ABSTRACT

Background: Depression is a common symptom in Parkinson's disease (PD), resulting from underlying neuropathological processes and psychological factors. However, the extent to which shared genetic risk factors contribute to the relationship between depression and PD is poorly understood. Objective: To examine the effects of common genetic variants influencing the etiology of PD and depression risk at the genome-wide and local genomic regional level. Methods: We comprehensively investigated the genetic relationship between PD and depression using genome-wide association studies data. First, we estimated the genetic correlation at the genome-wide level using linkage-disequilibrium score regression, followed by local genetic correlation analysis using the GWAS-pairwise method and functional annotation to identify genes that may jointly influence the risk for both traits. Also, we performed Latent Causal Variable, Latent Heritable Confounder Mendelian Randomization, and traditional Mendelian Randomization analyses to investigate the potential causal relationship. Results: Although the genetic correlation between PD and depression was not statistically significant at the genome-wide level, GWAS-pairwise analyses identified 16 genomic segments associated with PD and depression, implicating nine genes. Further analyses revealed distinct patterns within individual genes, suggesting an intricate pattern. These genes involve various biological processes, including neurotransmitter regulation, senescence, and nucleo-cytoplasmic transport mechanisms. We did not observe genetic evidence of causality between PD and depression. Conclusions: Our findings did not support a genome-wide genetic correlation or a causal association between both conditions. However, we identified genomic segments but identified genomic segments linked to distinct biological pathways influencing their etiology.Further research is needed to understand their functional consequences.


Subject(s)
Depression , Genome-Wide Association Study , Mendelian Randomization Analysis , Parkinson Disease , Humans , Parkinson Disease/genetics , Depression/genetics , Depression/etiology , Genetic Predisposition to Disease , Linkage Disequilibrium , Polymorphism, Single Nucleotide
17.
J Leukoc Biol ; 116(1): 64-76, 2024 Jun 28.
Article in English | MEDLINE | ID: mdl-38146769

ABSTRACT

The adaptive immune response is coordinated by CD4+ T cells, which determine the type and strength of the immune response and the effector cells involved. It has been reported that CD4+ T cells are less responsive in neonates, leading to low activation of the cellular response and poor antibody production by B cells. This low response is essential for the tolerant window that favors birth transition from the sterile environment in the womb to the outside world but leaves neonates vulnerable to infection, which is still an important health issue. Neonates have a high morbidity and mortality rate due to infections, and the molecular reasons are still understudied. We asked whether the neonatal naive CD4+ T cells have a genomic program that predisposes them to a low response. Therefore, we evaluated the transcriptome and epigenome of human neonatal and adult naive CD4+ T cells. Our results point to a gene expression profile forming a distinct regulatory network in neonatal cells, which favors proliferation and a low T-cell response. Such expression profile is supported by a characteristic epigenetic landscape of neonatal CD4+ T cells, which correlates with the characteristic transcriptome of the neonatal cells. These results were confirmed by experiments showing a low response to activation signals, higher proliferation, and lower expression of cytokines of neonatal CD4+ T cells as compared to adult cells. Understanding this network could lead to novel vaccine formulations and better deal with life-threatening diseases during this highly vulnerable period of our lives.


Subject(s)
CD4-Positive T-Lymphocytes , Cell Proliferation , Epigenome , Receptors, Antigen, T-Cell , Transcriptome , Humans , CD4-Positive T-Lymphocytes/immunology , Infant, Newborn , Receptors, Antigen, T-Cell/metabolism , Receptors, Antigen, T-Cell/immunology , Receptors, Antigen, T-Cell/genetics , Adult , Lymphocyte Activation/immunology , Female , Epigenesis, Genetic
18.
Front Digit Health ; 6: 1344103, 2024.
Article in English | MEDLINE | ID: mdl-38895515

ABSTRACT

Objective: To introduce MexOMICS, a Mexican Consortium focused on establishing electronic databases to collect, cross-reference, and share health-related and omics data on the Mexican population. Methods: Since 2019, the MexOMICS Consortium has established three electronic-based registries: the Mexican Twin Registry (TwinsMX), Mexican Lupus Registry (LupusRGMX), and the Mexican Parkinson's Research Network (MEX-PD), designed and implemented using the Research Electronic Data Capture web-based application. Participants were enrolled through voluntary participation and on-site engagement with medical specialists. We also acquired DNA samples and Magnetic Resonance Imaging scans in subsets of participants. Results: The registries have successfully enrolled a large number of participants from a variety of regions within Mexico: TwinsMX (n = 2,915), LupusRGMX (n = 1,761) and MEX-PD (n = 750). In addition to sociodemographic, psychosocial, and clinical data, MexOMICS has collected DNA samples to study the genetic biomarkers across the three registries. Cognitive function has been assessed with the Montreal Cognitive Assessment in a subset of 376 MEX-PD participants. Furthermore, a subset of 267 twins have participated in cognitive evaluations with the Creyos platform and in MRI sessions acquiring structural, functional, and spectroscopy brain imaging; comparable evaluations are planned for LupusRGMX and MEX-PD. Conclusions: The MexOMICS registries offer a valuable repository of information concerning the potential interplay of genetic and environmental factors in health conditions among the Mexican population.

19.
Nucleic Acids Res ; 39(3): 808-24, 2011 Feb.
Article in English | MEDLINE | ID: mdl-20923783

ABSTRACT

Position-specific scoring matrices (PSSMs) are routinely used to predict transcription factor (TF)-binding sites in genome sequences. However, their reliability to predict novel binding sites can be far from optimum, due to the use of a small number of training sites or the inappropriate choice of parameters when building the matrix or when scanning sequences with it. Measures of matrix quality such as E-value and information content rely on theoretical models, and may fail in the context of full genome sequences. We propose a method, implemented in the program 'matrix-quality', that combines theoretical and empirical score distributions to assess reliability of PSSMs for predicting TF-binding sites. We applied 'matrix-quality' to estimate the predictive capacity of matrices for bacterial, yeast and mouse TFs. The evaluation of matrices from RegulonDB revealed some poorly predictive motifs, and allowed us to quantify the improvements obtained by applying multi-genome motif discovery. Interestingly, the method reveals differences between global and specific regulators. It also highlights the enrichment of binding sites in sequence sets obtained from high-throughput ChIP-chip (bacterial and yeast TFs), and ChIP-seq and experiments (mouse TFs). The method presented here has many applications, including: selecting reliable motifs before scanning sequences; improving motif collections in TFs databases; evaluating motifs discovered using high-throughput data sets.


Subject(s)
Position-Specific Scoring Matrices , Promoter Regions, Genetic , Sequence Analysis, DNA , Transcription Factors/metabolism , Animals , Bacterial Proteins/metabolism , Binding Sites , Chromatin Immunoprecipitation , Genomics , Mice , Oligonucleotide Array Sequence Analysis , ROC Curve , Repressor Proteins/metabolism , Serine Endopeptidases/metabolism , Software
20.
Nucleic Acids Res ; 39(Web Server issue): W86-91, 2011 Jul.
Article in English | MEDLINE | ID: mdl-21715389

ABSTRACT

RSAT (Regulatory Sequence Analysis Tools) comprises a wide collection of modular tools for the detection of cis-regulatory elements in genome sequences. Thirteen new programs have been added to the 30 described in the 2008 NAR Web Software Issue, including an automated sequence retrieval from EnsEMBL (retrieve-ensembl-seq), two novel motif discovery algorithms (oligo-diff and info-gibbs), a 100-times faster version of matrix-scan enabling the scanning of genome-scale sequence sets, and a series of facilities for random model generation and statistical evaluation (random-genome-fragments, random-motifs, random-sites, implant-sites, sequence-probability, permute-matrix). Our most recent work also focused on motif comparison (compare-matrices) and evaluation of motif quality (matrix-quality) by combining theoretical and empirical measures to assess the predictive capability of position-specific scoring matrices. To process large collections of peak sequences obtained from ChIP-seq or related technologies, RSAT provides a new program (peak-motifs) that combines several efficient motif discovery algorithms to predict transcription factor binding motifs, match them against motif databases and predict their binding sites. Availability (web site, stand-alone programs and SOAP/WSDL (Simple Object Access Protocol/Web Services Description Language) web services): http://rsat.ulb.ac.be/rsat/.


Subject(s)
Regulatory Sequences, Nucleic Acid , Software , Binding Sites , Genomics , Regulatory Elements, Transcriptional , Transcription Factors/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL