Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
1.
Proc Natl Acad Sci U S A ; 116(11): 5021-5026, 2019 03 12.
Article in English | MEDLINE | ID: mdl-30796191

ABSTRACT

Major histocompatibility complex (MHC) genes encode proteins that initiate adaptive immune responses through the presentation of foreign antigens to T cells. The high polymorphism found at these genes, thought to be promoted and maintained by pathogen-mediated selection, contrasts with the limited number of MHC loci found in most vertebrates. Although expressing many diverse MHC genes should broaden the range of detectable pathogens, it has been hypothesized to also cause deletion of larger fractions of self-reactive T cells, leading to a detrimental reduction of the T cell receptor (TCR) repertoire. However, a key prediction of this TCR depletion hypothesis, that the TCR repertoire should be inversely related to the individual MHC diversity, has never been tested. Here, using high-throughput sequencing and advanced sequencing error correction, we provide evidence of such an association in a rodent species with high interindividual variation in the number of expressed MHC molecules, the bank vole (Myodes glareolus). Higher individual diversity of MHC class I, but not class II, was associated with smaller TCR repertoires. Our results thus provide partial support for the TCR depletion model, while also highlighting the complex, potentially MHC class-specific mechanisms by which autoreactivity may trade off against evolutionary expansion of the MHC gene family.


Subject(s)
Arvicolinae/genetics , Arvicolinae/immunology , Genetic Variation , Histocompatibility Antigens Class I/genetics , Receptors, Antigen, T-Cell/metabolism , Animals , Histocompatibility Antigens Class II/genetics , Linear Models
2.
Mol Cell Proteomics ; 18(6): 1085-1095, 2019 06.
Article in English | MEDLINE | ID: mdl-31154437

ABSTRACT

All but thirteen mammalian mitochondrial proteins are encoded by the nuclear genome, translated in the cytosol and then imported into the mitochondria. For a significant proportion of the mitochondrial proteins, import is coupled with the cleavage of a presequence called the transit peptide, and the formation of a new N-terminus. Determination of the neo N-termini has been investigated by proteomic approaches in several systems, but generally in a static way to compile as many N-termini as possible. In the present study, we have investigated how the mitochondrial proteome and N-terminome react to chemical stimuli that alter mitochondrial metabolism, namely zinc ions and rapamycin. To this end, we have used a strategy that analyzes both internal and N-terminal peptides in a single run, the dN-TOP approach. We used these two very different stressors to sort out what could be a generic response to stress and what is specific to each of these stressors. Rapamycin and zinc induced different changes in the mitochondrial proteome. However, convergent changes to key mitochondrial enzymatic activities such as pyruvate dehydrogenase, succinate dehydrogenase and citrate synthase were observed for both treatments. Other convergent changes were seen in components of the N-terminal processing system and mitochondrial proteases. Investigations into the generation of neo-N-termini in mitochondria showed that the processing system is robust, as indicated by the lack of change in neo N-termini under the conditions tested. Detailed analysis of the data revealed that zinc caused a slight reduction in the efficiency of the N-terminal trimming system and that both treatments increased the degradation of mitochondrial proteins. In conclusion, the use of this combined strategy allowed a detailed analysis of the dynamics of the mitochondrial N-terminome in response to treatments which impact the mitochondria.


Subject(s)
Mitochondria/metabolism , Proteomics/methods , Sirolimus/pharmacology , Zinc/pharmacology , Cluster Analysis , Humans , Mitochondria/drug effects , Mitochondria/enzymology , Mitochondrial Proteins/metabolism , Proteome/metabolism , U937 Cells
3.
Mol Ecol ; 27(11): 2594-2603, 2018 06.
Article in English | MEDLINE | ID: mdl-29654666

ABSTRACT

Pathogens are one of the main forces driving the evolution and maintenance of the highly polymorphic genes of the vertebrate major histocompatibility complex (MHC). Although MHC proteins are crucial in pathogen recognition, it is still poorly understood how pathogen-mediated selection promotes and maintains MHC diversity, and especially so in host species with highly duplicated MHC genes. Sedge warblers (Acrocephalus schoenobaenus) have highly duplicated MHC genes, and using data from high-throughput MHC genotyping, we were able to investigate to what extent avian malaria parasites explain temporal MHC class I supertype fluctuations in a long-term study population. We investigated infection status and infection intensities of two different strains of Haemoproteus, that is avian malaria parasites that are known to have significant fitness consequences in sedge warblers. We found that prevalence of avian malaria in carriers of specific MHC class I supertypes was a significant predictor of their frequency changes between years. This finding suggests that avian malaria infections partly drive the temporal fluctuations of the MHC class I supertypes. Furthermore, we found that individuals with a large number of different supertypes had higher resistance to avian malaria, but there was no evidence for an optimal MHC class I diversity. Thus, the two studied malaria parasite strains appear to select for a high MHC class I supertype diversity. Such selection may explain the maintenance of the extremely high number of MHC class I gene copies in sedge warblers and possibly also in other passerines where avian malaria is a common disease.


Subject(s)
Haemosporida/genetics , Major Histocompatibility Complex/genetics , Malaria, Avian/parasitology , Parasites/genetics , Songbirds/parasitology , Alleles , Animals , Genetic Variation/genetics , Selection, Genetic/genetics
4.
BMC Evol Biol ; 17(1): 159, 2017 Jul 05.
Article in English | MEDLINE | ID: mdl-28679358

ABSTRACT

BACKGROUND: Recent work suggests that gene duplications may play an important role in the evolution of immunity genes. Passerine birds, and in particular Sylvioidea warblers, have highly duplicated major histocompatibility complex (MHC) genes, which are key in immunity, compared to other vertebrates. However, reasons for this high MHC gene copy number are yet unclear. High-throughput sequencing (HTS) allows MHC genotyping even in individuals with extremely duplicated genes. This HTS data can reveal evidence of selection, which may help to unravel the putative functions of different gene copies, i.e. neofunctionalization. We performed exhaustive genotyping of MHC class I in a Sylvioidea warbler, the sedge warbler, Acrocephalus schoenobaenus, using the Illumina MiSeq technique on individuals from a wild study population. RESULTS: The MHC diversity in 863 genotyped individuals by far exceeds that of any other bird species described to date. A single individual could carry up to 65 different alleles, a large proportion of which are expressed (transcribed). The MHC alleles were of three different lengths differing in evidence of selection, diversity and divergence within our study population. Alleles without any deletions and alleles containing a 6 bp deletion showed characteristics of classical MHC genes, with evidence of multiple sites subject to positive selection and high sequence divergence. In contrast, alleles containing a 3 bp deletion had no sites subject to positive selection and had low divergence. CONCLUSIONS: Our results suggest that sedge warbler MHC alleles that either have no deletion, or contain a 6 bp deletion, encode classical antigen presenting MHC molecules. In contrast, MHC alleles containing a 3 bp deletion may encode molecules with a different function. This study demonstrates that highly duplicated MHC genes can be characterised with HTS and that selection patterns can be useful for revealing neofunctionalization. Importantly, our results highlight the need to consider the putative function of different MHC genes in future studies of MHC in relation to disease resistance and fitness.


Subject(s)
Evolution, Molecular , Genes, MHC Class I , Songbirds/genetics , Amino Acid Sequence , Animals , DNA, Complementary , Exons , Gene Duplication , Phylogeny , Selection, Genetic , Sequence Alignment
5.
Bioinformatics ; 30(2): 258-65, 2014 Jan 15.
Article in English | MEDLINE | ID: mdl-24234003

ABSTRACT

MOTIVATION: Traditional and high-throughput techniques for determining transcription factor (TF) binding specificities are generating large volumes of data of uneven quality, which are scattered across individual databases. RESULTS: FootprintDB integrates some of the most comprehensive freely available libraries of curated DNA binding sites and systematically annotates the binding interfaces of the corresponding TFs. The first release contains 2422 unique TF sequences, 10 112 DNA binding sites and 3662 DNA motifs. A survey of the included data sources, organisms and TF families was performed together with proprietary database TRANSFAC, finding that footprintDB has a similar coverage of multicellular organisms, while also containing bacterial regulatory data. A search engine has been designed that drives the prediction of DNA motifs for input TFs, or conversely of TF sequences that might recognize input regulatory sequences, by comparison with database entries. Such predictions can also be extended to a single proteome chosen by the user, and results are ranked in terms of interface similarity. Benchmark experiments with bacterial, plant and human data were performed to measure the predictive power of footprintDB searches, which were able to correctly recover 10, 55 and 90% of the tested sequences, respectively. Correctly predicted TFs had a higher interface similarity than the average, confirming its diagnostic value. AVAILABILITY AND IMPLEMENTATION: Web site implemented in PHP,Perl, MySQL and Apache. Freely available from http://floresta.eead.csic.es/footprintdb.


Subject(s)
Databases, Genetic , Gene Expression Regulation , Nucleotide Motifs/genetics , Regulatory Sequences, Nucleic Acid/genetics , Software , Transcription Factors/metabolism , Animals , Arabidopsis/genetics , Bacillus subtilis/genetics , Binding Sites , Drosophila melanogaster/genetics , Escherichia coli K12/genetics , Humans , Molecular Sequence Annotation , Protein Binding , Proteome/analysis , Transcription Factors/genetics
6.
Nucleic Acids Res ; 41(3): 1438-49, 2013 Feb 01.
Article in English | MEDLINE | ID: mdl-23268451

ABSTRACT

Sequence alignment of proteins and nucleic acids is a routine task in bioinformatics. Although the comparison of complete peptides, genes or genomes can be undertaken with a great variety of tools, the alignment of short DNA sequences and motifs entails pitfalls that have not been fully addressed yet. Here we confront the structural superposition of transcription factors with the sequence alignment of their recognized cis elements. Our goals are (i) to test TFcompare (http://floresta.eead.csic.es/tfcompare), a structural alignment method for protein-DNA complexes; (ii) to benchmark the pairwise alignment of regulatory elements; (iii) to define the confidence limits and the twilight zone of such alignments and (iv) to evaluate the relevance of these thresholds with elements obtained experimentally. We find that the structure of cis elements and protein-DNA interfaces is significantly more conserved than their sequence and measures how this correlates with alignment errors when only sequence information is considered. Our results confirm that DNA motifs in the form of matrices produce better alignments than individual sequences. Finally, we report that empirical and theoretically derived twilight thresholds are useful for estimating the natural plasticity of regulatory sequences, and hence for filtering out unreliable alignments.


Subject(s)
DNA/chemistry , Regulatory Elements, Transcriptional , Sequence Alignment/methods , Transcription Factors/chemistry , Binding Sites , DNA/metabolism , Nucleotide Motifs , Position-Specific Scoring Matrices , Transcription Factors/metabolism
7.
BMC Genomics ; 15: 317, 2014 Apr 28.
Article in English | MEDLINE | ID: mdl-24773781

ABSTRACT

BACKGROUND: Using motif detection programs it is fairly straightforward to identify conserved cis-sequences in promoters of co-regulated genes. In contrast, the identification of the transcription factors (TFs) interacting with these cis-sequences is much more elaborate. To facilitate this, we explore the possibility of using several bioinformatic and experimental approaches for TF identification. This starts with the selection of co-regulated gene sets and leads first to the prediction and then to the experimental validation of TFs interacting with cis-sequences conserved in the promoters of these co-regulated genes. RESULTS: Using the PathoPlant database, 32 up-regulated gene groups were identified with microarray data for drought-responsive gene expression from Arabidopsis thaliana. Application of the binding site estimation suite of tools (BEST) discovered 179 conserved sequence motifs within the corresponding promoters. Using the STAMP web-server, 49 sequence motifs were classified into 7 motif families for which similarities with known cis-regulatory sequences were identified. All motifs were subjected to a footprintDB analysis to predict interacting DNA binding domains from plant TF families. Predictions were confirmed by using a yeast-one-hybrid approach to select interacting TFs belonging to the predicted TF families. TF-DNA interactions were further experimentally validated in yeast and with a Physcomitrella patens transient expression system, leading to the discovery of several novel TF-DNA interactions. CONCLUSIONS: The present work demonstrates the successful integration of several bioinformatic resources with experimental approaches to predict and validate TFs interacting with conserved sequence motifs in co-regulated genes.


Subject(s)
Computational Biology , Gene Expression Regulation, Plant , Transcription Factors/metabolism , Arabidopsis/genetics , Arabidopsis/physiology , Droughts , Genes, Plant
8.
Methods Mol Biol ; 2809: 37-66, 2024.
Article in English | MEDLINE | ID: mdl-38907889

ABSTRACT

AmpliSAS and AmpliHLA are tools for automatic genotyping of MHC genes from high-throughput sequencing data. AmpliSAS is designed specifically to analyze amplicon sequencing data from non-model species and it is able to perform de novo genotyping without any previous knowledge of the reference alleles. AmpliHLA is a human specific version; it performs HLA typing by comparing sequenced variants against human reference alleles from the IMGT/HLA database. Both tools are available in AmpliSAT web-server as well as scripts for local/server installation. Here we describe the installation and deployment of AmpliSAS and AmpliHLA Perl scripts and dependencies on a local or a server computer. We will show how to run them in the command line using as examples four genotyping protocols: the first two use amplicon sequencing data to genotype the MHC genes of a passerine bird and human respectively; the third and fourth present the HLA typing of a human cell line starting from RNA and exome sequencing data respectively.


Subject(s)
High-Throughput Nucleotide Sequencing , Histocompatibility Testing , Software , Humans , High-Throughput Nucleotide Sequencing/methods , Histocompatibility Testing/methods , Animals , Major Histocompatibility Complex/genetics , Alleles , Genotyping Techniques/methods , Internet , Computational Biology/methods , Genotype , HLA Antigens/genetics
9.
Plant Mol Biol ; 82(4-5): 439-55, 2013 Jul.
Article in English | MEDLINE | ID: mdl-23703395

ABSTRACT

High salinity causes remarkable losses in rice productivity worldwide mainly because it inhibits growth and reduces grain yield. To cope with environmental changes, plants evolved several adaptive mechanisms, which involve the regulation of many stress-responsive genes. Among these, we have chosen OsRMC to study its transcriptional regulation in rice seedlings subjected to high salinity. Its transcription was highly induced by salt treatment and showed a stress-dose-dependent pattern. OsRMC encodes a receptor-like kinase described as a negative regulator of salt stress responses in rice. To investigate how OsRMC is regulated in response to high salinity, a salt-induced rice cDNA expression library was constructed and subsequently screened using the yeast one-hybrid system and the OsRMC promoter as bait. Thereby, two transcription factors (TFs), OsEREBP1 and OsEREBP2, belonging to the AP2/ERF family were identified. Both TFs were shown to bind to the same GCC-like DNA motif in OsRMC promoter and to negatively regulate its gene expression. The identified TFs were characterized regarding their gene expression under different abiotic stress conditions. This study revealed that OsEREBP1 transcript level is not significantly affected by salt, ABA or severe cold (5 °C) and is only slightly regulated by drought and moderate cold. On the other hand, the OsEREBP2 transcript level increased after cold, ABA, drought and high salinity treatments, indicating that OsEREBP2 may play a central role mediating the response to different abiotic stresses. Gene expression analysis in rice varieties with contrasting salt tolerance further suggests that OsEREBP2 is involved in salt stress response in rice.


Subject(s)
Oryza/metabolism , Transcription Factor AP-2/metabolism , Abscisic Acid/pharmacology , Droughts , Gene Expression Regulation, Plant/drug effects , Gene Expression Regulation, Plant/genetics , Oryza/drug effects , Oryza/genetics , Plant Proteins/genetics , Plant Proteins/metabolism , Sodium Chloride/pharmacology , Transcription Factor AP-2/genetics
10.
Immunol Cell Biol ; 88(5): 545-54, 2010 Jul.
Article in English | MEDLINE | ID: mdl-20125115

ABSTRACT

Induction of reactive oxygen species (ROS) is a hallmark of granzyme B (gzmB)-mediated pro-apoptotic processes and target cell death. However, it is unclear to what extent the generated ROS derive from mitochondrial and/or extra-mitochondrial sources. To clarify this point, we have produced a mutant EL4 cell line, termed EL4-rho(0), which lacks mitochondrial DNA, associated with a decreased mitochondrial membrane potential and a defective ROS production through the electron transport chain of oxidative phosphorylation. When incubated with either recombinant gzmB plus streptolysin or ex vivo gzmB(+) cytotoxic T cells, EL4-rho(0) cells showed phosphatydylserine translocation, caspase 3 activation, Bak conformational change, cytochrome c release and apoptotic morphology comparable to EL4 cells. Moreover, EL4-rho(0) cells produced ROS at levels similar to EL4 under these conditions. GzmB-mediated ROS production was almost totally abolished in both cell lines by the pan-caspase inhibitor, Z-VAD-fmk. However, addition of apocynin, a specific inhibitor of nicotinamide adenine dinucleotide phosphate (NADPH) oxidases, led to a significant reduction of ROS production and cell death only in EL4-rho(0) but not EL4 cells. These data suggest that gzmB-induced cell death is accompanied by a caspase-dependent pathway of extra-mitochondrial ROS production, most probably through activation of NADPH oxidase.


Subject(s)
Caspases/immunology , Granzymes/immunology , Reactive Oxygen Species/immunology , Signal Transduction/immunology , T-Lymphocytes, Cytotoxic/immunology , Animals , Apoptosis/immunology , Caspases/metabolism , Cell Line , Enzyme Activation/immunology , Granzymes/metabolism , Humans , Immunoblotting , Mice , Mice, Inbred C57BL , Mitochondria/immunology , Mitochondria/metabolism , NADPH Oxidases/immunology , NADPH Oxidases/metabolism , Reactive Oxygen Species/metabolism , Reverse Transcriptase Polymerase Chain Reaction , T-Lymphocytes, Cytotoxic/metabolism
11.
Sci Rep ; 8(1): 11613, 2018 08 02.
Article in English | MEDLINE | ID: mdl-30072736

ABSTRACT

In recent years, immune repertoire profiling with high-throughput sequencing (HTS) has advanced our understanding of adaptive immunity. However, fast progress in the field applied mostly to human and mouse research, with only few studies devoted to other model vertebrates. We present the first in-depth characterization of the T-cell receptor (TCR) repertoire in a non-model mammal (bank vole, Myodes glareolus), widely used in ecological and evolutionary research. We used RNA from spleens, 5'RACE and HTS to describe V and J segments of TCRß, qualitatively characterize preferential V-J segment usage and CDR3 length distribution. Overall orthology to murine genes was preserved, with 11 J and 37 V genes found in voles (although 3 V genes lacked a close orthologue). Further, we implemented unique molecular identifiers for quantitative analysis of CDR3 repertoire with stringent error correction. A conservative, lower bound estimation of the TCRß repertoire was similar to that found for mice (1.7-2.3 × 105 clonotypes). We hope that by providing an easy-to-follow molecular protocol and on-line bioinformatics tools that do not require reference sequences (AmpliTCR and AmpliCDR3), we will encourage HTS immune repertoire profiling in other non-model vertebrates, thus opening new research avenues in e.g. comparative immunology, ecology and evolutionary biology.


Subject(s)
Arvicolinae , Complementarity Determining Regions , High-Throughput Nucleotide Sequencing , Animals , Arvicolinae/genetics , Arvicolinae/immunology , Complementarity Determining Regions/genetics , Complementarity Determining Regions/immunology , Mice , Receptors, Antigen, T-Cell, alpha-beta/genetics , Receptors, Antigen, T-Cell, alpha-beta/immunology
12.
Methods Mol Biol ; 1802: 249-273, 2018.
Article in English | MEDLINE | ID: mdl-29858815

ABSTRACT

AmpliSAS and AmpliHLA are web server tools for automatic genotyping of MHC genes from high-throughput sequencing data. AmpliSAS is designed specifically to analyze amplicon sequencing data from non-model species and it is able to perform de-novo genotyping without any previous knowledge of the reference alleles. AmpliHLA is a human-specific version, it performs HLA typing by comparing sequenced variants against human reference alleles from the IMGT/HLA database. Here we describe four genotyping protocols: the first two use amplicon sequencing data to genotype the MHC genes of a passerine bird and human respectively; the third and fourth present the HLA typing of a human cell line starting from RNA and exome sequencing data respectively.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Histocompatibility Testing/methods , Internet , Major Histocompatibility Complex/genetics , Software , Alleles , Animals , Base Sequence , Cell Line , Exome/genetics , Genotyping Techniques , HLA Antigens/genetics , Humans , Passeriformes/genetics
13.
Mol Ecol Resour ; 17(4): 642-655, 2017 Jul.
Article in English | MEDLINE | ID: mdl-27762049

ABSTRACT

Characterization of highly duplicated genes, such as genes of the major histocompatibility complex (MHC), where multiple loci often co-amplify, has until recently been hindered by insufficient read depths per amplicon. Here, we used ultra-deep Illumina sequencing to resolve genotypes at exon 3 of MHC class I genes in the sedge warbler (Acrocephalus schoenobaenus). We sequenced 24 individuals in two replicates and used this data, as well as a simulated data set, to test the effect of amplicon coverage (range: 500-20 000 reads per amplicon) on the repeatability of genotyping using four different genotyping approaches. A third replicate employed unique barcoding to assess the extent of tag jumping, that is swapping of individual tag identifiers, which may confound genotyping. The reliability of MHC genotyping increased with coverage and approached or exceeded 90% within-method repeatability of allele calling at coverages of >5000 reads per amplicon. We found generally high agreement between genotyping methods, especially at high coverages. High reliability of the tested genotyping approaches was further supported by our analysis of the simulated data set, although the genotyping approach relying primarily on replication of variants in independent amplicons proved sensitive to repeatable errors. According to the most repeatable genotyping method, the number of co-amplifying variants per individual ranged from 19 to 42. Tag jumping was detectable, but at such low frequencies that it did not affect the reliability of genotyping. We thus demonstrate that gene families with many co-amplifying genes can be reliably genotyped using HTS, provided that there is sufficient per amplicon coverage.


Subject(s)
Genes, MHC Class I , Passeriformes/genetics , Animals , Genotyping Techniques , High-Throughput Nucleotide Sequencing , Reproducibility of Results , Sequence Analysis, DNA
14.
Methods Mol Biol ; 1482: 259-77, 2016.
Article in English | MEDLINE | ID: mdl-27557773

ABSTRACT

FootprintDB is a database and search engine that compiles regulatory sequences from open access libraries of curated DNA cis-elements and motifs, and their associated transcription factors (TFs). It systematically annotates the binding interfaces of the TFs by exploiting protein-DNA complexes deposited in the Protein Data Bank. Each entry in footprintDB is thus a DNA motif linked to the protein sequence of the TF(s) known to recognize it, and in most cases, the set of predicted interface residues involved in specific recognition. This chapter explains step-by-step how to search for DNA motifs and protein sequences in footprintDB and how to focus the search to a particular organism. Two real-world examples are shown where this software was used to analyze transcriptional regulation in plants. Results are described with the aim of guiding users on their interpretation, and special attention is given to the choices users might face when performing similar analyses.


Subject(s)
Binding Sites/genetics , Molecular Biology/methods , Promoter Regions, Genetic , Software , Arabidopsis/genetics , Databases, Protein , Gene Expression Regulation/genetics , Nucleotide Motifs/genetics , Regulatory Sequences, Nucleic Acid/genetics , Transcription Factors/genetics
15.
Mol Ecol Resour ; 16(2): 498-510, 2016 Mar.
Article in English | MEDLINE | ID: mdl-26257385

ABSTRACT

Next-generation sequencing (NGS) technologies are revolutionizing the fields of biology and medicine as powerful tools for amplicon sequencing (AS). Using combinations of primers and barcodes, it is possible to sequence targeted genomic regions with deep coverage for hundreds, even thousands, of individuals in a single experiment. This is extremely valuable for the genotyping of gene families in which locus-specific primers are often difficult to design, such as the major histocompatibility complex (MHC). The utility of AS is, however, limited by the high intrinsic sequencing error rates of NGS technologies and other sources of error such as polymerase amplification or chimera formation. Correcting these errors requires extensive bioinformatic post-processing of NGS data. Amplicon Sequence Assignment (AMPLISAS) is a tool that performs analysis of AS results in a simple and efficient way, while offering customization options for advanced users. AMPLISAS is designed as a three-step pipeline consisting of (i) read demultiplexing, (ii) unique sequence clustering and (iii) erroneous sequence filtering. Allele sequences and frequencies are retrieved in excel spreadsheet format, making them easy to interpret. AMPLISAS performance has been successfully benchmarked against previously published genotyped MHC data sets obtained with various NGS technologies.


Subject(s)
Computational Biology/methods , Genotyping Techniques , Internet , Multilocus Sequence Typing/methods , High-Throughput Nucleotide Sequencing
16.
PLoS One ; 10(10): e0141044, 2015.
Article in English | MEDLINE | ID: mdl-26484765

ABSTRACT

The control of growth and development of all living organisms is a complex and dynamic process that requires the harmonious expression of numerous genes. Gene expression is mainly controlled by the activity of sequence-specific DNA binding proteins called transcription factors (TFs). Amongst the various classes of eukaryotic TFs, the MYB superfamily is one of the largest and most diverse, and it has considerably expanded in the plant kingdom. R2R3-MYBs have been extensively studied over the last 15 years. However, DNA-binding specificity has been characterized for only a small subset of these proteins. Therefore, one of the remaining challenges is the exhaustive characterization of the DNA-binding specificity of all R2R3-MYB proteins. In this study, we have developed a library of Arabidopsis thaliana R2R3-MYB open reading frames, whose DNA-binding activities were assayed in vivo (yeast one-hybrid experiments) with a pool of selected cis-regulatory elements. Altogether 1904 interactions were assayed leading to the discovery of specific patterns of interactions between the various R2R3-MYB subgroups and their DNA target sequences and to the identification of key features that govern these interactions. The present work provides a comprehensive in vivo analysis of R2R3-MYB binding activities that should help in predicting new DNA motifs and identifying new putative target genes for each member of this very large family of TFs. In a broader perspective, the generated data will help to better understand how TF interact with their target DNA sequences.


Subject(s)
Arabidopsis Proteins/metabolism , Arabidopsis/genetics , DNA-Binding Proteins/genetics , Gene Expression Regulation, Plant/genetics , Transcription Factors/metabolism , Amino Acid Sequence , Base Sequence , Binding Sites/genetics , DNA, Plant/genetics , DNA, Plant/metabolism , Gene Library , Promoter Regions, Genetic , Two-Hybrid System Techniques
SELECTION OF CITATIONS
SEARCH DETAIL