Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 17 de 17
Filter
Add more filters










Publication year range
1.
Eur Urol Oncol ; 2024 Apr 18.
Article in English | MEDLINE | ID: mdl-38641541

ABSTRACT

Chemoradiation therapy (CRT) is a treatment for muscle-invasive bladder cancer (MIBC). Using a novel transcriptomic profiling panel, we validated prognostic immune biomarkers to CRT using 70 pretreatment tumor samples from prospective trials of MIBC (NRG/RTOG 0524 and 0712). Disease-free survival (DFS) and overall survival (OS) were estimated via the Kaplan-Meier method and stratified by genes correlated with immune cell activation. Cox proportional-hazards models were used to assess group differences. Clustering of gene expression profiles revealed that the cluster with high immune cell content was associated with longer DFS (hazard ratio [HR] 0.53, 95% confidence interval [CI] 0.26-1.10; p = 0.071) and OS (HR 0.48, 95% CI 0.24-0.97; p = 0.040) than the cluster with low immune cell content. Higher expression of T-cell infiltration genes (CD8A and ICOS) was associated with longer DFS (HR 0.40, 95% CI 0.21-0.75; p = 0.005) and OS (HR 0.49, 95% CI 0.25-0.94; p = 0.033). Higher IDO1 expression (IFNγ signature) was also associated with longer DFS (HR 0.44, 95% CI 0.24-0.88; p = 0.021) and OS (HR 0.49, 95% CI 0.24-0.99; p = 0.048). These findings should be validated in prospective CRT trials that include biomarkers, particularly for trials incorporating immunotherapy for MIBC. PATIENT SUMMARY: We analyzed patient samples from two clinical trials (NRG/RTOG 0524 and 0712) of chemoradiation for muscle-invasive bladder cancer using a novel method to assess immune cells in the tumor microenvironment. Higher expression of genes associated with immune activation and high overall immune-cell content were associated with better disease-free survival and overall survival for patients treated with chemoradiation.

2.
J Cancer Res Clin Oncol ; 149(15): 14125-14136, 2023 Nov.
Article in English | MEDLINE | ID: mdl-37552307

ABSTRACT

PURPOSE: Anti-PD-1 therapy provides clinical benefit in 40-50% of patients with relapsed and/or metastatic head and neck squamous cell carcinoma (RM-HNSCC). Selection of anti- PD-1 therapy is typically based on patient PD-L1 immunohistochemistry (IHC) which has low specificity for predicting disease control. Therefore, there is a critical need for a clinical biomarker that will predict clinical benefit to anti-PD-1 treatment with high specificity. METHODS: Clinical treatment and outcomes data for 103 RM-HNSCC patients were paired with RNA-sequencing data from formalin-fixed patient samples. Using logistic regression methods, we developed a novel biomarker classifier based on expression patterns in the tumor immune microenvironment to predict disease control with monotherapy PD-1 inhibitors (pembrolizumab and nivolumab). The performance of the biomarker was internally validated using out-of-bag methods. RESULTS: The biomarker significantly predicted disease control (65% in predicted non-progressors vs. 17% in predicted progressors, p < 0.001) and was significantly correlated with overall survival (OS; p = 0.004). In addition, the biomarker outperformed PD-L1 IHC across numerous metrics including sensitivity (0.79 vs 0.64, respectively; p = 0.005) and specificity (0.70 vs 0.61, respectively; p = 0.009). CONCLUSION: This novel assay uses tumor immune microenvironment expression data to predict disease control and OS with high sensitivity and specificity in patients with RM-HNSCC treated with anti-PD-1 monotherapy.

3.
J Oral Biol Craniofac Res ; 12(6): 753-759, 2022.
Article in English | MEDLINE | ID: mdl-36118143

ABSTRACT

Aim: The purpose of this study was to evaluate the impact of gender and peripheral blood parameters on the characteristics of Leucocyte-and Platelet-Rich Fibrin (L-PRF) membranes and to describe histologically three different zones of L-PRF membranes. Methods: Blood was collected from twenty healthy donors (10 men and 10 women). Peripheral blood parameters including leucocyte and platelet counts, and fibrinogen levels were recorded. L-PRF membranes were prepared to quantify the release of growth factors (PDGF, VEGF, BMP-2, and BMP-9) at 1, 2, 3 and 7 days and for histological examination. Three zones within each L-PRF membrane (face, body, and tail) were analysed separately, quantifying the area of leucocytes, platelets, and fibrin in percentage. The Young's modulus of the membranes was also considered (during tensile and compression tests). Results: Women had significantly higher fibrinogen levels in their peripheral blood, and a higher release of BMP-9, whereas men showed a significantly higher Young's modulus in compression tests. The histology revealed significant differences in cellular content and fibrin concentration between the 3 areas, with the face being biologically the richest. Conclusion: Several factors influenced the final characteristics of L-PRF membranes. These need to be taken into consideration when interpreting the results of research, but especially in clinical practice.

4.
Sci Rep ; 12(1): 1342, 2022 01 25.
Article in English | MEDLINE | ID: mdl-35079117

ABSTRACT

Anti-PD-1 therapy can provide long, durable benefit to a fraction of patients. The on-label PD-L1 test, however, does not accurately predict response. To build a better biomarker, we created a method called T Cell Subtype Profiling (TCSP) that characterizes the abundance of T cell subtypes (TCSs) in FFPE specimens using five RNA models. These TCS RNA models are created using functional methods, and robustly discriminate between naïve, activated, exhausted, effector memory, and central memory TCSs, without the reliance on non-specific, classical markers. TCSP is analytically valid and corroborates associations between TCSs and clinical outcomes. Multianalyte biomarkers based on TCS estimates predicted response to anti-PD-1 therapy in three different cancers and outperformed the indicated PD-L1 test, as well as Tumor Mutational Burden. Given the utility of TCSP, we investigated the abundance of TCSs in TCGA cancers and created a portal to enable researchers to discover other TCSP-based biomarkers.


Subject(s)
CD8-Positive T-Lymphocytes/metabolism , Neoplasms/drug therapy , Programmed Cell Death 1 Receptor/metabolism , Biomarkers, Tumor/metabolism , CD8-Positive T-Lymphocytes/pathology , Cells, Cultured , Humans , Leukocytes, Mononuclear
5.
Int J Periodontics Restorative Dent ; 41(6): e287-e296, 2021.
Article in English | MEDLINE | ID: mdl-34818400

ABSTRACT

The aim of the present study was to compare leukocyte- and platelet-rich fibrin (L-PRF) membranes with a connective tissue graft (CTG) in combination with a coronally advanced flap (CAF) in the treatment of Miller Class I or II localized gingival recessions. A randomized controlled clinical trial with 17 recessions in each group was initiated; the control group received treatment with CAF+CTG, and the test group received CAF+L-PRF. The following variables were measured before treatment and after 1, 3, and 6 months: gingival recession depth (RD), gingival recession width (RW), gingival thickness (GT), probing depth (PD), clinical attachment level (CAL), and keratinized tissue height (KTH). Also, the root coverage percentage (RC), the pain score, postoperative complications, and the root coverage esthetic score (RES) were recorded after surgery. Both treatments presented significant improvements in the RD, RW, and CAL at 1, 3, and 6 months. CTG achieved a significantly higher RC at 1, 3, and 6 months and a significantly higher RES score at 6 months. L-PRF presented a significantly lower pain score and less postoperative complications. Both strategies were effective for the treatment of localized gingival recessions. The CTG obtained higher RC and esthetic results, and L-PRF had less pain and postsurgical complications.


Subject(s)
Gingival Recession , Platelet-Rich Fibrin , Connective Tissue , Gingiva , Gingival Recession/surgery , Humans , Tooth Root , Treatment Outcome
6.
J Mol Diagn ; 22(4): 555-570, 2020 04.
Article in English | MEDLINE | ID: mdl-32036085

ABSTRACT

As immuno-oncology drugs grow more popular in the treatment of cancer, better methods are needed to quantify the tumor immune cell component to determine which patients are most likely to benefit from treatment. Methods such as flow cytometry can accurately assess the composition of infiltrating immune cells; however, they show limited use in formalin-fixed, paraffin-embedded (FFPE) specimens. This article describes a novel hybrid-capture RNA sequencing assay, ImmunoPrism, that estimates the relative percentage abundance of eight immune cell types in FFPE solid tumors. Immune health expression models were generated using machine learning methods and used to uniquely identify each immune cell type using the most discriminatively expressed genes. The analytical performance of the assay was assessed using 101 libraries from 40 FFPE and 32 fresh-frozen samples. With defined samples, ImmunoPrism had a precision of ±2.72%, a total error of 2.75%, and a strong correlation (r2 = 0.81; P < 0.001) to flow cytometry. ImmunoPrism had similar performance in dissociated tumor cell samples (total error of 8.12%) and correlated strongly with immunohistochemistry (CD8: r2 = 0.83; P < 0.001) in FFPE samples. Other performance metrics were determined, including limit of detection, reportable range, and reproducibility. The approach used for analytical validation is shared here so that it may serve as a helpful framework for other laboratories when validating future complex RNA-based assays.


Subject(s)
Computational Biology/methods , Gene Expression Profiling/methods , Immunomodulation/genetics , Neoplasms/genetics , Neoplasms/immunology , Computational Biology/standards , Gene Expression Profiling/standards , High-Throughput Nucleotide Sequencing , Humans , Immunohistochemistry , Leukocytes, Mononuclear/immunology , Leukocytes, Mononuclear/metabolism , Lymphocytes/immunology , Lymphocytes/metabolism , Neoplasms/metabolism , Neoplasms/pathology , Reproducibility of Results , Sensitivity and Specificity , Sequence Analysis, RNA
7.
Sci Rep ; 8(1): 28, 2018 01 08.
Article in English | MEDLINE | ID: mdl-29311716

ABSTRACT

Massive amounts of metagenomics data are currently being produced, and in all such projects a sizeable fraction of the resulting data shows no or little homology to known sequences. It is likely that this fraction contains novel viruses, but identification is challenging since they frequently lack homology to known viruses. To overcome this problem, we developed a strategy to detect ORFan protein families in shotgun metagenomics data, using similarity-based clustering and a set of filters to extract bona fide protein families. We applied this method to 17 virus-enriched libraries originating from human nasopharyngeal aspirates, serum, feces, and cerebrospinal fluid samples. This resulted in 32 predicted putative novel gene families. Some families showed detectable homology to sequences in metagenomics datasets and protein databases after reannotation. Notably, one predicted family matches an ORF from the highly variable Torque Teno virus (TTV). Furthermore, follow-up from a predicted ORFan resulted in the complete reconstruction of a novel circular genome. Its organisation suggests that it most likely corresponds to a novel bacteriophage in the microviridae family, hence it was named bacteriophage HFM.


Subject(s)
Genome, Viral , Metagenome , Metagenomics , Viral Proteins/genetics , Base Sequence , Cluster Analysis , Computational Biology/methods , Humans , Markov Chains , Metagenomics/methods , Molecular Sequence Annotation , Open Reading Frames
8.
Proc Natl Acad Sci U S A ; 112(52): 15976-81, 2015 Dec 29.
Article in English | MEDLINE | ID: mdl-26598659

ABSTRACT

Horizontal gene transfer (HGT), or the transfer of genes between species, has been recognized recently as more pervasive than previously suspected. Here, we report evidence for an unprecedented degree of HGT into an animal genome, based on a draft genome of a tardigrade, Hypsibius dujardini. Tardigrades are microscopic eight-legged animals that are famous for their ability to survive extreme conditions. Genome sequencing, direct confirmation of physical linkage, and phylogenetic analysis revealed that a large fraction of the H. dujardini genome is derived from diverse bacteria as well as plants, fungi, and Archaea. We estimate that approximately one-sixth of tardigrade genes entered by HGT, nearly double the fraction found in the most extreme cases of HGT into animals known to date. Foreign genes have supplemented, expanded, and even replaced some metazoan gene families within the tardigrade genome. Our results demonstrate that an unexpectedly large fraction of an animal genome can be derived from foreign sources. We speculate that animals that can survive extremes may be particularly prone to acquiring foreign genes.


Subject(s)
Gene Transfer, Horizontal , Genome/genetics , Genomic Library , Sequence Analysis, DNA/methods , Tardigrada/genetics , Animals , DNA, Archaeal/chemistry , DNA, Archaeal/genetics , DNA, Bacterial/chemistry , DNA, Bacterial/genetics , DNA, Fungal/chemistry , DNA, Fungal/genetics , DNA, Plant/chemistry , DNA, Plant/genetics , DNA, Viral/chemistry , DNA, Viral/genetics , Phylogeny , Tardigrada/classification
9.
Plant Genome ; 8(2): eplantgenome2014.08.0040, 2015 Jul.
Article in English | MEDLINE | ID: mdl-33228321

ABSTRACT

Leaf rust, caused by Puccinia triticina Eriks., is one of the most widespread diseases of wheat and breeding for resistance is one of the most effective methods of control. Lr16 is a wheat leaf rust resistance gene (R-gene) that provides resistance at both the seedling and adult stages. Simple-sequence repeat (SSR) markers have been used to map Lr16 to the distal end of chromosome 2B. The objectives of this study were to use RNA sequencing (RNA-seq) and in silico subtraction to identify new R-gene analogs (RGAs) and use them as Lr16 markers. RNA was isolated from the susceptible wheat cultivar Thatcher (Tc) and the resistant Tc isolines TcLr10, TcLr16, TcLr21, and sequenced using Illumina technology. Using in silico subtraction, sequences from the resistant Tc isolines were aligned to a Tc reference expressed sequence tag (EST) set. Sequences not aligning to the Tc reference were assembled into contigs and analyzed using BLASTx to determine putative gene functions. Primer pairs were designed for 181 RGA sequences, of which, 137 amplified in at least one of the parents. A mapping population was developed with 165 F2 lines from a cross between the rust-susceptible cultivar Chinese Spring (CS) and TcLr16. Two RGA markers XTaLr16_RGA266585 and XTaLr16_RGA22128 were identified that mapped proximally 1.2 and 23.8 cM from Lr16, respectively. Three SSR markers Xwmc764, Xwmc661, and Xbarc35 mapped between these two RGA markers at distances of 5.0, 10.9, and 16.1 cM from Lr16, respectively. In silico subtraction is an effective technique for isolating RGAs linked to R-genes of interest.

10.
Brief Bioinform ; 12(5): 485-8, 2011 Sep.
Article in English | MEDLINE | ID: mdl-21666252

ABSTRACT

There is a great need for standards in the orthology field. Users must contend with different ortholog data representations from each provider, and the providers themselves must independently gather and parse the input sequence data. These burdensome and redundant procedures make data comparison and integration difficult. We have designed two XML-based formats, SeqXML and OrthoXML, to solve these problems. SeqXML is a lightweight format for sequence records-the input for orthology prediction. It stores the same sequence and metadata as typical FASTA format records, but overcomes common problems such as unstructured metadata in the header and erroneous sequence content. XML provides validation to prevent data integrity problems that are frequent in FASTA files. The range of applications for SeqXML is broad and not limited to ortholog prediction. We provide read/write functions for BioJava, BioPerl, and Biopython. OrthoXML was designed to represent ortholog assignments from any source in a consistent and structured way, yet cater to specific needs such as scoring schemes or meta-information. A unified format is particularly valuable for ortholog consumers that want to integrate data from numerous resources, e.g. for gene annotation projects. Reference proteomes for 61 organisms are already available in SeqXML, and 10 orthology databases have signed on to OrthoXML. Adoption by the entire field would substantially facilitate exchange and quality control of sequence and orthology information.


Subject(s)
Databases, Factual , Internet , Proteome/analysis , Software , Molecular Sequence Annotation , Proteome/standards , Sequence Analysis
11.
Nucleic Acids Res ; 38(Database issue): D196-203, 2010 Jan.
Article in English | MEDLINE | ID: mdl-19892828

ABSTRACT

The InParanoid project gathers proteomes of completely sequenced eukaryotic species plus Escherichia coli and calculates pairwise ortholog relationships among them. The new release 7.0 of the database has grown by an order of magnitude over the previous version and now includes 100 species and their collective 1.3 million proteins organized into 42.7 million pairwise ortholog groups. The InParanoid algorithm itself has been revised and is now both more specific and sensitive. Based on results from our recent benchmarking of low-complexity filters in homology assignment, a two-pass BLAST approach was developed that makes use of high-precision compositional score matrix adjustment, but avoids the alignment truncation that sometimes follows. We have also updated the InParanoid web site (http://InParanoid.sbc.su.se). Several features have been added, the response times have been improved and the site now sports a new, clearer look. As the number of ortholog databases has grown, it has become difficult to compare among these resources due to a lack of standardized source data and incompatible representations of ortholog relationships. To facilitate data exchange and comparisons among ortholog databases, we have developed and are making available two XML schemas: SeqXML for the input sequences and OrthoXML for the output ortholog clusters.


Subject(s)
Computational Biology/methods , Databases, Genetic , Databases, Nucleic Acid , Escherichia coli/genetics , Eukaryotic Cells/chemistry , Proteins/genetics , Algorithms , Animals , Cluster Analysis , Computational Biology/trends , Escherichia coli/metabolism , Genome, Bacterial , Humans , Information Storage and Retrieval/methods , Internet , Protein Structure, Tertiary , Proteomics/methods , Software
12.
BMC Bioinformatics ; 10: 314, 2009 Sep 28.
Article in English | MEDLINE | ID: mdl-19785723

ABSTRACT

BACKGROUND: Transmembrane (TM) proteins are proteins that span a biological membrane one or more times. As their 3-D structures are hard to determine, experiments focus on identifying their topology (i. e. which parts of the amino acid sequence are buried in the membrane and which are located on either side of the membrane), but only a few topologies are known. Consequently, various computational TM topology predictors have been developed, but their accuracies are far from perfect. The prediction quality can be improved by applying a consensus approach, which combines results of several predictors to yield a more reliable result. RESULTS: A novel TM consensus method, named MetaTM, is proposed in this work. MetaTM is based on support vector machine models and combines the results of six TM topology predictors and two signal peptide predictors. On a large data set comprising 1460 sequences of TM proteins with known topologies and 2362 globular protein sequences it correctly predicts 86.7% of all topologies. CONCLUSION: Combining several TM predictors in a consensus prediction framework improves overall accuracy compared to any of the individual methods. Our proposed SVM-based system also has higher accuracy than a previous consensus predictor. MetaTM is made available both as downloadable source code and as DAS server at http://MetaTM.sbc.su.se.


Subject(s)
Computational Biology/methods , Membrane Proteins/chemistry , Software , Algorithms , Databases, Protein , Protein Conformation , Sequence Analysis, Protein/methods
13.
Bioinformatics ; 25(10): 1333-4, 2009 May 15.
Article in English | MEDLINE | ID: mdl-19297349

ABSTRACT

SUMMARY: The rise in biological sequence data has led to a proliferation of separate, specialized databases. While there is great value in having many independent annotations, it is critical that there be a way to integrate them in one combined view. The Distributed Annotation System (DAS) was developed for that very purpose. There are currently no DAS clients that are open source, specialized for aggregating and comparing protein sequence annotation, and that can run as a self-contained application outside of a web browser. The speed, flexibility and extensibility that come with a stand-alone application motivated us to create DASher, an open-source Java DAS client. Given a UniProt sequence identifier, DASher automatically queries DAS-supporting servers worldwide for any information on that sequence and then displays the annotations in an interactive viewer for easy comparison. DASher is a fast, Java-based DAS client optimized for viewing protein sequence annotation and compliant with the latest DAS protocol specification 1.53E. AVAILABILITY: DASher is available for direct use and download at http://dasher.sbc.su.se including examples and source code under the GPLv3 licence. Java version 6 or higher is required.


Subject(s)
Computational Biology/methods , Proteins/chemistry , Sequence Analysis, Protein/methods , Software , Databases, Protein , Programming Languages , User-Computer Interface
14.
BMC Genomics ; 9: 525, 2008 Nov 04.
Article in English | MEDLINE | ID: mdl-18983679

ABSTRACT

BACKGROUND: Nematode.net http://www.nematode.net is a web-accessible resource for investigating gene sequences from parasitic and free-living nematode genomes. Beyond the well-characterized model nematode C. elegans, over 500,000 expressed sequence tags (ESTs) and nearly 600,000 genome survey sequences (GSSs) have been generated from 36 nematode species as part of the Parasitic Nematode Genomics Program undertaken by the Genome Center at Washington University School of Medicine. However, these sequencing data are not present in most publicly available protein databases, which only include sequences in Swiss-Prot. Swiss-Prot, in turn, relies on GenBank/Embl/DDJP for predicted proteins from complete genomes or full-length proteins. DESCRIPTION: Here we present the NemaPath pathway server, a web-based pathway-level visualization tool for navigating putative metabolic pathways for over 30 nematode species, including 27 parasites. The NemaPath approach consists of two parts: 1) a backend tool to align and evaluate nematode genomic sequences (curated EST contigs) against the annotated Kyoto Encyclopedia of Genes and Genomes (KEGG) protein database; 2) a web viewing application that displays annotated KEGG pathway maps based on desired confidence levels of primary sequence similarity as defined by a user. NemaPath also provides cross-referenced access to nematode genome information provided by other tools available on Nematode.net, including: detailed NemaGene EST cluster information; putative translations; GBrowse EST cluster views; links from nematode data to external databases for corresponding synonymous C. elegans counterparts, subject matches in KEGG's gene database, and also KEGG Ontology (KO) identification. CONCLUSION: The NemaPath server hosts metabolic pathway mappings for 30 nematode species and is available on the World Wide Web at http://nematode.net/cgi-bin/keggview.cgi. The nematode source sequences used for the metabolic pathway mappings are available via FTP http://www.nematode.net/FTP/index.php, as provided by the Genome Center at Washington University School of Medicine.


Subject(s)
Databases, Genetic , Metabolic Networks and Pathways/genetics , Nematoda/genetics , Software , Animals , Caenorhabditis elegans/genetics , Expressed Sequence Tags , Genomics , Nematoda/metabolism , Online Systems
15.
Genome Res ; 16(6): 768-75, 2006 Jun.
Article in English | MEDLINE | ID: mdl-16741162

ABSTRACT

We describe a targeted approach to improve the contiguity of whole-genome shotgun sequence (WGS) assemblies at run-time, using information from Bacterial Artificial Chromosome (BAC)-based physical maps. Clone sizes and overlaps derived from clone fingerprints are used for the calculation of length constraints between any two BAC neighbors sharing 40% of their size. These constraints are used to promote the linkage and guide the arrangement of sequence contigs within a sequence scaffold at the layout phase of WGS assemblies. This process is facilitated by FASSI, a stand-alone application that calculates BAC end and BAC overlap length constraints from clone fingerprint map contigs created by the FPC package. FASSI is designed to work with the assembly tool PCAP, but its output can be formatted to work with other WGS assembly algorithms able to use length constraints for individual clones. The FASSI method is simple to implement, potentially cost-effective, and has resulted in the increase of scaffold contiguity for both the Drosophila melanogaster and Cryptococcus gattii genomes when compared to a control assembly without map-derived constraints. A 6.5-fold coverage draft DNA sequence of the Pan troglodytes (chimpanzee) genome was assembled using map-derived constraints and resulted in a 26.1% increase in scaffold contiguity.


Subject(s)
Cryptococcus/genetics , Drosophila melanogaster/genetics , Genome , Pan troglodytes/genetics , Physical Chromosome Mapping , Sequence Analysis, DNA/methods , Animals , Chromosomes, Artificial, Bacterial/genetics , Databases, Nucleic Acid , Software
16.
Hum Mol Genet ; 14(7): 903-12, 2005 Apr 01.
Article in English | MEDLINE | ID: mdl-15703188

ABSTRACT

Craniofacial abnormalities are one of the most common birth defects in humans, but little is known about the human genes that control these important developmental processes. To identify relevant genes, we analyzed transcription profiles of human pharyngeal arch 1 (PA1), a conserved embryonic structure that develops into the palate and jaw. Using microdissected, normal human craniofacial structures, we constructed 12 SAGE (serial analysis of gene expression) libraries and sequenced 606 532 tags. We also performed Affymetrix microarray analysis on 25 craniofacial targets. Our data revealed not only genes "enriched" or differentially expressed in PA1 during fourth and fifth week of human development, but also 6927 genes newly identified to be expressed in human PA1. Many of these genes are involved in biosynthetic processes and have binding function and catalytic activity. We compared expression profiles of human genes with those of mouse homologs to look for genes more specific to human craniofacial development and found 766 genes expressed in human PA1, but not in mouse PA1. We also identified 1408 genes that were expressed in mouse as well as human PA1 and could be useful in creating mouse models for human conditions. We confirmed conservation of some human PA1 expression patterns in mouse embryonic samples with whole mount in situ hybridization and real-time RT-PCR. This comprehensive approach to expression profiling gives insights into the early development of the craniofacial region and provides markers for developmental structures and candidate genes, including SET and CCT3, for diseases such as orofacial clefting and micrognathia.


Subject(s)
Branchial Region/embryology , Embryonic Development , Gene Expression Regulation, Developmental , Animals , Catalysis , Craniofacial Abnormalities/genetics , DNA, Complementary/metabolism , Disease Models, Animal , Gene Library , Humans , In Situ Hybridization , Mice , Oligonucleotide Array Sequence Analysis , Reverse Transcriptase Polymerase Chain Reaction , Species Specificity , Time Factors
17.
Genome Res ; 14(10B): 2041-7, 2004 Oct.
Article in English | MEDLINE | ID: mdl-15489324

ABSTRACT

Transcription factors (TFs) are essential regulators of gene expression, and mutated TF genes have been shown to cause numerous human genetic diseases. Yet to date, no single, comprehensive database of human TFs exists. In this work, we describe the collection of an essentially complete set of TF genes from one depiction of the human ORFeome, and the design of a microarray to interrogate their expression. Taking 1468 known TFs from TRANSFAC, InterPro, and FlyBase, we used this seed set to search the ScriptSure human transcriptome database for additional genes. ScriptSure's genome-anchored transcript clusters allowed us to work with a nonredundant high-quality representation of the human transcriptome. We used a high-stringency similarity search by using BLASTN, and a protein motif search of the human ORFeome by using hidden Markov models of DNA-binding domains known to occur exclusively or primarily in TFs. Four hundred ninety-four additional TF genes were identified in the overlap between the two searches, bringing our estimate of the total number of human TFs to 1962. Zinc finger genes are by far the most abundant family (762 members), followed by homeobox (199 members) and basic helix-loop-helix genes (117 members). We designed a microarray of 50-mer oligonucleotide probes targeted to a unique region of the coding sequence of each gene. We have successfully used this microarray to interrogate TF gene expression in species as diverse as chickens and mice, as well as in humans.


Subject(s)
Gene Expression Profiling , Genome, Human , Oligonucleotide Array Sequence Analysis , Open Reading Frames/genetics , Transcription Factors/chemistry , Transcription Factors/genetics , Humans , Markov Chains , Transcription Factors/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...