RESUMO
The recent development of whole genome association studies has lead to the robust identification of several loci involved in different common human diseases. Interestingly, some of the strongest signals of association observed in these studies arise from non-coding regions located in very large introns or far away from any annotated genes, raising the possibility that these regions are involved in the etiology of the disease through some unidentified regulatory mechanisms. These findings highlight the importance of better understanding the mechanisms leading to inter-individual differences in gene expression in humans. Most of the existing approaches developed to identify common regulatory polymorphisms are based on linkage/association mapping of gene expression to genotypes. However, these methods have some limitations, notably their cost and the requirement of extensive genotyping information from all the individuals studied which limits their applications to a specific cohort or tissue. Here we describe a robust and high-throughput method to directly measure differences in allelic expression for a large number of genes using the Illumina Allele-Specific Expression BeadArray platform and quantitative sequencing of RT-PCR products. We show that this approach allows reliable identification of differences in the relative expression of the two alleles larger than 1.5-fold (i.e., deviations of the allelic ratio larger than 60:40) and offers several advantages over the mapping of total gene expression, particularly for studying humans or outbred populations. Our analysis of more than 80 individuals for 2,968 SNPs located in 1,380 genes confirms that differential allelic expression is a widespread phenomenon affecting the expression of 20% of human genes and shows that our method successfully captures expression differences resulting from both genetic and epigenetic cis-acting mechanisms.
Assuntos
Epigênese Genética , Regulação da Expressão Gênica , Genoma Humano , Alelos , Desequilíbrio Alélico , Teste de Complementação Genética , Humanos , Íntrons , Análise de Sequência com Séries de Oligonucleotídeos , Polimorfismo de Nucleotídeo Único , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Reação em Cadeia da Polimerase Via Transcriptase ReversaAssuntos
Financiamento de Capital/economia , Financiamento de Capital/tendências , Educação Médica/economia , Educação Médica/tendências , Economia Hospitalar/tendências , Educação de Pós-Graduação em Medicina/economia , Educação de Pós-Graduação em Medicina/tendências , Hospitais de Ensino/economia , Hospitais de Ensino/tendências , Humanos , Medicina Interna/economia , Medicina Interna/educação , Medicina Interna/tendências , Internato e Residência/economia , Internato e Residência/tendências , Programas de Assistência Gerenciada/economia , Programas de Assistência Gerenciada/tendências , Medicare/economia , Medicare/tendências , Estados UnidosRESUMO
We have developed a simple and efficient algorithm to identify each member of a large collection of DNA-linked objects through the use of hybridization, and have applied it to the manufacture of randomly assembled arrays of beads in wells. Once the algorithm has been used to determine the identity of each bead, the microarray can be used in a wide variety of applications, including single nucleotide polymorphism genotyping and gene expression profiling. The algorithm requires only a few labels and several sequential hybridizations to identify thousands of different DNA sequences with great accuracy. We have decoded tens of thousands of arrays, each with 1520 sequences represented at approximately 30-fold redundancy by up to approximately 50,000 beads, with a median error rate of <1 x 10(-4) per bead. The approach makes use of error checking codes and provides, for the first time, a direct functional quality control of every element of each array that is manufactured. The algorithm can be applied to any spatially fixed collection of objects or molecules that are associated with specific DNA sequences.