RESUMEN
Genome-wide linkage analysis is a powerful tool for the identification of genes underlying single gene and complex genetic disorders. The most commonly used technique for performing genome wide scans for genetic studies in the rat is by analysis of simple sequence length polymorphism (SSLPs) or microsatellite markers. A sensitive and flexible method for high-throughput genotyping is described. Addition of an M13 tail to the SSLP primer eliminates the need for direct conjugation of the fluorescent dye to the primers, allowing for any combination of primer and fluorophor, and therefore for easy multiplexing of primers in the same reaction. With the use of three different dyes, it is possible to run more than five hundred genotypes in each run of the automatic sequencer. Automation in the fluorescent detection system and data tracking software for processing genotypes, contributes to the ability to genotype large number of samples rapidly and accurately.
Asunto(s)
Genoma , Polimorfismo Genético , Animales , Secuencia de Bases , Cartilla de ADN , Marcadores Genéticos , Repeticiones de Microsatélite/genética , Reacción en Cadena de la Polimerasa , Sitios de Carácter Cuantitativo , Ratas , Sensibilidad y EspecificidadRESUMEN
The genetic basis of type 2 diabetes remains incompletely defined despite the use of multiple genetic strategies. Multiparental populations such as heterogeneous stocks (HS) facilitate gene discovery by allowing fine mapping to only a few megabases, significantly decreasing the number of potential candidate genes compared to traditional mapping strategies. In the present work, we employed expression and sequence analysis in HS rats (Rattus norvegicus) to identify Tpcn2 as a likely causal gene underlying a 3.1-Mb locus for glucose and insulin levels. Global gene expression analysis on liver identified Tpcn2 as the only gene in the region that is differentially expressed between HS rats with glucose intolerance and those with normal glucose regulation. Tpcn2 also maps as a cis-regulating expression QTL and is negatively correlated with fasting glucose levels. We used founder sequence to identify variants within this region and assessed association between 18 variants and diabetic traits by conducting a mixed-model analysis, accounting for the complex family structure of the HS. We found that two variants were significantly associated with fasting glucose levels, including a nonsynonymous coding variant within Tpcn2. Studies in Tpcn2 knockout mice demonstrated a significant decrease in fasting glucose levels and insulin response to a glucose challenge relative to those in wild-type mice. Finally, we identified variants within Tpcn2 that are associated with fasting insulin in humans. These studies indicate that Tpcn2 is a likely causal gene that may play a role in human diabetes and demonstrate the utility of multiparental populations for positionally cloning genes within complex loci.
Asunto(s)
Canales de Calcio/genética , Diabetes Mellitus Tipo 2/genética , Animales , Glucemia/genética , Canales de Calcio/metabolismo , Femenino , Genoma Humano , Humanos , Insulina/sangre , Insulina/genética , Masculino , Ratones , Ratones Endogámicos C57BL , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Ratas , Ratas EndogámicasRESUMEN
The presence of copy number variants in normal genomes poses a challenge to identify small genuine somatic copy number changes in high-resolution cancer genome profiling studies due to the use of unpaired reference DNA. Another problem is the well-known rearrangements of immunoglobulin and T-cell receptor genes in lymphocytes (a commonly used reference), which may misdirect the researcher to a locus with no relevance in tumorigenesis. We here show real gains of the IgG heavy chain V gene region in carcinogen-induced rat mammary tumor samples after normalization to paired mammary gland, a tissue without lymphocyte infiltration. We further show that the segmental duplication region encompassing the IgG heavy chain V genes is a copy number variant between the susceptible (SS) and the resistant (BN) to mammary tumor development inbred rat strains. Our data suggest that the already inherently unstable genomic region is a convenient target for additional structural rearrangements (gains) at the somatic level when exposed to a carcinogen (7,12-dimethylbenz[a]anthracene), which subsequently seem to benefit tumor development in the mammary gland of the susceptible strain. Thus, the selection of an appropriate reference DNA enabled us to identify immunoglobulin genes as novel cancer targets playing a role in mammary tumor development. We conclude that control DNA in array-based comparative genomic hybridization experiments should be selected with care, and DNA from pooled spleen (contains immature lymphocytes and is used as reference in animal studies) or blood may not be the ideal control in the study of primary tumors.
Asunto(s)
Adenocarcinoma/genética , Neoplasias Mamarias Experimentales/genética , Hibridación de Ácido Nucleico/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos , 9,10-Dimetil-1,2-benzantraceno/toxicidad , Adenocarcinoma/inducido químicamente , Animales , Secuencia de Bases , Carcinógenos/toxicidad , Cartilla de ADN , Neoplasias Mamarias Experimentales/inducido químicamente , Reacción en Cadena de la Polimerasa , RatasRESUMEN
The laboratory rat is a major model organism for systems biology. To complement the cornucopia of physiological and pharmacological data generated in the rat, a large genomic toolset has been developed, culminating in the release of the rat draft genome sequence. The rat draft sequence used a variety of assembly packages, as well as data from the Radiation Hybrid (RH) map of the rat as part of their validation. As part of the Rat Genome Project, we have been building a high-density RH map to facilitate data integration from multiple maps and now to help validate the genome assembly. By incorporating vectors from our lab and several other labs, we have doubled the number of simple sequence length polymorphisms (SSLPs), genes, expressed sequence tags (ESTs), and sequence-tagged sites (STSs) compared to any other genome-wide rat map, a total of 24,437 elements. During the process, we also identified a novel approach for integrating the RH placement results from multiple maps. This new integrated RH map contains approximately 10 RH-mapped elements per Mb on the genome assembly, enabling the RH maps to serve as a scaffold for a variety of data visualization tools.