RESUMO
Pericentromeric heterochromatin is mainly composed of satellite DNA sequences. Although being historically associated with transcriptional repression, some pericentromeric satellite DNA sequences are transcribed. The transcription events of pericentromeric satellite sequences occur in highly flexible biological contexts. Hence, the apparent randomness of pericentromeric satellite transcription incites the discussion about the attribution of biological functions. However, pericentromeric satellite RNAs have clear roles in the organization of nuclear structure. Silencing pericentromeric heterochromatin depends on pericentromeric satellite RNAs, that, in a feedback mechanism, contribute to the repression of pericentromeric heterochromatin. Moreover, pericentromeric satellite RNAs can also act as scaffolding molecules in condensate subnuclear structures (e.g., nuclear stress bodies). Since the formation/dissociation of nuclear condensates provides cell adaptability, pericentromeric satellite RNAs can be an epigenetic platform for regulating (sub)nuclear structure. We review current knowledge about pericentromeric satellite RNAs that, irrespective of the meaning of biological function, should be functionally addressed in regular and disease settings. This article is categorized under: RNA Methods > RNA Analyses in Cells RNA in Disease and Development > RNA in Disease.
Assuntos
Heterocromatina , RNA Satélite , RNA Satélite/metabolismo , RNA Satélite/genética , Humanos , Heterocromatina/metabolismo , Heterocromatina/genética , Animais , Núcleo Celular/metabolismo , Núcleo Celular/genética , Centrômero/metabolismo , Centrômero/genética , DNA Satélite/metabolismo , DNA Satélite/genéticaRESUMO
Molecular combing is a technique used to stretch hundreds of consistent DNA molecules in parallel on a glass surface, with a resolution of two kilo-basepairs per micrometer. The combination of this approach with fluorescent in situ hybridization (FISH) has enabled the direct visualization of DNA structure and variations at an unprecedent high resolution. This technique has been successfully used in various studies such as the identification of copy number and genomic structural variations and the precise measurements of overlap and gap sizing between contigs in genome assemblies. Here, we describe the procedure for the preparation of DNA fibers by molecular combing and its applications in multicolor fiber-FISH.
Assuntos
DNA , Hibridização in Situ Fluorescente , Hibridização in Situ Fluorescente/métodos , DNA/genética , DNA/químicaRESUMO
Organisms are often subjected to conditions that promote cellular stress. Cell responses to stress include the activation of pathways to defend against and recover from the stress, or the initiation of programmed cell death to eliminate the damaged cells. One of the processes that can be triggered under stress is the transcription and variation in the number of copies of satellite DNA sequences (satDNA), which are involved in response mechanisms. Satellite DNAs are highly repetitive tandem sequences, mainly located in the centromeric and pericentromeric regions of eukaryotic chromosomes, where they form the constitutive heterochromatin. Satellite non-coding RNAs (satncRNAs) are important regulators of cell processes, and their deregulation has been associated with disease. Also, these transcripts have been associated with stress-response mechanisms in varied eukaryotic species. This review intends to explore the role of satncRNAs when cells are subjected to adverse conditions. Studying satDNA transcription under various stress conditions and deepening our understanding of where and how these sequences are involved could be a key factor in uncovering important facts about the functions of these sequences.
Assuntos
Apoptose , CogniçãoRESUMO
BACKGROUND: Pericentromeric regions of human chromosomes are composed of tandem-repeated and highly organized sequences named satellite DNAs. Human classical satellite DNAs are classified into three families named HSat1, HSat2, and HSat3, which have historically posed a challenge for the assembly of the human reference genome where they are misrepresented due to their repetitive nature. Although being known for a long time as the most AT-rich fraction of the human genome, classical satellite HSat1A has been disregarded in genomic and transcriptional studies, falling behind other human satellites in terms of functional knowledge. Here, we aim to characterize and provide an understanding on the biological relevance of HSat1A. RESULTS: The path followed herein trails with HSat1A isolation and cloning, followed by in silico analysis. Monomer copy number and expression data was obtained in a wide variety of human cell lines, with greatly varying profiles in tumoral/non-tumoral samples. HSat1A was mapped in human chromosomes and applied in in situ transcriptional assays. Additionally, it was possible to observe the nuclear organization of HSat1A transcripts and further characterize them by 3' RACE-Seq. Size-varying polyadenylated HSat1A transcripts were detected, which possibly accounts for the intricate regulation of alternative polyadenylation. CONCLUSION: As far as we know, this work pioneers HSat1A transcription studies. With the emergence of new human genome assemblies, acrocentric pericentromeres are becoming relevant characters in disease and other biological contexts. HSat1A sequences and associated noncoding RNAs will most certainly prove significant in the future of HSat research.
Assuntos
DNA Satélite , Sequências de Repetição em Tandem , Humanos , DNA Satélite/genética , RNA não Traduzido , Genômica , Genoma HumanoRESUMO
Glycophorins are transmembrane proteins of red blood cells (RBCs), heavily glycosylated on their external-facing surface. In humans, there are four glycophorin proteins, glycophorins A, B, C and D. Glycophorins A and B are encoded by two similar genes GYPA and GYPB, and glycophorin C and glycophorin D are encoded by a single gene, GYPC. The exact function of glycophorins remains unclear. However, given their abundance on the surface of RBCs, it is likely that they serve as a substrate for glycosylation, giving the RBC a negatively charged, complex glycan "coat". GYPB and GYPE (a closely related pseudogene) were generated from GYPA by two duplication events involving a 120-kb genomic segment between 10 and 15 million years ago. Non-allelic homologous recombination between these 120-kb repeats generates a variety of duplication alleles and deletion alleles, which have been systematically catalogued from genomic sequence data. One allele, called DUP4, encodes the Dantu NE blood type and is strongly protective against malaria as it alters the surface tension of the RBC membrane. Glycophorins interact with other infectious pathogens, including viruses, as well as the malarial parasite Plasmodium falciparum, but the role of glycophorin variation in mediating the effects of these pathogens remains underexplored.
Assuntos
Doenças Transmissíveis , Glicoforinas , Humanos , Glicoforinas/genética , Glicoforinas/metabolismo , Eritrócitos/metabolismo , Eritrócitos/parasitologia , Proteínas de Membrana/genética , Variação GenéticaRESUMO
Duchenne muscular dystrophy (DMD) is caused by dystrophin gene mutations leading to skeletal muscle weakness and wasting. Dystrophin is enriched at the neuromuscular junction (NMJ), but how NMJ abnormalities contribute to DMD pathogenesis remains unclear. Here, we combine transcriptome analysis and modeling of DMD patient-derived neuromuscular circuits with CRISPR-corrected isogenic controls in compartmentalized microdevices. We show that NMJ volumes and optogenetic motor neuronstimulated myofiber contraction are compromised in DMD neuromuscular circuits, which can be rescued by pharmacological inhibition of TGFß signaling, an observation validated in a 96-well human neuromuscular circuit coculture assay. These beneficial effects are associated with normalization of dysregulated gene expression in DMD myogenic transcriptomes affecting NMJ assembly (e.g., MUSK) and axon guidance (e.g., SLIT2 and SLIT3). Our study provides a new human microphysiological model for investigating NMJ defects in DMD and assessing candidate drugs and suggests that enhancing neuromuscular connectivity may be an effective therapeutic strategy.
RESUMO
(Peri)centromeric repetitive sequences and, more specifically, satellite DNA (satDNA) sequences, constitute a major human genomic component. SatDNA sequences can vary on a large number of features, including nucleotide composition, complexity, and abundance. Several satDNA families have been identified and characterized in the human genome through time, albeit at different speeds. Human satDNA families present a high degree of sub-variability, leading to the definition of various subfamilies with different organization and clustered localization. Evolution of satDNA analysis has enabled the progressive characterization of satDNA features. Despite recent advances in the sequencing of centromeric arrays, comprehensive genomic studies to assess their variability are still required to provide accurate and proportional representation of satDNA (peri)centromeric/acrocentric short arm sequences. Approaches combining multiple techniques have been successfully applied and seem to be the path to follow for generating integrated knowledge in the promising field of human satDNA biology.
Assuntos
DNA Satélite/genética , DNA Satélite/química , Evolução Molecular , Genoma Humano , Genômica/métodos , Genômica/tendências , Humanos , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/tendências , Fatores de TempoRESUMO
BACKGROUND: Approximately 5% of the human genome shows common structural variation, which is enriched for genes involved in the immune response and cell-cell interactions. A well-established region of extensive structural variation is the glycophorin gene cluster, comprising three tandemly-repeated regions about 120 kb in length and carrying the highly homologous genes GYPA, GYPB and GYPE. Glycophorin A (encoded by GYPA) and glycophorin B (encoded by GYPB) are glycoproteins present at high levels on the surface of erythrocytes, and they have been suggested to act as decoy receptors for viral pathogens. They are receptors for the invasion of the protist parasite Plasmodium falciparum, a causative agent of malaria. A particular complex structural variant, called DUP4, creates a GYPB-GYPA fusion gene known to confer resistance to malaria. Many other structural variants exist across the glycophorin gene cluster, and they remain poorly characterised. RESULTS: Here, we analyse sequences from 3234 diploid genomes from across the world for structural variation at the glycophorin locus, confirming 15 variants in the 1000 Genomes project cohort, discovering 9 new variants, and characterising a selection of these variants using fibre-FISH and breakpoint mapping at the sequence level. We identify variants predicted to create novel fusion genes and a common inversion duplication variant at appreciable frequencies in West Africans. We show that almost all variants can be explained by non-allelic homologous recombination and by comparing the structural variant breakpoints with recombination hotspot maps, confirm the importance of a particular meiotic recombination hotspot on structural variant formation in this region. CONCLUSIONS: We identify and validate large structural variants in the human glycophorin A-B-E gene cluster which may be associated with different clinical aspects of malaria.
Assuntos
Variação Estrutural do Genoma , Glicoforinas/genética , Malária Falciparum/genética , Pontos de Quebra do Cromossomo , Mapeamento Cromossômico , Bases de Dados Genéticas , Resistência à Doença , Humanos , Hibridização in Situ Fluorescente , Alinhamento de Sequência , Sequenciamento Completo do GenomaRESUMO
Repetitive DNA is a major organizational component of eukaryotic genomes, being intrinsically related with their architecture and evolution. Tandemly repeated satellite DNAs (satDNAs) can be found clustered in specific heterochromatin-rich chromosomal regions, building vital structures like functional centromeres and also dispersed within euchromatin. Interestingly, despite their association to critical chromosomal structures, satDNAs are widely variable among species due to their high turnover rates. This dynamic behavior has been associated with genome plasticity and chromosome rearrangements, leading to the reshaping of genomes. Here we present the current knowledge regarding satDNAs in the light of new genomic technologies, and the challenges in the study of these sequences. Furthermore, we discuss how these sequences, together with other repeats, influence genome architecture, impacting its evolution and association with disease.
Assuntos
Adaptação Fisiológica/genética , DNA Satélite/genética , DNA Satélite/metabolismo , Animais , Centrômero/genética , Centrômero/metabolismo , Cromossomos/genética , Elementos de DNA Transponíveis/genética , Eucariotos , Evolução Molecular , Rearranjo Gênico/genética , Genômica , Heterocromatina/genética , Heterocromatina/metabolismo , HumanosRESUMO
Dystroglycan, an extracellular matrix receptor, has essential functions in various tissues. Loss of α-dystroglycan-laminin interaction due to defective glycosylation of α-dystroglycan underlies a group of congenital muscular dystrophies often associated with brain malformations, referred to as dystroglycanopathies. The lack of isogenic human dystroglycanopathy cell models has limited our ability to test potential drugs in a human- and neural-specific context. Here, we generated induced pluripotent stem cells (iPSCs) from a severe dystroglycanopathy patient with homozygous FKRP (fukutin-related protein gene) mutation. We showed that CRISPR/Cas9-mediated gene correction of FKRP restored glycosylation of α-dystroglycan in iPSC-derived cortical neurons, whereas targeted gene mutation of FKRP in wild-type cells disrupted this glycosylation. In parallel, we screened 31,954 small molecule compounds using a mouse myoblast line for increased glycosylation of α-dystroglycan. Using human FKRP-iPSC-derived neural cells for hit validation, we demonstrated that compound 4-(4-bromophenyl)-6-ethylsulfanyl-2-oxo-3,4-dihydro-1H-pyridine-5-carbonitrile (4BPPNit) significantly augmented glycosylation of α-dystroglycan, in part through upregulation of LARGE1 glycosyltransferase gene expression. Together, isogenic human iPSC-derived cells represent a valuable platform for facilitating dystroglycanopathy drug discovery and therapeutic development.
Assuntos
Avaliação Pré-Clínica de Medicamentos , Distroglicanas/metabolismo , Células-Tronco Pluripotentes Induzidas/metabolismo , Sequência de Bases , Sistemas CRISPR-Cas , Células Cultivadas , Avaliação Pré-Clínica de Medicamentos/métodos , Distroglicanas/genética , Edição de Genes , Marcação de Genes , Loci Gênicos , Glicosilação/efeitos dos fármacos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Imagem Molecular , Distrofias Musculares/tratamento farmacológico , Distrofias Musculares/etiologia , Distrofias Musculares/metabolismo , Mutação , N-Acetilglucosaminiltransferases/genética , N-Acetilglucosaminiltransferases/metabolismo , Células-Tronco Neurais/metabolismo , Neurônios/metabolismo , Pentosiltransferases/genética , Pentosiltransferases/metabolismoRESUMO
BACKGROUND: Large palindromes (inverted repeats) make up substantial proportions of mammalian sex chromosomes, often contain genes, and have high rates of structural variation arising via ectopic recombination. As a result, they underlie many genomic disorders. Maintenance of the palindromic structure by gene conversion between the arms has been documented, but over longer time periods, palindromes are remarkably labile. Mechanisms of origin and loss of palindromes have, however, received little attention. RESULTS: Here, we use fiber-FISH, 10x Genomics Linked-Read sequencing, and breakpoint PCR sequencing to characterize the structural variation of the P8 palindrome on the human Y chromosome, which contains two copies of the VCY (Variable Charge Y) gene. We find a deletion of almost an entire arm of the palindrome, leading to death of the palindrome, a size increase by recruitment of adjacent sequence, and other complex changes including the formation of an entire new palindrome nearby. Together, these changes are found in ~ 1% of men, and we can assign likely molecular mechanisms to these mutational events. As a result, healthy men can have 1-4 copies of VCY. CONCLUSIONS: Gross changes, especially duplications, in palindrome structure can be relatively frequent and facilitate the evolution of sex chromosomes in humans, and potentially also in other mammalian species.
Assuntos
Cromossomos Humanos Y , Sequências Repetidas Invertidas , Proteínas Nucleares/genética , Sequência de Bases , Variações do Número de Cópias de DNA , HumanosRESUMO
Spiradenoma and cylindroma are distinctive skin adnexal tumors with sweat gland differentiation and potential for malignant transformation and aggressive behaviour. We present the genomic analysis of 75 samples from 57 representative patients including 15 cylindromas, 17 spiradenomas, 2 cylindroma-spiradenoma hybrid tumors, and 24 low- and high-grade spiradenocarcinoma cases, together with morphologically benign precursor regions of these cancers. We reveal somatic or germline alterations of the CYLD gene in 15/15 cylindromas and 5/17 spiradenomas, yet only 2/24 spiradenocarcinomas. Notably, we find a recurrent missense mutation in the kinase domain of the ALPK1 gene in spiradenomas and spiradenocarcinomas, which is mutually exclusive from mutation of CYLD and can activate the NF-κB pathway in reporter assays. In addition, we show that high-grade spiradenocarcinomas carry loss-of-function TP53 mutations, while cylindromas may have disruptive mutations in DNMT3A. Thus, we reveal the genomic landscape of adnexal tumors and therapeutic targets.
Assuntos
Carcinoma Adenoide Cístico/genética , Enzima Desubiquitinante CYLD/genética , Proteínas Quinases/genética , Neoplasias das Glândulas Sudoríparas/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Carcinoma Adenoide Cístico/patologia , Estudos de Coortes , DNA (Citosina-5-)-Metiltransferases/genética , DNA Metiltransferase 3A , Análise Mutacional de DNA , Feminino , Humanos , Mutação com Perda de Função , Masculino , Pessoa de Meia-Idade , Mutação de Sentido Incorreto , Domínios Proteicos/genética , Neoplasias das Glândulas Sudoríparas/patologia , Glândulas Sudoríparas/patologia , Proteína Supressora de Tumor p53/genética , Sequenciamento do ExomaRESUMO
Human RBMY1 genes are located in four variable-sized clusters on the Y chromosome, expressed in male germ cells and possibly associated with sperm motility. We have re-investigated the mutational background and evolutionary history of the RBMY1 copy number distribution in worldwide samples and its relevance to sperm parameters in an Estonian cohort of idiopathic male factor infertility subjects. We estimated approximate RBMY1 copy numbers in 1218 1000 Genomes Project phase 3 males from sequencing read-depth, then chose 14 for valid ation by multicolour fibre-FISH. These fibre-FISH samples provided accurate calibration standards for the entire panel and led to detailed insights into population variation and mutational mechanisms. RBMY1 copy number worldwide ranged from 3 to 13 with a mode of 8. The two larger proximal clusters were the most variable, and additional duplications, deletions and inversions were detected. Placing the copy number estimates onto the published Y-SNP-based phylogeny of the same samples suggested a minimum of 562 mutational changes, translating to a mutation rate of 2.20 × 10-3 (95% CI 1.94 × 10-3 to 2.48 × 10-3) per father-to-son Y-transmission, higher than many short tandem repeat (Y-STRs), and showed no evidence for selection for increased or decreased copy number, but possible copy number stabilizing selection. An analysis of RBMY1 copy numbers among 376 infertility subjects failed to replicate a previously reported association with sperm motility and showed no significant effect on sperm count and concentration, serum follicle stimulating hormone (FSH), luteinizing hormone (LH) and testosterone levels or testicular and semen volume. These results provide the first in-depth insights into the structural rearrangements underlying RBMY1 copy number variation across diverse human lineages.
Assuntos
Cromossomos Humanos Y , Variações do Número de Cópias de DNA , Evolução Molecular , Proteínas Nucleares/genética , Proteínas de Ligação a RNA/genética , Hibridização Genômica Comparativa , Genoma Humano , Genômica/métodos , Humanos , Hibridização in Situ Fluorescente , Masculino , Família Multigênica , Mutação , Filogenia , Espermatozoides/metabolismoRESUMO
BACKGROUND: CRISPR-Cas9 genome editing is widely used to study gene function, from basic biology to biomedical research. Structural rearrangements are a ubiquitous feature of cancer cells and their impact on the functional consequences of CRISPR-Cas9 gene-editing has not yet been assessed. RESULTS: Utilizing CRISPR-Cas9 knockout screens for 250 cancer cell lines, we demonstrate that targeting structurally rearranged regions, in particular tandem or interspersed amplifications, is highly detrimental to cellular fitness in a gene-independent manner. In contrast, amplifications caused by whole chromosomal duplication have little to no impact on fitness. This effect is cell line specific and dependent on the ploidy status. We devise a copy-number ratio metric that substantially improves the detection of gene-independent cell fitness effects in CRISPR-Cas9 screens. Furthermore, we develop a computational tool, called Crispy, to account for these effects on a single sample basis and provide corrected gene fitness effects. CONCLUSION: Our analysis demonstrates the importance of structural rearrangements in mediating the effect of CRISPR-Cas9-induced DNA damage, with implications for the use of CRISPR-Cas9 gene-editing in cancer cells.
Assuntos
Sistemas CRISPR-Cas , Variação Estrutural do Genoma , Genômica/métodos , Sequenciamento Completo do Genoma , Linhagem Celular Tumoral , Humanos , Neoplasias/genética , Ploidias , SoftwareRESUMO
Glycophorin A and glycophorin B are red blood cell surface proteins and are both receptors for the parasite Plasmodium falciparum, which is the principal cause of malaria in sub-Saharan Africa. DUP4 is a complex structural genomic variant that carries extra copies of a glycophorin A-glycophorin B fusion gene and has a dramatic effect on malaria risk by reducing the risk of severe malaria by up to 40%. Using fiber-FISH and Illumina sequencing, we validate the structural arrangement of the glycophorin locus in the DUP4 variant and reveal somatic variation in copy number of the glycophorin B-glycophorin A fusion gene. By developing a simple, specific, PCR-based assay for DUP4, we show that the DUP4 variant reaches a frequency of 13% in the population of a malaria-endemic village in south-eastern Tanzania. We genotype a substantial proportion of that village and demonstrate an association of DUP4 genotype with hemoglobin levels, a phenotype related to malaria, using a family-based association test. Taken together, we show that DUP4 is a complex structural variant that may be susceptible to somatic variation and show that DUP4 is associated with a malarial-related phenotype in a longitudinally followed population.
Assuntos
Variação Estrutural do Genoma/genética , Glicoforinas/genética , Hemoglobinas/genética , Malária/genética , Linhagem Celular , Criança , Pré-Escolar , Eritrócitos/metabolismo , Feminino , Genótipo , Humanos , Estudos Longitudinais , Masculino , Mosaicismo , Fenótipo , Plasmodium falciparum/genética , TanzâniaRESUMO
The poor correlation of mutational landscapes with phenotypes limits our understanding of the pathogenesis and metastasis of pancreatic ductal adenocarcinoma (PDAC). Here we show that oncogenic dosage-variation has a critical role in PDAC biology and phenotypic diversification. We find an increase in gene dosage of mutant KRAS in human PDAC precursors, which drives both early tumorigenesis and metastasis and thus rationalizes early PDAC dissemination. To overcome the limitations posed to gene dosage studies by the stromal richness of PDAC, we have developed large cell culture resources of metastatic mouse PDAC. Integration of cell culture genomes, transcriptomes and tumour phenotypes with functional studies and human data reveals additional widespread effects of oncogenic dosage variation on cell morphology and plasticity, histopathology and clinical outcome, with the highest KrasMUT levels underlying aggressive undifferentiated phenotypes. We also identify alternative oncogenic gains (Myc, Yap1 or Nfkb2), which collaborate with heterozygous KrasMUT in driving tumorigenesis, but have lower metastatic potential. Mechanistically, different oncogenic gains and dosages evolve along distinct evolutionary routes, licensed by defined allelic states and/or combinations of hallmark tumour suppressor alterations (Cdkn2a, Trp53, Tgfß-pathway). Thus, evolutionary constraints and contingencies direct oncogenic dosage gain and variation along defined routes to drive the early progression of PDAC and shape its downstream biology. Our study uncovers universal principles of Ras-driven oncogenesis that have potential relevance beyond pancreatic cancer.
Assuntos
Carcinoma Ductal Pancreático/genética , Carcinoma Ductal Pancreático/patologia , Evolução Molecular , Dosagem de Genes , Neoplasias Pancreáticas/genética , Neoplasias Pancreáticas/patologia , Proteínas Proto-Oncogênicas p21(ras)/genética , Proteínas Adaptadoras de Transdução de Sinal/genética , Alelos , Animais , Carcinogênese/genética , Proteínas de Ciclo Celular , Inibidor p16 de Quinase Dependente de Ciclina/genética , Progressão da Doença , Feminino , Genes myc , Genes p53 , Humanos , Masculino , Camundongos , Mutação , Subunidade p52 de NF-kappa B/genética , Metástase Neoplásica/genética , Proteínas Nucleares/genética , Fenótipo , Fosfoproteínas/genética , Fatores de Transcrição/genética , Transcriptoma/genética , Fator de Crescimento Transformador beta1/genética , Proteínas de Sinalização YAPRESUMO
Haematopoietic stem cells renew blood. Accumulation of DNA damage in these cells promotes their decline, while misrepair of this damage initiates malignancies. Here we describe the features and mutational landscape of DNA damage caused by acetaldehyde, an endogenous and alcohol-derived metabolite. This damage results in DNA double-stranded breaks that, despite stimulating recombination repair, also cause chromosome rearrangements. We combined transplantation of single haematopoietic stem cells with whole-genome sequencing to show that this damage occurs in stem cells, leading to deletions and rearrangements that are indicative of microhomology-mediated end-joining repair. Moreover, deletion of p53 completely rescues the survival of aldehyde-stressed and mutated haematopoietic stem cells, but does not change the pattern or the intensity of genome instability within individual stem cells. These findings characterize the mutation of the stem-cell genome by an alcohol-derived and endogenous source of DNA damage. Furthermore, we identify how the choice of DNA-repair pathway and a stringent p53 response limit the transmission of aldehyde-induced mutations in stem cells.
Assuntos
Acetaldeído/metabolismo , Quebras de DNA de Cadeia Dupla/efeitos dos fármacos , Etanol/metabolismo , Etanol/farmacologia , Instabilidade Genômica/efeitos dos fármacos , Células-Tronco Hematopoéticas/efeitos dos fármacos , Células-Tronco Hematopoéticas/patologia , Mutação , Álcool Desidrogenase/deficiência , Álcool Desidrogenase/genética , Álcool Desidrogenase/metabolismo , Animais , Sobrevivência Celular/efeitos dos fármacos , Reparo do DNA por Junção de Extremidades , Etanol/administração & dosagem , Anemia de Fanconi/genética , Anemia de Fanconi/metabolismo , Anemia de Fanconi/patologia , Proteína do Grupo de Complementação D2 da Anemia de Fanconi/deficiência , Proteína do Grupo de Complementação D2 da Anemia de Fanconi/genética , Proteína do Grupo de Complementação D2 da Anemia de Fanconi/metabolismo , Feminino , Deleção de Genes , Genes p53/genética , Transplante de Células-Tronco Hematopoéticas , Células-Tronco Hematopoéticas/metabolismo , Autoantígeno Ku/metabolismo , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Reparo de DNA por Recombinação/efeitos dos fármacos , Proteína Supressora de Tumor p53/deficiência , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/metabolismo , Sequenciamento Completo do GenomaRESUMO
We describe the variation in copy number of a ~ 10 kb region overlapping the long intergenic noncoding RNA (lincRNA) gene, TTTY22, within the IR3 inverted repeat on the short arm of the human Y chromosome, leading to individuals with 0-3 copies of this region in the general population. Variation of this CNV is common, with 266 individuals having 0 copies, 943 (including the reference sequence) having 1, 23 having 2 copies, and two having 3 copies, and was validated by breakpoint PCR, fibre-FISH, and 10× Genomics Chromium linked-read sequencing in subsets of 1234 individuals from the 1000 Genomes Project. Mapping the changes in copy number to the phylogeny of these Y chromosomes previously established by the Project identified at least 20 mutational events, and investigation of flanking paralogous sequence variants showed that the mutations involved flanking sequences in 18 of these, and could extend over > 30 kb of DNA. While either gene conversion or double crossover between misaligned sister chromatids could formally explain the 0-2 copy events, gene conversion is the more likely mechanism, and these events include the longest non-allelic gene conversion reported thus far. Chromosomes with three copies of this CNV have arisen just once in our data set via another mechanism: duplication of 420 kb that places the third copy 230 kb proximal to the existing proximal copy. Our results establish gene conversion as a previously under-appreciated mechanism of generating copy number changes in humans and reveal the exceptionally large size of the conversion events that can occur.
Assuntos
Cromossomos Humanos Y/genética , Variações do Número de Cópias de DNA , Conversão Gênica , Humanos , Filogenia , RNA Longo não Codificante/genética , Análise de Sequência de DNARESUMO
The human amylase gene cluster includes the human salivary (AMY1) and pancreatic amylase genes (AMY2A and AMY2B), and is a highly variable and dynamic region of the genome. Copy number variation (CNV) of AMY1 has been implicated in human dietary adaptation, and in population association with obesity, but neither of these findings has been independently replicated. Despite these functional implications, the structural genomic basis of CNV has only been defined in detail very recently. In this work, we use high-resolution analysis of copy number, and analysis of segregation in trios, to define new, independent allelic series of amylase CNVs in sub-Saharan Africans, including a series of higher-order expansions of a unit consisting of one copy each of AMY1, AMY2A, and AMY2B. We use fiber-FISH (fluorescence in situ hybridization) to define unexpected complexity in the accompanying rearrangements. These findings demonstrate recurrent involvement of the amylase gene region in genomic instability, involving at least five independent rearrangements of the pancreatic amylase genes (AMY2A and AMY2B). Structural features shared by fundamentally distinct lineages strongly suggest that the common ancestral state for the human amylase cluster contained more than one, and probably three, copies of AMY1.
Assuntos
Amilases/genética , Variações do Número de Cópias de DNA , Dosagem de Genes , Adulto , Alelos , Criança , Feminino , Ordem dos Genes , Estudos de Associação Genética , Loci Gênicos , Haplótipos , Humanos , Hibridização in Situ Fluorescente , Masculino , Repetições de Microssatélites , Família MultigênicaRESUMO
We report the sequences of 1,244 human Y chromosomes randomly ascertained from 26 worldwide populations by the 1000 Genomes Project. We discovered more than 65,000 variants, including single-nucleotide variants, multiple-nucleotide variants, insertions and deletions, short tandem repeats, and copy number variants. Of these, copy number variants contribute the greatest predicted functional impact. We constructed a calibrated phylogenetic tree on the basis of binary single-nucleotide variants and projected the more complex variants onto it, estimating the number of mutations for each class. Our phylogeny shows bursts of extreme expansion in male numbers that have occurred independently among each of the five continental superpopulations examined, at times of known migrations and technological innovations.