RESUMO
The 10q11.22 chromosomal region is a duplication-rich interval of the human genome and one of the last to be fully assembled. It carries copy number-variable genes associated with intellectual disability, bipolar disorder, and obesity. In this study, we characterized the structural diversity at this locus by analyzing 64 haploid assemblies produced by the Human Pangenome Reference Consortium. We identified 11 alternative haplotypes that differ in the copy number and/or orientation of large genomic segments, ranging from hundreds of kilobase pairs (kbp) to over one megabase pair (Mbp). We uncovered a 2.4 Mbp size difference between the shortest and longest haplotypes. Breakpoint analysis revealed that genomic instability results from nonallelic homologous recombination between segmental duplication (SD) pairs with varying similarity (94.4%-99.6%). Nonetheless, these pairs generally recombine at positions where their identity is higher (>99.6%). Recurrent inversions occur with different breakpoints within the same inverted SD pair. Inversion polymorphisms shuffle the entire SD arrangement, creating new predispositions to copy-number variations. The SD architecture is associated with a catarrhine-specific subgroup of the AGAP gene family, which likely triggered the accumulation of SDs at this locus over the past 25 million years of human evolution. Our results reveal extensive structural diversity and genomic instability at the 10q11.22 locus, and expand the general understanding of the mutational mechanisms behind SD-mediated rearrangements.
RESUMO
The ALF transcription factor paralogs, AFF1, AFF2, AFF3, and AFF4, are components of the transcriptional super elongation complex that regulates expression of genes involved in neurogenesis and development. We describe an autosomal dominant disorder associated with de novo missense variants in the degron of AFF3, a nine amino acid sequence important for its binding to ubiquitin ligase, or with de novo deletions of this region. The sixteen affected individuals we identified, along with two previously reported individuals, present with a recognizable pattern of anomalies, which we named KINSSHIP syndrome (KI for horseshoe kidney, NS for Nievergelt/Savarirayan type of mesomelic dysplasia, S for seizures, H for hypertrichosis, I for intellectual disability, and P for pulmonary involvement), partially overlapping the AFF4-associated CHOPS syndrome. Whereas homozygous Aff3 knockout mice display skeletal anomalies, kidney defects, brain malformations, and neurological anomalies, knockin animals modeling one of the microdeletions and the most common of the missense variants identified in affected individuals presented with lower mesomelic limb deformities like KINSSHIP-affected individuals and early lethality, respectively. Overexpression of AFF3 in zebrafish resulted in body axis anomalies, providing some support for the pathological effect of increased amount of AFF3. The only partial phenotypic overlap of AFF3- and AFF4-associated syndromes and the previously published transcriptome analyses of ALF transcription factors suggest that these factors are not redundant and each contributes uniquely to proper development.
Assuntos
Encefalopatias/genética , Epilepsia/genética , Rim Fundido/genética , Deficiência Intelectual/genética , Mutação de Sentido Incorreto , Proteínas Nucleares/genética , Osteocondrodisplasias/genética , Adolescente , Sequência de Aminoácidos , Animais , Encefalopatias/etiologia , Criança , Pré-Escolar , Epilepsia/complicações , Evolução Molecular , Feminino , Frequência do Gene , Humanos , Lactente , Masculino , Camundongos , Modelos Moleculares , Proteínas Nucleares/química , Proteínas Nucleares/deficiência , Fenótipo , Estabilidade Proteica , Síndrome , Fatores de Elongação da Transcrição/química , Fatores de Elongação da Transcrição/genética , Adulto Jovem , Peixe-Zebra/genéticaRESUMO
Non-Syndromic Hereditary Hearing Loss (NSHHL) is a genetically heterogeneous sensory disorder with about 120 genes already associated. Through exome sequencing (ES) and data aggregation, we identified a family with six affected individuals and one unrelated NSHHL patient with predicted-to-be deleterious missense variants in USP48. We also uncovered an eighth patient presenting unilateral cochlear nerve aplasia and a de novo splice variant in the same gene. USP48 encodes a ubiquitin carboxyl-terminal hydrolase under evolutionary constraint. Pathogenicity of the variants is supported by in vitro assays that showed that the mutated proteins are unable to hydrolyze tetra-ubiquitin. Correspondingly, three-dimensional representation of the protein containing the familial missense variant is situated in a loop that might influence the binding to ubiquitin. Consistent with a contribution of USP48 to auditory function, immunohistology showed that the encoded protein is expressed in the developing human inner ear, specifically in the spiral ganglion neurons, outer sulcus, interdental cells of the spiral limbus, stria vascularis, Reissner's membrane and in the transient Kolliker's organ that is essential for auditory development. Engineered zebrafish knocked-down for usp48, the USP48 ortholog, presented with a delayed development of primary motor neurons, less developed statoacoustic neurons innervating the ears, decreased swimming velocity and circling swimming behavior indicative of vestibular dysfunction and hearing impairment. Corroboratingly, acoustic startle response assays revealed a significant decrease of auditory response of zebrafish lacking usp48 at 600 and 800 Hz wavelengths. In conclusion, we describe a novel autosomal dominant NSHHL gene through a multipronged approach combining ES, animal modeling, immunohistology and molecular assays.
Assuntos
Perda Auditiva , Peixe-Zebra , Animais , Perda Auditiva/genética , Humanos , Hidrolases , Reflexo de Sobressalto , Ubiquitina , Proteases Específicas de Ubiquitina , Peixe-Zebra/genéticaRESUMO
Human centromeres are mainly composed of alpha satellite DNA hierarchically organized as higher-order repeats (HORs). Alpha satellite dynamics is shown by sequence homogenization in centromeric arrays and by its transfer to other centromeric locations, for example, during the maturation of new centromeres. We identified during prenatal aneuploidy diagnosis by fluorescent in situ hybridization a de novo insertion of alpha satellite DNA from the centromere of chromosome 18 (D18Z1) into cytoband 15q26. Although bound by CENP-B, this locus did not acquire centromeric functionality as demonstrated by the lack of constriction and the absence of CENP-A binding. The insertion was associated with a 2.8-kbp deletion and likely occurred in the paternal germline. The site was enriched in long terminal repeats and located â¼10 Mbp from the location where a centromere was ancestrally seeded and became inactive in the common ancestor of humans and apes 20-25 million years ago. Long-read mapping to the T2T-CHM13 human genome assembly revealed that the insertion derives from a specific region of chromosome 18 centromeric 12-mer HOR array in which the monomer size follows a regular pattern. The rearrangement did not directly disrupt any gene or predicted regulatory element and did not alter the methylation status of the surrounding region, consistent with the absence of phenotypic consequences in the carrier. This case demonstrates a likely rare but new class of structural variation that we name "alpha satellite insertion." It also expands our knowledge on alphoid DNA dynamics and conveys the possibility that alphoid arrays can relocate near vestigial centromeric sites.
Assuntos
Centrômero , Proteínas Cromossômicas não Histona , Centrômero/genética , Centrômero/metabolismo , Proteína B de Centrômero/genética , Proteína B de Centrômero/metabolismo , Proteínas Cromossômicas não Histona/genética , DNA Satélite/genética , Humanos , Hibridização in Situ FluorescenteRESUMO
Human-specific duplications at chromosome 16p11.2 mediate recurrent pathogenic 600 kbp BP4-BP5 copy-number variations, which are among the most common genetic causes of autism. These copy-number polymorphic duplications are under positive selection and include three to eight copies of BOLA2, a gene involved in the maturation of cytosolic iron-sulfur proteins. To investigate the potential advantage provided by the rapid expansion of BOLA2, we assessed hematological traits and anemia prevalence in 379,385 controls and individuals who have lost or gained copies of BOLA2: 89 chromosome 16p11.2 BP4-BP5 deletion carriers and 56 reciprocal duplication carriers in the UK Biobank. We found that the 16p11.2 deletion is associated with anemia (18/89 carriers, 20%, p = 4e-7, OR = 5), particularly iron-deficiency anemia. We observed similar enrichments in two clinical 16p11.2 deletion cohorts, which included 6/63 (10%) and 7/20 (35%) unrelated individuals with anemia, microcytosis, low serum iron, or low blood hemoglobin. Upon stratification by BOLA2 copy number, our data showed an association between low BOLA2 dosage and the above phenotypes (8/15 individuals with three copies, 53%, p = 1e-4). In parallel, we analyzed hematological traits in mice carrying the 16p11.2 orthologous deletion or duplication, as well as Bola2+/- and Bola2-/- animals. The Bola2-deficient mice and the mice carrying the deletion showed early evidence of iron deficiency, including a mild decrease in hemoglobin, lower plasma iron, microcytosis, and an increased red blood cell zinc-protoporphyrin-to-heme ratio. Our results indicate that BOLA2 participates in iron homeostasis in vivo, and its expansion has a potential adaptive role in protecting against iron deficiency.
Assuntos
Anemia/genética , Transtorno Autístico/genética , Duplicação Cromossômica/genética , Cromossomos Humanos Par 16/genética , Homeostase/genética , Proteínas/genética , Animais , Deleção Cromossômica , Transtornos Cromossômicos/genética , Variações do Número de Cópias de DNA/genética , Feminino , Genótipo , Heterozigoto , Humanos , Ferro , Masculino , FenótipoRESUMO
Genetic differences that specify unique aspects of human evolution have typically been identified by comparative analyses between the genomes of humans and closely related primates, including more recently the genomes of archaic hominins. Not all regions of the genome, however, are equally amenable to such study. Recurrent copy number variation (CNV) at chromosome 16p11.2 accounts for approximately 1% of cases of autism and is mediated by a complex set of segmental duplications, many of which arose recently during human evolution. Here we reconstruct the evolutionary history of the locus and identify bolA family member 2 (BOLA2) as a gene duplicated exclusively in Homo sapiens. We estimate that a 95-kilobase-pair segment containing BOLA2 duplicated across the critical region approximately 282 thousand years ago (ka), one of the latest among a series of genomic changes that dramatically restructured the locus during hominid evolution. All humans examined carried one or more copies of the duplication, which nearly fixed early in the human lineage--a pattern unlikely to have arisen so rapidly in the absence of selection (P < 0.0097). We show that the duplication of BOLA2 led to a novel, human-specific in-frame fusion transcript and that BOLA2 copy number correlates with both RNA expression (r = 0.36) and protein level (r = 0.65), with the greatest expression difference between human and chimpanzee in experimentally derived stem cells. Analyses of 152 patients carrying a chromosome 16p11. rearrangement show that more than 96% of breakpoints occur within the H. sapiens-specific duplication. In summary, the duplicative transposition of BOLA2 at the root of the H. sapiens lineage about 282 ka simultaneously increased copy number of a gene associated with iron homeostasis and predisposed our species to recurrent rearrangements associated with disease.
Assuntos
Cromossomos Humanos Par 16/genética , Variações do Número de Cópias de DNA/genética , Evolução Molecular , Predisposição Genética para Doença , Proteínas/genética , Animais , Transtorno Autístico/genética , Quebra Cromossômica , Duplicação Gênica , Homeostase/genética , Humanos , Ferro/metabolismo , Pan troglodytes/genética , Pongo/genética , Proteínas/análise , Recombinação Genética , Especificidade da Espécie , Fatores de TempoRESUMO
Copy-number changes in 16p11.2 contribute significantly to neuropsychiatric traits. Besides the 600 kb BP4-BP5 CNV found in 0.5%-1% of individuals with autism spectrum disorders and schizophrenia and whose rearrangement causes reciprocal defects in head size and body weight, a second distal 220 kb BP2-BP3 CNV is likewise a potent driver of neuropsychiatric, anatomical, and metabolic pathologies. These two CNVs are engaged in complex reciprocal chromatin looping, intimating a functional relationship between genes in these regions that might be relevant to pathomechanism. We assessed the drivers of the distal 16p11.2 duplication by overexpressing each of the nine encompassed genes in zebrafish. Only overexpression of LAT induced a reduction of brain proliferating cells and concomitant microcephaly. Consistently, suppression of the zebrafish ortholog induced an increase of proliferation and macrocephaly. These phenotypes were not unique to zebrafish; Lat knockout mice show brain volumetric changes. Consistent with the hypothesis that LAT dosage is relevant to the CNV pathology, we observed similar effects upon overexpression of CD247 and ZAP70, encoding members of the LAT signalosome. We also evaluated whether LAT was interacting with KCTD13, MVP, and MAPK3, major driver and modifiers of the proximal 16p11.2 600 kb BP4-BP5 syndromes, respectively. Co-injected embryos exhibited an increased microcephaly, suggesting the presence of genetic interaction. Correspondingly, carriers of 1.7 Mb BP1-BP5 rearrangements that encompass both the BP2-BP3 and BP4-BP5 loci showed more severe phenotypes. Taken together, our results suggest that LAT, besides its well-recognized function in T cell development, is a major contributor of the 16p11.2 220 kb BP2-BP3 CNV-associated neurodevelopmental phenotypes.
Assuntos
Proteínas Adaptadoras de Transdução de Sinal/genética , Transtorno Autístico/genética , Encéfalo/patologia , Transtornos Cromossômicos/genética , Cromossomos Humanos Par 16 , Variações do Número de Cópias de DNA , Deficiência Intelectual/genética , Proteínas de Membrana/genética , Microcefalia/genética , Microcefalia/patologia , Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Proteínas Adaptadoras de Transdução de Sinal/fisiologia , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Animais , Transtorno Autístico/imunologia , Transtorno Autístico/patologia , Encéfalo/metabolismo , Criança , Pré-Escolar , Deleção Cromossômica , Transtornos Cromossômicos/imunologia , Transtornos Cromossômicos/patologia , Cromossomos Humanos Par 16/genética , Cromossomos Humanos Par 16/imunologia , Estudos de Coortes , Embrião não Mamífero/metabolismo , Embrião não Mamífero/patologia , Feminino , Regulação da Expressão Gênica no Desenvolvimento , Humanos , Lactente , Deficiência Intelectual/imunologia , Deficiência Intelectual/patologia , Masculino , Proteínas de Membrana/metabolismo , Proteínas de Membrana/fisiologia , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Knockout , Pessoa de Meia-Idade , Fenótipo , Fosfoproteínas/fisiologia , Transdução de Sinais , Adulto Jovem , Peixe-Zebra/embriologia , Peixe-Zebra/genética , Proteínas de Peixe-Zebra/genética , Proteínas de Peixe-Zebra/metabolismoRESUMO
We report five individuals with loss-of-function of the X-linked AMMECR1: a girl with a balanced X-autosome translocation and inactivation of the normal X-chromosome; two boys with maternally inherited and de novo nonsense variants; and two half-brothers with maternally inherited microdeletion variants. They present with short stature, cardiac and skeletal abnormalities, and hearing loss. Variants of unknown significance in AMMECR1 in four male patients from two families with partially overlapping phenotypes were previously reported. AMMECR1 is coexpressed with genes implicated in cell cycle regulation, five of which were previously associated with growth and bone alterations. Our knockdown of the zebrafish orthologous gene resulted in phenotypes reminiscent of patients' features. The increased transcript and encoded protein levels of AMMECR1L, an AMMECR1 paralog, in the t(X;9) patient's cells indicate a possible partial compensatory mechanism. AMMECR1 and AMMECR1L proteins dimerize and localize to the nucleus as suggested by their nucleic acid-binding RAGNYA folds. Our results suggest that AMMECR1 is potentially involved in cell cycle control and linked to a new syndrome with growth, bone, heart, and kidney alterations with or without elliptocytosis.
Assuntos
Osso e Ossos/fisiologia , Coração/fisiologia , Proteínas/genética , Animais , Western Blotting , Osso e Ossos/metabolismo , Ciclo Celular/genética , Ciclo Celular/fisiologia , Linhagem Celular , Exoma/genética , Feminino , Células HeLa , Humanos , Masculino , Sequenciamento Completo do Genoma , Peixe-ZebraRESUMO
Dicentric chromosomes are products of genomic rearrangements that place two centromeres on the same chromosome. Due to the presence of two primary constrictions, they are inherently unstable and overcome their instability by epigenetically inactivating and/or deleting one of the two centromeres, thus resulting in functionally monocentric chromosomes that segregate normally during cell division. Our understanding to date of dicentric chromosome formation, behavior and fate has been largely inferred from observational studies in plants and humans as well as artificially produced de novo dicentrics in yeast and in human cells. We investigate the most recent product of a chromosome fusion event fixed in the human lineage, human chromosome 2, whose stability was acquired by the suppression of one centromere, resulting in a unique difference in chromosome number between humans (46 chromosomes) and our most closely related ape relatives (48 chromosomes). Using molecular cytogenetics, sequencing, and comparative sequence data, we deeply characterize the relicts of the chromosome 2q ancestral centromere and its flanking regions, gaining insight into the ancestral organization that can be easily broadened to all acrocentric chromosome centromeres. Moreover, our analyses offered the opportunity to trace the evolutionary history of rDNA and satellite III sequences among great apes, thus suggesting a new hypothesis for the preferential inactivation of some human centromeres, including IIq. Our results suggest two possible centromere inactivation models to explain the evolutionarily stabilization of human chromosome 2 over the last 5-6 million years. Our results strongly favor centromere excision through a one-step process.
Assuntos
Centrômero/genética , Cromossomos Humanos Par 2 , Centrômero/fisiologia , DNA Antigo , Evolução Molecular , Humanos , Translocação GenéticaRESUMO
The 16p11.2 600 kb copy-number variants (CNVs) are associated with mirror phenotypes on BMI, head circumference, and brain volume and represent frequent genetic lesions in autism spectrum disorders (ASDs) and schizophrenia. Here we interrogated the transcriptome of individuals carrying reciprocal 16p11.2 CNVs. Transcript perturbations correlated with clinical endophenotypes and were enriched for genes associated with ASDs, abnormalities of head size, and ciliopathies. Ciliary gene expression was also perturbed in orthologous mouse models, raising the possibility that ciliary dysfunction contributes to 16p11.2 pathologies. In support of this hypothesis, we found structural ciliary defects in the CA1 hippocampal region of 16p11.2 duplication mice. Moreover, by using an established zebrafish model, we show genetic interaction between KCTD13, a key driver of the mirrored neuroanatomical phenotypes of the 16p11.2 CNV, and ciliopathy-associated genes. Overexpression of BBS7 rescues head size and neuroanatomical defects of kctd13 morphants, whereas suppression or overexpression of CEP290 rescues phenotypes induced by KCTD13 under- or overexpression, respectively. Our data suggest that dysregulation of ciliopathy genes contributes to the clinical phenotypes of these CNVs.
Assuntos
Transtornos Globais do Desenvolvimento Infantil/genética , Cromossomos Humanos Par 16/genética , Variações do Número de Cópias de DNA/genética , Esquizofrenia/genética , Animais , Encéfalo , Criança , Transtornos Globais do Desenvolvimento Infantil/patologia , Deleção Cromossômica , Corpo Ciliar/metabolismo , Corpo Ciliar/patologia , Regulação da Expressão Gênica , Humanos , Camundongos , Canais de Potássio de Abertura Dependente da Tensão da Membrana/genética , Esquizofrenia/patologia , Transcriptoma , Peixe-Zebra , Proteínas de Peixe-Zebra/genéticaRESUMO
Grapevine (Vitis vinifera L.) is one of the world's most important crop plants, which is of large economic value for fruit and wine production. There is much interest in identifying genomic variations and their functional effects on inter-varietal, phenotypic differences. Using an approach developed for the analysis of human and mammalian genomes, which combines high-throughput sequencing, array comparative genomic hybridization, fluorescent in situ hybridization and quantitative PCR, we created an inter-varietal atlas of structural variations and single nucleotide variants (SNVs) for the grapevine genome analyzing four economically and genetically relevant table grapevine varieties. We found 4.8 million SNVs and detected 8% of the grapevine genome to be affected by genomic variations. We identified more than 700 copy number variation (CNV) regions and more than 2000 genes subjected to CNV as potential candidates for phenotypic differences between varieties.
Assuntos
Genoma de Planta/genética , Vitis/genética , Hibridização Genômica Comparativa/métodos , Variações do Número de Cópias de DNA/genética , Hibridização in Situ Fluorescente , Reação em Cadeia da PolimeraseRESUMO
Human and chimpanzee genomes are 98.8% identical within comparable sequences. However, they differ structurally in nine pericentric inversions, one fusion that originated human chromosome 2, and content and localization of heterochromatin and lineage-specific segmental duplications. The possible functional consequences of these cytogenetic and structural differences are not fully understood and their possible involvement in speciation remains unclear. We show that subtelomeric regions--regions that have a species-specific organization, are more divergent in sequence, and are enriched in genes and recombination hotspots--are significantly enriched for species-specific histone modifications that decorate transcription start sites in different tissues in both human and chimpanzee. The human lineage-specific chromosome 2 fusion point and ancestral centromere locus as well as chromosome 1 and 18 pericentric inversion breakpoints showed enrichment of human-specific H3K4me3 peaks in the prefrontal cortex. Our results reveal an association between plastic regions and potential novel regulatory elements.
Assuntos
Centrômero/genética , Genoma Humano , Histonas/genética , Telômero/genética , Animais , Aberrações Cromossômicas , Pontos de Quebra do Cromossomo , Histonas/metabolismo , Humanos , Especificidade de Órgãos , Pan troglodytes/genética , Córtex Pré-Frontal/metabolismo , Especificidade da Espécie , Sítio de Iniciação de TranscriçãoRESUMO
Ape chromosomes homologous to human chromosomes 14 and 15 were generated by a fission event of an ancestral submetacentric chromosome, where the two chromosomes were joined head-to-tail. The hominoid ancestral chromosome most closely resembles the macaque chromosome 7. In this work, we provide insights into the evolution of human chromosomes 14 and 15, performing a comparative study between macaque boundary region 14/15 and the orthologous human regions. We construct a 1.6-Mb contig of macaque BAC clones in the region orthologous to the ancestral hominoid fission site and use it to define the structural changes that occurred on human 14q pericentromeric and 15q subtelomeric regions. We characterize the novel euchromatin-heterochromatin transition region (â¼20 Mb) acquired during the neocentromere establishment on chromosome 14, and find it was mainly derived through pericentromeric duplications from ancestral hominoid chromosomes homologous to human 2q14-qter and 10. Further, we show a relationship between evolutionary hotspots and low-copy repeat loci for chromosome 15, revealing a possible role of segmental duplications not only in mediating but also in "stitching" together rearrangement breakpoints.
Assuntos
Cromossomos Humanos Par 14/genética , Cromossomos Humanos Par 15/genética , Cromossomos de Mamíferos/genética , Evolução Molecular , Hominidae/genética , Duplicações Segmentares Genômicas , Animais , Pontos de Quebra do Cromossomo , Duplicação Cromossômica , Cromossomos Artificiais Bacterianos , Clonagem Molecular , Eucromatina/genética , Heterocromatina/genética , Humanos , Dados de Sequência Molecular , FilogeniaRESUMO
Core duplicons in the human genome represent ancestral duplication modules shared by the majority of intrachromosomal duplication blocks within a given chromosome. These cores are associated with the emergence of novel gene families in the hominoid lineage, but their genomic organization and gene characterization among other primates are largely unknown. Here, we investigate the genomic organization and expression of the core duplicon on chromosome 17 that led to the expansion of LRRC37 during primate evolution. A comparison of the LRRC37 gene family organization in human, orangutan, macaque, marmoset, and lemur genomes shows the presence of both orthologous and species-specific gene copies in all primate lineages. Expression profiling in mouse, macaque, and human tissues reveals that the ancestral expression of LRRC37 was restricted to the testis. In the hominid lineage, the pattern of LRRC37 became increasingly ubiquitous, with significantly higher levels of expression in the cerebellum and thymus, and showed a remarkable diversity of alternative splice forms. Transfection studies in HeLa cells indicate that the human FLAG-tagged recombinant LRRC37 protein is secreted after cleavage of a transmembrane precursor and its overexpression can induce filipodia formation.
Assuntos
Evolução Molecular , Família Multigênica/genética , Primatas/genética , Proteínas/genética , Processamento Alternativo , Animais , Sequência de Bases , Cerebelo/metabolismo , Cromossomos de Mamíferos/genética , DNA/química , Duplicação Gênica , Perfilação da Expressão Gênica , Genoma/genética , Células HeLa , Humanos , Proteínas de Repetições Ricas em Leucina , Masculino , Camundongos , Dados de Sequência Molecular , Especificidade de Órgãos , Proteínas/metabolismo , Testículo/metabolismo , Timo/metabolismo , Transcrição Gênica/genéticaRESUMO
Antisense RNAs (asRNAs) represent an underappreciated yet crucial layer of gene expression regulation. Generally thought to modulate their sense genes in cis through sequence complementarity or their act of transcription, asRNAs can also regulate different molecular targets in trans, in the nucleus or in the cytoplasm. Here, we performed an in-depth molecular characterization of NFYC Antisense 1 (NFYC-AS1), the asRNA transcribed head-to-head to NFYC subunit of the proliferation-associated NF-Y transcription factor. Our results show that NFYC-AS1 is a prevalently nuclear asRNA peaking early in the cell cycle. Comparative genomics suggests a narrow phylogenetic distribution, with a probable origin in the common ancestor of mammalian lineages. NFYC-AS1 is overexpressed pancancer, preferentially in association with RB1 mutations. Knockdown of NFYC-AS1 by antisense oligonucleotides impairs cell growth in lung squamous cell carcinoma and small cell lung cancer cells, a phenotype recapitulated by CRISPR/Cas9-deletion of its transcription start site. Surprisingly, expression of the sense gene is affected only when endogenous transcription of NFYC-AS1 is manipulated. This suggests that regulation of cell proliferation is at least in part independent of the in cis transcription-mediated effect on NFYC and is possibly exerted by RNA-dependent in trans effects converging on the regulation of G2/M cell cycle phase genes. Accordingly, NFYC-AS1-depleted cells are stuck in mitosis, indicating defects in mitotic progression. Overall, NFYC-AS1 emerged as a cell cycle-regulating asRNA with dual action, holding therapeutic potential in different cancer types, including the very aggressive RB1-mutated tumors.
Assuntos
Neoplasias Pulmonares , RNA Longo não Codificante , Animais , Humanos , Filogenia , Regulação Neoplásica da Expressão Gênica , RNA Antissenso/genética , Ciclo Celular/genética , Proliferação de Células/genética , Neoplasias Pulmonares/genética , RNA Longo não Codificante/genética , Linhagem Celular Tumoral , Movimento Celular , Mamíferos/genética , Fator de Ligação a CCAAT/genéticaRESUMO
The largest multi-gene family in metazoans is the family of olfactory receptor (OR) genes. Human ORs are organized in clusters over most chromosomes and seem to include >0.1% the human genome. Because 369 out of 856 OR genes are mapped on chromosome 11 (HSA11), we sought to determine whether they mediate structural rearrangements involving this chromosome. To this aim, we analyzed 220 specimens collected during diagnostic procedures involving structural rearrangements of chromosome 11. A total of 222 chromosomal abnormalities were included, consisting of inversions, deletions, translocations, duplications, and one insertion, detected by conventional chromosome analysis and/or fluorescence in situ hybridization (FISH) and array comparative genomic hybridization (array-CGH). We verified by bioinformatics and statistical approaches the occurrence of breakpoints in cytobands with or without OR genes. We found that OR genes are not involved in chromosome 11 reciprocal translocations, suggesting that different DNA motifs and mechanisms based on homology or non-homology recombination can cause chromosome 11 structural alterations. We also considered the proximity between the chromosomal territories of chromosome 11 and its partner chromosomes involved in the translocations by using the deposited Hi-C data concerning the possible occurrence of chromosome interactions. Interestingly, most of the breakpoints are located in regions highly involved in chromosome interactions. Further studies should be carried out to confirm the potential role of chromosome territories' proximity in promoting genome structural variation, so fundamental in our understanding of the molecular basis of medical genetics and evolutionary genetics.
Assuntos
Cromossomos Humanos Par 11 , Receptores Odorantes , Humanos , Hibridização Genômica Comparativa , Hibridização in Situ Fluorescente , Aberrações Cromossômicas , Translocação Genética/genética , Receptores Odorantes/genéticaRESUMO
The extent of human genomic structural variation suggests that there must be portions of the genome yet to be discovered, annotated and characterized at the sequence level. We present a resource and analysis of 2,363 new insertion sequences corresponding to 720 genomic loci. We found that a substantial fraction of these sequences are either missing, fragmented or misassigned when compared to recent de novo sequence assemblies from short-read next-generation sequence data. We determined that 18-37% of these new insertions are copy-number polymorphic, including loci that show extensive population stratification among Europeans, Asians and Africans. Complete sequencing of 156 of these insertions identified new exons and conserved noncoding sequences not yet represented in the reference genome. We developed a method to accurately genotype these new insertions by mapping next-generation sequencing datasets to the breakpoint, thereby providing a means to characterize copy-number status for regions previously inaccessible to single-nucleotide polymorphism microarrays.
Assuntos
Mapeamento de Sequências Contíguas/métodos , Genoma Humano , Análise de Sequência de DNA/métodos , Elementos de DNA Transponíveis/genética , Frequência do Gene , Variação Estrutural do Genoma/genética , Genótipo , Humanos , Hibridização in Situ Fluorescente , Dados de Sequência Molecular , Polimorfismo GenéticoRESUMO
Recurrent copy-number variations (CNVs) at chromosome 16p11.2 are associated with neurodevelopmental diseases, skeletal system abnormalities, anemia, and genitourinary defects. Among the 40 protein-coding genes encompassed within the rearrangement, some have roles in leukocyte biology and immunodeficiency, like SPN and CORO1A. We therefore investigated leukocyte differential counts and disease in 16p11.2 CNV carriers. In our clinically-recruited cohort, we identified three deletion carriers from two families (out of 32 families assessed) with neutropenia and lymphopenia. They had no deleterious single-nucleotide or indel variant in known cytopenia genes, suggesting a possible causative role of the deletion. Noticeably, all three individuals had the lowest copy number of the human-specific BOLA2 duplicon (copy-number range: 3-8). Consistent with the lymphopenia and in contrast with the neutropenia associations, adult deletion carriers from UK biobank (n = 74) showed lower lymphocyte (Padj = 0.04) and increased neutrophil (Padj = 8.31e-05) counts. Mendelian randomization studies pinpointed to reduced CORO1A, KIF22, and BOLA2-SMG1P6 expressions being causative for the lower lymphocyte counts. In conclusion, our data suggest that 16p11.2 deletion, and possibly also the lowest dosage of the BOLA2 duplicon, are associated with low lymphocyte counts. There is a trend between 16p11.2 deletion with lower copy-number of the BOLA2 duplicon and higher susceptibility to moderate neutropenia. Higher numbers of cases are warranted to confirm the association with neutropenia and to resolve the involvement of the deletion coupled with deleterious variants in other genes and/or with the structure and copy number of segments in the CNV breakpoint regions.
RESUMO
BACKGROUND: Segmental duplications (SDs) are blocks of genomic sequence of 1-200 kb that map to different loci in a genome and share a sequence identity > 90%. SDs show at the sequence level the same characteristics as other regions of the human genome: they contain both high-copy repeats and gene sequences. SDs play an important role in genome plasticity by creating new genes and modeling genome structure. Although data is plentiful for mammals, not much was known about the representation of SDs in plant genomes. In this regard, we performed a genome-wide analysis of high-identity SDs on the sequenced grapevine (Vitis vinifera) genome (PN40024). RESULTS: We demonstrate that recent SDs (> 94% identity and >= 10 kb in size) are a relevant component of the grapevine genome (85 Mb, 17% of the genome sequence). We detected mitochondrial and plastid DNA and genes (10% of gene annotation) in segmentally duplicated regions of the nuclear genome. In particular, the nine highest copy number genes have a copy in either or both organelle genomes. Further we showed that several duplicated genes take part in the biosynthesis of compounds involved in plant response to environmental stress. CONCLUSIONS: These data show the great influence of SDs and organelle DNA transfers in modeling the Vitis vinifera nuclear DNA structure as well as the impact of SDs in contributing to the adaptive capacity of grapevine and the nutritional content of grape products through genome variation. This study represents a step forward in the full characterization of duplicated genes important for grapevine cultural needs and human health.
Assuntos
Genoma de Planta , Duplicações Segmentares Genômicas , Vitis/genética , Núcleo Celular/genética , Variações do Número de Cópias de DNA , Genoma de Cloroplastos , Genoma Mitocondrial , Genômica/métodos , Análise de Sequência de DNARESUMO
POTE (prostate, ovary, testis, and placenta expressed) genes belong to a primate-specific gene family expressed in prostate, ovary, and testis as well as in several cancers including breast, prostate, and lung cancers. Due to their tumor-specific expression, POTEs are potential oncogenes, therapeutic targets, and biomarkers for these malignancies. This gene family maps within human and primate segmental duplications with a copy number ranging from two to 14 in different species. Due to the high sequence identity among the gene copies, specific efforts are needed to assemble these loci in order to correctly define the organization and evolution of the gene family. Using single-molecule, real-time (SMRT) sequencing, in silico analyses, and molecular cytogenetics, we characterized the structure, copy number, and chromosomal distribution of the POTE genes, as well as their expression in normal and disease tissues, and provided a comparative analysis of the POTE organization and gene structure in primate genomes. We were able, for the first time, to de novo sequence and assemble a POTE tandem duplication in marmoset that is misassembled and collapsed in the reference genome, thus revealing the presence of a second POTE copy. Taken together, our findings provide comprehensive insights into the evolutionary dynamics of the primate-specific POTE gene family, involving gene duplications, deletions, and long interspersed nuclear element (LINE) transpositions to explain the actual repertoire of these genes in human and primate genomes.