ABSTRACT
Recurrent DNA break clusters (RDCs) are replication-transcription collision hotspots; many are unique to neural progenitor cells. Through high-resolution replication sequencing and a capture-ligation assay in mouse neural progenitor cells experiencing replication stress, we unravel the replication features dictating RDC location and orientation. Most RDCs occur at the replication forks traversing timing transition regions (TTRs), where sparse replication origins connect unidirectional forks. Leftward-moving forks generate telomere-connected DNA double-strand breaks (DSBs), while rightward-moving forks lead to centromere-connected DSBs. Strand-specific mapping for DNA-bound RNA reveals co-transcriptional dual-strand DNA:RNA hybrids present at a higher density in RDC than in other actively transcribed long genes. In addition, mapping RNA polymerase activity uncovers that head-to-head interactions between replication and transcription machinery result in 60% DSB contribution to the head-on compared to 40% for co-directional. Taken together we reveal TTR as a fragile class and show how the linear interaction between transcription and replication impacts genome stability.
Subject(s)
DNA Breaks, Double-Stranded , DNA Replication , Genomic Instability , Transcription, Genetic , Animals , Mice , Neural Stem Cells/metabolism , DNA/metabolism , DNA/genetics , Replication Origin , Telomere/metabolism , Telomere/genetics , Centromere/metabolism , Centromere/geneticsABSTRACT
Recurrent DNA break clusters (RDCs) are replication-transcription collision hotspots; many are unique to neural progenitor cells. Through high-resolution replication sequencing and a capture-ligation assay in mouse neural progenitor cells experiencing replication stress, we unraveled the replication features dictating RDC location and orientation. Most RDCs occur at the replication forks traversing timing transition regions (TTRs), where sparse replication origins connect unidirectional forks. Leftward-moving forks generate telomere-connected DNA double-strand breaks (DSBs), while rightward-moving forks lead to centromere-connected DSBs. Strand-specific mapping for DNA-bound RNA revealed co-transcriptional dual-strand DNA:RNA hybrids present at a higher density in RDC than in other actively transcribed long genes. In addition, mapping RNA polymerase activity revealed that head-to-head interactions between replication and transcription machinery resulted in 60% DSB contribution to the head-on compared to 40% for co-directional. Our findings revealed TTR as a novel fragile class and highlighted how the linear interaction between transcription and replication impacts genome stability.
ABSTRACT
Biallelic pathogenic variants in PGAP3 cause a rare glycosylphosphatidyl-inositol biogenesis disorder, PGAP3-CDG. This multisystem condition presents with a predominantly neurological phenotype, including developmental delay, intellectual disability, seizures, and hyperphosphatemia. Here, we summarized the phenotype of sixty-five individuals including six unreported individuals from our CDG natural history study with a confirmed PGAP3-CDG diagnosis. Common additional features found in this disorder included brain malformations, behavioral abnormalities, cleft palate, and characteristic facial features. This report aims to review the genetic and metabolic findings and characterize the disease's phenotype while highlighting the necessary clinical approach to improve the management of this rare CDG.
Subject(s)
Abnormalities, Multiple , Congenital Disorders of Glycosylation , Intellectual Disability , Humans , Abnormalities, Multiple/genetics , Glycosylation , Phenotype , Intellectual Disability/genetics , Intellectual Disability/pathology , Seizures , Congenital Disorders of Glycosylation/genetics , Congenital Disorders of Glycosylation/diagnosis , Carboxylic Ester Hydrolases/genetics , Receptors, Cell Surface/geneticsABSTRACT
We present CARDIO:DE, the first freely available and distributable large German clinical corpus from the cardiovascular domain. CARDIO:DE encompasses 500 clinical routine German doctor's letters from Heidelberg University Hospital, which were manually annotated. Our prospective study design complies well with current data protection regulations and allows us to keep the original structure of clinical documents consistent. In order to ease access to our corpus, we manually de-identified all letters. To enable various information extraction tasks the temporal information in the documents was preserved. We added two high-quality manual annotation layers to CARDIO:DE, (1) medication information and (2) CDA-compliant section classes. To the best of our knowledge, CARDIO:DE is the first freely available and distributable German clinical corpus in the cardiovascular domain. In summary, our corpus offers unique opportunities for collaborative and reproducible research on natural language processing models for German clinical texts.