Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 1.665
Filter
1.
Am J Hum Genet ; 111(10): 2176-2189, 2024 Oct 03.
Article in English | MEDLINE | ID: mdl-39265574

ABSTRACT

We previously identified a homozygous Alu insertion variant (Alu_Ins) in the 3'-untranslated region (3'-UTR) of SPINK1 as the cause of severe infantile isolated exocrine pancreatic insufficiency. Although we established that Alu_Ins leads to the complete loss of SPINK1 mRNA expression, the precise mechanisms remained elusive. Here, we aimed to elucidate these mechanisms through a hypothesis-driven approach. Initially, we speculated that, owing to its particular location, Alu_Ins could independently disrupt mRNA 3' end formation and/or affect other post-transcriptional processes such as nuclear export and translation. However, employing a 3'-UTR luciferase reporter assay, Alu_Ins was found to result in only an ∼50% reduction in luciferase activity compared to wild type, which is insufficient to account for the severe pancreatic deficiency in the Alu_Ins homozygote. We then postulated that double-stranded RNA (dsRNA) structures formed between Alu elements, an upstream mechanism regulating gene expression, might be responsible. Using RepeatMasker, we identified two Alu elements within SPINK1's third intron, both oriented oppositely to Alu_Ins. Through RNAfold predictions and full-length gene expression assays, we investigated orientation-dependent interactions between these Alu repeats. We provide compelling evidence to link the detrimental effect of Alu_Ins to extensive dsRNA structures formed between Alu_Ins and pre-existing intronic Alu sequences, including the restoration of SPINK1 mRNA expression by aligning all three Alu elements in the same orientation. Given the widespread presence of Alu elements in the human genome and the potential for new Alu insertions at almost any locus, our findings have important implications for detecting and interpreting Alu insertions in disease genes.


Subject(s)
3' Untranslated Regions , Alu Elements , RNA, Double-Stranded , Alu Elements/genetics , Humans , RNA, Double-Stranded/genetics , 3' Untranslated Regions/genetics , Introns/genetics , Mutagenesis, Insertional/genetics , Homozygote , RNA, Messenger/genetics , RNA, Messenger/metabolism
2.
STAR Protoc ; 5(3): 103184, 2024 Sep 20.
Article in English | MEDLINE | ID: mdl-39180746

ABSTRACT

Generating a transgene with a reporter inserted into the genome helps us study endogenous gene expression patterns in model organisms. Here, using Drosophila melanogaster, we present a protocol for generating a P2A-Gal4 insertion through CRISPR-Cas9-mediated homology recombination. We describe the design strategy, steps for constructing the injection plasmids, and the fly-cross scheme for screening the transformants from the G0 generation. This protocol can also be applied to introduce mutations or various genetic tools into the fly genome. For complete details on the use and execution of this protocol, please refer to Li et al.1.


Subject(s)
CRISPR-Cas Systems , Drosophila Proteins , Drosophila melanogaster , Animals , CRISPR-Cas Systems/genetics , Drosophila melanogaster/genetics , Drosophila Proteins/genetics , Drosophila Proteins/metabolism , Transcription Factors/genetics , Transcription Factors/metabolism , Gene Expression/genetics , Animals, Genetically Modified , Mutagenesis, Insertional/methods , Mutagenesis, Insertional/genetics , Homologous Recombination/genetics
3.
J Assist Reprod Genet ; 41(9): 2257-2269, 2024 Sep.
Article in English | MEDLINE | ID: mdl-38951360

ABSTRACT

PURPOSE: Retrotransposons play important roles during early development when they are transiently de-repressed during epigenetic reprogramming. Long interspersed element-1 (L1), the only autonomous retrotransposon in humans, comprises 17% of the human genome. We applied the Single Cell Transposon Insertion Profiling by Sequencing (scTIPseq) to characterize and map L1 insertions in human embryos. METHODS: Sixteen cryopreserved, genetically tested, human blastocysts, were accessed from consenting couples undergoing IVF at NYU Langone Fertility Center. Additionally, four trios (father, mother, and embryos) were also evaluated. scTIPseq was applied to map L1 insertions in all samples, using L1 locations reported in the 1000 Genomes as controls. RESULTS: Twenty-nine unknown and unique insertions were observed in the sixteen embryos. Most were intergenic; no insertions were located in exons or immediately upstream of genes. The location or number of unknown insertions did not differ between euploid and aneuploid embryos, suggesting they are not merely markers of aneuploidy. Rather, scTIPseq provides novel information about sub-chromosomal structural variation in human embryos. Trio analyses showed a parental origin of all L1 insertions in embryos. CONCLUSION: Several studies have measured L1 expression at different stages of development in mice, but this study for the first time reports unknown insertions in human embryos that were inherited from one parent, confirming no de novo L1 insertions occurred in parental germline or during embryogenesis. Since one-third of euploid embryo transfers fail, future studies would be useful for understanding whether these sub-chromosomal genetic variants or de novo L1 insertions affect embryo developmental potential.


Subject(s)
Blastocyst , Long Interspersed Nucleotide Elements , Humans , Long Interspersed Nucleotide Elements/genetics , Blastocyst/metabolism , Female , Embryo, Mammalian/metabolism , Embryonic Development/genetics , Mutagenesis, Insertional/genetics , Aneuploidy , Genome, Human/genetics , Fertilization in Vitro , Male , Genetic Variation/genetics , Mice , Chromosome Mapping/methods
4.
J Med Genet ; 61(10): 950-958, 2024 09 24.
Article in English | MEDLINE | ID: mdl-38960580

ABSTRACT

BACKGROUND: SINE-VNTR-Alu (SVA) retrotransposons move from one genomic location to another in a 'copy-and-paste' manner. They continue to move actively and cause monogenic diseases through various mechanisms. Currently, disease-causing SVA retrotransposons are classified into human-specific young SVA_E or SVA_F subfamilies. In this study, we identified an evolutionarily old SVA_D retrotransposon as a novel cause of occipital horn syndrome (OHS). OHS is an X-linked, copper metabolism disorder caused by dysfunction of the copper transporter, ATP7A. METHODS: We investigated a 16-year-old boy with OHS whose pathogenic variant could not be detected via routine molecular genetic analyses. RESULTS: A 2.8 kb insertion was detected deep within the intron of the patient's ATP7A gene. This insertion caused aberrant mRNA splicing activated by a new donor splice site located within it. Long-read circular consensus sequencing enabled us to accurately read the entire insertion sequence, which contained highly repetitive and GC-rich segments. Consequently, the insertion was identified as an SVA_D retrotransposon. Antisense oligonucleotides (AOs) targeting the new splice site restored the expression of normal transcripts and functional ATP7A proteins. AO treatment alleviated excessive accumulation of copper in patient fibroblasts in a dose-dependent manner. Pedigree analysis revealed that the retrotransposon had moved into the OHS-causing position two generations ago. CONCLUSION: This is the first report of a human monogenic disease caused by the SVA_D retrotransposon. The fact that the evolutionarily old SVA_D is still actively transposed, leading to increased copy numbers may make a notable impact on rare genetic disease research.


Subject(s)
Copper-Transporting ATPases , Introns , Retroelements , Humans , Copper-Transporting ATPases/genetics , Male , Retroelements/genetics , Adolescent , Introns/genetics , Central Nervous System Cysts/genetics , Central Nervous System Cysts/pathology , Genetic Diseases, X-Linked/genetics , Genetic Diseases, X-Linked/pathology , Alu Elements/genetics , Mutagenesis, Insertional/genetics , Brain Diseases/genetics , Brain Diseases/pathology , RNA Splicing/genetics , Cutis Laxa , Ehlers-Danlos Syndrome
5.
Nature ; 631(8021): 593-600, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38926583

ABSTRACT

The current technologies to place new DNA into specific locations in plant genomes are low frequency and error-prone, and this inefficiency hampers genome-editing approaches to develop improved crops1,2. Often considered to be genome 'parasites', transposable elements (TEs) evolved to insert their DNA seamlessly into genomes3-5. Eukaryotic TEs select their site of insertion based on preferences for chromatin contexts, which differ for each TE type6-9. Here we developed a genome engineering tool that controls the TE insertion site and cargo delivered, taking advantage of the natural ability of the TE to precisely excise and insert into the genome. Inspired by CRISPR-associated transposases that target transposition in a programmable manner in bacteria10-12, we fused the rice Pong transposase protein to the Cas9 or Cas12a programmable nucleases. We demonstrated sequence-specific targeted insertion (guided by the CRISPR gRNA) of enhancer elements, an open reading frame and a gene expression cassette into the genome of the model plant Arabidopsis. We then translated this system into soybean-a major global crop in need of targeted insertion technology. We have engineered a TE 'parasite' into a usable and accessible toolkit that enables the sequence-specific targeting of custom DNA into plant genomes.


Subject(s)
Arabidopsis , DNA Transposable Elements , Genetic Engineering , Genome, Plant , Mutagenesis, Insertional , Plants, Genetically Modified , Transposases , Arabidopsis/genetics , CRISPR-Associated Protein 9/metabolism , CRISPR-Associated Protein 9/genetics , CRISPR-Cas Systems/genetics , DNA Transposable Elements/genetics , Enhancer Elements, Genetic/genetics , Gene Editing/methods , Genetic Engineering/methods , Genome, Plant/genetics , Mutagenesis, Insertional/genetics , Open Reading Frames/genetics , Oryza/enzymology , Oryza/genetics , Plant Proteins/genetics , Plant Proteins/metabolism , Plants, Genetically Modified/genetics , RNA, Guide, CRISPR-Cas Systems/genetics , RNA, Guide, CRISPR-Cas Systems/metabolism , Transposases/metabolism , Transposases/genetics
6.
Nature ; 630(8018): 984-993, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38926615

ABSTRACT

Genomic rearrangements, encompassing mutational changes in the genome such as insertions, deletions or inversions, are essential for genetic diversity. These rearrangements are typically orchestrated by enzymes that are involved in fundamental DNA repair processes, such as homologous recombination, or in the transposition of foreign genetic material by viruses and mobile genetic elements1,2. Here we report that IS110 insertion sequences, a family of minimal and autonomous mobile genetic elements, express a structured non-coding RNA that binds specifically to their encoded recombinase. This bridge RNA contains two internal loops encoding nucleotide stretches that base-pair with the target DNA and the donor DNA, which is the IS110 element itself. We demonstrate that the target-binding and donor-binding loops can be independently reprogrammed to direct sequence-specific recombination between two DNA molecules. This modularity enables the insertion of DNA into genomic target sites, as well as programmable DNA excision and inversion. The IS110 bridge recombination system expands the diversity of nucleic-acid-guided systems beyond CRISPR and RNA interference, offering a unified mechanism for the three fundamental DNA rearrangements-insertion, excision and inversion-that are required for genome design.


Subject(s)
DNA , RNA, Untranslated , Recombination, Genetic , Base Pairing , Base Sequence , DNA/genetics , DNA/metabolism , DNA Transposable Elements/genetics , Mutagenesis, Insertional/genetics , Recombinases/metabolism , Recombinases/genetics , Recombination, Genetic/genetics , RNA, Untranslated/genetics , RNA, Untranslated/metabolism
8.
Pathology ; 56(5): 653-661, 2024 Aug.
Article in English | MEDLINE | ID: mdl-38811262

ABSTRACT

Epidermal growth factor receptor (EGFR) exon 20 insertion mutations (E20ins) are the third most frequent mutations observed in non-small cell lung cancer, accounting for approximately 1-10% of all EGFR mutations. In the era of precision medicine and targeted therapies, consistent naming of genetic alterations is crucial to avoid confusion and errors. However, the annotation of EGFR E20ins mutations has been inconsistent, leading to confusion in the scientific literature and product documentation. In this study, our primary objective was to investigate the usage of different annotation related to EGFR E20ins in independent studies. Additionally, we assessed the distribution of EGFR E20ins mutations and estimated the detection coverage expected from each available EGFR E20ins detection assay. A total of 1,418 EGFR E20ins mutations were collected from six studies (FoundationInsights, Geneseeq Technology Inc, mobocertinib phase I/II trial, poziotinib phase II trial, sunvozertinib phase I trial, and Samsung Medical Center) and reorganised according to Human Genome Variation Society (HGVS) nomenclature. Our analysis revealed that the majority of EGFR E20ins mutations requiring correction were 'insertion' or 'deletion-insertion', which should be appropriately designated as 'duplication'. Additionally, duplicated variants were reported using different annotations in each study, and furthermore, even identical variant sequences were annotated differently within the same study. In all six studies, p.A767_V769dup and p.S768_D770dup were the most frequently observed EGFR E20ins. The Oncomine Dx Target Test showed the highest patient coverage at 77.2%, followed by the Droplex EGFR Mutation Test v2 with a patient coverage of 70.5% for EGFR E20ins patients. To ensure comprehensive coverage in real-world settings, it is essential to standardise the annotations for each variant, for example using the HGVS nomenclature. The accurate classification and analysis of drug responsiveness in EGFR E20ins necessitate consideration of the nomenclature, particularly with respect to the locations where the actual mutations occur.


Subject(s)
Carcinoma, Non-Small-Cell Lung , ErbB Receptors , Exons , Lung Neoplasms , Precision Medicine , Humans , Carcinoma, Non-Small-Cell Lung/genetics , Carcinoma, Non-Small-Cell Lung/drug therapy , Carcinoma, Non-Small-Cell Lung/pathology , ErbB Receptors/genetics , Lung Neoplasms/genetics , Lung Neoplasms/drug therapy , Lung Neoplasms/pathology , Exons/genetics , Mutation , Mutagenesis, Insertional/genetics , Terminology as Topic
9.
Plant Physiol ; 196(1): 432-445, 2024 Sep 02.
Article in English | MEDLINE | ID: mdl-38788771

ABSTRACT

Malic acid is an important flavor determinant in apple (Malus × domestica Borkh.) fruit. One known variation controlling malic acid is the A/G single nucleotide polymorphism in an aluminum-activated malate transporter gene (MdMa1). Nevertheless, there are still differences in malic acid content in apple varieties with the same Ma1 genotype (Ma1/Ma1 homozygous), such as 'Honeycrisp' (high malic acid content) and 'Qinguan' (low malic acid content), indicating that other loci may influence malic acid and fruit acidity. Here, the F1 (Filial 1) hybrid generation of 'Honeycrisp' × 'Qinguan' was used to analyze quantitative trait loci for malic acid content. A major locus (Ma7) was identified on chromosome 13. Within this locus, a malate dehydrogenase gene, MDH1 (MdMa7), was the best candidate for further study. Subcellular localization suggested that MdMa7 encodes a cytosolic protein. Overexpression and RNA interference of MdMa7 in apple fruit increased and decreased malic acid content, respectively. An insertion/deletion (indel) in the MdMa7 promoter was found to affect MdMa7 expression and malic acid content in both hybrids and other cultivated varieties. The insertion and deletion genotypes were designated as MA7 and ma7, respectively. The transcription factor MdbHLH74 was found to stimulate MdMa7 expression in the MA7 genotype but not in the ma7 genotype. Transient transformation of fruit showed that MdbHLH74 affected MdMa7 expression and malic acid content in 'Gala' (MA7/MA7) but not in 'Fuji' (ma7/ma7). Our results indicated that genetic variation in the MdMa7 (MDH1) promoter alters the binding ability of the transcription factor MdbHLH74, which alters MdMa7 (MDH1) transcription and the malic acid content in apple fruit, especially in Ma1/Ma1 homozygous accessions.


Subject(s)
Fruit , Gene Expression Regulation, Plant , Malate Dehydrogenase , Malates , Malus , Plant Proteins , Promoter Regions, Genetic , Malus/genetics , Malus/metabolism , Malates/metabolism , Fruit/genetics , Fruit/metabolism , Malate Dehydrogenase/genetics , Malate Dehydrogenase/metabolism , Promoter Regions, Genetic/genetics , Plant Proteins/genetics , Plant Proteins/metabolism , Quantitative Trait Loci/genetics , Mutagenesis, Insertional/genetics , Plants, Genetically Modified , Genes, Plant
10.
Clin Neurol Neurosurg ; 240: 108261, 2024 05.
Article in English | MEDLINE | ID: mdl-38599043

ABSTRACT

This report presents a case of Alexander disease showing clinical characteristics mimicking progressive supranuclear palsy (PSP). A 67-year-old woman complaining of motor disturbance exhibited severe atrophy of medulla, spinal cord, and midbrain tegmentum, as well as periventricular hyperintensity on cerebral MRI. Genetic analysis identified a novel in-frame deletion/insertion mutation in the exon 3 of the GFAP gene. Interestingly, neurological findings and decreased striatal uptake in dopamine transporter SPECT were suggestive of PSP. A novel GFAP gene mutation found in the present case may cause the unique clinical phenotype, which should be differentiated from PSP.


Subject(s)
Alexander Disease , Glial Fibrillary Acidic Protein , Magnetic Resonance Imaging , Supranuclear Palsy, Progressive , Humans , Alexander Disease/genetics , Alexander Disease/diagnostic imaging , Alexander Disease/diagnosis , Female , Supranuclear Palsy, Progressive/genetics , Supranuclear Palsy, Progressive/diagnostic imaging , Aged , Glial Fibrillary Acidic Protein/genetics , Diagnosis, Differential , Tomography, Emission-Computed, Single-Photon , Brain/diagnostic imaging , Brain/pathology , Mutagenesis, Insertional/genetics
11.
Am J Med Genet A ; 194(8): e63581, 2024 08.
Article in English | MEDLINE | ID: mdl-38600862

ABSTRACT

Alu elements are short, interspersed elements located throughout the genome, playing a role in human diversity, and occasionally causing genetic diseases. Here, we report a novel Alu insertion causing Mowat-Wilson syndrome, a rare neurodevelopmental disorder, in an 8-year-old boy displaying the typical clinical features for Mowat-Wilson syndrome. The variant was not initially detected in genome sequencing data, but through deep phenotyping, which pointed to only one plausible candidate gene, manual inspection of genome sequencing alignment data enabled us to identify a de novo heterozygous Alu insertion in exon 8 of the ZEB2 gene. Nanopore long-read sequencing confirmed the Alu insertion, leading to the formation of a premature stop codon and likely haploinsufficiency of ZEB2. This underscores the importance of deep phenotyping and mobile element insertion analysis in uncovering genetic causes of monogenic disorders as these elements might be overlooked in standard next-generation sequencing protocols.


Subject(s)
Alu Elements , Facies , Hirschsprung Disease , Intellectual Disability , Microcephaly , Zinc Finger E-box Binding Homeobox 2 , Humans , Alu Elements/genetics , Microcephaly/genetics , Microcephaly/pathology , Male , Child , Zinc Finger E-box Binding Homeobox 2/genetics , Hirschsprung Disease/genetics , Hirschsprung Disease/pathology , Intellectual Disability/genetics , Intellectual Disability/pathology , Phenotype , Mutagenesis, Insertional/genetics , High-Throughput Nucleotide Sequencing , Exons/genetics
12.
Hum Mol Genet ; 33(11): 1001-1014, 2024 May 18.
Article in English | MEDLINE | ID: mdl-38483348

ABSTRACT

The CEL gene encodes carboxyl ester lipase, a pancreatic digestive enzyme. CEL is extremely polymorphic due to a variable number tandem repeat (VNTR) located in the last exon. Single-base deletions within this VNTR cause the inherited disorder MODY8, whereas little is known about VNTR single-base insertions in pancreatic disease. We therefore mapped CEL insertion variants (CEL-INS) in 200 Norwegian patients with pancreatic neoplastic disorders. Twenty-eight samples (14.0%) carried CEL-INS alleles. Most common were insertions in repeat 9 (9.5%), which always associated with a VNTR length of 13 repeats. The combined INS allele frequency (0.078) was similar to that observed in a control material of 416 subjects (0.075). We performed functional testing in HEK293T cells of a set of CEL-INS variants, in which the insertion site varied from the first to the 12th VNTR repeat. Lipase activity showed little difference among the variants. However, CEL-INS variants with insertions occurring in the most proximal repeats led to protein aggregation and endoplasmic reticulum stress, which upregulated the unfolded protein response. Moreover, by using a CEL-INS-specific antibody, we observed patchy signals in pancreatic tissue from humans without any CEL-INS variant in the germline. Similar pancreatic staining was seen in knock-in mice expressing the most common human CEL VNTR with 16 repeats. CEL-INS proteins may therefore be constantly produced from somatic events in the normal pancreatic parenchyma. This observation along with the high population frequency of CEL-INS alleles strongly suggests that these variants are benign, with a possible exception for insertions in VNTR repeats 1-4.


Subject(s)
Minisatellite Repeats , Pancreas, Exocrine , Humans , Minisatellite Repeats/genetics , Animals , Mice , Pancreas, Exocrine/metabolism , Pancreas, Exocrine/enzymology , HEK293 Cells , Mutagenesis, Insertional/genetics , Alleles , Pancreatic Neoplasms/genetics , Pancreatic Neoplasms/pathology , Pancreatic Neoplasms/enzymology , Gene Frequency , Male , Female , Lipase/genetics
13.
J Integr Plant Biol ; 66(4): 645-659, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38450982

ABSTRACT

ChinaMu is the largest sequence-indexed Mutator (Mu) transposon insertional library in maize (Zea mays). In this study, we made significant improvements to the size and quality of the ChinaMu library. We developed a new Mu-tag isolation method Mu-Tn5-seq (MuT-seq). Compared to the previous method used by ChinaMu, MuT-seq recovered 1/3 more germinal insertions, while requiring only about 1/14 of the sequencing volume and 1/5 of the experimental time. Using MuT-seq, we identified 113,879 germinal insertions from 3,168 Mu-active F1 families. We also assembled a high-quality genome for the Mu-active line Mu-starter, which harbors the initial active MuDR element and was used as the pollen donor for the mutation population. Using the Mu-starter genome, we recovered 33,662 (15.6%) additional germinal insertions in 3,244 (7.4%) genes in the Mu-starter line. The Mu-starter genome also improved the assignment of 117,689 (54.5%) germinal insertions. The newly upgraded ChinaMu dataset currently contains 215,889 high-quality germinal insertions. These insertions cover 32,224 pan-genes in the Mu-starter and B73Ref5 genomes, including 23,006 (80.4%) core genes shared by the two genomes. As a test model, we investigated Mu insertions in the pentatricopeptide repeat (PPR) superfamily, discovering insertions for 92% (449/487) of PPR genes in ChinaMu, demonstrating the usefulness of ChinaMu as a functional genomics resource for maize.


Subject(s)
Chromosomes , DNA Transposable Elements , Humans , DNA Transposable Elements/genetics , Mutagenesis, Insertional/genetics , Base Sequence , Mutation , Zea mays/genetics
14.
Plant Cell Environ ; 47(6): 2011-2026, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38392921

ABSTRACT

Crispr/CAS9-enabled homologous recombination to insert a tag in frame with an endogenous gene can circumvent difficulties such as context-dependent promoter activity that complicate analysis of gene expression and protein accumulation patterns. However, there have been few reports examining whether such gene targeting/gene tagging (GT) can alter expression of the target gene. The enzyme encoded by Δ1-pyrroline-5-carboxylate synthetase 1 (P5CS1) is key for stress-induced proline synthesis and drought resistance, yet its expression pattern and protein localisation have been difficult to assay. We used GT to insert YFP in frame with the 5' or 3' ends of the endogenous P5CS1 and At14a-Like 1 (AFL1) coding regions. Insertion at the 3' end of either gene generated homozygous lines with expression of the gene-YFP fusion indistinguishable from the wild type allele. However, for P5CS1 this occurred only after selfing and advancement to the T5 generation allowed initial homozygous lethality of the insertion to be overcome. Once this was done, the GT-generated P5CS1-YFP plants revealed new information about P5CS1 localisation and tissue-specific expression. In contrast, insertion of YFP at the 5' end of either gene blocked expression. The results demonstrate that GT can be useful for functional analyses of genes that are problematic to properly express by other means but also show that, in some cases, GT can disrupt expression of the target gene.


Subject(s)
Arabidopsis Proteins , Arabidopsis , Arabidopsis/genetics , Arabidopsis/physiology , Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Plants, Genetically Modified , Gene Expression Regulation, Plant , Stress, Physiological/genetics , Mutagenesis, Insertional/genetics , Luminescent Proteins/genetics , Luminescent Proteins/metabolism , Bacterial Proteins/genetics , Bacterial Proteins/metabolism
15.
Cell Rep ; 43(1): 113517, 2024 01 23.
Article in English | MEDLINE | ID: mdl-38142397

ABSTRACT

Randomly barcoded transposon mutant libraries are powerful tools for studying gene function and organization, assessing gene essentiality and pathways, discovering potential therapeutic targets, and understanding the physiology of gut bacteria and their interactions with the host. However, construction of high-quality libraries with uniform representation can be challenging. In this review, we survey various strategies for barcoded library construction, including transposition systems, methods of transposon delivery, optimal library size, and transconjugant selection schemes. We discuss the advantages and limitations of each approach, as well as factors to consider when selecting a strategy. In addition, we highlight experimental and computational advances in arraying condensed libraries from mutant pools. We focus on examples of successful library construction in gut bacteria and their application to gene function studies and drug discovery. Given the need for understanding gene function and organization in gut bacteria, we provide a comprehensive guide for researchers to construct randomly barcoded transposon mutant libraries.


Subject(s)
DNA Transposable Elements , High-Throughput Nucleotide Sequencing , DNA Transposable Elements/genetics , High-Throughput Nucleotide Sequencing/methods , Cloning, Molecular , Gene Library , Bacteria/genetics , Mutagenesis, Insertional/genetics
16.
Cell Rep ; 43(1): 113519, 2024 01 23.
Article in English | MEDLINE | ID: mdl-38142398

ABSTRACT

The critical role of the intestinal microbiota in human health and disease is well recognized. Nevertheless, there are still large gaps in our understanding of the functions and mechanisms encoded in the genomes of most members of the gut microbiota. Genome-scale libraries of transposon mutants are a powerful tool to help us address this gap. Recent advances in barcoded transposon mutagenesis have dramatically lowered the cost of mutant fitness determination in hundreds of in vitro and in vivo experimental conditions. In an accompanying review, we discuss recent advances and caveats for the construction of pooled and arrayed barcoded transposon mutant libraries in human gut commensals. In this review, we discuss how these libraries can be used across a wide range of applications, the technical aspects involved, and expectations for such screens.


Subject(s)
DNA Transposable Elements , Humans , Mutagenesis, Insertional/genetics , DNA Transposable Elements/genetics , Gene Library
17.
Int J Biol Sci ; 19(6): 1764-1777, 2023.
Article in English | MEDLINE | ID: mdl-37063417

ABSTRACT

Sleeping Beauty (SB) insertional mutagenesis has been widely used for genome-wide functional screening in mouse models of human cancers, however, intertumor heterogeneity can be a major obstacle in identifying common insertion sites (CISs). Although previous algorithms have been successful in defining some CISs, they also miss CISs in certain situations. A major common characteristic of these previous methods is that they do not take tumor heterogeneity into account. However, intertumoral heterogeneity directly influences the sequence read number for different tumor samples and then affects CIS identification. To precisely detect and define cancer driver genes, we developed SB Digestor, a computational algorithm that overcomes biological heterogeneity to identify more potential driver genes. Specifically, we define the relationship between the sequenced read number and putative gene number to deduce the depth cutoff for each tumor, which can reduce tumor complexity and precisely reflect intertumoral heterogeneity. Using this new tool, we re-analyzed our previously published SB-based screening dataset and identified many additional potent drivers involved in Brca1-related tumorigenesis, including Arhgap42, Tcf12, and Fgfr2. SB Digestor not only greatly enhances our ability to identify and prioritize cancer drivers from SB tumors but also substantially deepens our understanding of the intrinsic genetic basis of cancer.


Subject(s)
DNA Transposable Elements , Neoplasms , Animals , Mice , Humans , DNA Transposable Elements/genetics , Neoplasms/genetics , Neoplasms/pathology , Mutagenesis, Insertional/genetics , Oncogenes , Disease Models, Animal , Transposases/genetics
18.
Mol Syst Biol ; 19(6): e11398, 2023 06 12.
Article in English | MEDLINE | ID: mdl-36970845

ABSTRACT

In bacteria, natural transposon mobilization can drive adaptive genomic rearrangements. Here, we build on this capability and develop an inducible, self-propagating transposon platform for continuous genome-wide mutagenesis and the dynamic rewiring of gene networks in bacteria. We first use the platform to study the impact of transposon functionalization on the evolution of parallel Escherichia coli populations toward diverse carbon source utilization and antibiotic resistance phenotypes. We then develop a modular, combinatorial assembly pipeline for the functionalization of transposons with synthetic or endogenous gene regulatory elements (e.g., inducible promoters) as well as DNA barcodes. We compare parallel evolutions across alternating carbon sources and demonstrate the emergence of inducible, multigenic phenotypes and the ease with which barcoded transposons can be tracked longitudinally to identify the causative rewiring of gene networks. This work establishes a synthetic transposon platform that can be used to optimize strains for industrial and therapeutic applications, for example, by rewiring gene networks to improve growth on diverse feedstocks, as well as help address fundamental questions about the dynamic processes that have sculpted extant gene networks.


Subject(s)
DNA Transposable Elements , Genomics , Mutagenesis, Insertional/genetics , DNA Transposable Elements/genetics , Phenotype , Gene Regulatory Networks
19.
J Virol ; 97(3): e0003823, 2023 03 30.
Article in English | MEDLINE | ID: mdl-36779761

ABSTRACT

Coronaviruses infect a wide variety of host species, resulting in a range of diseases in both humans and animals. The coronavirus genome consists of a large positive-sense single-stranded molecule of RNA containing many RNA structures. One structure, denoted s2m and consisting of 41 nucleotides, is located within the 3' untranslated region (3' UTR) and is shared between some coronavirus species, including infectious bronchitis virus (IBV), severe acute respiratory syndrome coronavirus (SARS-CoV), and SARS-CoV-2, as well as other pathogens, including human astrovirus. Using a reverse genetic system to generate recombinant viruses, we investigated the requirement of the s2m structure in the replication of IBV, a globally distributed economically important Gammacoronavirus that infects poultry causing respiratory disease. Deletion of three nucleotides predicted to destabilize the canonical structure of the s2m or the deletion of the nucleotides corresponding to s2m impacted viral replication in vitro. In vitro passaging of the recombinant IBV with the s2m sequence deleted resulted in a 36-nucleotide insertion in place of the deletion, which was identified to be composed of a duplication of flanking sequences. A similar result was observed following serial passage of human astrovirus with a deleted s2m sequence. RNA modeling indicated that deletion of the nucleotides corresponding to the s2m impacted other RNA structures present in the IBV 3' UTR. Our results indicated for both IBV and human astrovirus a preference for nucleotide occupation in the genome location corresponding to the s2m, which is independent of the specific s2m sequence. IMPORTANCE Coronaviruses infect many species, including humans and animals, with substantial effects on livestock, particularly with respect to poultry. The coronavirus RNA genome consists of structural elements involved in viral replication whose roles are poorly understood. We investigated the requirement of the RNA structural element s2m in the replication of the Gammacoronavirus infectious bronchitis virus, an economically important viral pathogen of poultry. Using reverse genetics to generate recombinant IBVs with either a disrupted or deleted s2m, we showed that the s2m is not required for viral replication in cell culture; however, replication is decreased in tracheal tissue, suggesting a role for the s2m in the natural host. Passaging of these viruses as well as human astrovirus lacking the s2m sequence demonstrated a preference for nucleotide occupation, independent of the s2m sequence. RNA modeling suggested deletion of the s2m may negatively impact other essential RNA structures.


Subject(s)
Infectious bronchitis virus , Mamastrovirus , Mutagenesis, Insertional , Animals , Humans , 3' Untranslated Regions/genetics , Chickens/virology , Infectious bronchitis virus/genetics , Mamastrovirus/genetics , Mutagenesis, Insertional/genetics , Poultry Diseases/virology , RNA, Viral/genetics , Virus Replication/genetics , RNA Stability/genetics , Sequence Deletion/genetics
20.
Mol Oncol ; 17(2): 230-237, 2023 02.
Article in English | MEDLINE | ID: mdl-36269676

ABSTRACT

Epidermal growth factor receptor (EGFR) exon 20 insertion mutations (ex20ins) account for ≤ 12% of all EGFR-mutant nonsmall cell lung cancers. We analysed real-world datasets to determine the frequency of ex20ins variants, and the ability of polymerase chain reaction (PCR) and next-generation sequencing (NGS) to identify them. Three real-world United States NGS databases were used: GENIE, FoundationInsights, and GuardantINFORM. Mutation profiles consistent with in-frame EGFR ex20ins were summarized. GENIE, FoundationInsights, and GuardantINFORM datasets identified 180, 627, and 627 patients with EGFR ex20ins respectively. The most frequent insertion region of exon 20 was the near loop (~ 70%), followed by the far loop (~ 30%) and the helical (~ 3-6%) regions. GENIE, FoundationInsights, and GuardantINFORM datasets identified 41, 102, and 96 unique variants respectively. An analysis of variants projected that ~ 50% of EGFR ex20ins identified by NGS would have been missed by PCR-based assays. Given the breadth of EGFR ex20ins identified in the real-world US datasets, the ability of PCR to identify these mutations is limited. NGS platforms are more appropriate to identify patients likely to benefit from EGFR ex20ins-targeted therapies.


Subject(s)
Lung Neoplasms , Humans , Lung Neoplasms/diagnosis , Lung Neoplasms/genetics , Mutagenesis, Insertional/genetics , ErbB Receptors/genetics , Mutation/genetics , Exons/genetics , Genomics , Protein Kinase Inhibitors
SELECTION OF CITATIONS
SEARCH DETAIL