RESUMO
The Cas9 nuclease from Staphylococcus aureus (SaCas9) holds great potential for use in gene therapy, and variants with increased fidelity have been engineered. However, we find that existing variants have not reached the greatest accuracy to discriminate base mismatches and exhibited much reduced activity when their mutations were grafted onto the KKH mutant of SaCas9 for editing an expanded set of DNA targets. We performed structure-guided combinatorial mutagenesis to re-engineer KKH-SaCas9 with enhanced accuracy. We uncover that introducing a Y239H mutation on KKH-SaCas9's REC domain substantially reduces off-target edits while retaining high on-target activity when added to a set of mutations on REC and RuvC domains that lessen its interactions with the target DNA strand. The Y239H mutation is modelled to have removed an interaction from the REC domain with the guide RNA backbone in the guide RNA-DNA heteroduplex structure. We further confirmed the greatly improved genome-wide editing accuracy and single-base mismatch discrimination of our engineered variants, named KKH-SaCas9-SAV1 and SAV2, in human cells. In addition to generating broadly useful KKH-SaCas9 variants with unprecedented accuracy, our findings demonstrate the feasibility for multi-domain combinatorial mutagenesis on SaCas9's DNA- and guide RNA- interacting residues to optimize its editing fidelity.
Assuntos
Proteína 9 Associada à CRISPR/genética , Edição de Genes , Staphylococcus aureus , Sistemas CRISPR-Cas , Humanos , Nuclease do Micrococo/genética , RNA Guia de Cinetoplastídeos , Staphylococcus aureus/genéticaRESUMO
OBJECTIVE: To develop a method for noninvasive prenatal paternity testing based on targeted sequencing of single nucleotide polymorphisms (SNPs). METHOD: SNPs were selected based on population genetics data. Target-SNPs in cell-free DNA extracted from maternal blood (maternal cfDNA) were analyzed by targeted sequencing wherein target enrichment was based on multiplex amplification using QIAseq Targeted DNA Panels with Unique Molecular Identifiers. Fetal SNP genotypes were called using a novel bioinformatics algorithm, and the combined paternity indices (CPIs) and resultant paternity probabilities were calculated. RESULTS: Fetal SNP genotypes obtained from targeted sequencing of maternal cfDNA were 100% concordant with those from amniotic fluid-derived fetal genomic DNA. From an initial panel of 356 target-SNPs, an average of 148 were included in paternity calculations in 15 family trio cases, generating paternity probabilities of greater than 99.9999%. All paternity results were confirmed by short-tandem-repeat analysis. The high specificity of the methodology was validated by successful paternity discrimination between biological fathers and their siblings and by large separations between the CPIs calculated for the biological fathers and those for 60 unrelated men. CONCLUSION: The novel method is highly effective, with substantial improvements over similar approaches in terms of reduced number of target-SNPs, increased accuracy, and reduced costs.
Assuntos
Ácidos Nucleicos Livres/análise , Teste Pré-Natal não Invasivo/métodos , Paternidade , Algoritmos , Biologia Computacional , Feminino , Humanos , Masculino , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único , GravidezRESUMO
The genome-editing Cas9 protein uses multiple amino-acid residues to bind the target DNA. Considering only the residues in proximity to the target DNA as potential sites to optimise Cas9's activity, the number of combinatorial variants to screen through is too massive for a wet-lab experiment. Here we generate and cross-validate ten in silico and experimental datasets of multi-domain combinatorial mutagenesis libraries for Cas9 engineering, and demonstrate that a machine learning-coupled engineering approach reduces the experimental screening burden by as high as 95% while enriching top-performing variants by â¼7.5-fold in comparison to the null model. Using this approach and followed by structure-guided engineering, we identify the N888R/A889Q variant conferring increased editing activity on the protospacer adjacent motif-relaxed KKH variant of Cas9 nuclease from Staphylococcus aureus (KKH-SaCas9) and its derived base editor in human cells. Our work validates a readily applicable workflow to enable resource-efficient high-throughput engineering of genome editor's activity.
Assuntos
Proteínas de Bactérias , Sistemas CRISPR-Cas , Proteínas de Bactérias/metabolismo , Sistemas CRISPR-Cas/genética , DNA/metabolismo , Humanos , Aprendizado de Máquina , MutagêneseRESUMO
Viral transcription is an essential step of SARS-CoV-2 infection after invasion into the target cells. Antiviral drugs such as remdesivir, which is used to treat COVID-19 patients, targets the viral RNA synthesis. Understanding the mechanism of viral transcription may help to develop new therapeutic treatment by perturbing virus replication. In this study, we established 28 ddPCR assays and designed specific primers/probe sets to detect the RNA levels of 15 NSP, 9 ORF, and 4 structural genes of SARS-CoV-2. The transcriptional kinetics of these viral genes were determined longitudinally from the beginning of infection to 12 h postinfection in Caco-2 cells. We found that SARS-CoV-2 takes around 6 h to hijack the cells before the initiation of viral transcription process in human cells. Our results may contribute to a deeper understanding of the mechanisms of SARS-CoV-2 infection.
RESUMO
As the final sequencing of the human genome has now been completed, we present the results of the largest examination of the quality of the finished DNA sequence. The completed study covers the major contributing sequencing centres and is based on a rigorous combination of laboratory experiments and computational analysis.
Assuntos
Biologia Computacional/normas , Genoma Humano , Projeto Genoma Humano , Análise de Sequência de DNA/normas , Pareamento de Bases , Biologia Computacional/tendências , Humanos , Controle de Qualidade , Projetos de Pesquisa , Sensibilidade e Especificidade , Análise de Sequência de DNA/tendênciasRESUMO
Human chromosome 16 features one of the highest levels of segmentally duplicated sequence among the human autosomes. We report here the 78,884,754 base pairs of finished chromosome 16 sequence, representing over 99.9% of its euchromatin. Manual annotation revealed 880 protein-coding genes confirmed by 1,670 aligned transcripts, 19 transfer RNA genes, 341 pseudogenes and three RNA pseudogenes. These genes include metallothionein, cadherin and iroquois gene families, as well as the disease genes for polycystic kidney disease and acute myelomonocytic leukaemia. Several large-scale structural polymorphisms spanning hundreds of kilobase pairs were identified and result in gene content differences among humans. Whereas the segmental duplications of chromosome 16 are enriched in the relatively gene-poor pericentromere of the p arm, some are involved in recent gene duplication and conversion events that are likely to have had an impact on the evolution of primates and human disease susceptibility.
Assuntos
Cromossomos Humanos Par 16/genética , Duplicação Gênica , Mapeamento Físico do Cromossomo , Animais , Genes/genética , Genômica , Heterocromatina/genética , Humanos , Dados de Sequência Molecular , Polimorfismo Genético/genética , Análise de Sequência de DNA , Sintenia/genéticaRESUMO
Chromosome 5 is one of the largest human chromosomes and contains numerous intrachromosomal duplications, yet it has one of the lowest gene densities. This is partially explained by numerous gene-poor regions that display a remarkable degree of noncoding conservation with non-mammalian vertebrates, suggesting that they are functionally constrained. In total, we compiled 177.7 million base pairs of highly accurate finished sequence containing 923 manually curated protein-coding genes including the protocadherin and interleukin gene families. We also completely sequenced versions of the large chromosome-5-specific internal duplications. These duplications are very recent evolutionary events and probably have a mechanistic role in human physiological variation, as deletions in these regions are the cause of debilitating disorders including spinal muscular atrophy.
Assuntos
Cromossomos Humanos Par 5/genética , Análise de Sequência de DNA , Animais , Composição de Bases , Caderinas/genética , Sequência Conservada/genética , Duplicação Gênica , Genes/genética , Doenças Genéticas Inatas/genética , Genômica , Humanos , Interleucinas/genética , Dados de Sequência Molecular , Atrofia Muscular Espinal/genética , Pan troglodytes/genética , Mapeamento Físico do Cromossomo , Pseudogenes/genética , Sintenia/genética , Vertebrados/genéticaRESUMO
Chromosome 19 has the highest gene density of all human chromosomes, more than double the genome-wide average. The large clustered gene families, corresponding high G + C content, CpG islands and density of repetitive DNA indicate a chromosome rich in biological and evolutionary significance. Here we describe 55.8 million base pairs of highly accurate finished sequence representing 99.9% of the euchromatin portion of the chromosome. Manual curation of gene loci reveals 1,461 protein-coding genes and 321 pseudogenes. Among these are genes directly implicated in mendelian disorders, including familial hypercholesterolaemia and insulin-resistant diabetes. Nearly one-quarter of these genes belong to tandemly arranged families, encompassing more than 25% of the chromosome. Comparative analyses show a fascinating picture of conservation and divergence, revealing large blocks of gene orthology with rodents, scattered regions with more recent gene family expansions and deletions, and segments of coding and non-coding conservation with the distant fish species Takifugu.
Assuntos
Cromossomos Humanos Par 19/genética , Genes/genética , Mapeamento Físico do Cromossomo , Processamento Alternativo/genética , Animais , Composição de Bases , Sequência Conservada/genética , Ilhas de CpG/genética , Evolução Molecular , Duplicação Gênica , Genética Médica , Humanos , Camundongos , Dados de Sequência Molecular , Família Multigênica/genética , Pseudogenes/genética , Análise de Sequência de DNARESUMO
trans-1,2-Diaryloxiranes were conveniently prepared in an one-pot reaction by the direct coupling of benzyl halides in the presence of silver oxide and DMSO under mild conditions.
RESUMO
Cancer-related mortality of solid tumors remains the major cause of death worldwide. Circulating tumor DNA (ctDNA) released from cancer cells harbors specific somatic mutations. Sequencing ctDNA opens opportunities to non-invasive population screening and lays foundations for personalized therapy. In this study, two commercially available platforms, Roche's Avenio ctDNA Expanded panel and QIAgen's QIAseq Human Comprehensive Cancer panel were compared for (1) panel coverage of clinically relevant variants; (2) target enrichment specificity and sequencing performance; (3) the sensitivity; (4) concordance and (5) sequencing coverage using the same human blood sample with ultra-deep next-generation sequencing. Our finding suggests that Avenio detected somatic mutations in common cancers in over 70% of patients while QIAseq covered nearly 90% with a higher average number of variants per patient (Avenio: 3; QIAseq: 8 variants per patient). Both panels demonstrated similar on-target rate and percentage of reads mapped. However, Avenio had more uniform sequencing coverage across regions with different GC content. Avenio had a higher sensitivity and concordance compared with QIAseq at the same sequencing depth. This study identifies a unique niche for the application of each of the panel and allows the scientific community to make an informed decision on the technologies to meet research or application needs.
Assuntos
DNA Tumoral Circulante/sangue , DNA de Neoplasias/sangue , Composição de Bases/genética , Biomarcadores Tumorais/sangue , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , MutaçãoRESUMO
Substituents on the pyridinium ring of N-methylpyridinium derivatives, especially those on the 2- or 4-positions, have a large effect on the (1)H and (13)C NMR chemical shifts of the N-methyl group. Reasonable correlations between the chemical shift changes and the resonance substituent constants are observed. The dual substituent parameter approach provides an excellent correlation when a combination of polar and resonance substituent constants is employed.