Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 49
Filtrar
1.
Genome Biol Evol ; 2024 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-38973368

RESUMO

This article describes a genome assembly and annotation for Bombus dahlbomii, the giant Patagonian bumble bee. DNA from a single, haploid male collected in Argentina was used for PacBio (HiFi) sequencing and HiC technology was then used to map chromatin contacts. Using Juicer and manual curation, the genome was scaffolded into 18 main pseudomolecules, representing a high quality, near chromosome-level assembly. The sequenced genome size is estimated at 265Mb. The genome was annotated based on RNA-sequencing data of another male from Argentina, and BRAKER3 produced 15,767 annotated genes. The genome and annotation show high completeness, with >95% BUSCO scores for both the genome and annotated genes (based on conserved genes from Hymenoptera). This genome provides a valuable resource for studying the biology of this iconic and endangered species, as well as for understanding the impacts of its decline and designing strategies for its preservation.

2.
G3 (Bethesda) ; 2024 May 28.
Artigo em Inglês | MEDLINE | ID: mdl-38805695

RESUMO

The bivalve subclass Pteriomorphia, which includes the economically important scallops, oysters, mussels, and ark clams, exhibits extreme ecological, morphological, and behavioral diversity. Among this diversity are five morphologically distinct eye types, making Pteriomorphia an excellent setting to explore the molecular basis for the evolution of novel traits. Of pteriomorphian bivalves, Limida is the only order lacking genomic resources, greatly limiting the potential phylogenomic analyses related to eyes and phototransduction. Here, we present a limid genome assembly, the disco clam, Ctenoides ales, which is characterized by invaginated eyes, exceptionally long tentacles, and a flashing light display. This genome assembly was constructed with PacBio long reads and Dovetail Omni-CTM proximity-ligation sequencing. The final assembly is ∼2.3Gb and over 99% of the total length is contained in 18 pseudomolecule scaffolds. We annotated 41,064 protein coding genes and report a BUSCO completeness of 91.9% for metazoa_obd10. Additionally, we report a complete and annotated mitochondrial genome, which also had been lacking from Limida. The ∼20Kb mitogenome has 12 protein coding genes, 22 tRNAs, 2 rRNA genes, and a 1,589 bp duplicated sequence containing the origin of replication. The C. ales nuclear genome size is substantially larger than other pteriomorphian genomes, mainly accounted for by transposable element sequences. We inventoried the genome for opsins, the signaling proteins that initiate phototransduction, and found that, unlike its closest eyed-relatives, the scallops, C. ales lacks duplication of the rhabdomeric Gq-protein coupled opsin that is typically used for invertebrate vision. In fact, C. ales has uncharacteristically few opsins relative to the other pteriomorphian families, all of which have unique expansions of xenopsins, a recently discovered opsin subfamily. This chromosome-level assembly, along with the mitogenome, will be valuable resources for comparative genomics and phylogenetics in bivalves and particularly for the understudied but charismatic limids.

4.
Artigo em Inglês | MEDLINE | ID: mdl-38179990

RESUMO

A fully assembled spirochaete genome was identified as a contaminating scaffold in our red abalone (Haliotis rufescens) genome assembly. In this paper, we describe the analysis of this bacterial genome. The assembled spirochaete genome is 3.25 Mb in size with 48.5 mol% G+C content. The proteomes of 38 species were compared with the spirochaete genome and it was discovered to form an independent branch within the family Spirochaetaceae on the phylogenetic tree. The comparison of 16S rRNA sequences and average nucleotide identity scores between the spirochaete genome with known species of different families in Spirochaetia indicate that it is an unknown species. Further, the percentage of conserved proteins compared to neighbouring taxa confirm that it does not belong to a known genus within Spirochaetaceae. We propose the name Candidatus Haliotispira prima gen. nov., sp. nov. based on its taxonomic placement and origin. We also tested for the presence of this species in different species of abalone and found that it is also present in white abalone (Haliotis sorenseni). In addition, we highlight the need for better classification of taxa within the class Spirochaetia.


Assuntos
Gastrópodes , Spirochaeta , Spirochaetaceae , Humanos , Animais , Spirochaetales , Filogenia , RNA Ribossômico 16S/genética , Composição de Bases , Análise de Sequência de DNA , DNA Bacteriano/genética , Técnicas de Tipagem Bacteriana , Ácidos Graxos/química , Bactérias
5.
Toxicol Sci ; 194(1): 23-37, 2023 06 28.
Artigo em Inglês | MEDLINE | ID: mdl-37208198

RESUMO

Glyphosate (GLY) is an herbicide used for rural and urban weed control. Urinary GLY in women is associated with shortened gestational length yet effects of GLY on offspring due to maternal exposure are unclear. This study tested the hypothesis that maternal chronic pre-conceptional GLY exposure would cause phenotypic and molecular changes in F1 offspring. Female C57BL/6 mice (7-week-old; n = 40) received saline vehicle control (CT; n = 20) or GLY (2 mg/kg; n = 20) daily per os for 10 weeks. At dosing completion, females were housed with unexposed males and divided into Cohort 1 who were euthanized at gestation day 14 (n = 10 per treatment) and Cohort 2 who completed gestation (n = 10 per treatment). F1 female ovarian and liver samples underwent LC-MS/MS and bioinformatic analysis. Maternal exposure did not affect litter (P > .05) sex ratio, or embryonic or neonatal gross phenotypes. In Cohort 2 offspring, no treatment effect on (P > .05) offspring anogenital distance, puberty onset, or ovarian follicular composition was noted. Body weight was increased (P < .05) in male GLY-exposed compared with CT dam offspring. F1 females from GLY-exposed dams had altered (P < .05) abundance of 54 ovarian and 110 hepatic proteins. Pathways altered in the ovary (false discovery rate [FDR] ≤ 0.07) included thermogenesis and phosphatidylinositol-3 kinase-AKT signaling and in liver (FDR ≤ 0.08) included metabolic, glutathione metabolism, oxidative phosphorylation, non-alcoholic fatty liver disease, and thermogenesis. Thus, pre-conceptional GLY exposure affected offspring phenotypic and molecular profiles potentially impacting reproductive health.


Assuntos
Exposição Materna , Efeitos Tardios da Exposição Pré-Natal , Humanos , Animais , Camundongos , Feminino , Masculino , Exposição Materna/efeitos adversos , Ovário , Proteoma , Cromatografia Líquida , Maturidade Sexual , Camundongos Endogâmicos C57BL , Espectrometria de Massas em Tandem , Fígado , Efeitos Tardios da Exposição Pré-Natal/induzido quimicamente , Glifosato
6.
Genome Biol Evol ; 15(3)2023 03 03.
Artigo em Inglês | MEDLINE | ID: mdl-36792366

RESUMO

Long-read sequencing has revolutionized genome assembly, yielding highly contiguous, chromosome-level contigs. However, assemblies from some third generation long read technologies, such as Pacific Biosciences (PacBio) continuous long reads (CLR), have a high error rate. Such errors can be corrected with short reads through a process called polishing. Although best practices for polishing non-model de novo genome assemblies were recently described by the Vertebrate Genome Project (VGP) Assembly community, there is a need for a publicly available, reproducible workflow that can be easily implemented and run on a conventional high performance computing environment. Here, we describe polishCLR (https://github.com/isugifNF/polishCLR), a reproducible Nextflow workflow that implements best practices for polishing assemblies made from CLR data. PolishCLR can be initiated from several input options that extend best practices to suboptimal cases. It also provides re-entry points throughout several key processes, including identifying duplicate haplotypes in purge_dups, allowing a break for scaffolding if data are available, and throughout multiple rounds of polishing and evaluation with Arrow and FreeBayes. PolishCLR is containerized and publicly available for the greater assembly community as a tool to complete assemblies from existing, error-prone long-read data.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Fluxo de Trabalho , Haplótipos
7.
Toxicol Sci ; 190(2): 173-188, 2022 11 23.
Artigo em Inglês | MEDLINE | ID: mdl-36214631

RESUMO

Perfluorooctanoic acid (PFOA) is an environmentally persistent perfluoroalkyl substance that is widely used in consumer products. Exposure to PFOA is associated with reproductive and developmental effects including endocrine disruption, delayed puberty in girls, and decreased fetal growth. In the United States, obesity affects 40% of women and 20% of girls, with higher rates in minority females. Obesity causes infertility, poor oocyte quality, miscarriage, and offspring defects. This study proposed that PFOA exposure would impact estrous cyclicity, ovarian steroid hormones, and the ovarian proteome and further hypothesized that obesity would impact PFOA-induced ovotoxicity. Female wild type (KK.Cg-a/a; lean) or KK.Cg-Ay/J mice (obese) received saline (CT) or PFOA (2.5 mg/kg) per os for 15 days beginning at 7 weeks of age. There were no effects on food intake, body weight, estrous cyclicity, serum progesterone, and heart, spleen, kidney, or uterus weight (p > .05). Ovary weight was decreased (p < .05) by PFOA exposure relative to vehicle control-treated mice in lean but not obese mice. Liquid chromatography-tandem mass spectrometry was performed on isolated ovarian protein and PFOA exposure altered the ovarian abundance of proteins involved in DNA damage sensing and repair pathways and reproduction pathways (p < .05) differentially in lean and obese mice. The data suggest that PFOA exposure alters ovary weight and differentially targets ovarian proteins in lean and obese females in ways that might reduce female fecundity.


Assuntos
Fluorocarbonos , Feminino , Camundongos , Animais , Fluorocarbonos/metabolismo , Ovário , Camundongos Obesos , Reprodução , Obesidade/metabolismo , Dano ao DNA
8.
Toxicol Sci ; 190(2): 204-214, 2022 11 23.
Artigo em Inglês | MEDLINE | ID: mdl-36173347

RESUMO

Exposure to glyphosate (GLY), a commonly used herbicide, is supported by urinary detection and associated with shortened gestation in women. This study tested the hypothesis that chronic low-dose pre-conceptional GLY exposure would affect maternal ovarian function mid- and post-gestation. Mice (C57BL/6; n = 40) were exposed per os to saline vehicle control (CT; n = 20) or GLY (2 mg/kg; n = 20) daily for 10 weeks starting at 7 weeks of age. Post-exposure, females were impregnated and euthanized at gestation day 14 (GD14) or post-weaning (PW). Pregnancy success was reduced from 75% to 55% by GLY exposure. No treatment effect (p > .05) on body weight, maternal serum 17ß-estradiol, or litter size was noted. Ovarian weight was unaffected or reduced (p < .05) by GLY in GD14 and PW dams, respectively. Exposure to GLY decreased (p < .05) PW ovarian secondary follicle number with no other follicle composition impacts. Protein abundance analysis by LC-MS/MS identified that GLY altered (p < .05) 26 ovarian and 41 hepatic proteins in GD14 dams and 39 hepatic proteins in PW dams. In GD14 dams, GLY increased ovarian protein abundance of SEC16A (p < .05; 29-fold) and hepatic RPS27L and GM4952 (p < .05; ∼4-fold). In both GD14 and PW dams, GLY exposure increased (p < .05) hepatic RPS4 and decreased (p < .05) ECHDC3. Pathway analysis using DAVID identified 10 GLY hepatic pathway targets with FDR ≤ 0.07 in GD14 dams.


Assuntos
Herbicidas , Efeitos Tardios da Exposição Pré-Natal , Proteoma , Animais , Feminino , Camundongos , Gravidez , Cromatografia Líquida , Retículo Endoplasmático , Complexo de Golgi , Exposição Materna/efeitos adversos , Camundongos Endogâmicos C57BL , Espectrometria de Massas em Tandem , Proteínas de Transporte Vesicular , Herbicidas/toxicidade , Fígado , Ovário/metabolismo , Glifosato
9.
Cancers (Basel) ; 14(14)2022 Jul 20.
Artigo em Inglês | MEDLINE | ID: mdl-35884586

RESUMO

Lipopolysaccharide (LPS) is associated with chronic intestinal inflammation and promotes intestinal cancer progression in the gut. While the interplay between LPS and intestinal immune cells has been well-characterized, little is known about LPS and the intestinal epithelium interactions. In this study, we explored the differential effects of LPS on proliferation and the transcriptome in 3D enteroids/colonoids obtained from dogs with naturally occurring gastrointestinal (GI) diseases including inflammatory bowel disease (IBD) and intestinal mast cell tumor. The study objective was to analyze the LPS-induced modulation of signaling pathways involving the intestinal epithelia and contributing to colorectal cancer development in the context of an inflammatory (IBD) or a tumor microenvironment. While LPS incubation resulted in a pro-cancer gene expression pattern and stimulated proliferation of IBD enteroids and colonoids, downregulation of several cancer-associated genes such as Gpatch4, SLC7A1, ATP13A2, and TEX45 was also observed in tumor enteroids. Genes participating in porphyrin metabolism (CP), nucleocytoplasmic transport (EEF1A1), arachidonic acid, and glutathione metabolism (GPX1) exhibited a similar pattern of altered expression between IBD enteroids and IBD colonoids following LPS stimulation. In contrast, genes involved in anion transport, transcription and translation, apoptotic processes, and regulation of adaptive immune responses showed the opposite expression patterns between IBD enteroids and colonoids following LPS treatment. In brief, the crosstalk between LPS/TLR4 signal transduction pathway and several metabolic pathways such as primary bile acid biosynthesis and secretion, peroxisome, renin-angiotensin system, glutathione metabolism, and arachidonic acid pathways may be important in driving chronic intestinal inflammation and intestinal carcinogenesis.

10.
Nat Chem Biol ; 18(1): 38-46, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-34711982

RESUMO

Inefficient homology-directed repair (HDR) constrains CRISPR-Cas9 genome editing in organisms that preferentially employ nonhomologous end joining (NHEJ) to fix DNA double-strand breaks (DSBs). Current strategies used to alleviate NHEJ proficiency involve NHEJ disruption. To confer precision editing without NHEJ disruption, we identified the shortcomings of the conventional CRISPR platforms and developed a CRISPR platform-lowered indel nuclease system enabling accurate repair (LINEAR)-which enhanced HDR rates (to 67-100%) compared to those in previous reports using conventional platforms in four NHEJ-proficient yeasts. With NHEJ preserved, we demonstrate its ability to survey genomic landscapes, identifying loci whose spatiotemporal genomic architectures yield favorable expression dynamics for heterologous pathways. We present a case study that deploys LINEAR precision editing and NHEJ-mediated random integration to rapidly engineer and optimize a microbial factory to produce (S)-norcoclaurine. Taken together, this work demonstrates how to leverage an antagonizing pair of DNA DSB repair pathways to expand the current collection of microbial factories.


Assuntos
Sistemas CRISPR-Cas , Engenharia Genética , Saccharomyces cerevisiae/genética , Reparo do DNA por Junção de Extremidades , Fermentação , Genes Fúngicos
11.
Mol Ecol Resour ; 21(7): 2407-2422, 2021 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-34036752

RESUMO

The soybean cyst nematode (Heterodera glycines) is a sedentary plant parasite that exceeds billion USD annually in yield losses. This problem is exacerbated by H. glycines populations overcoming the limited sources of natural resistance in soybean and by the lack of effective and safe alternative treatments. Although there are genetic determinants that render soybeans resistant to nematode genotypes, resistant soybeans are increasingly ineffective because their multiyear usage has selected for virulent H. glycines populations. Successful H. glycines infection relies on the comprehensive re-engineering of soybean root cells into a syncytium, as well as the long-term suppression of host defences to ensure syncytial viability. At the forefront of these complex molecular interactions are effectors, the proteins secreted by H. glycines into host root tissues. The mechanisms that control genomic effector acquisition, diversification, and selection are important insights needed for the development of essential novel control strategies. As a foundation to obtain this understanding, we created a nine-scaffold, 158 Mb pseudomolecule assembly of the H. glycines genome using PacBio, Chicago, and Hi-C sequencing. A Mikado consensus gene prediction produced an annotation of 22,465 genes using short- and long-read expression data. To evaluate assembly and annotation quality, we cross-examined synteny among H. glycines assemblies, and compared BUSCO across related species. To describe the predicted proteins involved in H. glycines' secretory pathway, we contrasted expression between preparasitic and parasitic stages with functional gene information. Here, we present the results from our assembly and annotation of the H. glycines genome and contribute this resource to the scientific community.


Assuntos
Cistos , Tylenchoidea , Animais , Cromossomos , Genoma , Glycine max/genética , Tylenchoidea/genética
12.
PLoS One ; 16(4): e0249899, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33909645

RESUMO

Rocky Mountain elk (Cervus canadensis) populations have significant economic implications to the cattle industry, as they are a major reservoir for Brucella abortus in the Greater Yellowstone area. Vaccination attempts against intracellular bacterial diseases in elk populations have not been successful due to a negligible adaptive cellular immune response. A lack of genomic resources has impeded attempts to better understand why vaccination does not induce protective immunity. To overcome this limitation, PacBio, Illumina, and Hi-C sequencing with a total of 686-fold coverage was used to assemble the elk genome into 35 pseudomolecules. A robust gene annotation was generated resulting in 18,013 gene models and 33,422 mRNAs. The accuracy of the assembly was assessed using synteny to the red deer and cattle genomes identifying several chromosomal rearrangements, fusions and fissions. Because this genome assembly and annotation provide a foundation for genome-enabled exploration of Cervus species, we demonstrate its utility by exploring the conservation of immune system-related genes. We conclude by comparing cattle immune system-related genes to the elk genome, revealing eight putative gene losses in elk.


Assuntos
Cervos/genética , Genoma , Animais , Bovinos , Fusão Gênica , Rearranjo Gênico , Imunidade/genética , Pseudogenes/genética , RNA Mensageiro/metabolismo
13.
Nucleic Acids Res ; 49(7): 4037-4053, 2021 04 19.
Artigo em Inglês | MEDLINE | ID: mdl-33744974

RESUMO

Cas9 is an RNA-guided endonuclease in the bacterial CRISPR-Cas immune system and a popular tool for genome editing. The commonly used Streptococcus pyogenes Cas9 (SpCas9) is relatively non-specific and prone to off-target genome editing. Other Cas9 orthologs and engineered variants of SpCas9 have been reported to be more specific. However, previous studies have focused on specificity of double-strand break (DSB) or indel formation, potentially overlooking alternative cleavage activities of these Cas9 variants. In this study, we employed in vitro cleavage assays of target libraries coupled with high-throughput sequencing to systematically compare cleavage activities and specificities of two natural Cas9 variants (SpCas9 and Staphylococcus aureus Cas9) and three engineered SpCas9 variants (SpCas9 HF1, HypaCas9 and HiFi Cas9). We observed that all Cas9s tested could cleave target sequences with up to five mismatches. However, the rate of cleavage of both on-target and off-target sequences varied based on target sequence and Cas9 variant. In addition, SaCas9 and engineered SpCas9 variants nick targets with multiple mismatches but have a defect in generating a DSB, while SpCas9 creates DSBs at these targets. Overall, these differences in cleavage rates and DSB formation may contribute to varied specificities observed in genome editing studies.


Assuntos
Proteína 9 Associada à CRISPR , Sistemas CRISPR-Cas , Staphylococcus aureus/genética , Proteína 9 Associada à CRISPR/genética , Proteína 9 Associada à CRISPR/metabolismo , Edição de Genes , Especificidade por Substrato
14.
Commun Biol ; 4(1): 253, 2021 02 26.
Artigo em Inglês | MEDLINE | ID: mdl-33637860

RESUMO

While it is well known that the genome can affect social behavior, recent models posit that social lifestyles can, in turn, influence genome evolution. Here, we perform the most phylogenetically comprehensive comparative analysis of 16 bee genomes to date: incorporating two published and four new carpenter bee genomes (Apidae: Xylocopinae) for a first-ever genomic comparison with a monophyletic clade containing solitary through advanced eusocial taxa. We find that eusocial lineages have undergone more gene family expansions, feature more signatures of positive selection, and have higher counts of taxonomically restricted genes than solitary and weakly social lineages. Transcriptomic data reveal that caste-affiliated genes are deeply-conserved; gene regulatory and functional elements are more closely tied to social phenotype than phylogenetic lineage; and regulatory complexity increases steadily with social complexity. Overall, our study provides robust empirical evidence that social evolution can act as a major and surprisingly consistent driver of macroevolutionary genomic change.


Assuntos
Abelhas/genética , Comportamento Animal , Evolução Molecular , Genes de Insetos , Genoma de Inseto , Comportamento Social , Animais , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Interação Gene-Ambiente , Genômica , Filogenia , Especificidade da Espécie , Transcriptoma
15.
Plant Physiol ; 184(2): 960-972, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32737073

RESUMO

Maize (Zea mays) thick aleurone1 (thk1-R) mutants form multiple aleurone layers in the endosperm and have arrested embryogenesis. Prior studies suggest that thk1 functions downstream of defective kernel1 (dek1) in a regulatory pathway that controls aleurone cell fate and other endosperm traits. The original thk1-R mutant contained an ∼2-Mb multigene deletion, which precluded identification of the causal gene. Here, ethyl methanesulfonate mutagenesis produced additional alleles, and RNA sequencing from developing endosperm was used to identify a candidate gene based on differential expression compared with the wild-type progenitor. Gene editing confirmed the gene identity by producing mutant alleles that failed to complement existing thk1 mutants and that produced multiple-aleurone homozygous phenotypes. Thk1 encodes a homolog of NEGATIVE ON TATA-LESS1, a protein that acts as a scaffold for the CARBON CATABOLITE REPRESSION4-NEGATIVE ON TATA-LESS complex. This complex is highly conserved and essential in all eukaryotes for regulating a wide array of gene expression and cellular activities. Maize also harbors a duplicate locus, thick aleurone-like1, which likely accounts for the ability of thk1 mutants to form viable cells. Transcriptomic analysis indicated that THK1 regulates activities involving cell division, signaling, differentiation, and metabolism. Identification of thk1 provides an important new component of the DEK1 regulatory system that patterns cell fate in endosperm.


Assuntos
Diferenciação Celular/genética , Endosperma/citologia , Endosperma/crescimento & desenvolvimento , Endosperma/genética , Zea mays/citologia , Zea mays/crescimento & desenvolvimento , Zea mays/genética , Produtos Agrícolas/citologia , Produtos Agrícolas/genética , Produtos Agrícolas/crescimento & desenvolvimento , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Genes de Plantas , Variação Genética , Genótipo , Mutação , Fenótipo
16.
Bioinformatics ; 36(18): 4699-4705, 2020 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-32579213

RESUMO

MOTIVATION: As the cost of sequencing decreases, the amount of data being deposited into public repositories is increasing rapidly. Public databases rely on the user to provide metadata for each submission that is prone to user error. Unfortunately, most public databases, such as non-redundant (NR), rely on user input and do not have methods for identifying errors in the provided metadata, leading to the potential for error propagation. Previous research on a small subset of the NR database analyzed misclassification based on sequence similarity. To the best of our knowledge, the amount of misclassification in the entire database has not been quantified. We propose a heuristic method to detect potentially misclassified taxonomic assignments in the NR database. We applied a curation technique and quality control to find the most probable taxonomic assignment. Our method incorporates provenance and frequency of each annotation from manually and computationally created databases and clustering information at 95% similarity. RESULTS: We found more than two million potentially taxonomically misclassified proteins in the NR database. Using simulated data, we show a high precision of 97% and a recall of 87% for detecting taxonomically misclassified proteins. The proposed approach and findings could also be applied to other databases. AVAILABILITY AND IMPLEMENTATION: Source code, dataset, documentation, Jupyter notebooks and Docker container are available at https://github.com/boalang/nr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Metadados , Software , Bases de Dados Factuais
17.
J Biol Chem ; 295(17): 5538-5553, 2020 04 24.
Artigo em Inglês | MEDLINE | ID: mdl-32161115

RESUMO

Cas12a (Cpf1) is an RNA-guided endonuclease in the bacterial type V-A CRISPR-Cas anti-phage immune system that can be repurposed for genome editing. Cas12a can bind and cut dsDNA targets with high specificity in vivo, making it an ideal candidate for expanding the arsenal of enzymes used in precise genome editing. However, this reported high specificity contradicts Cas12a's natural role as an immune effector against rapidly evolving phages. Here, we employed high-throughput in vitro cleavage assays to determine and compare the native cleavage specificities and activities of three different natural Cas12a orthologs (FnCas12a, LbCas12a, and AsCas12a). Surprisingly, we observed pervasive sequence-specific nicking of randomized target libraries, with strong nicking of DNA sequences containing up to four mismatches in the Cas12a-targeted DNA-RNA hybrid sequences. We also found that these nicking and cleavage activities depend on mismatch type and position and vary with Cas12a ortholog and CRISPR RNA sequence. Our analysis further revealed robust nonspecific nicking of dsDNA when Cas12a is activated by binding to a target DNA. Together, our findings reveal that Cas12a has multiple nicking activities against dsDNA substrates and that these activities vary among different Cas12a orthologs.


Assuntos
Acidaminococcus/enzimologia , Proteínas de Bactérias/metabolismo , Proteínas Associadas a CRISPR/metabolismo , Sistemas CRISPR-Cas , DNA/genética , Endodesoxirribonucleases/metabolismo , Francisella/enzimologia , Acidaminococcus/genética , Acidaminococcus/metabolismo , Proteínas de Bactérias/genética , Pareamento Incorreto de Bases , Sequência de Bases , Proteínas Associadas a CRISPR/genética , DNA/metabolismo , Clivagem do DNA , Endodesoxirribonucleases/genética , Francisella/genética , Francisella/metabolismo , Edição de Genes/métodos , Expressão Gênica
18.
Database (Oxford) ; 2019(1)2019 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-31680133

RESUMO

Soybean is an important worldwide crop, and farmers continue to experience significant yield loss due to the soybean cyst nematode (SCN), Heterodera glycines. This soil-borne roundworm parasite is rated the most important pathogen problem in soybean production. The infective nematodes enter into complex interactions with their host plant by inducing the development of specialized plant feeding cells that provide the parasites with nourishment. Addressing the SCN problem will require the development of genomic resources and a global collaboration of scientists to analyze and use these resources. SCNBase.org was designed as a collaborative hub for the SCN genome. All data and analyses are downloadable and can be analyzed with three integrated genomic tools: JBrowse, Feature Search and BLAST. At the time of this writing, a number of genomic and transcriptomic data sets are already available, with 43 JBrowse tracks and 21 category pages describing SCN genomic analyses on gene predictions, transcriptome and read alignments, effector-like genes, expansion and contraction of genomic repeats, orthology and synteny with related nematode species, Single Nucleotide Polymorphism (SNPs) from 15 SCN populations and novel splice sites. Standard functional gene annotations were supplemented with orthologous gene annotations using a comparison to nine related plant-parasitic nematodes, thereby enabling functional annotations for 85% of genes. These annotations led to a greater grasp on the SCN effectorome, which include over 3324 putative effector genes. By designing SCNBase as a hub, future research findings and genomic resources can easily be uploaded and made available for use by others with minimal needs for further curation. By providing these resources to nematode research community, scientists will be empowered to develop novel, more effective SCN management tools.


Assuntos
Bases de Dados de Ácidos Nucleicos , Regulação da Expressão Gênica , Genoma Helmíntico , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Rabditídios/genética , Animais , Ontologia Genética , Glycine max
19.
BMC Bioinformatics ; 20(1): 436, 2019 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-31438850

RESUMO

BACKGROUND: Creating a scalable computational infrastructure to analyze the wealth of information contained in data repositories is difficult due to significant barriers in organizing, extracting and analyzing relevant data. Shared data science infrastructures like Boag is needed to efficiently process and parse data contained in large data repositories. The main features of Boag are inspired from existing languages for data intensive computing and can easily integrate data from biological data repositories. RESULTS: As a proof of concept, Boa for genomics, Boag, has been implemented to analyze RefSeq's 153,848 annotation (GFF) and assembly (FASTA) file metadata. Boag provides a massive improvement from existing solutions like Python and MongoDB, by utilizing a domain-specific language that uses Hadoop infrastructure for a smaller storage footprint that scales well and requires fewer lines of code. We execute scripts through Boag to answer questions about the genomes in RefSeq. We identify the largest and smallest genomes deposited, explore exon frequencies for assemblies after 2016, identify the most commonly used bacterial genome assembly program, and address how animal genome assemblies have improved since 2016. Boag databases provide a significant reduction in required storage of the raw data and a significant speed up in its ability to query large datasets due to automated parallelization and distribution of Hadoop infrastructure during computations. CONCLUSIONS: In order to keep pace with our ability to produce biological data, innovative methods are required. The Shared Data Science Infrastructure, Boag, provides researchers a greater access to researchers to efficiently explore data in new ways. We demonstrate the potential of a the domain specific language Boag using the RefSeq database to explore how deposited genome assemblies and annotations are changing over time. This is a small example of how Boag could be used with large biological datasets.


Assuntos
Ciência de Dados , Genômica , Disseminação de Informação , Animais , Bases de Dados Factuais , Bases de Dados Genéticas , Éxons/genética , Genoma , Análise de Sequência de DNA , Software
20.
Sci Rep ; 9(1): 1356, 2019 02 04.
Artigo em Inglês | MEDLINE | ID: mdl-30718603

RESUMO

Spliced leader trans-splicing (SLTS) plays a part in the maturation of pre-mRNAs in select species across multiple phyla but is particularly prevalent in Nematoda. The role of spliced leaders (SL) within the cell is unclear and an accurate assessment of SL occurrence within an organism is possible only after extensive sequencing data are available, which is not currently the case for many nematode species. SL discovery is further complicated by an absence of SL sequences from high-throughput sequencing results due to incomplete sequencing of the 5'-ends of transcripts during RNA-seq library preparation, known as 5'-bias. Existing datasets and novel methodology were used to identify both conserved SLs and unique hypervariable SLs within Heterodera glycines, the soybean cyst nematode. In H. glycines, twenty-one distinct SL sequences were found on 2,532 unique H. glycines transcripts. The SL sequences identified on the H. glycines transcripts demonstrated a high level of promiscuity, meaning that some transcripts produced as many as nine different individual SL-transcript combinations. Most uniquely, transcriptome analysis revealed that H. glycines is the first nematode to demonstrate a higher SL trans-splicing rate using a species-specific SL over well-conserved Caenorhabditis elegans SL-like sequences.


Assuntos
Caenorhabditis elegans/genética , Nematoides/genética , Splicing de RNA/genética , RNA Líder para Processamento/genética , Animais , Sequência de Bases , Dosagem de Genes , Ontologia Genética , Genoma , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA Líder para Processamento/química , Especificidade da Espécie , Trans-Splicing/genética , Transcriptoma/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA