Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 48
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
G3 (Bethesda) ; 2024 May 28.
Artículo en Inglés | MEDLINE | ID: mdl-38805695

RESUMEN

The bivalve subclass Pteriomorphia, which includes the economically important scallops, oysters, mussels, and ark clams, exhibits extreme ecological, morphological, and behavioral diversity. Among this diversity are five morphologically distinct eye types, making Pteriomorphia an excellent setting to explore the molecular basis for the evolution of novel traits. Of pteriomorphian bivalves, Limida is the only order lacking genomic resources, greatly limiting the potential phylogenomic analyses related to eyes and phototransduction. Here, we present a limid genome assembly, the disco clam, Ctenoides ales, which is characterized by invaginated eyes, exceptionally long tentacles, and a flashing light display. This genome assembly was constructed with PacBio long reads and Dovetail Omni-CTM proximity-ligation sequencing. The final assembly is ∼2.3Gb and over 99% of the total length is contained in 18 pseudomolecule scaffolds. We annotated 41,064 protein coding genes and report a BUSCO completeness of 91.9% for metazoa_obd10. Additionally, we report a complete and annotated mitochondrial genome, which also had been lacking from Limida. The ∼20Kb mitogenome has 12 protein coding genes, 22 tRNAs, 2 rRNA genes, and a 1,589 bp duplicated sequence containing the origin of replication. The C. ales nuclear genome size is substantially larger than other pteriomorphian genomes, mainly accounted for by transposable element sequences. We inventoried the genome for opsins, the signaling proteins that initiate phototransduction, and found that, unlike its closest eyed-relatives, the scallops, C. ales lacks duplication of the rhabdomeric Gq-protein coupled opsin that is typically used for invertebrate vision. In fact, C. ales has uncharacteristically few opsins relative to the other pteriomorphian families, all of which have unique expansions of xenopsins, a recently discovered opsin subfamily. This chromosome-level assembly, along with the mitogenome, will be valuable resources for comparative genomics and phylogenetics in bivalves and particularly for the understudied but charismatic limids.

3.
Artículo en Inglés | MEDLINE | ID: mdl-38179990

RESUMEN

A fully assembled spirochaete genome was identified as a contaminating scaffold in our red abalone (Haliotis rufescens) genome assembly. In this paper, we describe the analysis of this bacterial genome. The assembled spirochaete genome is 3.25 Mb in size with 48.5 mol% G+C content. The proteomes of 38 species were compared with the spirochaete genome and it was discovered to form an independent branch within the family Spirochaetaceae on the phylogenetic tree. The comparison of 16S rRNA sequences and average nucleotide identity scores between the spirochaete genome with known species of different families in Spirochaetia indicate that it is an unknown species. Further, the percentage of conserved proteins compared to neighbouring taxa confirm that it does not belong to a known genus within Spirochaetaceae. We propose the name Candidatus Haliotispira prima gen. nov., sp. nov. based on its taxonomic placement and origin. We also tested for the presence of this species in different species of abalone and found that it is also present in white abalone (Haliotis sorenseni). In addition, we highlight the need for better classification of taxa within the class Spirochaetia.


Asunto(s)
Gastrópodos , Spirochaeta , Spirochaetaceae , Humanos , Animales , Spirochaetales , Filogenia , ARN Ribosómico 16S/genética , Composición de Base , Análisis de Secuencia de ADN , ADN Bacteriano/genética , Técnicas de Tipificación Bacteriana , Ácidos Grasos/química , Bacterias
4.
Toxicol Sci ; 194(1): 23-37, 2023 06 28.
Artículo en Inglés | MEDLINE | ID: mdl-37208198

RESUMEN

Glyphosate (GLY) is an herbicide used for rural and urban weed control. Urinary GLY in women is associated with shortened gestational length yet effects of GLY on offspring due to maternal exposure are unclear. This study tested the hypothesis that maternal chronic pre-conceptional GLY exposure would cause phenotypic and molecular changes in F1 offspring. Female C57BL/6 mice (7-week-old; n = 40) received saline vehicle control (CT; n = 20) or GLY (2 mg/kg; n = 20) daily per os for 10 weeks. At dosing completion, females were housed with unexposed males and divided into Cohort 1 who were euthanized at gestation day 14 (n = 10 per treatment) and Cohort 2 who completed gestation (n = 10 per treatment). F1 female ovarian and liver samples underwent LC-MS/MS and bioinformatic analysis. Maternal exposure did not affect litter (P > .05) sex ratio, or embryonic or neonatal gross phenotypes. In Cohort 2 offspring, no treatment effect on (P > .05) offspring anogenital distance, puberty onset, or ovarian follicular composition was noted. Body weight was increased (P < .05) in male GLY-exposed compared with CT dam offspring. F1 females from GLY-exposed dams had altered (P < .05) abundance of 54 ovarian and 110 hepatic proteins. Pathways altered in the ovary (false discovery rate [FDR] ≤ 0.07) included thermogenesis and phosphatidylinositol-3 kinase-AKT signaling and in liver (FDR ≤ 0.08) included metabolic, glutathione metabolism, oxidative phosphorylation, non-alcoholic fatty liver disease, and thermogenesis. Thus, pre-conceptional GLY exposure affected offspring phenotypic and molecular profiles potentially impacting reproductive health.


Asunto(s)
Exposición Materna , Efectos Tardíos de la Exposición Prenatal , Humanos , Animales , Ratones , Femenino , Masculino , Exposición Materna/efectos adversos , Ovario , Proteoma , Cromatografía Liquida , Maduración Sexual , Ratones Endogámicos C57BL , Espectrometría de Masas en Tándem , Hígado , Efectos Tardíos de la Exposición Prenatal/inducido químicamente , Glifosato
5.
Genome Biol Evol ; 15(3)2023 03 03.
Artículo en Inglés | MEDLINE | ID: mdl-36792366

RESUMEN

Long-read sequencing has revolutionized genome assembly, yielding highly contiguous, chromosome-level contigs. However, assemblies from some third generation long read technologies, such as Pacific Biosciences (PacBio) continuous long reads (CLR), have a high error rate. Such errors can be corrected with short reads through a process called polishing. Although best practices for polishing non-model de novo genome assemblies were recently described by the Vertebrate Genome Project (VGP) Assembly community, there is a need for a publicly available, reproducible workflow that can be easily implemented and run on a conventional high performance computing environment. Here, we describe polishCLR (https://github.com/isugifNF/polishCLR), a reproducible Nextflow workflow that implements best practices for polishing assemblies made from CLR data. PolishCLR can be initiated from several input options that extend best practices to suboptimal cases. It also provides re-entry points throughout several key processes, including identifying duplicate haplotypes in purge_dups, allowing a break for scaffolding if data are available, and throughout multiple rounds of polishing and evaluation with Arrow and FreeBayes. PolishCLR is containerized and publicly available for the greater assembly community as a tool to complete assemblies from existing, error-prone long-read data.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN , Flujo de Trabajo , Haplotipos
6.
Toxicol Sci ; 190(2): 173-188, 2022 11 23.
Artículo en Inglés | MEDLINE | ID: mdl-36214631

RESUMEN

Perfluorooctanoic acid (PFOA) is an environmentally persistent perfluoroalkyl substance that is widely used in consumer products. Exposure to PFOA is associated with reproductive and developmental effects including endocrine disruption, delayed puberty in girls, and decreased fetal growth. In the United States, obesity affects 40% of women and 20% of girls, with higher rates in minority females. Obesity causes infertility, poor oocyte quality, miscarriage, and offspring defects. This study proposed that PFOA exposure would impact estrous cyclicity, ovarian steroid hormones, and the ovarian proteome and further hypothesized that obesity would impact PFOA-induced ovotoxicity. Female wild type (KK.Cg-a/a; lean) or KK.Cg-Ay/J mice (obese) received saline (CT) or PFOA (2.5 mg/kg) per os for 15 days beginning at 7 weeks of age. There were no effects on food intake, body weight, estrous cyclicity, serum progesterone, and heart, spleen, kidney, or uterus weight (p > .05). Ovary weight was decreased (p < .05) by PFOA exposure relative to vehicle control-treated mice in lean but not obese mice. Liquid chromatography-tandem mass spectrometry was performed on isolated ovarian protein and PFOA exposure altered the ovarian abundance of proteins involved in DNA damage sensing and repair pathways and reproduction pathways (p < .05) differentially in lean and obese mice. The data suggest that PFOA exposure alters ovary weight and differentially targets ovarian proteins in lean and obese females in ways that might reduce female fecundity.


Asunto(s)
Fluorocarburos , Femenino , Ratones , Animales , Fluorocarburos/metabolismo , Ovario , Ratones Obesos , Reproducción , Obesidad/metabolismo , Daño del ADN
7.
Toxicol Sci ; 190(2): 204-214, 2022 11 23.
Artículo en Inglés | MEDLINE | ID: mdl-36173347

RESUMEN

Exposure to glyphosate (GLY), a commonly used herbicide, is supported by urinary detection and associated with shortened gestation in women. This study tested the hypothesis that chronic low-dose pre-conceptional GLY exposure would affect maternal ovarian function mid- and post-gestation. Mice (C57BL/6; n = 40) were exposed per os to saline vehicle control (CT; n = 20) or GLY (2 mg/kg; n = 20) daily for 10 weeks starting at 7 weeks of age. Post-exposure, females were impregnated and euthanized at gestation day 14 (GD14) or post-weaning (PW). Pregnancy success was reduced from 75% to 55% by GLY exposure. No treatment effect (p > .05) on body weight, maternal serum 17ß-estradiol, or litter size was noted. Ovarian weight was unaffected or reduced (p < .05) by GLY in GD14 and PW dams, respectively. Exposure to GLY decreased (p < .05) PW ovarian secondary follicle number with no other follicle composition impacts. Protein abundance analysis by LC-MS/MS identified that GLY altered (p < .05) 26 ovarian and 41 hepatic proteins in GD14 dams and 39 hepatic proteins in PW dams. In GD14 dams, GLY increased ovarian protein abundance of SEC16A (p < .05; 29-fold) and hepatic RPS27L and GM4952 (p < .05; ∼4-fold). In both GD14 and PW dams, GLY exposure increased (p < .05) hepatic RPS4 and decreased (p < .05) ECHDC3. Pathway analysis using DAVID identified 10 GLY hepatic pathway targets with FDR ≤ 0.07 in GD14 dams.


Asunto(s)
Herbicidas , Efectos Tardíos de la Exposición Prenatal , Proteoma , Animales , Femenino , Ratones , Embarazo , Cromatografía Liquida , Retículo Endoplásmico , Aparato de Golgi , Exposición Materna/efectos adversos , Ratones Endogámicos C57BL , Espectrometría de Masas en Tándem , Proteínas de Transporte Vesicular , Herbicidas/toxicidad , Hígado , Ovario/metabolismo , Glifosato
8.
Cancers (Basel) ; 14(14)2022 Jul 20.
Artículo en Inglés | MEDLINE | ID: mdl-35884586

RESUMEN

Lipopolysaccharide (LPS) is associated with chronic intestinal inflammation and promotes intestinal cancer progression in the gut. While the interplay between LPS and intestinal immune cells has been well-characterized, little is known about LPS and the intestinal epithelium interactions. In this study, we explored the differential effects of LPS on proliferation and the transcriptome in 3D enteroids/colonoids obtained from dogs with naturally occurring gastrointestinal (GI) diseases including inflammatory bowel disease (IBD) and intestinal mast cell tumor. The study objective was to analyze the LPS-induced modulation of signaling pathways involving the intestinal epithelia and contributing to colorectal cancer development in the context of an inflammatory (IBD) or a tumor microenvironment. While LPS incubation resulted in a pro-cancer gene expression pattern and stimulated proliferation of IBD enteroids and colonoids, downregulation of several cancer-associated genes such as Gpatch4, SLC7A1, ATP13A2, and TEX45 was also observed in tumor enteroids. Genes participating in porphyrin metabolism (CP), nucleocytoplasmic transport (EEF1A1), arachidonic acid, and glutathione metabolism (GPX1) exhibited a similar pattern of altered expression between IBD enteroids and IBD colonoids following LPS stimulation. In contrast, genes involved in anion transport, transcription and translation, apoptotic processes, and regulation of adaptive immune responses showed the opposite expression patterns between IBD enteroids and colonoids following LPS treatment. In brief, the crosstalk between LPS/TLR4 signal transduction pathway and several metabolic pathways such as primary bile acid biosynthesis and secretion, peroxisome, renin-angiotensin system, glutathione metabolism, and arachidonic acid pathways may be important in driving chronic intestinal inflammation and intestinal carcinogenesis.

9.
Nat Chem Biol ; 18(1): 38-46, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34711982

RESUMEN

Inefficient homology-directed repair (HDR) constrains CRISPR-Cas9 genome editing in organisms that preferentially employ nonhomologous end joining (NHEJ) to fix DNA double-strand breaks (DSBs). Current strategies used to alleviate NHEJ proficiency involve NHEJ disruption. To confer precision editing without NHEJ disruption, we identified the shortcomings of the conventional CRISPR platforms and developed a CRISPR platform-lowered indel nuclease system enabling accurate repair (LINEAR)-which enhanced HDR rates (to 67-100%) compared to those in previous reports using conventional platforms in four NHEJ-proficient yeasts. With NHEJ preserved, we demonstrate its ability to survey genomic landscapes, identifying loci whose spatiotemporal genomic architectures yield favorable expression dynamics for heterologous pathways. We present a case study that deploys LINEAR precision editing and NHEJ-mediated random integration to rapidly engineer and optimize a microbial factory to produce (S)-norcoclaurine. Taken together, this work demonstrates how to leverage an antagonizing pair of DNA DSB repair pathways to expand the current collection of microbial factories.


Asunto(s)
Sistemas CRISPR-Cas , Ingeniería Genética , Saccharomyces cerevisiae/genética , Reparación del ADN por Unión de Extremidades , Fermentación , Genes Fúngicos
10.
Mol Ecol Resour ; 21(7): 2407-2422, 2021 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-34036752

RESUMEN

The soybean cyst nematode (Heterodera glycines) is a sedentary plant parasite that exceeds billion USD annually in yield losses. This problem is exacerbated by H. glycines populations overcoming the limited sources of natural resistance in soybean and by the lack of effective and safe alternative treatments. Although there are genetic determinants that render soybeans resistant to nematode genotypes, resistant soybeans are increasingly ineffective because their multiyear usage has selected for virulent H. glycines populations. Successful H. glycines infection relies on the comprehensive re-engineering of soybean root cells into a syncytium, as well as the long-term suppression of host defences to ensure syncytial viability. At the forefront of these complex molecular interactions are effectors, the proteins secreted by H. glycines into host root tissues. The mechanisms that control genomic effector acquisition, diversification, and selection are important insights needed for the development of essential novel control strategies. As a foundation to obtain this understanding, we created a nine-scaffold, 158 Mb pseudomolecule assembly of the H. glycines genome using PacBio, Chicago, and Hi-C sequencing. A Mikado consensus gene prediction produced an annotation of 22,465 genes using short- and long-read expression data. To evaluate assembly and annotation quality, we cross-examined synteny among H. glycines assemblies, and compared BUSCO across related species. To describe the predicted proteins involved in H. glycines' secretory pathway, we contrasted expression between preparasitic and parasitic stages with functional gene information. Here, we present the results from our assembly and annotation of the H. glycines genome and contribute this resource to the scientific community.


Asunto(s)
Quistes , Tylenchoidea , Animales , Cromosomas , Genoma , Glycine max/genética , Tylenchoidea/genética
11.
PLoS One ; 16(4): e0249899, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33909645

RESUMEN

Rocky Mountain elk (Cervus canadensis) populations have significant economic implications to the cattle industry, as they are a major reservoir for Brucella abortus in the Greater Yellowstone area. Vaccination attempts against intracellular bacterial diseases in elk populations have not been successful due to a negligible adaptive cellular immune response. A lack of genomic resources has impeded attempts to better understand why vaccination does not induce protective immunity. To overcome this limitation, PacBio, Illumina, and Hi-C sequencing with a total of 686-fold coverage was used to assemble the elk genome into 35 pseudomolecules. A robust gene annotation was generated resulting in 18,013 gene models and 33,422 mRNAs. The accuracy of the assembly was assessed using synteny to the red deer and cattle genomes identifying several chromosomal rearrangements, fusions and fissions. Because this genome assembly and annotation provide a foundation for genome-enabled exploration of Cervus species, we demonstrate its utility by exploring the conservation of immune system-related genes. We conclude by comparing cattle immune system-related genes to the elk genome, revealing eight putative gene losses in elk.


Asunto(s)
Ciervos/genética , Genoma , Animales , Bovinos , Fusión Génica , Reordenamiento Génico , Inmunidad/genética , Seudogenes/genética , ARN Mensajero/metabolismo
12.
Nucleic Acids Res ; 49(7): 4037-4053, 2021 04 19.
Artículo en Inglés | MEDLINE | ID: mdl-33744974

RESUMEN

Cas9 is an RNA-guided endonuclease in the bacterial CRISPR-Cas immune system and a popular tool for genome editing. The commonly used Streptococcus pyogenes Cas9 (SpCas9) is relatively non-specific and prone to off-target genome editing. Other Cas9 orthologs and engineered variants of SpCas9 have been reported to be more specific. However, previous studies have focused on specificity of double-strand break (DSB) or indel formation, potentially overlooking alternative cleavage activities of these Cas9 variants. In this study, we employed in vitro cleavage assays of target libraries coupled with high-throughput sequencing to systematically compare cleavage activities and specificities of two natural Cas9 variants (SpCas9 and Staphylococcus aureus Cas9) and three engineered SpCas9 variants (SpCas9 HF1, HypaCas9 and HiFi Cas9). We observed that all Cas9s tested could cleave target sequences with up to five mismatches. However, the rate of cleavage of both on-target and off-target sequences varied based on target sequence and Cas9 variant. In addition, SaCas9 and engineered SpCas9 variants nick targets with multiple mismatches but have a defect in generating a DSB, while SpCas9 creates DSBs at these targets. Overall, these differences in cleavage rates and DSB formation may contribute to varied specificities observed in genome editing studies.


Asunto(s)
Proteína 9 Asociada a CRISPR , Sistemas CRISPR-Cas , Staphylococcus aureus/genética , Proteína 9 Asociada a CRISPR/genética , Proteína 9 Asociada a CRISPR/metabolismo , Edición Génica , Especificidad por Sustrato
13.
Commun Biol ; 4(1): 253, 2021 02 26.
Artículo en Inglés | MEDLINE | ID: mdl-33637860

RESUMEN

While it is well known that the genome can affect social behavior, recent models posit that social lifestyles can, in turn, influence genome evolution. Here, we perform the most phylogenetically comprehensive comparative analysis of 16 bee genomes to date: incorporating two published and four new carpenter bee genomes (Apidae: Xylocopinae) for a first-ever genomic comparison with a monophyletic clade containing solitary through advanced eusocial taxa. We find that eusocial lineages have undergone more gene family expansions, feature more signatures of positive selection, and have higher counts of taxonomically restricted genes than solitary and weakly social lineages. Transcriptomic data reveal that caste-affiliated genes are deeply-conserved; gene regulatory and functional elements are more closely tied to social phenotype than phylogenetic lineage; and regulatory complexity increases steadily with social complexity. Overall, our study provides robust empirical evidence that social evolution can act as a major and surprisingly consistent driver of macroevolutionary genomic change.


Asunto(s)
Abejas/genética , Conducta Animal , Evolución Molecular , Genes de Insecto , Genoma de los Insectos , Conducta Social , Animales , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Interacción Gen-Ambiente , Genómica , Filogenia , Especificidad de la Especie , Transcriptoma
14.
Plant Physiol ; 184(2): 960-972, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-32737073

RESUMEN

Maize (Zea mays) thick aleurone1 (thk1-R) mutants form multiple aleurone layers in the endosperm and have arrested embryogenesis. Prior studies suggest that thk1 functions downstream of defective kernel1 (dek1) in a regulatory pathway that controls aleurone cell fate and other endosperm traits. The original thk1-R mutant contained an ∼2-Mb multigene deletion, which precluded identification of the causal gene. Here, ethyl methanesulfonate mutagenesis produced additional alleles, and RNA sequencing from developing endosperm was used to identify a candidate gene based on differential expression compared with the wild-type progenitor. Gene editing confirmed the gene identity by producing mutant alleles that failed to complement existing thk1 mutants and that produced multiple-aleurone homozygous phenotypes. Thk1 encodes a homolog of NEGATIVE ON TATA-LESS1, a protein that acts as a scaffold for the CARBON CATABOLITE REPRESSION4-NEGATIVE ON TATA-LESS complex. This complex is highly conserved and essential in all eukaryotes for regulating a wide array of gene expression and cellular activities. Maize also harbors a duplicate locus, thick aleurone-like1, which likely accounts for the ability of thk1 mutants to form viable cells. Transcriptomic analysis indicated that THK1 regulates activities involving cell division, signaling, differentiation, and metabolism. Identification of thk1 provides an important new component of the DEK1 regulatory system that patterns cell fate in endosperm.


Asunto(s)
Diferenciación Celular/genética , Endospermo/citología , Endospermo/crecimiento & desarrollo , Endospermo/genética , Zea mays/citología , Zea mays/crecimiento & desarrollo , Zea mays/genética , Productos Agrícolas/citología , Productos Agrícolas/genética , Productos Agrícolas/crecimiento & desarrollo , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Genes de Plantas , Variación Genética , Genotipo , Mutación , Fenotipo
15.
Bioinformatics ; 36(18): 4699-4705, 2020 09 15.
Artículo en Inglés | MEDLINE | ID: mdl-32579213

RESUMEN

MOTIVATION: As the cost of sequencing decreases, the amount of data being deposited into public repositories is increasing rapidly. Public databases rely on the user to provide metadata for each submission that is prone to user error. Unfortunately, most public databases, such as non-redundant (NR), rely on user input and do not have methods for identifying errors in the provided metadata, leading to the potential for error propagation. Previous research on a small subset of the NR database analyzed misclassification based on sequence similarity. To the best of our knowledge, the amount of misclassification in the entire database has not been quantified. We propose a heuristic method to detect potentially misclassified taxonomic assignments in the NR database. We applied a curation technique and quality control to find the most probable taxonomic assignment. Our method incorporates provenance and frequency of each annotation from manually and computationally created databases and clustering information at 95% similarity. RESULTS: We found more than two million potentially taxonomically misclassified proteins in the NR database. Using simulated data, we show a high precision of 97% and a recall of 87% for detecting taxonomically misclassified proteins. The proposed approach and findings could also be applied to other databases. AVAILABILITY AND IMPLEMENTATION: Source code, dataset, documentation, Jupyter notebooks and Docker container are available at https://github.com/boalang/nr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Metadatos , Programas Informáticos , Bases de Datos Factuales
16.
J Biol Chem ; 295(17): 5538-5553, 2020 04 24.
Artículo en Inglés | MEDLINE | ID: mdl-32161115

RESUMEN

Cas12a (Cpf1) is an RNA-guided endonuclease in the bacterial type V-A CRISPR-Cas anti-phage immune system that can be repurposed for genome editing. Cas12a can bind and cut dsDNA targets with high specificity in vivo, making it an ideal candidate for expanding the arsenal of enzymes used in precise genome editing. However, this reported high specificity contradicts Cas12a's natural role as an immune effector against rapidly evolving phages. Here, we employed high-throughput in vitro cleavage assays to determine and compare the native cleavage specificities and activities of three different natural Cas12a orthologs (FnCas12a, LbCas12a, and AsCas12a). Surprisingly, we observed pervasive sequence-specific nicking of randomized target libraries, with strong nicking of DNA sequences containing up to four mismatches in the Cas12a-targeted DNA-RNA hybrid sequences. We also found that these nicking and cleavage activities depend on mismatch type and position and vary with Cas12a ortholog and CRISPR RNA sequence. Our analysis further revealed robust nonspecific nicking of dsDNA when Cas12a is activated by binding to a target DNA. Together, our findings reveal that Cas12a has multiple nicking activities against dsDNA substrates and that these activities vary among different Cas12a orthologs.


Asunto(s)
Acidaminococcus/enzimología , Proteínas Bacterianas/metabolismo , Proteínas Asociadas a CRISPR/metabolismo , Sistemas CRISPR-Cas , ADN/genética , Endodesoxirribonucleasas/metabolismo , Francisella/enzimología , Acidaminococcus/genética , Acidaminococcus/metabolismo , Proteínas Bacterianas/genética , Disparidad de Par Base , Secuencia de Bases , Proteínas Asociadas a CRISPR/genética , ADN/metabolismo , División del ADN , Endodesoxirribonucleasas/genética , Francisella/genética , Francisella/metabolismo , Edición Génica/métodos , Expresión Génica
17.
Database (Oxford) ; 2019(1)2019 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-31680133

RESUMEN

Soybean is an important worldwide crop, and farmers continue to experience significant yield loss due to the soybean cyst nematode (SCN), Heterodera glycines. This soil-borne roundworm parasite is rated the most important pathogen problem in soybean production. The infective nematodes enter into complex interactions with their host plant by inducing the development of specialized plant feeding cells that provide the parasites with nourishment. Addressing the SCN problem will require the development of genomic resources and a global collaboration of scientists to analyze and use these resources. SCNBase.org was designed as a collaborative hub for the SCN genome. All data and analyses are downloadable and can be analyzed with three integrated genomic tools: JBrowse, Feature Search and BLAST. At the time of this writing, a number of genomic and transcriptomic data sets are already available, with 43 JBrowse tracks and 21 category pages describing SCN genomic analyses on gene predictions, transcriptome and read alignments, effector-like genes, expansion and contraction of genomic repeats, orthology and synteny with related nematode species, Single Nucleotide Polymorphism (SNPs) from 15 SCN populations and novel splice sites. Standard functional gene annotations were supplemented with orthologous gene annotations using a comparison to nine related plant-parasitic nematodes, thereby enabling functional annotations for 85% of genes. These annotations led to a greater grasp on the SCN effectorome, which include over 3324 putative effector genes. By designing SCNBase as a hub, future research findings and genomic resources can easily be uploaded and made available for use by others with minimal needs for further curation. By providing these resources to nematode research community, scientists will be empowered to develop novel, more effective SCN management tools.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Regulación de la Expresión Génica , Genoma de los Helmintos , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple , Rabdítidos/genética , Animales , Ontología de Genes , Glycine max
18.
BMC Bioinformatics ; 20(1): 436, 2019 Aug 22.
Artículo en Inglés | MEDLINE | ID: mdl-31438850

RESUMEN

BACKGROUND: Creating a scalable computational infrastructure to analyze the wealth of information contained in data repositories is difficult due to significant barriers in organizing, extracting and analyzing relevant data. Shared data science infrastructures like Boag is needed to efficiently process and parse data contained in large data repositories. The main features of Boag are inspired from existing languages for data intensive computing and can easily integrate data from biological data repositories. RESULTS: As a proof of concept, Boa for genomics, Boag, has been implemented to analyze RefSeq's 153,848 annotation (GFF) and assembly (FASTA) file metadata. Boag provides a massive improvement from existing solutions like Python and MongoDB, by utilizing a domain-specific language that uses Hadoop infrastructure for a smaller storage footprint that scales well and requires fewer lines of code. We execute scripts through Boag to answer questions about the genomes in RefSeq. We identify the largest and smallest genomes deposited, explore exon frequencies for assemblies after 2016, identify the most commonly used bacterial genome assembly program, and address how animal genome assemblies have improved since 2016. Boag databases provide a significant reduction in required storage of the raw data and a significant speed up in its ability to query large datasets due to automated parallelization and distribution of Hadoop infrastructure during computations. CONCLUSIONS: In order to keep pace with our ability to produce biological data, innovative methods are required. The Shared Data Science Infrastructure, Boag, provides researchers a greater access to researchers to efficiently explore data in new ways. We demonstrate the potential of a the domain specific language Boag using the RefSeq database to explore how deposited genome assemblies and annotations are changing over time. This is a small example of how Boag could be used with large biological datasets.


Asunto(s)
Ciencia de los Datos , Genómica , Difusión de la Información , Animales , Bases de Datos Factuales , Bases de Datos Genéticas , Exones/genética , Genoma , Análisis de Secuencia de ADN , Programas Informáticos
19.
Sci Rep ; 9(1): 1356, 2019 02 04.
Artículo en Inglés | MEDLINE | ID: mdl-30718603

RESUMEN

Spliced leader trans-splicing (SLTS) plays a part in the maturation of pre-mRNAs in select species across multiple phyla but is particularly prevalent in Nematoda. The role of spliced leaders (SL) within the cell is unclear and an accurate assessment of SL occurrence within an organism is possible only after extensive sequencing data are available, which is not currently the case for many nematode species. SL discovery is further complicated by an absence of SL sequences from high-throughput sequencing results due to incomplete sequencing of the 5'-ends of transcripts during RNA-seq library preparation, known as 5'-bias. Existing datasets and novel methodology were used to identify both conserved SLs and unique hypervariable SLs within Heterodera glycines, the soybean cyst nematode. In H. glycines, twenty-one distinct SL sequences were found on 2,532 unique H. glycines transcripts. The SL sequences identified on the H. glycines transcripts demonstrated a high level of promiscuity, meaning that some transcripts produced as many as nine different individual SL-transcript combinations. Most uniquely, transcriptome analysis revealed that H. glycines is the first nematode to demonstrate a higher SL trans-splicing rate using a species-specific SL over well-conserved Caenorhabditis elegans SL-like sequences.


Asunto(s)
Caenorhabditis elegans/genética , Nematodos/genética , Empalme del ARN/genética , ARN Lider Empalmado/genética , Animales , Secuencia de Bases , Dosificación de Gen , Ontología de Genes , Genoma , ARN Mensajero/genética , ARN Mensajero/metabolismo , ARN Lider Empalmado/química , Especificidad de la Especie , Trans-Empalme/genética , Transcriptoma/genética
20.
BMC Genomics ; 20(1): 119, 2019 Feb 07.
Artículo en Inglés | MEDLINE | ID: mdl-30732586

RESUMEN

BACKGROUND: Heterodera glycines, commonly referred to as the soybean cyst nematode (SCN), is an obligatory and sedentary plant parasite that causes over a billion-dollar yield loss to soybean production annually. Although there are genetic determinants that render soybean plants resistant to certain nematode genotypes, resistant soybean cultivars are increasingly ineffective because their multi-year usage has selected for virulent H. glycines populations. The parasitic success of H. glycines relies on the comprehensive re-engineering of an infection site into a syncytium, as well as the long-term suppression of host defense to ensure syncytial viability. At the forefront of these complex molecular interactions are effectors, the proteins secreted by H. glycines into host root tissues. The mechanisms of effector acquisition, diversification, and selection need to be understood before effective control strategies can be developed, but the lack of an annotated genome has been a major roadblock. RESULTS: Here, we use PacBio long-read technology to assemble a H. glycines genome of 738 contigs into 123 Mb with annotations for 29,769 genes. The genome contains significant numbers of repeats (34%), tandem duplicates (18.7 Mb), and horizontal gene transfer events (151 genes). A large number of putative effectors (431 genes) were identified in the genome, many of which were found in transposons. CONCLUSIONS: This advance provides a glimpse into the host and parasite interplay by revealing a diversity of mechanisms that give rise to virulence genes in the soybean cyst nematode, including: tandem duplications containing over a fifth of the total gene count, virulence genes hitchhiking in transposons, and 107 horizontal gene transfers not reported in other plant parasitic nematodes thus far. Through extensive characterization of the H. glycines genome, we provide new insights into H. glycines biology and shed light onto the mystery underlying complex host-parasite interactions. This genome sequence is an important prerequisite to enable work towards generating new resistance or control measures against H. glycines.


Asunto(s)
Evolución Molecular , Duplicación de Gen , Genómica , Glycine max/parasitología , Tylenchoidea/genética , Tylenchoidea/fisiología , Animales , Genotipo , Interacciones Huésped-Parásitos , Anotación de Secuencia Molecular , Enfermedades de las Plantas/parasitología , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...