Pesquisa | Portal Regional da BVS

1.

Properties and predicted functions of large genes and proteins of apicomplexan parasites.

Fang, Tiffany; Mohseni, Amir; Lonardi, Stefano; Ben Mamoun, Choukri.

NAR Genom Bioinform ; 6(2): lqae032, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38584870

RESUMO

Evolutionary constraints greatly favor compact genomes that efficiently encode proteins. However, several eukaryotic organisms, including apicomplexan parasites such as Toxoplasma gondii, Plasmodium falciparum and Babesia duncani, the causative agents of toxoplasmosis, malaria and babesiosis, respectively, encode very large proteins, exceeding 20 times their average protein size. Although these large proteins represent <1% of the total protein pool and are generally expressed at low levels, their persistence throughout evolution raises important questions about their functions and possible evolutionary pressures to maintain them. In this study, we examined the trends in gene and protein size, function and expression patterns within seven apicomplexan pathogens. Our analysis revealed that certain large proteins in apicomplexan parasites harbor domains potentially important for functions such as antigenic variation, erythrocyte invasion and immune evasion. However, these domains are not limited to or strictly conserved within large proteins. While some of these proteins are predicted to engage in conventional metabolic pathways within these parasites, others fulfill specialized functions for pathogen-host interactions, nutrient acquisition and overall survival.

2.

Comprehensive assessment of 11 de novo HiFi assemblers on complex eukaryotic genomes and metagenomes.

Yu, Wenjuan; Luo, Haohui; Yang, Jinbao; Zhang, Shengchen; Jiang, Heling; Zhao, Xianjia; Hui, Xingqi; Sun, Da; Li, Liang; Wei, Xiu-Qing; Lonardi, Stefano; Pan, Weihua.

Genome Res ; 34(2): 326-340, 2024 Mar 20.

Artigo em Inglês | MEDLINE | ID: mdl-38428994

RESUMO

Pacific Biosciences (PacBio) HiFi sequencing technology generates long reads (>10 kbp) with very high accuracy (<0.01% sequencing error). Although several de novo assembly tools are available for HiFi reads, there are no comprehensive studies on the evaluation of these assemblers. We evaluated the performance of 11 de novo HiFi assemblers on (1) real data for three eukaryotic genomes; (2) 34 synthetic data sets with different ploidy, sequencing coverage levels, heterozygosity rates, and sequencing error rates; (3) one real metagenomic data set; and (4) five synthetic metagenomic data sets with different composition abundance and heterozygosity rates. The 11 assemblers were evaluated using quality assessment tool (QUAST) and benchmarking universal single-copy ortholog (BUSCO). We also used several additional criteria, namely, completion rate, single-copy completion rate, duplicated completion rate, average proportion of largest category, average distance difference, quality value, run-time, and memory utilization. Results show that hifiasm and hifiasm-meta should be the first choice for assembling eukaryotic genomes and metagenomes with HiFi data. We performed a comprehensive benchmarking study of commonly used assemblers on complex eukaryotic genomes and metagenomes. Our study will help the research community to choose the most appropriate assembler for their data and identify possible improvements in assembly algorithms.

Assuntos

Metagenoma , Software , Análise de Sequência de DNA/métodos , Algoritmos , Metagenômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos

3.

Multiomics analysis reveals B. MO1 as a distinct Babesia species and provides insights into its evolution and virulence.

Singh, Pallavi; Vydyam, Pratap; Fang, Tiffany; Estrada, Karel; Gonzalez, Luis Miguel; Grande, Ricardo; Kumar, Madelyn; Chakravarty, Sakshar; Berry, Vincent; Ranwez, Vincent; Carcy, Bernard; Depoix, Delphine; Sánchez, Sergio; Cornillot, Emmanuel; Abel, Steven; Ciampossin, Loic; Lenz, Todd; Harb, Omar; Sanchez-Flores, Alejandro; Montero, Estrella; Le Roch, Karine G; Lonardi, Stefano; Ben Mamoun, Choukri.

bioRxiv ; 2024 Jan 18.

Artigo em Inglês | MEDLINE | ID: mdl-38293033

RESUMO

Babesiosis, caused by protozoan parasites of the genus Babesia , is an emerging tick-borne disease of significance for both human and animal health. Babesia parasites infect erythrocytes of vertebrate hosts where they develop and multiply rapidly to cause the pathological symptoms associated with the disease. The identification of various Babesia species underscores the ongoing risk of new zoonotic pathogens capable of infecting humans, a concern amplified by anthropogenic activities and environmental shifts impacting the distribution and transmission dynamics of parasites, their vectors, and reservoir hosts. One such species, Babesia MO1, previously implicated in severe cases of human babesiosis in the midwestern United States, was initially considered closely related to B. divergens , the predominant agent of human babesiosis in Europe. Yet, uncertainties persist regarding whether these pathogens represent distinct variants of the same species or are entirely separate species. We show that although both B. MO1 and B. divergens share similar genome sizes, comprising three nuclear chromosomes, one linear mitochondrial chromosome, and one circular apicoplast chromosome, major differences exist in terms of genomic sequence divergence, gene functions, transcription profiles, replication rates and susceptibility to antiparasitic drugs. Furthermore, both pathogens have evolved distinct classes of multigene families, crucial for their pathogenicity and adaptation to specific mammalian hosts. Leveraging genomic information for B. MO1, B. divergens , and other members of the Babesiidae family within Apicomplexa provides valuable insights into the evolution, diversity, and virulence of these parasites. This knowledge serves as a critical tool in preemptively addressing the emergence and rapid transmission of more virulent strains.

4.

A Mid-Density Single-Nucleotide Polymorphism Panel for Molecular Applications in Cowpea (Vigna unguiculata (L.) Walp).

Ongom, Patrick Obia; Fatokun, Christian; Togola, Abou; Garcia-Oliveira, Ana Luisa; Ng, Eng Hwa; Kilian, Andrzej; Lonardi, Stefano; Close, Timothy J; Boukar, Ousmane.

Int J Genomics ; 2024: 9912987, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38235497

RESUMO

Molecular markers are increasingly being deployed to accelerate genetic gain in crop plants. The objective of this study was to assess the potential of a mid-density genotyping panel for molecular applications in cowpea breeding. A core set of 2,602 targeted diversity array technology (DArTag) single-nucleotide polymorphisms (SNPs) was designed from an existing 51,128 Cowpea iSelect Consortium Array. The panel's usefulness was assessed using 376 genotypes from different populations of known genetic backgrounds. The panel was informative, with over 78% of SNPs exceeding a minor allele frequency of 0.20. The panel decoded three stratifications in the constituted population, as was expected. Linkage disequilibrium (LD) decay was correctly depicted as slower in a biparental subset than in other populations. A known flower and seed coat color gene region was located on chromosome Vu07, suggesting that the mid-density panel may be used to hypothesize genomic regions underlying target traits in cowpea. Unexpected heterozygosity was detected in some lines and highly among F1 progenies, divulging the panel's potential application in germplasm purity and hybridity verification. The study unveils the potential of an excellent genomic resource that can be tapped to enhance the development of improved cowpea cultivars.

5.

A view of the pan-genome of domesticated Cowpea (Vigna unguiculata [L.] Walp.).

Liang, Qihua; Muñoz-Amatriaín, María; Shu, Shengqiang; Lo, Sassoum; Wu, Xinyi; Carlson, Joseph W; Davidson, Patrick; Goodstein, David M; Phillips, Jeremy; Janis, Nadia M; Lee, Elaine J; Liang, Chenxi; Morrell, Peter L; Farmer, Andrew D; Xu, Pei; Close, Timothy J; Lonardi, Stefano.

Plant Genome ; 17(1): e20319, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-36946261

RESUMO

Cowpea, Vigna unguiculata L. Walp., is a diploid warm-season legume of critical importance as both food and fodder in sub-Saharan Africa. This species is also grown in Northern Africa, Europe, Latin America, North America, and East to Southeast Asia. To capture the genomic diversity of domesticates of this important legume, de novo genome assemblies were produced for representatives of six subpopulations of cultivated cowpea identified previously from genotyping of several hundred diverse accessions. In the most complete assembly (IT97K-499-35), 26,026 core and 4963 noncore genes were identified, with 35,436 pan genes when considering all seven accessions. GO terms associated with response to stress and defense response were highly enriched among the noncore genes, while core genes were enriched in terms related to transcription factor activity, and transport and metabolic processes. Over 5 million single nucleotide polymorphisms (SNPs) relative to each assembly and over 40 structural variants >1 Mb in size were identified by comparing genomes. Vu10 was the chromosome with the highest frequency of SNPs, and Vu04 had the most structural variants. Noncore genes harbor a larger proportion of potentially disruptive variants than core genes, including missense, stop gain, and frameshift mutations; this suggests that noncore genes substantially contribute to diversity within domesticated cowpea.

Assuntos

Fabaceae , Vigna , Vigna/genética , Genoma de Planta , Genes de Plantas , Fabaceae/genética , Locos de Características Quantitativas

6.

Corrigendum: Seed coat pattern QTL and development in cowpea (Vigna unguiculata [L.] Walp.).

Herniter, Ira A; Lo, Ryan; Muñoz-Amatriaín, María; Lo, Sassoum; Guo, Yi-Ning; Huynh, Bao-Lam; Lucas, Mitchell; Jia, Zhenyu; Roberts, Philip A; Lonardi, Stefano; Close, Timothy J.

Front Plant Sci ; 14: 1299051, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-38023847

RESUMO

[This corrects the article DOI: 10.3389/fpls.2019.01346.].

7.

Babesia BdFE1 esterase is required for the anti-parasitic activity of the ACE inhibitor fosinopril.

Vydyam, Pratap; Choi, Jae-Yeon; Gihaz, Shalev; Chand, Meenal; Gewirtz, Meital; Thekkiniath, Jose; Lonardi, Stefano; Gennaro, Joseph C; Ben Mamoun, Choukri.

J Biol Chem ; 299(11): 105313, 2023 11.

Artigo em Inglês | MEDLINE | ID: mdl-37797695

RESUMO

Effective and safe therapies for the treatment of diseases caused by intraerythrocytic parasites are impeded by the rapid emergence of drug resistance and the lack of novel drug targets. One such disease is human babesiosis, which is a rapidly emerging tick-borne illness caused by Babesia parasites. In this study, we identified fosinopril, a phosphonate-containing, FDA-approved angiotensin converting enzyme (ACE) inhibitor commonly used as a prodrug for hypertension and heart failure, as a potent inhibitor of Babesia duncani parasite development within human erythrocytes. Cell biological and mass spectrometry analyses revealed that the conversion of fosinopril to its active diacid molecule, fosinoprilat, is essential for its antiparasitic activity. We show that this conversion is mediated by a parasite-encoded esterase, BdFE1, which is highly conserved among apicomplexan parasites. Parasites carrying the L238H mutation in the active site of BdFE1 failed to convert the prodrug to its active moiety and became resistant to the drug. Our data set the stage for the development of this class of drugs for the therapy of vector-borne parasitic diseases.

Assuntos

Babesia , Parasitos , Pró-Fármacos , Animais , Humanos , Inibidores da Enzima Conversora de Angiotensina/farmacologia , Fosinopril/farmacologia , Pró-Fármacos/farmacologia , Esterases/metabolismo

8.

acCRISPR: an activity-correction method for improving the accuracy of CRISPR screens.

Ramesh, Adithya; Trivedi, Varun; Lee, Sangcheon; Tafrishi, Aida; Schwartz, Cory; Mohseni, Amirsadra; Li, Mengwan; Lonardi, Stefano; Wheeldon, Ian.

Commun Biol ; 6(1): 617, 2023 06 08.

Artigo em Inglês | MEDLINE | ID: mdl-37291233

RESUMO

High throughput CRISPR screens are revolutionizing the way scientists unravel the genetic underpinnings of engineered and evolved phenotypes. One of the critical challenges in accurately assessing screening outcomes is accounting for the variability in sgRNA cutting efficiency. Poorly active guides targeting genes essential to screening conditions obscure the growth defects that are expected from disrupting them. Here, we develop acCRISPR, an end-to-end pipeline that identifies essential genes in pooled CRISPR screens using sgRNA read counts obtained from next-generation sequencing. acCRISPR uses experimentally determined cutting efficiencies for each guide in the library to provide an activity correction to the screening outcomes via calculation of an optimization metric, thus determining the fitness effect of disrupted genes. CRISPR-Cas9 and -Cas12a screens were carried out in the non-conventional oleaginous yeast Yarrowia lipolytica and acCRISPR was used to determine a high-confidence set of essential genes for growth under glucose, a common carbon source used for the industrial production of oleochemicals. acCRISPR was also used in screens quantifying relative cellular fitness under high salt conditions to identify genes that were related to salt tolerance. Collectively, this work presents an experimental-computational framework for CRISPR-based functional genomics studies that may be expanded to other non-conventional organisms of interest.

Assuntos

Sistemas CRISPR-Cas , Yarrowia , Biblioteca Gênica , Genômica , Genes Essenciais , Yarrowia/genética

9.

On the prediction of non-CG DNA methylation using machine learning.

Sereshki, Saleh; Lee, Nathan; Omirou, Michalis; Fasoula, Dionysia; Lonardi, Stefano.

NAR Genom Bioinform ; 5(2): lqad045, 2023 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-37206627

RESUMO

DNA methylation can be detected and measured using sequencing instruments after sodium bisulfite conversion, but experiments can be expensive for large eukaryotic genomes. Sequencing nonuniformity and mapping biases can leave parts of the genome with low or no coverage, thus hampering the ability of obtaining DNA methylation levels for all cytosines. To address these limitations, several computational methods have been proposed that can predict DNA methylation from the DNA sequence around the cytosine or from the methylation level of nearby cytosines. However, most of these methods are entirely focused on CG methylation in humans and other mammals. In this work, we study, for the first time, the problem of predicting cytosine methylation for CG, CHG and CHH contexts on six plant species, either from the DNA primary sequence around the cytosine or from the methylation levels of neighboring cytosines. In this framework, we also study the cross-species prediction problem and the cross-context prediction problem (within the same species). Finally, we show that providing gene and repeat annotations allows existing classifiers to significantly improve their prediction accuracy. We introduce a new classifier called AMPS (annotation-based methylation prediction from sequence) that takes advantage of genomic annotations to achieve higher accuracy.

10.

Babesia duncani multi-omics identifies virulence factors and drug targets.

Singh, Pallavi; Lonardi, Stefano; Liang, Qihua; Vydyam, Pratap; Khabirova, Eleonora; Fang, Tiffany; Gihaz, Shalev; Thekkiniath, Jose; Munshi, Muhammad; Abel, Steven; Ciampossin, Loic; Batugedara, Gayani; Gupta, Mohit; Lu, Xueqing Maggie; Lenz, Todd; Chakravarty, Sakshar; Cornillot, Emmanuel; Hu, Yangyang; Ma, Wenxiu; Gonzalez, Luis Miguel; Sánchez, Sergio; Estrada, Karel; Sánchez-Flores, Alejandro; Montero, Estrella; Harb, Omar S; Le Roch, Karine G; Mamoun, Choukri Ben.

Nat Microbiol ; 8(5): 845-859, 2023 05.

Artigo em Inglês | MEDLINE | ID: mdl-37055610

RESUMO

Babesiosis is a malaria-like disease in humans and animals that is caused by Babesia species, which are tick-transmitted apicomplexan pathogens. Babesia duncani causes severe to lethal infection in humans, but despite the risk that this parasite poses as an emerging pathogen, little is known about its biology, metabolic requirements or pathogenesis. Unlike other apicomplexan parasites that infect red blood cells, B. duncani can be continuously cultured in vitro in human erythrocytes and can infect mice resulting in fulminant babesiosis and death. We report comprehensive, detailed molecular, genomic, transcriptomic and epigenetic analyses to gain insights into the biology of B. duncani. We completed the assembly, 3D structure and annotation of its nuclear genome, and analysed its transcriptomic and epigenetics profiles during its asexual life cycle stages in human erythrocytes. We used RNA-seq data to produce an atlas of parasite metabolism during its intraerythrocytic life cycle. Characterization of the B. duncani genome, epigenome and transcriptome identified classes of candidate virulence factors, antigens for diagnosis of active infection and several attractive drug targets. Furthermore, metabolic reconstitutions from genome annotation and in vitro efficacy studies identified antifolates, pyrimethamine and WR-99210 as potent inhibitors of B. duncani to establish a pipeline of small molecules that could be developed as effective therapies for the treatment of human babesiosis.

Assuntos

Babesia , Babesiose , Carrapatos , Animais , Humanos , Camundongos , Babesia/genética , Babesiose/tratamento farmacológico , Multiômica , Eritrócitos/parasitologia

11.

Karyotype variation, spontaneous genome rearrangements affecting chemical insensitivity, and expression level polymorphisms in the plant pathogen Phytophthora infestans revealed using its first chromosome-scale assembly.

Matson, Michael E H; Liang, Qihua; Lonardi, Stefano; Judelson, Howard S.

PLoS Pathog ; 18(10): e1010869, 2022 10.

Artigo em Inglês | MEDLINE | ID: mdl-36215336

RESUMO

Natural isolates of the potato and tomato pathogen Phytophthora infestans exhibit substantial variation in virulence, chemical sensitivity, ploidy, and other traits. A chromosome-scale assembly was developed to expand genomic resources for this oomyceteous microbe, and used to explore the basis of variation. Using PacBio and Illumina data, a long-range linking library, and an optical map, an assembly was created and coalesced into 15 pseudochromosomes spanning 219 Mb using SNP-based genetic linkage data. De novo gene prediction combined with transcript evidence identified 19,981 protein-coding genes, plus about eight thousand tRNA genes. The chromosomes were comprised of a mosaic of gene-rich and gene-sparse regions plus very long centromeres. Genes exhibited a biased distribution across chromosomes, especially members of families encoding RXLR and CRN effectors which clustered on certain chromosomes. Strikingly, half of F1 progeny of diploid parents were polyploid or aneuploid. Substantial expression level polymorphisms between strains were identified, much of which could be attributed to differences in chromosome dosage, transposable element insertions, and adjacency to repetitive DNA. QTL analysis identified a locus on the right arm of chromosome 3 governing sensitivity to the crop protection chemical metalaxyl. Strains heterozygous for resistance often experienced megabase-sized deletions of that part of the chromosome when cultured on metalaxyl, increasing resistance due to loss of the sensitive allele. This study sheds light on diverse phenomena affecting variation in P. infestans and relatives, helps explain the prevalence of polyploidy in natural populations, and provides a new foundation for biologic and genetic investigations.

Assuntos

Produtos Biológicos , Phytophthora infestans , Solanum tuberosum , Humanos , Phytophthora infestans/genética , Elementos de DNA Transponíveis , Solanum tuberosum/genética , Cariótipo

12.

Genome-wide functional screens enable the prediction of high activity CRISPR-Cas9 and -Cas12a guides in Yarrowia lipolytica.

Baisya, Dipankar; Ramesh, Adithya; Schwartz, Cory; Lonardi, Stefano; Wheeldon, Ian.

Nat Commun ; 13(1): 922, 2022 02 17.

Artigo em Inglês | MEDLINE | ID: mdl-35177617

RESUMO

Genome-wide functional genetic screens have been successful in discovering genotype-phenotype relationships and in engineering new phenotypes. While broadly applied in mammalian cell lines and in E. coli, use in non-conventional microorganisms has been limited, in part, due to the inability to accurately design high activity CRISPR guides in such species. Here, we develop an experimental-computational approach to sgRNA design that is specific to an organism of choice, in this case the oleaginous yeast Yarrowia lipolytica. A negative selection screen in the absence of non-homologous end-joining, the dominant DNA repair mechanism, was used to generate single guide RNA (sgRNA) activity profiles for both SpCas9 and LbCas12a. This genome-wide data served as input to a deep learning algorithm, DeepGuide, that is able to accurately predict guide activity. DeepGuide uses unsupervised learning to obtain a compressed representation of the genome, followed by supervised learning to map sgRNA sequence, genomic context, and epigenetic features with guide activity. Experimental validation, both genome-wide and with a subset of selected genes, confirms DeepGuide's ability to accurately predict high activity sgRNAs. DeepGuide provides an organism specific predictor of CRISPR guide activity that with retraining could be applied to other fungal species, prokaryotes, and other non-conventional organisms.

Assuntos

Edição de Genes/métodos , Modelos Genéticos , RNA Guia de Cinetoplastídeos/metabolismo , Yarrowia/genética , Proteínas de Bactérias/genética , Proteínas Associadas a CRISPR/genética , Sistemas CRISPR-Cas/genética , Aprendizado Profundo , Endodesoxirribonucleases/genética , Genoma Fúngico , RNA Guia de Cinetoplastídeos/genética

13.

Reference-agnostic representation and visualization of pan-genomes.

Liang, Qihua; Lonardi, Stefano.

BMC Bioinformatics ; 22(1): 502, 2021 Oct 16.

Artigo em Inglês | MEDLINE | ID: mdl-34656081

RESUMO

BACKGROUND: The pan-genome of a species is the union of the genes and non-coding sequences present in all individuals (cultivar, accessions, or strains) within that species. RESULTS: Here we introduce PGV, a reference-agnostic representation of the pan-genome of a species based on the notion of consensus ordering. Our experimental results demonstrate that PGV enables an intuitive, effective and interactive visualization of a pan-genome by providing a genome browser that can elucidate complex structural genomic variations. CONCLUSIONS: The PGV software can be installed via conda or downloaded from https://github.com/ucrbioinfo/PGV . The companion PGV browser at http://pgv.cs.ucr.edu can be tested using example bed tracks available from the GitHub page.

Assuntos

Genoma , Genômica , Biologia Computacional , Humanos , Software

14.

Prediction of histone post-translational modifications using deep learning.

Baisya, Dipankar Ranjan; Lonardi, Stefano.

Bioinformatics ; 36(24): 5610-5617, 2021 Apr 05.

Artigo em Inglês | MEDLINE | ID: mdl-33367499

RESUMO

MOTIVATION: Histone post-translational modifications (PTMs) are involved in a variety of essential regulatory processes in the cell, including transcription control. Recent studies have shown that histone PTMs can be accurately predicted from the knowledge of transcription factor binding or DNase hypersensitivity data. Similarly, it has been shown that one can predict PTMs from the underlying DNA primary sequence. RESULTS: In this study, we introduce a deep learning architecture called DeepPTM for predicting histone PTMs from transcription factor binding data and the primary DNA sequence. Extensive experimental results show that our deep learning model outperforms the prediction accuracy of the model proposed in Benveniste et al. (PNAS 2014) and DeepHistone (BMC Genomics 2019). The competitive advantage of our framework lies in the synergistic use of deep learning combined with an effective pre-processing step. Our classification framework has also enabled the discovery that the knowledge of a small subset of transcription factors (which are histone-PTM and cell-type-specific) can provide almost the same prediction accuracy that can be obtained using all the transcription factors data. AVAILABILITYAND IMPLEMENTATION: https://github.com/dDipankar/DeepPTM. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

15.

DeeplyEssential: a deep neural network for predicting essential genes in microbes.

Hasan, Md Abid; Lonardi, Stefano.

BMC Bioinformatics ; 21(Suppl 14): 367, 2020 Sep 30.

Artigo em Inglês | MEDLINE | ID: mdl-32998698

RESUMO

BACKGROUND: Essential genes are those genes that are critical for the survival of an organism. The prediction of essential genes in bacteria can provide targets for the design of novel antibiotic compounds or antimicrobial strategies. RESULTS: We propose a deep neural network for predicting essential genes in microbes. Our architecture called DEEPLYESSENTIAL makes minimal assumptions about the input data (i.e., it only uses gene primary sequence and the corresponding protein sequence) to carry out the prediction thus maximizing its practical application compared to existing predictors that require structural or topological features which might not be readily available. We also expose and study a hidden performance bias that effected previous classifiers. Extensive results show that DEEPLYESSENTIAL outperform existing classifiers that either employ down-sampling to balance the training set or use clustering to exclude multiple copies of orthologous genes. CONCLUSION: Deep neural network architectures can efficiently predict whether a microbial gene is essential (or not) using only its sequence information.

Assuntos

Bactérias/genética , Genes Essenciais , Redes Neurais de Computação , Área Sob a Curva , Análise por Conglomerados , Códon , Bactérias Gram-Negativas/genética , Bactérias Gram-Positivas/genética , Curva ROC

16.

Mustache: multi-scale detection of chromatin loops from Hi-C and Micro-C maps using scale-space representation.

Roayaei Ardakany, Abbas; Gezer, Halil Tuvan; Lonardi, Stefano; Ay, Ferhat.

Genome Biol ; 21(1): 256, 2020 09 30.

Artigo em Inglês | MEDLINE | ID: mdl-32998764

RESUMO

We present MUSTACHE, a new method for multi-scale detection of chromatin loops from Hi-C and Micro-C contact maps. MUSTACHE employs scale-space theory, a technical advance in computer vision, to detect blob-shaped objects in contact maps. MUSTACHE is scalable to kilobase-resolution maps and reports loops that are highly consistent between replicates and between Hi-C and Micro-C datasets. Compared to other loop callers, such as HiCCUPS and SIP, MUSTACHE recovers a higher number of published ChIA-PET and HiChIP loops as well as loops linking promoters to regulatory elements. Overall, MUSTACHE enables an efficient and comprehensive analysis of chromatin loops. Available at: https://github.com/ay-lab/mustache .

Assuntos

Inteligência Artificial , Cromatina , Genômica/métodos , Software , Conformação de Ácido Nucleico

17.

OMGS: Optical Map-Based Genome Scaffolding.

Pan, Weihua; Jiang, Tao; Lonardi, Stefano.

J Comput Biol ; 27(4): 519-533, 2020 04.

Artigo em Inglês | MEDLINE | ID: mdl-31794680

RESUMO

Due to the current limitations of sequencing technologies, de novo genome assembly is typically carried out in two stages, namely contig (sequence) assembly and scaffolding. While scaffolding is computationally easier than sequence assembly, the scaffolding problem can be challenging due to the high repetitive content of eukaryotic genomes, possible mis-joins in assembled contigs, and inaccuracies in the linkage information. Genome scaffolding tools either use paired-end/mate-pair/linked/Hi-C reads or genome-wide maps (optical, physical, or genetic) as linkage information. Optical maps (in particular Bionano Genomics maps) have been extensively used in many recent large-scale genome assembly projects (e.g., goat, apple, barley, maize, quinoa, sea bass, among others). However, the most commonly used scaffolding tools have a serious limitation: they can only deal with one optical map at a time, forcing users to alternate or iterate over multiple maps. In this article, we introduce a novel scaffolding algorithm called OMGS (Optical Map-based Genome Scaffolding) that for the first time can take advantages of multiple optical maps. OMGS solves several optimization problems to generate scaffolds with optimal contiguity and correctness. Extensive experimental results demonstrate that our tool outperforms existing methods when multiple optical maps are available and produces comparable scaffolds using a single optical map.

Assuntos

Mapeamento Cromossômico/métodos , Genoma/genética , Genômica , Software , Algoritmos , Mapeamento de Sequências Contíguas/métodos , Eucariotos/genética , Análise de Sequência de DNA/métodos

18.

Seed Coat Pattern QTL and Development in Cowpea (Vigna unguiculata [L.] Walp.).

Herniter, Ira A; Lo, Ryan; Muñoz-Amatriaín, María; Lo, Sassoum; Guo, Yi-Ning; Huynh, Bao-Lam; Lucas, Mitchell; Jia, Zhenyu; Roberts, Philip A; Lonardi, Stefano; Close, Timothy J.

Front Plant Sci ; 10: 1346, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-31708953

RESUMO

The appearance of the seed is an important aspect of consumer preference for cowpea (Vigna unguiculata [L.] Walp.). Seed coat pattern in cowpea has been a subject of study for over a century. This study makes use of newly available resources, including mapping populations, a reference genome and additional genome assemblies, and a high-density single nucleotide polymorphism genotyping platform, to map various seed coat pattern traits to three loci, concurrent with the Color Factor (C), Watson (W), and Holstein (H) factors identified previously. Several gene models encoding proteins involved in regulating the later stages of the flavonoid biosynthesis pathway have been identified as candidate genes, including a basic helix-loop-helix gene (Vigun07g110700) for the C locus, a WD-repeat gene (Vigun09g139900) for the W locus and an E3 ubiquitin ligase gene (Vigun10g163900) for the H locus. A model of seed coat development, consisting of six distinct stages, is described to explain some of the observed pattern phenotypes.

19.

Selfish: discovery of differential chromatin interactions via a self-similarity measure.

Ardakany, Abbas Roayaei; Ay, Ferhat; Lonardi, Stefano.

Bioinformatics ; 35(14): i145-i153, 2019 07 15.

Artigo em Inglês | MEDLINE | ID: mdl-31510653

RESUMO

MOTIVATION: High-throughput conformation capture experiments, such as Hi-C provide genome-wide maps of chromatin interactions, enabling life scientists to investigate the role of the three-dimensional structure of genomes in gene regulation and other essential cellular functions. A fundamental problem in the analysis of Hi-C data is how to compare two contact maps derived from Hi-C experiments. Detecting similarities and differences between contact maps are critical in evaluating the reproducibility of replicate experiments and for identifying differential genomic regions with biological significance. Due to the complexity of chromatin conformations and the presence of technology-driven and sequence-specific biases, the comparative analysis of Hi-C data is analytically and computationally challenging. RESULTS: We present a novel method called Selfish for the comparative analysis of Hi-C data that takes advantage of the structural self-similarity in contact maps. We define a novel self-similarity measure to design algorithms for (i) measuring reproducibility for Hi-C replicate experiments and (ii) finding differential chromatin interactions between two contact maps. Extensive experimental results on simulated and real data show that Selfish is more accurate and robust than state-of-the-art methods. AVAILABILITY AND IMPLEMENTATION: https://github.com/ucrbioinfo/Selfish.

Assuntos

Cromatina , Genoma , Mapeamento Cromossômico , Genômica , Reprodutibilidade dos Testes

20.

Validating genome-wide CRISPR-Cas9 function improves screening in the oleaginous yeast Yarrowia lipolytica.

Schwartz, Cory; Cheng, Jan-Fang; Evans, Robert; Schwartz, Christopher A; Wagner, James M; Anglin, Scott; Beitz, Adam; Pan, Weihua; Lonardi, Stefano; Blenner, Mark; Alper, Hal S; Yoshikuni, Yasuo; Wheeldon, Ian.

Metab Eng ; 55: 102-110, 2019 09.

Artigo em Inglês | MEDLINE | ID: mdl-31216436

RESUMO

Genome-wide mutational screens are central to understanding the genetic underpinnings of evolved and engineered phenotypes. The widespread adoption of CRISPR-Cas9 genome editing has enabled such screens in many organisms, but identifying functional sgRNAs still remains a challenge. Here, we developed a methodology to quantify the cutting efficiency of each sgRNA in a genome-scale library, and in doing so improve screens in the biotechnologically important yeast Yarrowia lipolytica. Screening in the presence and absence of native DNA repair enabled high-throughput quantification of sgRNA function leading to the identification of high efficiency sgRNAs that cover 94% of genes. Library validation enhanced the classification of essential genes by identifying inactive guides that create false negatives and mask the effects of successful disruptions. Quantification of guide effectiveness also creates a dataset from which determinants of CRISPR-Cas9 can be identified. Finally, application of the library identified novel mutations for metabolic engineering of high lipid accumulation.

Assuntos

Sistemas CRISPR-Cas , Edição de Genes , Biblioteca Gênica , Genes Fúngicos , Yarrowia/genética

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA