Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 129
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 52(D1): D115-D123, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37823705

RESUMO

Circular RNAs (circRNAs) are RNA molecules with a continuous loop structure characterized by back-splice junctions (BSJs). While analyses of short-read RNA sequencing have identified millions of BSJ events, it is inherently challenging to determine exact full-length sequences and alternatively spliced (AS) isoforms of circRNAs. Recent advances in nanopore long-read sequencing with circRNA enrichment bring an unprecedented opportunity for investigating the issues. Here, we developed FL-circAS (https://cosbi.ee.ncku.edu.tw/FL-circAS/), which collected such long-read sequencing data of 20 cell lines/tissues and thereby identified 884 636 BSJs with 1 853 692 full-length circRNA isoforms in human and 115 173 BSJs with 135 617 full-length circRNA isoforms in mouse. FL-circAS also provides multiple circRNA features. For circRNA expression, FL-circAS calculates expression levels for each circRNA isoform, cell line/tissue specificity at both the BSJ and isoform levels, and AS entropy for each BSJ across samples. For circRNA biogenesis, FL-circAS identifies reverse complementary sequences and RNA binding protein (RBP) binding sites residing in flanking sequences of BSJs. For functional patterns, FL-circAS identifies potential microRNA/RBP binding sites and several types of evidence for circRNA translation on each full-length circRNA isoform. FL-circAS provides user-friendly interfaces for browsing, searching, analyzing, and downloading data, serving as the first resource for discovering full-length circRNAs at the isoform level.


Assuntos
Bases de Dados de Ácidos Nucleicos , RNA Circular , Animais , Humanos , Camundongos , Processamento Alternativo/genética , MicroRNAs/genética , MicroRNAs/metabolismo , Sequenciamento por Nanoporos , RNA Circular/genética , Isoformas de RNA/genética
2.
RNA ; 29(5): 557-569, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-36737102

RESUMO

PIWI-interacting RNAs (piRNAs) protect genome integrity by silencing transposon mRNAs and some endogenous mRNAs in various animals. However, C. elegans piRNAs only trigger gene silencing at select predicted targeting sites, suggesting additional cellular mechanisms regulate piRNA silencing. To gain insight into possible mechanisms, we compared the transcriptome-wide predicted piRNA targeting sites to the in vivo piRNA binding sites. Surprisingly, while sequence-based predicted piRNA targeting sites are enriched in 3' UTRs, we found that C. elegans piRNAs preferentially bind to coding regions (CDS) of target mRNAs, leading to preferential production of secondary silencing small RNAs in the CDS. However, our analyses suggest that this CDS binding preference cannot be explained by the action of antisilencing Argonaute CSR-1. Instead, our analyses imply that CSR-1 protects mRNAs from piRNA silencing through two distinct mechanisms-by inhibiting piRNA binding across the entire CSR-1 targeted transcript, and by inhibiting secondary silencing small RNA production locally at CSR-1 bound sites. Together, our work identifies the CDS as the critical region that is uniquely competent for piRNA binding in C. elegans. We speculate the CDS binding preference may have evolved to allow the piRNA pathway to maintain robust recognition of RNA targets in spite of genetic drift. Together, our analyses revealed that distinct mechanisms are responsible for restricting piRNA binding and silencing to achieve proper transcriptome surveillance.


Assuntos
Proteínas de Caenorhabditis elegans , Caenorhabditis elegans , Animais , Caenorhabditis elegans/genética , Caenorhabditis elegans/metabolismo , RNA de Interação com Piwi , RNA Interferente Pequeno/genética , RNA Interferente Pequeno/metabolismo , Transcriptoma , Proteínas Argonautas/genética , Proteínas Argonautas/metabolismo , RNA de Cadeia Dupla/metabolismo , Sítios de Ligação , Proteínas de Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/metabolismo
3.
PLoS Pathog ; 19(7): e1011233, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37463183

RESUMO

Gram-negative bacteremia is a major cause of global morbidity involving three phases of pathogenesis: initial site infection, dissemination, and survival in the blood and filtering organs. Klebsiella pneumoniae is a leading cause of bacteremia and pneumonia is often the initial infection. In the lung, K. pneumoniae relies on many factors like capsular polysaccharide and branched chain amino acid biosynthesis for virulence and fitness. However, mechanisms directly enabling bloodstream fitness are unclear. Here, we performed transposon insertion sequencing (TnSeq) in a tail-vein injection model of bacteremia and identified 58 K. pneumoniae bloodstream fitness genes. These factors are diverse and represent a variety of cellular processes. In vivo validation revealed tissue-specific mechanisms by which distinct factors support bacteremia. ArnD, involved in Lipid A modification, was required across blood filtering organs and supported resistance to soluble splenic factors. The purine biosynthesis enzyme PurD supported liver fitness in vivo and was required for replication in serum. PdxA, a member of the endogenous vitamin B6 biosynthesis pathway, optimized replication in serum and lung fitness. The stringent response regulator SspA was required for splenic fitness yet was dispensable in the liver. In a bacteremic pneumonia model that incorporates initial site infection and dissemination, splenic fitness defects were enhanced. ArnD, PurD, DsbA, SspA, and PdxA increased fitness across bacteremia phases and each demonstrated unique fitness dynamics within compartments in this model. SspA and PdxA enhanced K. pnuemoniae resistance to oxidative stress. SspA, but not PdxA, specifically resists oxidative stress produced by NADPH oxidase Nox2 in the lung, spleen, and liver, as it was a fitness factor in wild-type but not Nox2-deficient (Cybb-/-) mice. These results identify site-specific fitness factors that act during the progression of Gram-negative bacteremia. Defining K. pneumoniae fitness strategies across bacteremia phases could illuminate therapeutic targets that prevent infection and sepsis.


Assuntos
Bacteriemia , Infecções por Klebsiella , Pneumonia , Camundongos , Animais , Klebsiella pneumoniae/genética , Pulmão , Bacteriemia/genética , Estresse Oxidativo , Infecções por Klebsiella/genética
4.
Blood ; 141(7): 725-742, 2023 02 16.
Artigo em Inglês | MEDLINE | ID: mdl-36493338

RESUMO

Coronavirus-associated coagulopathy (CAC) is a morbid and lethal sequela of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. CAC results from a perturbed balance between coagulation and fibrinolysis and occurs in conjunction with exaggerated activation of monocytes/macrophages (MO/Mφs), and the mechanisms that collectively govern this phenotype seen in CAC remain unclear. Here, using experimental models that use the murine betacoronavirus MHVA59, a well-established model of SARS-CoV-2 infection, we identify that the histone methyltransferase mixed lineage leukemia 1 (MLL1/KMT2A) is an important regulator of MO/Mφ expression of procoagulant and profibrinolytic factors such as tissue factor (F3; TF), urokinase (PLAU), and urokinase receptor (PLAUR) (herein, "coagulopathy-related factors") in noninfected and infected cells. We show that MLL1 concurrently promotes the expression of the proinflammatory cytokines while suppressing the expression of interferon alfa (IFN-α), a well-known inducer of TF and PLAUR. Using in vitro models, we identify MLL1-dependent NF-κB/RelA-mediated transcription of these coagulation-related factors and identify a context-dependent, MLL1-independent role for RelA in the expression of these factors in vivo. As functional correlates for these findings, we demonstrate that the inflammatory, procoagulant, and profibrinolytic phenotypes seen in vivo after coronavirus infection were MLL1-dependent despite blunted Ifna induction in MO/Mφs. Finally, in an analysis of SARS-CoV-2 positive human samples, we identify differential upregulation of MLL1 and coagulopathy-related factor expression and activity in CD14+ MO/Mφs relative to noninfected and healthy controls. We also observed elevated plasma PLAU and TF activity in COVID-positive samples. Collectively, these findings highlight an important role for MO/Mφ MLL1 in promoting CAC and inflammation.


Assuntos
COVID-19 , Histona-Lisina N-Metiltransferase , Animais , Humanos , Camundongos , COVID-19/complicações , Histona-Lisina N-Metiltransferase/genética , Histona-Lisina N-Metiltransferase/metabolismo , Histonas/metabolismo , Inflamação/metabolismo , Monócitos/metabolismo , Proteína de Leucina Linfoide-Mieloide/genética , Proteína de Leucina Linfoide-Mieloide/metabolismo , SARS-CoV-2/metabolismo , Ativador de Plasminogênio Tipo Uroquinase/metabolismo
5.
BMC Plant Biol ; 24(1): 210, 2024 Mar 23.
Artigo em Inglês | MEDLINE | ID: mdl-38519909

RESUMO

BACKGROUND: Different metabolic compounds give pepper leaves and fruits their diverse colors. Anthocyanin accumulation is the main cause of the purple color of pepper leaves. The light environment is a critical factor affecting anthocyanin biosynthesis. It is essential that we understand how to use light to regulate anthocyanin biosynthesis in plants. RESULT: Pepper leaves were significantly blue-purple only in continuous blue light or white light (with a blue light component) irradiation treatments, and the anthocyanin content of pepper leaves increased significantly after continuous blue light irradiation. This green-to-purple phenotype change in pepper leaves was due to the expression of different genes. We found that the anthocyanin synthesis precursor-related genes PAL and 4CL, as well as the structural genes F3H, DFR, ANS, BZ1, and F3'5'H in the anthocyanin synthesis pathway, had high expression under continuous blue light irradiation. Similarly, the expression of transcription factors MYB1R1-like, MYB48, MYB4-like isoform X1, bHLH143-like, and bHLH92-like isoform X3, and circadian rhythm-related genes LHY and COP1, were significantly increased after continuous blue light irradiation. A correlation network analysis revealed that these transcription factors and circadian rhythm-related genes were positively correlated with structural genes in the anthocyanin synthesis pathway. Metabolomic analysis showed that delphinidin-3-O-glucoside and delphinidin-3-O-rutinoside were significantly higher under continuous blue light irradiation relative to other light treatments. We selected 12 genes involved in anthocyanin synthesis in pepper leaves for qRT-PCR analysis, and the accuracy of the RNA-seq results was confirmed. CONCLUSIONS: In this study, we found that blue light and 24-hour irradiation together induced the expression of key genes and the accumulation of metabolites in the anthocyanin synthesis pathway, thus promoting anthocyanin biosynthesis in pepper leaves. These results provide a basis for future study of the mechanisms of light quality and photoperiod in anthocyanin synthesis and metabolism, and our study may serve as a valuable reference for screening light ratios that regulate anthocyanin biosynthesis in plants.


Assuntos
Capsicum , Transcriptoma , Antocianinas/metabolismo , Capsicum/genética , Capsicum/metabolismo , Luz Azul , Metaboloma , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Isoformas de Proteínas/metabolismo , Regulação da Expressão Gênica de Plantas
6.
J Chem Inf Model ; 64(7): 2445-2453, 2024 Apr 08.
Artigo em Inglês | MEDLINE | ID: mdl-37903033

RESUMO

miRNAs (microRNAs) target specific mRNA (messenger RNA) sites to regulate their translation expression. Although miRNA targeting can rely on seed region base pairing, animal miRNAs, including human miRNAs, typically cooperate with several cofactors, leading to various noncanonical pairing rules. Therefore, identifying the binding sites of animal miRNAs remains challenging. Because experiments for mapping miRNA targets are costly, computational methods are preferred for extracting potential miRNA-mRNA fragment binding pairs first. However, existing prediction tools can have significant false positives due to the prevalent noncanonical miRNA binding behaviors and the information-biased training negative sets that were used while constructing these tools. To overcome these obstacles, we first prepared an information-balanced miRNA binding pair ground-truth data set. A miRNA-mRNA interaction-aware model was then designed to help identify miRNA binding events. On the test set, our model (auROC = 94.4%) outperformed existing models by at least 2.8% in auROC. Furthermore, we showed that this model can suggest potential binding patterns for miRNA-mRNA sequence interacting pairs. Finally, we made the prepared data sets and the designed model available at http://cosbi2.ee.ncku.edu.tw/mirna_binding/download.


Assuntos
MicroRNAs , Animais , Humanos , MicroRNAs/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Algoritmos , Biologia Computacional/métodos
7.
Anal Chem ; 95(38): 14279-14287, 2023 09 26.
Artigo em Inglês | MEDLINE | ID: mdl-37713273

RESUMO

The identification of xenobiotic biotransformation products is crucial for delineating toxicity and carcinogenicity that might be caused by xenobiotic exposures and for establishing monitoring systems for public health. However, the lack of available reference standards and spectral data leads to the generation of multiple candidate structures during identification and reduces the confidence in identification. Here, a UHPLC-HRMS-based metabolomics strategy integrated with a metabolite structure elucidation approach, namely, FragAssembler, was proposed to reduce the number of false-positive structure candidates. biotransformation product candidates were filtered by mass defect filtering (MDF) and multiple-group comparison. FragAssembler assembled fragment signatures from the MS/MS spectra and generated the modified moieties corresponding to the identified biotransformation products. The feasibility of this approach was demonstrated by the three biotransformation products of di(2-ethylhexyl)phthalate (DEHP). Comprehensive identification was carried out, and 24 and 13 biotransformation products of two xenobiotics, DEHP and 4'-Methoxy-α-pyrrolidinopentiophenone (4-MeO-α-PVP), were annotated, respectively. The number of 4-MeO-α-PVP biotransformation product candidates in the FragAssembler calculation results was approximately 2.1 times lower than that generated by BioTransformer 3.0. Our study indicates that the proposed approach has great potential for efficiently and reliably identifying xenobiotic biotransformation products, which is attributed to the fact that FragAssembler eliminates false-positive reactions and chemical structures and distinguishes modified moieties on isomeric biotransformation products. The FragAssembler software and associated tutorial are freely available at https://cosbi.ee.ncku.edu.tw/FragAssembler/ and the source code can be found at https://github.com/YuanChihChen/FragAssembler.


Assuntos
Dietilexilftalato , Espectrometria de Massas em Tandem , Xenobióticos , Biotransformação
8.
BMC Plant Biol ; 22(1): 557, 2022 Dec 02.
Artigo em Inglês | MEDLINE | ID: mdl-36456919

RESUMO

Containing the largest number of species, the orchid family provides not only materials for studying plant evolution and environmental adaptation, but economically and culturally important ornamental plants for human society. Previously, we collected genome and transcriptome information of Dendrobium catenatum, Phalaenopsis equestris, and Apostasia shenzhenica which belong to two different subfamilies of Orchidaceae, and developed user-friendly tools to explore the orchid genetic sequences in the OrchidBase 4.0. The OrchidBase 4.0 offers the opportunity for plant science community to compare orchid genomes and transcriptomes and retrieve orchid sequences for further study.In the year 2022, two whole-genome sequences of Orchidoideae species, Platanthera zijinensis and Platanthera guangdongensis, were de novo sequenced, assembled and analyzed. In addition, systemic transcriptomes from these two species were also established. Therefore, we included these datasets to develop the new version of OrchidBase 5.0. In addition, three new functions including synteny, gene order, and miRNA information were also developed for orchid genome comparisons and miRNA characterization.OrchidBase 5.0 extended the genetic information to three orchid subfamilies (including five orchid species) and provided new tools for orchid researchers to analyze orchid genomes and transcriptomes. The online resources can be accessed at https://cosbi.ee.ncku.edu.tw/orchidbase5/.


Assuntos
MicroRNAs , Orchidaceae , Ordem dos Genes , Bases de Conhecimento , MicroRNAs/genética , Orchidaceae/genética , Sintenia
9.
BMC Bioinformatics ; 22(1): 503, 2021 Oct 16.
Artigo em Inglês | MEDLINE | ID: mdl-34656087

RESUMO

BACKGROUND: Piwi-interacting RNAs (piRNAs) are the small non-coding RNAs (ncRNAs) that silence genomic transposable elements. And researchers found out that piRNA also regulates various endogenous transcripts. However, there is no systematic understanding of the piRNA binding patterns and how piRNA targets genes. While various prediction methods have been developed for other similar ncRNAs (e.g., miRNAs), piRNA holds distinctive characteristics and requires its own computational model for binding target prediction. RESULTS: Recently, transcriptome-wide piRNA binding events in C. elegans were probed by PRG-1 CLASH experiments. Based on the probed piRNA-messenger RNAs (mRNAs) binding pairs, in this research, we devised the first deep learning architecture based on multi-head attention to computationally identify piRNA targeting mRNA sites. In the devised deep network, the given piRNA and mRNA segment sequences are first one-hot encoded and undergo a combined operation of convolution and squeezing-extraction to unravel motif patterns. And we incorporate a novel multi-head attention sub-network to extract the hidden piRNA binding rules that can simulate the biological piRNA target recognition process. Finally, the true piRNA-mRNA binding pairs are identified by a deep fully connected sub-network. Our model obtains a supreme discriminatory power of AUC [Formula: see text] 93.3% on an independent test set and successfully extracts the verified binding pattern of a synthetic piRNA. These results demonstrated that the devised model achieves high prediction performance and suggests testable potential biological piRNA binding rules. CONCLUSIONS: In this research, we developed the first deep learning method to identify piRNA targeting sites on C. elegans mRNAs. And the developed deep learning method is demonstrated to be of high accuracy and can provide biological insights into piRNA-mRNA binding patterns. The piRNA binding target identification network can be downloaded from http://cosbi2.ee.ncku.edu.tw/data_download/piRNA_mRNA_binding .


Assuntos
Proteínas de Caenorhabditis elegans , MicroRNAs , Animais , Proteínas Argonautas , Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/genética , Elementos de DNA Transponíveis , RNA Mensageiro/genética , RNA Interferente Pequeno/genética
10.
BMC Bioinformatics ; 22(Suppl 10): 271, 2021 May 25.
Artigo em Inglês | MEDLINE | ID: mdl-34058988

RESUMO

BACKGROUND: Translational regulation is one important aspect of gene expression regulation. Dysregulation of translation results in abnormal cell physiology and leads to diseases. Ribosome profiling (RP), also called ribo-seq, is a powerful experimental technique to study translational regulation. It can capture a snapshot of translation by deep sequencing of ribosome-protected mRNA fragments. Many ribosome profiling data processing tools have been developed. However, almost all tools analyze ribosome profiling data at the gene level. Since different isoforms of a gene may produce different proteins with distinct biological functions, it is advantageous to analyze ribosome profiling data at the isoform level. To meet this need, previously we developed a pipeline to analyze 610 public human ribosome profiling data at the isoform level and constructed HRPDviewer database. RESULTS: To allow other researchers to use our pipeline as well, here we implement our pipeline as an easy-to-use software tool called RPiso. Compared to Ribomap (a widely used tool which provides isoform-level ribosome profiling analyses), our RPiso (1) estimates isoform abundance more accurately, (2) supports analyses on more species, and (3) provides a web-based viewer for interactively visualizing ribosome profiling data on the selected mRNA isoforms. CONCLUSIONS: In this study, we developed RPiso software tool ( http://cosbi7.ee.ncku.edu.tw/RPiso/ ) to provide isoform-level ribosome profiling analyses. RPiso is very easy to install and execute. RPiso also provides a web-based viewer for interactively visualizing ribosome profiling data on the selected mRNA isoforms. We believe that RPiso is a useful tool for researchers to analyze and visualize their own ribosome profiling data at the isoform level.


Assuntos
Biossíntese de Proteínas , Ribossomos , Humanos , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Ribossomos/genética , Ribossomos/metabolismo , Software
11.
Mol Biol Evol ; 37(2): 429-441, 2020 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-31639821

RESUMO

Genomic imprinting leads to mono-allelic expression of genes based on parent of origin. Therian mammals and angiosperms evolved this mechanism in nutritive tissues, the placenta, and endosperm, where maternal and paternal genomes are in conflict with respect to resource allocation. We used RNA-seq to analyze allelic bias in the expression of 91 known imprinted genes in term human placentas from a prospective cohort study in Mali. A large fraction of the imprinted exons (39%) deviated from mono-allelic expression. Loss of imprinting (LOI) occurred in genes with either maternal or paternal expression bias, albeit more frequently in the former. We characterized LOI using binomial generalized linear mixed models. Variation in LOI was predominantly at the gene as opposed to the exon level, consistent with a single promoter driving the expression of most exons in a gene. Some genes were less prone to LOI than others, particularly lncRNA genes were rarely expressed from the repressed allele. Further, some individuals had more LOI than others and, within a person, the expression bias of maternally and paternally imprinted genes was correlated. We hypothesize that trans-acting maternal effect genes mediate correlated LOI and provide the mother with an additional lever to control fetal growth by extending her influence to LOI of the paternally imprinted genes. Limited evidence exists to support associations between LOI and offspring phenotypes. We show that birth length and placental weight were associated with allelic bias, making this the first comprehensive report of an association between LOI and a birth phenotype.


Assuntos
Peso ao Nascer/genética , Estatura/genética , Perfilação da Expressão Gênica/métodos , Impressão Genômica , Placenta/química , Adolescente , Feminino , Humanos , Recém-Nascido , Modelos Lineares , Mali , Herança Materna , Gravidez , Regiões Promotoras Genéticas , Estudos Prospectivos , Análise de Sequência de RNA , Adulto Jovem
12.
Anal Chem ; 93(2): 868-877, 2021 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-33302626

RESUMO

We used yeast proteome microarrays (∼5800 purified proteins) to conduct a high-throughput and systematic screening of PI5P-interacting proteins with PI5P-tagged fluorescent liposomal nanovesicles. Lissamine rhodamine B-dipalmitoyl phosphatidylethanol was incorporated into the liposome bilayer to provide the nanovesicles with fluorescence without any encapsulants, which not only made the liposome fabrication much easier without the need for purification but also improved the chip-probing quality. A special chip assay was washed very gently without the traditional spin-dry step. Forty-five PI5P-interacting proteins were identified in triplicate with this special chip assay. Subsequently, we used flow cytometry to validate these interactions, and a total of 41 PI5P-interacting proteins were confirmed. Enrichment analysis revealed that these proteins have significant functions associated with ribosome biogenesis, rRNA processing, ribosome binding, GTP binding, and hydrolase activity. Their component enrichment is located in the nucleolus. The InterPro domain analysis indicated that PI5P-interacting proteins are enriched in the P-loop containing nucleoside triphosphate hydrolases domain (P-loop). Additionally, using the MEME program, we identified a consensus motif (IVGPAGTGKSTLF) that contains the Walker A sequence, a well-known nucleotide-binding motif. Furthermore, using a quartz crystal microbalance, both the consensus motif and Walker A motif showed strong affinities to PI5P-containing liposomes but not to PI5P-deprived liposomes or PI-containing liposomes. Additionally, the glycine (G6) and lysine (K7) residues of the Walker A motif (-GPAGTG6K7S-) were found to be critical to the PI5P-binding ability. This study not only identified an additional set of PI5P-interacting proteins but also revealed the strong PI5P-binding affinity (Kd = 1.81 × 10-7 M) of the Walker A motif beyond the motif's nucleotide-binding characteristic.


Assuntos
Fosfatos de Fosfatidilinositol/química , Análise Serial de Proteínas , Proteoma/análise , Saccharomyces cerevisiae/isolamento & purificação , Lipossomos/química , Técnicas de Microbalança de Cristal de Quartzo
13.
BMC Plant Biol ; 21(1): 371, 2021 Aug 12.
Artigo em Inglês | MEDLINE | ID: mdl-34384382

RESUMO

BACKGROUND: The Orchid family is the largest families of the monocotyledons and an economically important ornamental plant worldwide. Given the pivotal role of this plant to humans, botanical researchers and breeding communities should have access to valuable genomic and transcriptomic information of this plant. Previously, we established OrchidBase, which contains expressed sequence tags (ESTs) from different tissues and developmental stages of Phalaenopsis as well as biotic and abiotic stress-treated Phalaenopsis. The database includes floral transcriptomic sequences from 10 orchid species across all the five subfamilies of Orchidaceae. DESCRIPTION: Recently, the whole-genome sequences of Apostasia shenzhenica, Dendrobium catenatum, and Phalaenopsis equestris were de novo assembled and analyzed. These datasets were used to develop OrchidBase 4.0, including genomic and transcriptomic data for these three orchid species. OrchidBase 4.0 offers information for gene annotation, gene expression with fragments per kilobase of transcript per millions mapped reads (FPKM), KEGG pathways and BLAST search. In addition, assembled genome sequences and location of genes and miRNAs could be visualized by the genome browser. The online resources in OrchidBase 4.0 can be accessed by browsing or using BLAST. Users can also download the assembled scaffold sequences and the predicted gene and protein sequences of these three orchid species. CONCLUSIONS: OrchidBase 4.0 is the first database that contain the whole-genome sequences and annotations of multiple orchid species. OrchidBase 4.0 is available at http://orchidbase.itps.ncku.edu.tw/.


Assuntos
Bases de Dados Genéticas , Orchidaceae/genética , Genoma de Planta
14.
PLoS Pathog ; 15(4): e1007653, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-31009518

RESUMO

The Gram-negative bacterium Proteus mirabilis is a common cause of catheter-associated urinary tract infections (CAUTI), which can progress to secondary bacteremia. While numerous studies have investigated experimental infection with P. mirabilis in the urinary tract, little is known about pathogenesis in the bloodstream. This study identifies the genes that are important for survival in the bloodstream using a whole-genome transposon insertion-site sequencing (Tn-Seq) approach. A library of 50,000 transposon mutants was utilized to assess the relative contribution of each non-essential gene in the P. mirabilis HI4320 genome to fitness in the livers and spleens of mice at 24 hours following tail vein inoculation compared to growth in RPMI, heat-inactivated (HI) naïve serum, and HI acute phase serum. 138 genes were identified as ex vivo fitness factors in serum, which were primarily involved in amino acid transport and metabolism, and 143 genes were identified as infection-specific in vivo fitness factors for both spleen and liver colonization. Infection-specific fitness factors included genes involved in twin arginine translocation, ammonia incorporation, and polyamine biosynthesis. Mutants in sixteen genes were constructed to validate both the ex vivo and in vivo results of the transposon screen, and 12/16 (75%) exhibited the predicted phenotype. Our studies indicate a role for the twin arginine translocation (tatAC) system in motility, translocation of potential virulence factors, and fitness within the bloodstream. We also demonstrate the interplay between two nitrogen assimilation pathways in the bloodstream, providing evidence that the GS-GOGAT system may be preferentially utilized. Furthermore, we show that a dual-function arginine decarboxylase (speA) is important for fitness within the bloodstream due to its role in putrescine biosynthesis rather than its contribution to maintenance of membrane potential. This study therefore provides insight into pathways needed for fitness within the bloodstream, which may guide strategies to reduce bacteremia-associated mortality.


Assuntos
Amônia/metabolismo , Arginina/metabolismo , Bacteriemia/microbiologia , Poliaminas/metabolismo , Infecções por Proteus/microbiologia , Proteus mirabilis/crescimento & desenvolvimento , Fatores de Virulência/metabolismo , Animais , Bacteriemia/genética , Bacteriemia/metabolismo , Elementos de DNA Transponíveis , Feminino , Aptidão Genética , Sequenciamento de Nucleotídeos em Larga Escala , Camundongos , Camundongos Endogâmicos CBA , Fenótipo , Infecções por Proteus/genética , Infecções por Proteus/metabolismo , Translocação Genética , Fatores de Virulência/genética
15.
Nucleic Acids Res ; 47(D1): D181-D187, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30357353

RESUMO

PIWI-interacting RNAs (piRNAs) are a class of small noncoding RNAs that guard animal genomes against mutation by silencing transposons. In addition, recent studies have reported that piRNAs silence various endogenous genes. Tens of thousands of distinct piRNAs made in animals do not pair well to transposons and currently the functions and targets of piRNAs are largely unexplored. piRTarBase provides a user-friendly interface to access both predicted and experimentally identified piRNA targeting sites in Caenorhabditis elegans. The user can input genes of interest and retrieve a list of piRNA targeting sites on the input genes. Alternatively, the user can input a piRNA and retrieve a list of its mRNA targets. Additionally, piRTarBase integrates published mRNA and small RNA sequencing data, which will help users identify biologically relevant targeting events. Importantly, our analyses suggest that the piRNA sites found by both predictive and experimental approaches are more likely to exhibit silencing effects on their targets than each method alone. Taken together, piRTarBase offers an integrative platform that will help users to identify functional piRNA target sites by evaluating various information. piRTarBase is freely available for academic use at http://cosbi6.ee.ncku.edu.tw/piRTarBase/.


Assuntos
Sítios de Ligação , Bases de Dados Genéticas , Regulação da Expressão Gênica , Inativação Gênica , Interferência de RNA , RNA Mensageiro/genética , RNA Interferente Pequeno/genética , Software , Navegador , Fluxo de Trabalho
16.
J Immunol ; 200(4): 1496-1503, 2018 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-29311362

RESUMO

The MHC region encodes HLA genes and is the most complex region in the human genome. The extensively polymorphic nature of the HLA hinders accurate localization and functional assessment of disease risk loci within this region. Using targeted capture sequencing and constructing individualized genomes for transcriptome alignment, we identified 908 novel transcripts within the human MHC region. These include 593 novel isoforms of known genes, 137 antisense strand RNAs, 119 novel long intergenic noncoding RNAs, and 5 transcripts of 3 novel putative protein-coding human endogenous retrovirus genes. We revealed allele-dependent expression imbalance involving 88% of all heterozygous transcribed single nucleotide polymorphisms throughout the MHC transcriptome. Among these variants, the genetic variant associated with Behçet's disease in the HLA-B/MICA region, which tags HLA-B*51, is within novel long intergenic noncoding RNA transcripts that are exclusively expressed from the haplotype with the protective but not the disease risk allele. Further, the transcriptome within the MHC region can be defined by 14 distinct coexpression clusters, with evidence of coregulation by unique transcription factors in at least 9 of these clusters. Our data suggest a very complex regulatory map of the human MHC, and can help uncover functional consequences of disease risk loci in this region.


Assuntos
Desequilíbrio Alélico , Complexo Principal de Histocompatibilidade/genética , Humanos
17.
Nature ; 515(7527): 371-375, 2014 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-25409826

RESUMO

To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human-mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.


Assuntos
Sequência Conservada/genética , Genoma/genética , Genômica , Sequências Reguladoras de Ácido Nucleico/genética , Fatores de Transcrição/metabolismo , Animais , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Elementos Facilitadores Genéticos/genética , Humanos , Camundongos , Polimorfismo de Nucleotídeo Único/genética
18.
Nature ; 515(7527): 402-5, 2014 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-25409831

RESUMO

Eukaryotic chromosomes replicate in a temporal order known as the replication-timing program. In mammals, replication timing is cell-type-specific with at least half the genome switching replication timing during development, primarily in units of 400-800 kilobases ('replication domains'), whose positions are preserved in different cell types, conserved between species, and appear to confine long-range effects of chromosome rearrangements. Early and late replication correlate, respectively, with open and closed three-dimensional chromatin compartments identified by high-resolution chromosome conformation capture (Hi-C), and, to a lesser extent, late replication correlates with lamina-associated domains (LADs). Recent Hi-C mapping has unveiled substructure within chromatin compartments called topologically associating domains (TADs) that are largely conserved in their positions between cell types and are similar in size to replication domains. However, TADs can be further sub-stratified into smaller domains, challenging the significance of structures at any particular scale. Moreover, attempts to reconcile TADs and LADs to replication-timing data have not revealed a common, underlying domain structure. Here we localize boundaries of replication domains to the early-replicating border of replication-timing transitions and map their positions in 18 human and 13 mouse cell types. We demonstrate that, collectively, replication domain boundaries share a near one-to-one correlation with TAD boundaries, whereas within a cell type, adjacent TADs that replicate at similar times obscure replication domain boundaries, largely accounting for the previously reported lack of alignment. Moreover, cell-type-specific replication timing of TADs partitions the genome into two large-scale sub-nuclear compartments revealing that replication-timing transitions are indistinguishable from late-replicating regions in chromatin composition and lamina association and accounting for the reduced correlation of replication timing to LADs and heterochromatin. Our results reconcile cell-type-specific sub-nuclear compartmentalization and replication timing with developmentally stable structural domains and offer a unified model for large-scale chromosome structure and function.


Assuntos
Cromatina/química , Cromatina/genética , Período de Replicação do DNA , DNA/biossíntese , Animais , Compartimento Celular , Cromatina/metabolismo , Montagem e Desmontagem da Cromatina , DNA/genética , Genoma/genética , Heterocromatina/química , Heterocromatina/genética , Heterocromatina/metabolismo , Humanos , Camundongos , Especificidade de Órgãos , Fatores de Tempo
19.
Nature ; 515(7527): 355-64, 2014 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-25409824

RESUMO

The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.


Assuntos
Genoma/genética , Genômica , Camundongos/genética , Anotação de Sequência Molecular , Animais , Linhagem da Célula/genética , Cromatina/genética , Cromatina/metabolismo , Sequência Conservada/genética , Replicação do DNA/genética , Desoxirribonuclease I/metabolismo , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Estudo de Associação Genômica Ampla , Humanos , RNA/genética , Sequências Reguladoras de Ácido Nucleico/genética , Especificidade da Espécie , Fatores de Transcrição/metabolismo , Transcriptoma/genética
20.
Nucleic Acids Res ; 46(W1): W43-W48, 2018 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-29897582

RESUMO

pirScan is a web-based tool for identifying C. elegans piRNA-targeting sites within a given mRNA or spliced DNA sequence. The purpose of our tool is to allow C. elegans researchers to predict piRNA targeting sites and to avoid the persistent germline silencing of transgenes that has rendered many constructs unusable. pirScan fulfills this purpose by first enumerating the predicted piRNA-targeting sites present in an input sequence. This prediction can be exported in a tabular or graphical format. Subsequently, pirScan suggests silent mutations that can be introduced to the input sequence that would allow the modified transgene to avoid piRNA targeting. The user can customize the piRNA targeting stringency and the silent mutations that he/she wants to introduce into the sequence. The modified sequences can be re-submitted to be certain that any previously present piRNA-targeting sites are now absent and no new piRNA-targeting sites are accidentally generated. This revised sequence can finally be downloaded as a text file and/or visualized in a graphical format. pirScan is freely available for academic use at http://cosbi4.ee.ncku.edu.tw/pirScan/.


Assuntos
Caenorhabditis elegans/genética , Internet , RNA Interferente Pequeno/genética , Software , Animais , Biologia Computacional/tendências , RNA Interferente Pequeno/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA