Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 36
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Brief Bioinform ; 22(3)2021 05 20.
Article in English | MEDLINE | ID: mdl-32698196

ABSTRACT

DNA variants represent an important source of genetic variations among individuals. Next- generation sequencing (NGS) is the most popular technology for genome-wide variant calling. Third-generation sequencing (TGS) has also recently been used in genetic studies. Although many variant callers are available, no single caller can call both types of variants on NGS or TGS data with high sensitivity and specificity. In this study, we systematically evaluated 11 variant callers on 12 NGS and TGS datasets. For germline variant calling, we tested DNAseq and DNAscope modes from Sentieon, HaplotypeCaller mode from GATK and WGS mode from DeepVariant. All the four callers had comparable performance on NGS data and 30× coverage of WGS data was recommended. For germline variant calling on TGS data, we tested DNAseq mode from Sentieon, HaplotypeCaller mode from GATK and PACBIO mode from DeepVariant. All the three callers had similar performance in SNP calling, while DeepVariant outperformed the others in InDel calling. TGS detected more variants than NGS, particularly in complex and repetitive regions. For somatic variant calling on NGS, we tested TNscope and TNseq modes from Sentieon, MuTect2 mode from GATK, NeuSomatic, VarScan2, and Strelka2. TNscope and Mutect2 outperformed the other callers. A higher proportion of tumor sample purity (from 10 to 20%) significantly increased the recall value of calling. Finally, computational costs of the callers were compared and Sentieon required the least computational cost. These results suggest that careful selection of a tool and parameters is needed for accurate SNP or InDel calling under different scenarios.


Subject(s)
Databases, Nucleic Acid , High-Throughput Nucleotide Sequencing , INDEL Mutation , Benchmarking , Cell Line , Computational Biology , Female , Genome, Human , Humans
2.
Genome Res ; 29(7): 1087-1099, 2019 07.
Article in English | MEDLINE | ID: mdl-31175153

ABSTRACT

To initiate X-Chromosome inactivation (XCI), the long noncoding RNA Xist mediates chromosome-wide gene silencing of one X Chromosome in female mammals to equalize gene dosage between the sexes. The efficiency of gene silencing is highly variable across genes, with some genes even escaping XCI in somatic cells. A gene's susceptibility to Xist-mediated silencing appears to be determined by a complex interplay of epigenetic and genomic features; however, the underlying rules remain poorly understood. We have quantified chromosome-wide gene silencing kinetics at the level of the nascent transcriptome using allele-specific Precision nuclear Run-On sequencing (PRO-seq). We have developed a Random Forest machine-learning model that can predict the measured silencing dynamics based on a large set of epigenetic and genomic features and tested its predictive power experimentally. The genomic distance to the Xist locus, followed by gene density and distance to LINE elements, are the prime determinants of the speed of gene silencing. Moreover, we find two distinct gene classes associated with different silencing pathways: a class that requires Xist-repeat A for silencing, which is known to activate the SPEN pathway, and a second class in which genes are premarked by Polycomb complexes and tend to rely on the B repeat in Xist for silencing, known to recruit Polycomb complexes during XCI. Moreover, a series of features associated with active transcriptional elongation and chromatin 3D structure are enriched at rapidly silenced genes. Our machine-learning approach can thus uncover the complex combinatorial rules underlying gene silencing during X inactivation.


Subject(s)
Epigenesis, Genetic , Gene Silencing , Machine Learning , RNA, Long Noncoding/physiology , X Chromosome Inactivation/genetics , Animals , Cell Line , Embryonic Stem Cells , Female , Genes, X-Linked , Genome , Kinetics , Mice , Models, Genetic
3.
Int J Cancer ; 144(2): 281-289, 2019 01 15.
Article in English | MEDLINE | ID: mdl-29752822

ABSTRACT

Multigene panel testing of breast cancer predisposition genes have been extensively conducted in Europe and America, which is relatively rare in Asia however. In this study, we assessed the frequency of germline mutations in 40 cancer predisposition genes, including BRCA1 and BRCA2, among a large cohort of Chinese patients with high hereditary risk of BC. From 2015 to 2016, consecutive BC patients from 26 centers of China with high hereditary risk were recruited (n = 937). Clinical information was collected and next-generation sequencing (NGS) was performed using blood samples of participants to identify germline mutations. In total, we acquired 223 patients with putative germline mutations, including 159 in BRCA1/2, 61 in 15 other BC susceptibility genes and 3 in both BRCA1/2 and non-BRCA1/2 gene. Major mutant non-BRCA1/2 genes were TP53 (n = 18), PALB2 (n = 11), CHEK2 (n = 6), ATM (n = 6) and BARD1 (n = 5). No factors predicted pathologic mutations in non-BRCA1/2 genes when treated as a whole. TP53 mutations were associated with HER-2 positive BC and younger age at diagnosis; and CHEK2 and PALB2 mutations were enriched in patients with luminal BC. Among high hereditary risk Chinese BC patients, 23.8% contained germline mutations, including 6.8% in non-BRCA1/2 genes. TP53 and PALB2 had a relatively high mutation rate (1.9 and 1.2%). Although no factors predicted for detrimental mutations in non-BRCA1/2 genes, some clinical features were associated with mutations of several particular genes.


Subject(s)
Breast Neoplasms/genetics , Genetic Predisposition to Disease/genetics , Adult , Asian People/genetics , Female , Germ-Line Mutation , Humans , Middle Aged
4.
J Hum Genet ; 63(11): 1129-1137, 2018 Nov.
Article in English | MEDLINE | ID: mdl-30131598

ABSTRACT

Noninvasive prenatal testing (NIPT), which involves analysis of circulating cell-free fetal DNA (cffDNA) from maternal plasma, is highly effective for detecting feto-placental chromosome aneuploidy. However, recent studies suggested that coverage-based shallow-depth NIPT cannot accurately detect smaller single or multi-loci genetic variants. To assess the fetal genotype of any locus using maternal plasma, we developed a novel genotyping algorithm named pseudo tetraploid genotyping (PTG). We performed paired-end captured sequencing of the plasma cell-free DNA (cfDNA), in which case a phenotypically healthy woman is suspected to be carrying a fetus with genetic defect. After a series of independent filtering of 111,407 SNPs, we found one variant in COL1A1 graded with high pathogenic potential which might cause osteogenesis imperfecta (OI). Then, we verified this mutation by Sanger sequencing of fetal and parental blood cells. In addition, we evaluated the accuracy and detection rate of the PTG algorithm through direct sequencing of the genomic DNA from maternal and fetal blood cells. Collectively, our study developed an intuitive and cost-effective method for the noninvasive detection of pathogenic mutations, and successfully identified a de novo variant in COL1A1 (c.2596 G > A, p.Gly866Ser) in the fetus implicated in OI.


Subject(s)
Collagen Type I/genetics , Fetus , High-Throughput Nucleotide Sequencing , Osteogenesis Imperfecta/genetics , Polymorphism, Single Nucleotide/genetics , Prenatal Diagnosis , Collagen Type I, alpha 1 Chain , Female , Humans , Male , Osteogenesis Imperfecta/embryology
5.
PLoS Genet ; 9(11): e1003791, 2013 11.
Article in English | MEDLINE | ID: mdl-24244175

ABSTRACT

In most mouse tissues, long-interspersed elements-1 (L1s) are silenced via methylation of their 5'-untranslated regions (5'-UTR). A gradual loss-of-methylation in pre-implantation embryos coincides with L1 retrotransposition in blastocysts, generating potentially harmful mutations. Here, we show that Dicer- and Ago2-dependent RNAi restricts L1 accumulation and retrotransposition in undifferentiated mouse embryonic stem cells (mESCs), derived from blastocysts. RNAi correlates with production of Dicer-dependent 22-nt small RNAs mapping to overlapping sense/antisense transcripts produced from the L1 5'-UTR. However, RNA-surveillance pathways simultaneously degrade these transcripts and, consequently, confound the anti-L1 RNAi response. In Dicer(-/-) mESC complementation experiments involving ectopic Dicer expression, L1 silencing was rescued in cells in which microRNAs remained strongly depleted. Furthermore, these cells proliferated and differentiated normally, unlike their non-complemented counterparts. These results shed new light on L1 biology, uncover defensive, in addition to regulatory roles for RNAi, and raise questions on the differentiation defects of Dicer(-/-) mESCs.


Subject(s)
Argonaute Proteins/genetics , DEAD-box RNA Helicases/genetics , Embryonic Stem Cells/metabolism , Long Interspersed Nucleotide Elements/genetics , RNA Interference , Ribonuclease III/genetics , 5' Untranslated Regions , Animals , Cell Differentiation/genetics , Cell Proliferation , DEAD-box RNA Helicases/metabolism , DNA Methylation/genetics , Gene Expression Regulation, Developmental , Mice , Promoter Regions, Genetic , Retroelements/genetics , Ribonuclease III/metabolism
6.
Biochem Biophys Res Commun ; 444(1): 86-91, 2014 Jan 31.
Article in English | MEDLINE | ID: mdl-24434141

ABSTRACT

Transcriptome studies have revealed that many non-coding RNAs (ncRNAs) are located near the 3' sense terminus of protein-coding genes. However, the transcription and function of these RNAs remain elusive. Here, we identify a 3' sense termini-associated sRNA (TASR) downstream of rpl26 in Schizosaccharomyces pombe (S. pombe). Structure and function assays indicate that the TASR is an H/ACA box snoRNA required for 18S rRNA pseudouridylation at U121 and U305 sites and is therefore a cognate of snR49 from the budding yeast. Transcriptional studies show that pre-snR49 overlaps most of the coding sequence (CDS) of rpl26. Using scanning deletion analysis within promoter region, we show that the rpl26 promoter is required for the 3' TASR transcription. Interestingly, chromosomal synteny of rpl26-snR49 is found in the Schizosaccharomyces groups. Taken together, we have revealed a new transcriptional mechanism for 3' sense TASRs, which are transcribed by the same promoter as their upstream protein genes. These results further suggest that the origin and function of 3' sense ncRNAs are associated with upstream genes in higher eukaryotes.


Subject(s)
RNA, Fungal/genetics , RNA, Untranslated/genetics , Ribosomal Proteins/genetics , Schizosaccharomyces pombe Proteins/genetics , 3' Untranslated Regions , Base Sequence , Genes, Fungal , Molecular Sequence Data , Promoter Regions, Genetic , Schizosaccharomyces/genetics , Schizosaccharomyces/metabolism , Sequence Homology, Nucleic Acid , Synteny , Transcription, Genetic
7.
Methods ; 63(1): 76-84, 2013 Sep 01.
Article in English | MEDLINE | ID: mdl-23684746

ABSTRACT

It has been shown in small RNA sequencing-based studies that some small RNA fragments are specifically processed from known structural non-coding RNAs, either through Dicer-dependent or Dicer-independent pathways. Although these small RNAs are often less abundant compared to microRNAs in normal mammalian tissues, they are always present in all sequenced libraries. In this paper, we use the ncPRO-seq pipeline, to describe different profiles of these small RNA fragments, and to discuss their potential processing pathways and functions. To assess whether more small RNA fragments can be detected in small RNA sequencing datasets, we decided to focus on small nuclear RNAs, abbreviated as snRNAs, which are associated with Sm ribonucleoproteins to form functional RNA-protein complexes. Here, we describe a group of small RNA fragments derived from snRNAs, which are typically highly enriched in regions bound by Sm proteins. Based on this, we propose the existence of a potential novel small RNA family associated with Sm proteins.


Subject(s)
RNA, Small Nuclear/genetics , RNA, Untranslated/isolation & purification , Ribonucleoproteins/genetics , Sequence Analysis, RNA , Animals , DEAD-box RNA Helicases/genetics , Humans , Mammals , RNA Processing, Post-Transcriptional , RNA, Small Nuclear/metabolism , RNA, Untranslated/genetics , Ribonuclease III/genetics , Ribonucleoproteins/chemistry
8.
RNA ; 17(3): 390-400, 2011 Mar.
Article in English | MEDLINE | ID: mdl-21220549

ABSTRACT

Plant genomes have undergone multiple rounds of duplications that contributed massively to the growth of gene families. The structure of resulting families has been studied in depth for protein-coding genes. However, little is known about the impact of duplications on noncoding RNA (ncRNA) genes. Here we perform a systematic analysis of duplicated regions in the rice genome in search of such ncRNA repeats. We observe that, just like their protein counterparts, most ncRNA genes have undergone multiple duplications that left visible sequence conservation footprints. The extent of ncRNA gene duplication in plants is such that these sequence footprints can be exploited for the discovery of novel ncRNA gene families on a large scale. We developed an SVM model that is able to retrieve likely ncRNA candidates among the 100,000+ repeat families in the rice genome, with a reasonably low false-positive discovery rate. Among the nearly 4000 ncRNA families predicted by this means, only 90 correspond to putative snoRNA or miRNA families. About half of the remaining families are classified as structured RNAs. New candidate ncRNAs are particularly enriched in UTR and intronic regions. Interestingly, 89% of the putative ncRNA families do not produce a detectable signal when their sequences are compared to another grass genome such as maize. Our results show that a large fraction of rice ncRNA genes are present in multiple copies and are species-specific or of recent origin. Intragenome comparison is a unique and potent source for the computational annotation of this major class of ncRNA.


Subject(s)
Comparative Genomic Hybridization , Computational Biology , Genes, Plant/genetics , Genome, Plant , Plants/genetics , RNA, Plant/genetics , RNA, Untranslated/genetics , Base Sequence , Molecular Sequence Data , Multigene Family , Nucleic Acid Conformation , Polymerase Chain Reaction , RNA, Untranslated/chemistry , Sequence Homology, Nucleic Acid
9.
Bioinformatics ; 28(21): 2843-4, 2012 Nov 01.
Article in English | MEDLINE | ID: mdl-22923296

ABSTRACT

SUMMARY: The R/Bioconductor package HiTC facilitates the exploration of high-throughput 3C-based data. It allows users to import and export 'C' data, to transform, normalize, annotate and visualize interaction maps. The package operates within the Bioconductor framework and thus offers new opportunities for future development in this field. AVAILABILITY AND IMPLEMENTATION: The R package HiTC is available from the Bioconductor website. A detailed vignette provides additional documentation and help for using the package.


Subject(s)
Chromosome Mapping/methods , Restriction Mapping/methods , Software , Animals , Chromosome Mapping/instrumentation , Chromosomes, Human, Pair 14/chemistry , Chromosomes, Human, Pair 14/genetics , Data Display , Humans , Mice , Molecular Conformation , Restriction Mapping/instrumentation
10.
Bioinformatics ; 28(23): 3147-9, 2012 Dec 01.
Article in English | MEDLINE | ID: mdl-23044543

ABSTRACT

SUMMARY: Non-coding RNA (ncRNA) PROfiling in small RNA (sRNA)-seq (ncPRO-seq) is a stand-alone, comprehensive and flexible ncRNA analysis pipeline. It can interrogate and perform detailed profiling analysis on sRNAs derived from annotated non-coding regions in miRBase, Rfam and RepeatMasker, as well as specific regions defined by users. The ncPRO-seq pipeline performs both gene-based and family-based analyses of sRNAs. It also has a module to identify regions significantly enriched with short reads, which cannot be classified under known ncRNA families, thus enabling the discovery of previously unknown ncRNA- or small interfering RNA (siRNA)-producing regions. The ncPRO-seq pipeline supports input read sequences in fastq, fasta and color space format, as well as alignment results in BAM format, meaning that sRNA raw data from the three current major platforms (Roche-454, Illumina-Solexa and Life technologies-SOLiD) can be analyzed with this pipeline. The ncPRO-seq pipeline can be used to analyze read and alignment data, based on any sequenced genome, including mammals and plants. AVAILABILITY: Source code, annotation files, manual and online version are available at http://ncpro.curie.fr/. CONTACT: bioinfo.ncproseq@curie.fr or cciaudo@ethz.ch SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Computational Biology/methods , RNA, Small Interfering/genetics , RNA, Untranslated/genetics , Sequence Analysis, RNA/methods , Software , Base Sequence , Internet , Sequence Alignment
12.
RNA Biol ; 8(3): 538-47, 2011.
Article in English | MEDLINE | ID: mdl-21525786

ABSTRACT

Small RNAs constitute a new and unanticipated layer of gene regulation present in the three domains of life. In plants, all organs are ultimately derived from a few pluripotent stem cells localized in specialized structures called apical meristems. The development of meristems involves a coordinated balance between undifferentiated growth and differentiation, a phenomenon requiring a tight regulation of gene expression. We used in vitro cultured embryogenic calli as a model to investigate the roles of meristem-associated small RNAs. Using high throughput sequencing, we sequenced 20 million short reads with size of 18-30 nt from rice undifferentiated and differentiated calli. We confirmed 50 known microRNA families, representing one third of annotated rice microRNAs. Using a specific computational pipeline for plant microRNA identification, we identified 24 novel microRNA families. Among them, 53 microRNA or microRNA* sequences appear to vary in expression between differentiated and undifferentiated calli, suggesting a role in meristem development. Our analysis also revealed a new class of plant small RNAs derived from 5' or 3' ends of mature tRNA analogous to the tRFs in human cancer cell. We independently verified the expression of these small RNAs from 5' end of mature tRNA using qRT-PCR.


Subject(s)
Genome, Plant , MicroRNAs/analysis , Oryza/embryology , Oryza/genetics , RNA, Plant/analysis , RNA, Small Interfering/analysis , Base Sequence , Gene Expression Regulation, Plant , Meristem/metabolism , Molecular Sequence Data , RNA, Transfer/metabolism
13.
Leuk Lymphoma ; 62(10): 2416-2427, 2021 10.
Article in English | MEDLINE | ID: mdl-33913388

ABSTRACT

We have reported the genetic mutation profile in previously untreated acute myeloid leukemia (AML) patients using a targeted NGS screening method. In this study, we evaluated the characteristics and prognostic significance of gene mutations in refractory/relapsed (R/R) AML patients by comparing their gene mutation spectrum to those newly diagnosed. The frequencies of tumor suppressor mutations were increased, while the mutation frequencies of nucleophosmin and spliceosome complex were decreased in relapsed AML. The frequency of FLT3-ITD mutation was increased, while that of CEBPA biallelic mutation decreased in refractory AML. Activated signaling mutations predicted a lower complete remission rate. FLT3-ITD mutation predicted an inferior overall survival after relapse. DNMT3A mutation predicted an inferior relapse-free survival in R/R AML. These findings may shed light on the molecular mechanism study of leukemia refractory or relapse and provide new guidance for the dynamic risk assessment of AML.


Subject(s)
Leukemia, Myeloid, Acute , High-Throughput Nucleotide Sequencing , Humans , Leukemia, Myeloid, Acute/diagnosis , Leukemia, Myeloid, Acute/genetics , Mutation , Prognosis , Remission Induction , fms-Like Tyrosine Kinase 3/genetics
14.
Mol Biol Evol ; 26(2): 327-34, 2009 Feb.
Article in English | MEDLINE | ID: mdl-18984903

ABSTRACT

Of the 1.1 million Alu retroposons in the human genome, about 10,000 are inserted in the 3' untranslated regions (UTR) of protein-coding genes and 1% of these (107 events) are active as polyadenylation sites (PASs). Strikingly, although Alu's in 3' UTR are indifferently inserted in the forward or reverse direction, 99% of polyadenylation-active Alu sequences are forward oriented. Consensus Alu+ sequences contain sites that can give rise to polyadenylation signals and enhancers through a few point mutations. We found that the strand bias of polyadenylation-active Alu's reflects a radical difference in the fitness of sense and antisense Alu's toward cleavage/polyadenylation activity. In contrast to previous beliefs, Alu inserts do not necessarily represent weak or cryptic PASs; instead, they often constitute the major or the unique PAS in a gene, adding to the growing list of Alu exaptations. Finally, some Alu-borne PASs are intronic and produce truncated transcripts that may impact gene function and/or contribute to gene remodeling.


Subject(s)
Alu Elements , Polyadenylation , Animals , Base Sequence , Enhancer Elements, Genetic , Humans , Mice , Molecular Sequence Data , RNA Splice Sites
15.
Nat Commun ; 11(1): 987, 2020 02 20.
Article in English | MEDLINE | ID: mdl-32080190

ABSTRACT

Longevity is influenced by genetic and environmental factors, but the underlying mechanisms remain elusive. Here, we functionally characterise a Drosophila small nucleolar RNA (snoRNA), named jouvence whose loss of function reduces lifespan. The genomic region of jouvence rescues the longevity in mutant, while its overexpression in wild-type increases lifespan. Jouvence is required in enterocytes. In mutant, the epithelium of the gut presents more hyperplasia, while the overexpression of jouvence prevents it. Molecularly, the mutant lack pseudouridylation on 18S and 28S-rRNA, a function rescued by targeted expression of jouvence in the gut. A transcriptomic analysis performed from the gut reveals that several genes are either up- or down-regulated, while restoring the mRNA level of two genes (ninaD or CG6296) rescue the longevity. Since snoRNAs are structurally and functionally well conserved throughout evolution, we identified putative jouvence orthologue in mammals including humans, suggesting that its function in longevity could be conserved.


Subject(s)
Drosophila melanogaster/genetics , Longevity/genetics , RNA, Small Nucleolar/genetics , Animals , Animals, Genetically Modified , Conserved Sequence , Drosophila Proteins/genetics , Drosophila melanogaster/metabolism , Enterocytes/metabolism , Evolution, Molecular , Female , Gene Deletion , Gene Expression Profiling , Genome, Insect , Humans , Intestinal Mucosa/metabolism , Loss of Function Mutation , Male , Mice , Nucleic Acid Conformation , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA, Small Nucleolar/chemistry , RNA, Small Nucleolar/metabolism , Receptors, Immunologic/genetics
16.
Onco Targets Ther ; 13: 4183-4187, 2020.
Article in English | MEDLINE | ID: mdl-32523354

ABSTRACT

Anaplastic lymphoma kinase (ALK) fusion is present in approximately 2-7% of patients with lung adenocarcinoma. ALK fusion-positive patients can benefit from targeted therapy. We herein report a 53-year-old Chinese male patient diagnosed as lung adenocarcinoma with a smoking history. Next-generation sequencing was performed to detect somatic mutations of oncogenic drivers and tumor suppressor genes in plasma-derived circulating tumor DNA using an ultra-deep 160-gene panel. A novel HPCAL1-ALK fusion variant was identified in the patient responding to ALK inhibitor treatments, and the fusion variant was also confirmed by fluorescence in situ hybridization and immunohistochemical. Our study expands the mutational spectrum of ALK fusion variants and provides options for the precise treatment of such patients.

17.
Cancer Med ; 9(22): 8457-8467, 2020 11.
Article in English | MEDLINE | ID: mdl-32970934

ABSTRACT

To explore the characteristics and prognostic significance of genetic mutations in acute myeloid leukemia (AML), we screened the gene mutation profile of 171 previously untreated AML patients using a next-generation sequencing technique targeting 127 genes with potential prognostic significance. A total of 390 genetic alterations were identified in 149 patients with a frequency of 87.1%. Younger age and high sensitivity to induction chemotherapy were associated with a lower number of mutations. NPM1 mutation was closely related to DNMT3A and FLT3-internal tandem duplication (FLT3-ITD) mutations, but mutually exclusive with ASXL1 mutation and CEBPAdouble mutation . In univariate analysis, ASXL1 or TET2 mutation predicted shorter overall survival (OS) or relapse-free survival (RFS), DNMT3A, FLT3-ITD, or RUNX1 mutation predicted a higher likelihood of remission-induction failure, whereas NRAS mutation or CEBPAdouble mutation predicted longer OS. Concurrent DNMT3A, FLT3-ITD, and NPM1 mutations predicted shorter OS. Hypomethylation agents could improve the OS in patients with DNA methylation-related mutations. According to multivariate analysis, TET2 mutation was recognized as an independent prognostic factors for RFS. In summary, our study provided a detailed pattern of gene mutations and their prognostic relevance in Chinese AML patients based on targeted next-generation sequencing screening.


Subject(s)
Biomarkers, Tumor/genetics , DNA Mutational Analysis , High-Throughput Nucleotide Sequencing , Leukemia, Myeloid, Acute/genetics , Mutation , Adult , Aged , Aged, 80 and over , Beijing , Female , Genetic Predisposition to Disease , Humans , Leukemia, Myeloid, Acute/diagnosis , Leukemia, Myeloid, Acute/mortality , Leukemia, Myeloid, Acute/therapy , Male , Middle Aged , Nucleophosmin , Predictive Value of Tests , Recurrence , Remission Induction , Risk Assessment , Risk Factors , Time Factors , Treatment Outcome , Young Adult
18.
Genetics ; 179(1): 21-30, 2008 May.
Article in English | MEDLINE | ID: mdl-18493037

ABSTRACT

Chlamydomonas reinhardtii is a unicellular green alga, the lineage of which diverged from that of land plants >1 billion years ago. Using the powerful small nucleolar RNA (snoRNA) mining platform to screen the C. reinhardtii genome, we identified 322 snoRNA genes grouped into 118 families. The 74 box C/D families can potentially guide methylation at 96 sites of ribosomal RNAs (rRNAs) and snRNAs, and the 44 box H/ACA families can potentially guide pseudouridylation at 62 sites. Remarkably, 242 of the snoRNA genes are arranged into 76 clusters, of which 77% consist of homologous genes produced by small local tandem duplications. At least 70 snoRNA gene clusters are found within introns of protein-coding genes. Although not exhaustive, this analysis reveals that C. reinhardtii has the highest number of intronic snoRNA gene clusters among eukaryotes. The prevalence of intronic snoRNA gene clusters in C. reinhardtii is similar to that of rice but in contrast with the one-snoRNA-per-intron organization of vertebrates and fungi and with that of Arabidopsis thaliana in which only a few intronic snoRNA gene clusters were identified. This analysis of C. reinhardtii snoRNA gene organization shows the functional importance of introns in a single-celled organism and provides evolutionary insight into the origin of intron-encoded RNAs in the plant lineage.


Subject(s)
Chlamydomonas reinhardtii/genetics , Genome, Protozoan/genetics , Introns/genetics , Multigene Family/genetics , RNA, Small Nucleolar/genetics , Animals , Base Sequence , Computational Biology , Evolution, Molecular , Gene Components , Molecular Sequence Data , Sequence Homology
19.
Tree Physiol ; 39(3): 356-371, 2019 03 01.
Article in English | MEDLINE | ID: mdl-30137626

ABSTRACT

Oil palm (Elaeis guineensis Jacq.) is the highest oil-yielding crop in the plant kingdom and accumulates 90% of palm oil in the mesocarp. However, the regulatory mechanisms of lipid and fatty acid (FA) metabolism in oil palm are just beginning to be understood, and more studies are needed, especially in the understanding of small noncoding RNA (ncRNA) and mRNA. Based on the deep sequencing of small noncoding RNAs and the degradome in five developmental mesocarp stages, 452 microRNAs (miRNAs), including 170 conserved known-miRNAs (kn-miRNAs) and 282 novel-miRNA (nov-miRNAs), were identified. After predicting the targets of those miRNAs to 37 FA synthesis-related genes, we found that 22 kn-miRNAs and 14 nov-miRNAs might be involved in FA metabolism pathways. Among them, eg-miR156c, eg-miR397, eg-miR444b and nov-miR129 regulated FA synthesis in plastids and the transport of FA-ACP from plastids to the endoplasmic reticulum by targeting acetyl-CoA carboxylase 1 (ACC1), long-chain acyl-CoA synthetase 9 (LACS9), LACS4 and enoyl-ACP reductase (ENR), respectively. Nov-miR138 and nov-miR59 targeted glycerol-3-phosphate acyltransferase (GPAT), and nov-miR274 targeted phosphatidate phosphatase 1 (PAP1). Both target genes are involved in triacylglycerol synthesis in the endoplasmic reticulum. Eg-miR156e and eg-miR156j played pivotal roles by targeting ß-ketoacyl-CoA synthase 12 (KCS12), and nov-miR201 targets very-long-chain enoyl-CoA reductase (ECR). Several miRNAs were also predicted to indirectly regulate FA synthesis and lipid metabolism through the squamosa promoter-binding protein-like gene (SPL), NAC and MYB transcription factors. As a whole, indications of a complex and extensive miRNA-mRNA regulatory network associated with FA metabolism in the mesocarp of the oil palm is presented. The results help to broaden the knowledge of potential mechanisms that might be regulated by miRNAs through modulation of the expression of FA-related target gene metabolism in the oil palm.


Subject(s)
Arecaceae/genetics , Fatty Acids/metabolism , Gene Expression Regulation, Plant , Gene Regulatory Networks , Lipid Metabolism , Arecaceae/metabolism , Genome-Wide Association Study , RNA, Plant/genetics , RNA, Plant/metabolism , RNA, Small Untranslated/genetics , RNA, Small Untranslated/metabolism , Sequence Analysis, RNA
20.
BMC Med Genomics ; 12(1): 28, 2019 01 28.
Article in English | MEDLINE | ID: mdl-30691450

ABSTRACT

BACKGROUND: Microtia-atresia is characterized by abnormalities of the auricle (microtia) and aplasia or hypoplasia of the external auditory canal, often associated with middle ear abnormalities. To date, no causal genetic mutations or genes have been identified in microtia-atresia patients. METHODS: We designed a panel of 131 genes associated with external/middle or inner ear deformity. Targeted genomic capturing combined with next-generation sequencing (NGS) was utilized to screen for mutations in 40 severe microtia-atresia patients. Mutations detected by NGS were filtered and validated. And then mutations were divided into three categories-rare or novel variants, low-frequency variants and common variants-based on their frequency in the public database. The rare or novel mutations were prioritized by pathogenicity analysis. For the low-frequency variants and common variants, we used association studies to explore risk factors of severe microtia-atresia. RESULTS: Sixty-five rare heterozygous mutations of 42 genes were identified in 27 (67.5%) severe microtia-atresia patients. Association studies to determine genes that were potentially pathogenic found that PLEC, USH2A, FREM2, DCHS1, GLI3, POMT1 and GBA genes were significantly associated with severe microtia-atresia. Of these, DCHS1 was strongly suggested to cause severe microtia-atresia as it was identified by both low-frequency and common variants association studies. A rare mutation (c.481C > T, p.R161C) in DCHS1 identified in one individual may be deleterious and may cause severe microtia-atresia. CONCLUSION: We identified several genes that were significantly associated with severe microtia-atresia. The findings provide new insights into genetic background of external ear deformities.


Subject(s)
Congenital Microtia/genetics , Genetic Variation , High-Throughput Nucleotide Sequencing , Adult , Child , Female , Humans , Male , Young Adult
SELECTION OF CITATIONS
SEARCH DETAIL