Your browser doesn't support javascript.
loading
Predicting Tissue-Specific mRNA and Protein Abundance in Maize: A Machine Learning Approach.
Cho, Kyoung Tak; Sen, Taner Z; Andorf, Carson M.
Afiliação
  • Cho KT; Department of Computer Science, Iowa State University, Ames, IA, United States.
  • Sen TZ; USDA-ARS, Crop Improvement and Genetics Research Unit, Albany, CA, United States.
  • Andorf CM; USDA-ARS, Corn Insects and Crop Genetics Research Unit, Ames, IA, United States.
Front Artif Intell ; 5: 830170, 2022.
Article em En | MEDLINE | ID: mdl-35719692
ABSTRACT
Machine learning and modeling approaches have been used to classify protein sequences for a broad set of tasks including predicting protein function, structure, expression, and localization. Some recent studies have successfully predicted whether a given gene is expressed as mRNA or even translated to proteins potentially, but given that not all genes are expressed in every condition and tissue, the challenge remains to predict condition-specific expression. To address this gap, we developed a machine learning approach to predict tissue-specific gene expression across 23 different tissues in maize, solely based on DNA promoter and protein sequences. For class labels, we defined high and low expression levels for mRNA and protein abundance and optimized classifiers by systematically exploring various methods and combinations of k-mer sequences in a two-phase approach. In the first phase, we developed Markov model classifiers for each tissue and built a feature vector based on the predictions. In the second phase, the feature vector was used as an input to a Bayesian network for final classification. Our results show that these methods can achieve high classification accuracy of up to 95% for predicting gene expression for individual tissues. By relying on sequence alone, our method works in settings where costly experimental data are unavailable and reveals useful insights into the functional, evolutionary, and regulatory characteristics of genes.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article