Your browser doesn't support javascript.
loading
Extracting the Synthetic Route of Pd-Based Catalysts in Methanol Steam Reforming from the Scientific Literature.
Li, Shuyuan; Zhang, Yunjiang; Fang, Zhaolin; Meng, Kong; Tian, Rui; He, Hong; Sun, Shaorui.
Afiliação
  • Li S; Beijing Key Laboratory for Green Catalysis and Separation, Faculty of Environment and Life, Beijing University of Technology, Beijing 100124, China.
  • Zhang Y; Beijing Key Laboratory for Green Catalysis and Separation, Faculty of Environment and Life, Beijing University of Technology, Beijing 100124, China.
  • Fang Z; Beijing Key Laboratory for Green Catalysis and Separation, Faculty of Environment and Life, Beijing University of Technology, Beijing 100124, China.
  • Meng K; Beijing Key Laboratory for Green Catalysis and Separation, Faculty of Environment and Life, Beijing University of Technology, Beijing 100124, China.
  • Tian R; Beijing Engineering Research Center for IoT Software and Systems, Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China.
  • He H; Beijing Key Laboratory for Green Catalysis and Separation, Faculty of Environment and Life, Beijing University of Technology, Beijing 100124, China.
  • Sun S; Beijing Key Laboratory for Green Catalysis and Separation, Faculty of Environment and Life, Beijing University of Technology, Beijing 100124, China.
J Chem Inf Model ; 63(20): 6249-6260, 2023 10 23.
Article em En | MEDLINE | ID: mdl-37807535
ABSTRACT
The structured material synthesis route is crucial for chemists in performing experiments and modern applications such as machine learning material design. With the exponential growth of the chemical literature in recent years, manual extraction from the published literature is time-consuming and labor-intensive. This study focuses on developing an automated method for extracting Pd-based catalyst synthesis routes from the chemical literature. First, a paragraph classification model based on regular expressions is employed to identify paragraphs that contain material synthesis processes. The identified paragraphs are verified using machine learning techniques. Second, natural language processing techniques are applied to automatically parse the material synthesis routes from the identified paragraphs, generate regularized flowcharts, and output structured data. Lastly, we utilized the structured data of the synthesis routes to train machine learning models and predict the performance of the materials. The extracted material entities include the product, preparation method, precursor, support, loading, synthesis operation, and operation condition. This method avoids extensive manual data annotation and improves the scientific literature information acquisition efficiency. The accuracy of the 11 material entities exceeds 80%, and the accuracy of the method, support, precursor, drying time, and reduction time exceeds 90%.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Vapor / Metanol Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Vapor / Metanol Idioma: En Ano de publicação: 2023 Tipo de documento: Article