Large language model enhanced corpus of CO2 reduction electrocatalysts and synthesis procedures-Reference-Cited by-同舟云学术

Large language model enhanced corpus of CO2 reduction electrocatalysts and synthesis procedures

Published:2024-04-06 Issue:1 Volume:11 Page:
ISSN:2052-4463
Container-title:Scientific Data
language:en
Short-container-title:Sci Data

Author:

Chen Xueqing^ORCID,Gao Yang^ORCID,Wang Ludi^ORCID,Cui Wenjuan^ORCID,Huang Jiamin,Du Yi^ORCID,Wang Bin^ORCID

Abstract

AbstractCO2 electroreduction has garnered significant attention from both the academic and industrial communities. Extracting crucial information related to catalysts from domain literature can help scientists find new and effective electrocatalysts. Herein, we used various advanced machine learning, natural language processing techniques and large language models (LLMs) approaches to extract relevant information about the CO2 electrocatalytic reduction process from scientific literature. By applying the extraction pipeline, we present an open-source corpus for electrocatalytic CO2 reduction. The database contains two types of corpus: (1) the benchmark corpus, which is a collection of 6,985 records extracted from 1,081 publications by catalysis postgraduates; and (2) the extended corpus, which consists of content extracted from 5,941 documents using traditional NLP techniques and LLMs techniques. The Extended Corpus I and II contain 77,016 and 30,283 records, respectively. Furthermore, several domain literature fine-tuned LLMs were developed. Overall, this work will contribute to the exploration of new and effective electrocatalysts by leveraging information from domain literature using cutting-edge computer techniques.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41597-024-03180-9.pdf

Reference49 articles.

1. Birdja, Y. Y. et al. Advances and challenges in understanding the electrocatalytic conversion of carbon dioxide to fuels. Nat. Energy 4, 732–745 (2019).

2. Zhong, M. et al. Accelerated discovery of CO2 electrocatalysts using active machine learning. Nature 581, 178–183 (2020).

3. Gao, Y., Wang, L., Chen, X., Du, Y. & Wang, B. Revisiting electrocatalyst design by a knowledge graph of Cu-based catalysts for CO2 reduction. ACS Catal. 13, 8525–8534 (2023).

4. Qiao, J., Liu, Y., Hong, F. & Zhang, J. A review of catalysts for the electroreduction of carbon dioxide to produce low-carbon fuels. Chem. Soc. Rev. 43, 631–675 (2014).

5. Zheng, T., Jiang, K. & Wang, H. Recent advances in electrochemical CO2-to-CO conversion on heterogeneous catalysts. Adv. Mater. 30, 1802066 (2018).