Empirical assessment of ChatGPT’s answering capabilities in natural science and engineering-Reference-Cited by-同舟云学术

Empirical assessment of ChatGPT’s answering capabilities in natural science and engineering

Published:2024-02-29 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Schulze Balhorn Lukas,Weber Jana M.,Buijsman Stefan,Hildebrandt Julian R.,Ziefle Martina,Schweidtmann Artur M.

Abstract

AbstractChatGPT is a powerful language model from OpenAI that is arguably able to comprehend and generate text. ChatGPT is expected to greatly impact society, research, and education. An essential step to understand ChatGPT’s expected impact is to study its domain-specific answering capabilities. Here, we perform a systematic empirical assessment of its abilities to answer questions across the natural science and engineering domains. We collected 594 questions on natural science and engineering topics from 198 faculty members across five faculties at Delft University of Technology. After collecting the answers from ChatGPT, the participants assessed the quality of the answers using a systematic scheme. Our results show that the answers from ChatGPT are, on average, perceived as “mostly correct”. Two major trends are that the rating of the ChatGPT answers significantly decreases (i) as the educational level of the question increases and (ii) as we evaluate skills beyond scientific knowledge, e.g., critical attitude.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-54936-7.pdf

Reference44 articles.

1. Smith, M. J. & Geach, J. E. Astronomia ex machina: A history, primer and outlook on neural networks in astronomy. R. Soc. Open Sci. 10(5), 221454 (2023).

2. Agathokleous, E., Saitanis, C. J., Fang, C. & Yu, Z. Use of ChatGPT: What does it mean for biology and environmental science?. Sci. Total Environ. 888, 164154 (2023).

3. Foroumandi, E. et al. ChatGPT in hydrology and earth sciences: Opportunities, prospects, and concerns. Water Resour. Res. 59(10), e2023WR036288 (2023).

4. Liu, Y. et al. Generative artificial intelligence and its applications in materials science: Current situation and future perspectives. J. Materiomics 9(4), 798–816. https://doi.org/10.1016/j.jmat.2023.05.001 (2023).

5. Aluga, M. Application of CHATGPT in civil engineering. East Afr. J. Eng. 6(1), 104–112 (2023).

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CarD-T: Interpreting Carcinomic Lexicon via Transformers;2024-08-14

2. Exploring large language models for microstructure evolution in materials;Materials Today Communications;2024-08

3. An Investigation into the Utility of Large Language Models in Geotechnical Education and Problem Solving;Geotechnics;2024-05-09

4. The Effect of Race, Gender and Priming on Large Language Models’ Conviction Predication;SSRN Electronic Journal;2024