Affiliation:
1. Cangrade, Inc., Watertown, MA 02472
2. Information School, University of Washington, Seattle, WA 98195
3. Department of Psychology, Harvard University, Cambridge, MA 02138
Abstract
How good a research scientist is ChatGPT? We systematically probed the capabilities of GPT-3.5 and GPT-4 across four central components of the scientific process: as a Research Librarian, Research Ethicist, Data Generator, and Novel Data Predictor, using psychological science as a testing field. In Study 1 (Research Librarian), unlike human researchers, GPT-3.5 and GPT-4 hallucinated, authoritatively generating fictional references 36.0% and 5.4% of the time, respectively, although GPT-4 exhibited an evolving capacity to acknowledge its fictions. In Study 2 (Research Ethicist), GPT-4 (though not GPT-3.5) proved capable of detecting violations like p-hacking in fictional research protocols, correcting 88.6% of blatantly presented issues, and 72.6% of subtly presented issues. In Study 3 (Data Generator), both models consistently replicated patterns of cultural bias previously discovered in large language corpora, indicating that ChatGPT can simulate known results, an antecedent to usefulness for both data generation and skills like hypothesis generation. Contrastingly, in Study 4 (Novel Data Predictor), neither model was successful at predicting new results absent in their training data, and neither appeared to leverage substantially new information when predicting more vs. less novel outcomes. Together, these results suggest that GPT is a flawed but rapidly improving librarian, a decent research ethicist already, capable of data generation in simple domains with known characteristics but poor at predicting novel patterns of empirical data to aid future experimentation.
Funder
US National Institute of Standards and Technology
Publisher
Proceedings of the National Academy of Sciences
Reference51 articles.
1. A. Vaswani , “Attention is all you need” in Advances in Neural Information Processing Systems, I. Guyon , Eds. (Curran Associates Inc., 2017), vol. 30, pp. 5998–6008.
2. X. Zhang Artificial intelligence for science in quantum atomistic and continuum systems. arXiv [Preprint] (2023). https://arxiv.org/pdf/2307.08423.pdf (Accessed 19 January 2024).
3. Highly accurate protein structure prediction with AlphaFold
4. Solving the quantum many-body problem with artificial neural networks
5. Machine learning–accelerated computational fluid dynamics
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献