Chatbot responses suggest that hypothetical biology questions are harder than realistic ones-Reference-Cited by-同舟云学术

Chatbot responses suggest that hypothetical biology questions are harder than realistic ones

Published:2023-12-14 Issue:3 Volume:24 Page:
ISSN:1935-7877
Container-title:Journal of Microbiology & Biology Education
language:en
Short-container-title:J Microbiol Biol Educ.

Author:

Crowther Gregory J.¹^ORCID,Sankar Usha²^ORCID,Knight Leena S.³^ORCID,Myers Deborah L.¹,Patton Kevin T.⁴^ORCID,Jenkins Lekelia D.⁵^ORCID,Knight Thomas A.³

Affiliation:

1. Life Sciences Department, Everett Community College, Everett, Washington, USA

2. Department of Biological Sciences, Fordham University, Bronx, New York, USA

3. Biology Department, Whitman College, Walla Walla, Washington, USA

4. Biology Department, St. Charles Community College, Cottleville, Missouri, USA

5. School for the Future of Innovation in Society, Arizona State University, Tempe, Arizona, USA

Abstract

ABSTRACT The biology education literature includes compelling assertions that unfamiliar problems are especially useful for revealing students’ true understanding of biology. However, there is only limited evidence that such novel problems have different cognitive requirements than more familiar problems. Here, we sought additional evidence by using chatbots based on large language models as models of biology students. For human physiology and cell biology, we developed sets of realistic and hypothetical problems matched to the same lesson learning objectives (LLOs). Problems were considered hypothetical if (i) known biological entities (molecules and organs) were given atypical or counterfactual properties (redefinition) or (ii) fictitious biological entities were introduced (invention). Several chatbots scored significantly worse on hypothetical problems than on realistic problems, with scores declining by an average of 13%. Among hypothetical questions, redefinition questions appeared especially difficult, with many chatbots scoring as if guessing randomly. These results suggest that, for a given LLO, hypothetical problems may have different cognitive demands than realistic problems and may more accurately reveal students’ ability to apply biology core concepts to diverse contexts. The Test Question Templates (TQT) framework, which explicitly connects LLOs with examples of assessment questions, can help educators generate problems that are challenging (due to their novelty), yet fair (due to their alignment with pre-specified LLOs). Finally, ChatGPT’s rapid improvement toward expert-level answers suggests that future educators cannot reasonably expect to ignore or outwit chatbots but must do what we can to make assessments fair and equitable.

Publisher

American Society for Microbiology

Subject

General Agricultural and Biological Sciences,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,Education

Link

https://journals.asm.org/doi/pdf/10.1128/jmbe.00153-23

Reference38 articles.

1. Scientific Teaching

2. BioSkills Guide: Development and National Validation of a Tool for Interpreting theVision and ChangeCore Competencies

3. A Revision of Bloom's Taxonomy: An Overview

4. Biology in Bloom: Implementing Bloom's Taxonomy to Enhance Student Learning in Biology

5. Just the Facts? Introductory Undergraduate Biology Courses Focus on Low-Level Cognitive Skills

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluating the Cognitive Levels of Generative AI via Bloom’s Taxonomy: A Cross-sectional Study (Preprint);2024-05-15