Evaluating the Symbol Binding Ability of Large Language Models for Multiple-Choice Questions in Vietnamese General Education-Reference-Cited by-同舟云学术

Evaluating the Symbol Binding Ability of Large Language Models for Multiple-Choice Questions in Vietnamese General Education

Published:2023-12-07 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 12th International Symposium on Information and Communication Technology
language:
Short-container-title:

Author:

Nguyen Duc-Vu¹^ORCID,Nguyen Quoc-Nam¹^ORCID

Affiliation:

1. University of Information Technology, Vietnam and Vietnam National University Ho Chi Minh City, Vietnam

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3628797.3628837

Reference16 articles.

1. Tom Brown , Benjamin Mann , Nick Ryder , Melanie Subbiah , Jared D Kaplan , Prafulla Dhariwal , Arvind Neelakantan , Pranav Shyam , Girish Sastry , Amanda Askell , 2020. Language models are few-shot learners. Advances in neural information processing systems 33 ( 2020 ), 1877–1901. Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, 2020. Language models are few-shot learners. Advances in neural information processing systems 33 (2020), 1877–1901.

2. Xuan-Quy Dao , Ngoc-Bich Le , Xuan-Dung Phan , and Bac-Bien Ngo . 2023. An Evaluation of ChatGPT’s Proficiency in English Language Testing of The Vietnamese National High School Graduation Examination. Available at SSRN 4473369 ( 2023 ). Xuan-Quy Dao, Ngoc-Bich Le, Xuan-Dung Phan, and Bac-Bien Ngo. 2023. An Evaluation of ChatGPT’s Proficiency in English Language Testing of The Vietnamese National High School Graduation Examination. Available at SSRN 4473369 (2023).

3. Xuan-Quy Dao , Ngoc-Bich Le , The-Duy Vo , Xuan-Dung Phan , Bac-Bien Ngo , Van-Tien Nguyen , Thi- My-Thanh Nguyen , and Hong-Phuoc Nguyen . 2023 . VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models. arxiv:2305.12199 [cs.CL] Xuan-Quy Dao, Ngoc-Bich Le, The-Duy Vo, Xuan-Dung Phan, Bac-Bien Ngo, Van-Tien Nguyen, Thi-My-Thanh Nguyen, and Hong-Phuoc Nguyen. 2023. VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models. arxiv:2305.12199 [cs.CL]

4. Dan Hendrycks , Collin Burns , Steven Basart , Andy Zou , Mantas Mazeika , Dawn Song , and Jacob Steinhardt . 2021 . Measuring Massive Multitask Language Understanding . Proceedings of the International Conference on Learning Representations (ICLR) (2021). Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, and Jacob Steinhardt. 2021. Measuring Massive Multitask Language Understanding. Proceedings of the International Conference on Learning Representations (ICLR) (2021).

5. Dan Hendrycks , Collin Burns , Saurav Kadavath , Akul Arora , Steven Basart , Eric Tang , Dawn Song , and Jacob Steinhardt . 2021. Measuring Mathematical Problem Solving With the MATH Dataset. NeurIPS ( 2021 ). Dan Hendrycks, Collin Burns, Saurav Kadavath, Akul Arora, Steven Basart, Eric Tang, Dawn Song, and Jacob Steinhardt. 2021. Measuring Mathematical Problem Solving With the MATH Dataset. NeurIPS (2021).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Towards an AI Tutor for Undergraduate Geotechnical Engineering: A Comparative Study of Evaluating the Efficiency of Large Language Model Application Programming Interfaces;2024-07-25