Author:
Zhong Haoxi,Xiao Chaojun,Tu Cunchao,Zhang Tianyang,Liu Zhiyuan,Sun Maosong
Abstract
We present JEC-QA, the largest question answering dataset in the legal domain, collected from the National Judicial Examination of China. The examination is a comprehensive evaluation of professional skills for legal practitioners. College students are required to pass the examination to be certified as a lawyer or a judge. The dataset is challenging for existing question answering methods, because both retrieving relevant materials and answering questions require the ability of logic reasoning. Due to the high demand of multiple reasoning abilities to answer legal questions, the state-of-the-art models can only achieve about 28% accuracy on JEC-QA, while skilled humans and unskilled humans can reach 81% and 64% accuracy respectively, which indicates a huge gap between humans and machines on this task. We will release JEC-QA and our baselines to help improve the reasoning ability of machine comprehension models. You can access the dataset from http://jecqa.thunlp.org/.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
38 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. LAWSUIT: a LArge expert-Written SUmmarization dataset of ITalian constitutional court verdicts;Artificial Intelligence and Law;2024-09-09
2. A Dynamic Retrieval-Augmented Generation Framework for Border Inspection Legal Question Answering;2024 International Conference on Asian Language Processing (IALP);2024-08-04
3. TriviaHG: A Dataset for Automatic Hint Generation from Factoid Questions;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10
4. LogSay: An Efficient Comprehension System for Log Numerical Reasoning;IEEE Transactions on Computers;2024-07
5. AI-Powered Legal Documentation Assistant;2024 4th International Conference on Pervasive Computing and Social Networking (ICPCSN);2024-05-03