🏢 Taobao & Tmall Group of Alibaba
Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models
·2396 words·12 mins
AI Generated
🤗 Daily Papers
Natural Language Processing
Large Language Models
🏢 Taobao & Tmall Group of Alibaba
Chinese SimpleQA, a new benchmark, offers a comprehensive evaluation of the factuality of LLMs answering short questions in Chinese, exhibiting diversity, high quality, and ease of evaluation.