Skip to main content

🏢 Taobao & Tmall Group of Alibaba

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models
·2396 words·12 mins
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Taobao & Tmall Group of Alibaba
Chinese SimpleQA, a new benchmark, offers a comprehensive evaluation of the factuality of LLMs answering short questions in Chinese, exhibiting diversity, high quality, and ease of evaluation.