NOIRBETTIK: A reading comprehension based multiple choice question answering dataset in Bangla language

NOIRBETTIK:一个基于阅读理解的孟加拉语多项选择题问答数据集

阅读:2

Abstract

The COVID-19 pandemic has accelerated the adoption of online educational systems, highlighting the need for advanced automation to enhance learning and evaluation processes. Multiple-choice questions (MCQs) are a fundamental assessment tool in these systems. This paper introduces NOIRBETTIK, a novel dataset designed for reading comprehension-based MCQ answering in Bangla, developed to address the shortage of high-quality Bangla datasets for context-based tasks. The dataset is human-made, sourced from authentic Bangla materials such as books, articles, and biographies, offering longer passages and multiple-choice questions with four alternatives per question. This work focuses on providing a comprehensive and real-world dataset, filling a critical gap in Bangla NLP research and educational applications. We describe the dataset's creation and annotation process, comparing it to existing datasets to highlight its uniqueness. The primary contributions include the release of the NOIRBETTIK dataset and a detailed exploration of its structure, enabling future advancements in educational technologies. This dataset holds significant promise for enhancing reading comprehension systems and addressing the educational needs of Bangla-speaking students.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。