KurdABSA: Kurdish aspect-based sentiment analysis dataset curation using few-shot learning

KurdABSA:基于少样本学习的库尔德语方面情感分析数据集构建

阅读:1

Abstract

Aspect-Based Sentiment Analysis (ABSA) extends traditional sentiment analysis by not only identifying the overall sentiment of a text but also associating specific sentiments with deeper and granular insights. The main objective of ABSA is to accurately extract relevant aspects and determine the sentiment polarity associated with each. Although extensive research has been conducted on ABSA across various languages, low-resource languages such as Kurdish remain largely underexplored in this domain. To address this gap, the present study introduces the first publicly available aspect-based sentiment analysis dataset for the Sorani dialect of Kurdish, addressing a critical gap in natural language processing (NLP) research for low-resource languages. The dataset has >4000 quadruplet ABSA in the restaurant review domain, written in the Kurdish language (Sorani dialect) using the Perso-Arabic script. A prompt-based few-shot learning model was employed to automatically annotate the dataset with aspect-opinion-category-sentiment quadruples, guided by a manually annotated support set verified by native Kurdish-language experts. This resource is intended for use in machine learning, deep learning, and cross-lingual model adaptation, making it suitable for training, fine-tuning, and benchmarking.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。