A bi-annotated Malay-English code-switching (Manglish) dataset of X posts for biological gender identification and authorship attribution
一个包含 X 篇帖子的双标注马来语-英语语码转换(Manglish)数据集,用于生物性别识别和作者身份归属。
期刊:Data in Brief
影响因子:1.4
doi:10.1016/j.dib.2024.110034
Maskat, Ruhaila; Azman, Norazmiera Ayunie; Nulizairos, Nur Shaheera Shastera; Zahidin, Nurul Athirah; Mahadi, Adibah Humairah; Norshamsul, Siti Rubaya; Sharif, Mohd Mukhlis Mohd; Mahdin, Hairulnizam