MAHE: a multiscale and hybrid expert-based model for image-text enhanced named entity recognition on social media

MAHE:一种基于多尺度混合专家模型的社交媒体图像-文本增强命名实体识别方法

阅读:1

Abstract

In the field of cybersecurity, verifying the authenticity of user identities is critical for combating fake accounts, bots, and malicious users. Although existing Multimodal Named Entity Recognition (MNER) methods have made some progress in cybersecurity, most rely on extracting visual features through image encoders and directly inputting them into cross-modal attention mechanisms. This approach often struggles to accurately align text with semantic understanding of images in complex network environments. To address this issue and improve both the accuracy and efficiency of identity verification, this paper proposes a novel framework: an MNER model based on the joint effect of multi-scale Mamba and a hybrid expert mechanism for modality enhancement. The model leverages the hybrid expert mechanism to enhance text recognition and employs the Mamba model's channel attention and local enhancement to generate high-resolution and multi-scale image features. This allows for a more comprehensive analysis of user-generated text and images, ensuring effective distinction between real users and fake or automated accounts, thereby improving the effectiveness of online identity verification. Experimental results show F1 scores of 75.34 and 87.41 on the Twitter-2015 and Twitter-2017 datasets, respectively. This approach demonstrates strong potential and competitiveness compared to state-of-the-art models in online identity verification for cybersecurity tasks.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。