A customized image editing framework for diverse prohibited and restricted products in illegal online transactions.

针对非法网络交易中各种违禁和受限商品的定制化图像编辑框架

阅读:5
作者:Liu Wenjin, Zou Jingyu, Zhang Shudong, Luo Ning, Liu Haoming
The circulation of prohibited and restricted goods in online transactions seriously violates consumer rights and threatens public safety. However, the lack of a dataset for prohibited and restricted goods makes it difficult to regulate such illegal online transactions. Therefore, a multimodal dataset for prohibited and restricted goods is proposed, including 38,513 images and 77,026 texts. Nevertheless, because of the diversity and potential adversarial modifications of prohibited and restricted goods, intelligent recognition of such items still faces significant challenges. Thus, an image editing framework for prohibited and restricted goods in online transactions is proposed. This framework integrates three novel components: (1) a PR-adapter that optimizes image prompts through image augmentation and compression representation techniques; (2) a text description generator combining the CLIP model and a multimodal large language model (MobileVLM) to generate more precise textual descriptions of images; and (3) an image generator, including a new loss function designed to fine-tune the stable diffusion model, enabling a better understanding of text semantics and generating images that more closely align with the textual descriptions. Experimental results show that this framework can generate diverse and accurate images of prohibited and restricted goods, effectively enhancing the development of intelligent supervision for online transactions.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。