The visual communication using generative artificial intelligence in the context of new media

在新媒体背景下,利用生成式人工智能进行视觉传达

阅读:2

Abstract

The purpose of this work is to explore methods of visual communication based on generative artificial intelligence in the context of new media. This work proposes an image automatic generation and recognition model that integrates the Conditional Generative Adversarial Network (CGAN) with the Transformer algorithm. The generator component of the model takes noise vectors and conditional variables as inputs. Subsequently, a Transformer module is incorporated, where the multi-head self-attention mechanism enables the model to establish complex relationships among different data points. This is further refined through linear transformations and activation functions to enhance feature representations. Ultimately, the self-attention mechanism captures the long-range dependencies within images, facilitating the generation of high-quality images that meet specific conditions. The model's performance is assessed, and the findings show that the accuracy of the proposed model reaches 95.69%, exceeding the baseline algorithm Generative Adversarial Network by more than 4%. Additionally, the Peak Signal-to-Noise Ratio of the model's algorithm is 33dB, and the Structural Similarity Index is 0.83, indicating higher image generation quality and recognition accuracy. Therefore, the model proposed achieves high recognition and prediction accuracy of generated images, and higher image quality, promising significant application value in visual communication in the new media era.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。