Loading...
Please wait, while we are loading the content...
Similar Documents
Better Understanding: Stylized Image Captioning with Style Attention and Adversarial Training
Content Provider | MDPI |
---|---|
Author | Yang, Zhenyu Liu, Qiao Liu, Guojing |
Copyright Year | 2020 |
Description | Compared with traditional image captioning technology, stylized image captioning has broader application scenarios, such as a better understanding of images. However, stylized image captioning faces many challenges, the most important of which is how to make the model take into account both the image meta information and the style factor of the generated captions. In this paper, we propose a novel end-to-end stylized image captioning framework (ST-BR). Specifically, we first use a style transformer to model the factual information of images, and the style attention module learns style factor form a multi-style corpus, it is a symmetric structure on the whole. At the same time, we use back-reinforcement to evaluate the degree of consistency between the generated stylized captions with the image knowledge and specified style, respectively. These two parts further enhance the learning ability of the model through adversarial learning. Our experiment has achieved effective performance on the benchmark dataset. |
Starting Page | 1978 |
e-ISSN | 20738994 |
DOI | 10.3390/sym12121978 |
Journal | Symmetry |
Issue Number | 12 |
Volume Number | 12 |
Language | English |
Publisher | MDPI |
Publisher Date | 2020-11-30 |
Access Restriction | Open |
Subject Keyword | Symmetry Artificial Intelligence Image Understanding Stylized Image Captioning Symmetric End-to-end Structure Style Attention Back-reinforcement Module |
Content Type | Text |
Resource Type | Article |