首页|Chongqing University Researcher Reports Recent Findings in Pattern Recognition and Artificial Intelligence (Transformer with a Parallel Decoder for Image Captioning)
Chongqing University Researcher Reports Recent Findings in Pattern Recognition and Artificial Intelligence (Transformer with a Parallel Decoder for Image Captioning)
扫码查看
点击上方二维码区域,可以放大扫码查看
原文链接
NETL
NSTL
World Scientific
Research findings on pattern recognition and artificial intelligence are discussed in a new report. According to news reporting originating from Chongqing, People's Republic of China, by NewsRx correspondents, research stated, "In this paper, a parallel decoder and a word group prediction module are proposed to speed up decoding and improve the effect of captions." Financial supporters for this research include Nsfc. The news journalists obtained a quote from the research from Chongqing University: "The features of the image extracted by the encoder are linearly projected to different word groups, and then a unique relaxed mask matrix is designed to improve the decoding speed and the caption effect. First, since image captioning is composed of many words, sentences can also be broken down into word groups or words according to their syntactic structure, and we achieve this function through constituency parsing. Second, we make full use of the extracted features to predict the size of word groups. Then, a new embedding representing the information of the word is proposed based on word embedding."
Chongqing UniversityChongqingPeople's Republic of ChinaAsiaMachine LearningPattern Recognition and Artificial Intelligence