福建电脑2025,Vol.41Issue(1) :11-17.DOI:10.16707/j.cnki.fjpc.2025.01.002

多模融合的陶瓷图像中文描述生成方法研究

Study On The Chinese Description And Generation Method Of Multimodal Fusion Ceramic Image

胡智猛 彭永康 张秀娟
福建电脑2025,Vol.41Issue(1) :11-17.DOI:10.16707/j.cnki.fjpc.2025.01.002

多模融合的陶瓷图像中文描述生成方法研究

Study On The Chinese Description And Generation Method Of Multimodal Fusion Ceramic Image

胡智猛 1彭永康 1张秀娟1
扫码查看

作者信息

  • 1. 景德镇陶瓷大学信息工程学院 江西 景德镇 333403
  • 折叠

摘要

早期的陶瓷图像描述生成方法存在识别和描述准确性不足的问题.针对这些问题,本文提出一种基于深度残差网络和特征金字塔网络的多尺度图像特征提取方法,并利用带有加性注意力机制的长短期记忆网络生成中文描述的Res-FL模型.实验结果表明,在描述准确性和细节捕捉方面,Res-FL 模型显著优于传统神经网络方法,在提高陶瓷图像描述的一致性和精确性方面具有较高的应用价值.

Abstract

Early methods for generating ceramic image descriptions had issues with insufficient accuracy in recognition and description.To address these issues,this paper proposes a multi-scale image feature extraction method based on deep residual networks and feature pyramid networks,and utilizes a long short-term memory network with additive attention mechanism to generate a Res FL model for Chinese descriptions.The experimental results show that the Res FL model is significantly superior to traditional neural network methods in terms of description accuracy and detail capture,and has high application value in improving the consistency and accuracy of ceramic image description.

关键词

陶瓷图像/图像描述/图像特征提取

Key words

Ceramic Images/Image Description/Image Feature Extraction

引用本文复制引用

出版年

2025
福建电脑
福建省计算机学会

福建电脑

影响因子:0.207
ISSN:1673-2782
段落导航相关论文