基于两阶段算法的多媒体有害信息识别方法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：在互联网安全监管和网络违法犯罪打击整治的应用场景中,现有多媒体有害信息识别方法普遍存在运算效率不高、无法准确识别局部敏感信息,以及识别检测局限于单一的网络违法犯罪类型等问题.针对以上问题,文中提出了一种基于两阶段算法的多媒体有害信息识别模型.该模型将信息过滤与内容检测分阶段处理,将场景识别和元素目标检测分任务并行处理,第一阶段采用EfficientNet-B2构建高吞吐的前置过滤模块快速筛选掉80％正常内容的数据;第二阶段基于Meal-V2,Faster RC-NN,Net VLAD网络构建3种不同网络结构的模块,适应多维度场景、多特征元素的识别要求.结果表明,模型运算效率在T4卡上达到57FPS,多媒体有害信息的识别准确率、召回率均超过97％;与传统模型相比,在NPDI和自建测试集上识别准确率分别最高提升3.09％和19.26％.

外文标题：Multimedia Harmful Information Recognition Method Based on Two-stage Algorithm

外文摘要：In the application scenarios of Internet content security supervision and combating and rectifying Internet crimes,exist-ing multimedia harmful information identification methods generally have problems such as low computational efficiency,inability to accurately identify local sensitive information,and identification capabilities are limited to a single type of cyber crimes.In order to solve the above problems,the paper proposes a multimedia harmful information recognition model based on a two-stage algo-rithm.This method processes information filtering and content detection in stages,and splits the tasks of scene recognition and element target detection.The first stage uses EfficientNet-B2 to build a high-throughput pre-filter model to quickly filter out 80％of images and short videos with normal content.In the second stage,three modules with different network structures are built based on Meal-V2,Faster RCNN,and Net VLAD networks to adapt to the recognition requirements of multi-dimensional scenes and multi-feature elements.The results show that the model's computing efficiency reaches 57FPS(frames per second)on the T4 card,and the recognition accuracy and recall rate of multimedia harmful information exceed 97％.Compared with traditional mo-dels,the recognition accuracy rate on the NPDI dataset and the self-built test dataset increases by 3.09％and 19.26％respective-ly.

外文关键词：

Two-stage algorithmMultimediaHarmful information recognition

作者：

史晓苏、李欣、简玲、倪华健

展开 >

作者单位：

中国人民公安大学信息网络安全学院北京 100091

上海市公安局网络安全保卫总队上海 200025

上海闪马智能科技有限公司杭州 310000

关键词：

两阶段算法多媒体有害信息识别

基金：

公安部应用创新计划

项目编号：

2020YYCXSHSJ019

出版年：

2024

DOI：

10.11896/jsjkx.231000052

计算机科学

重庆西南信息有限公司（原科技部西南信息中心）

计算机科学

CSTPCD北大核心

影响因子：0.944

ISSN：1002-137X

年,卷(期)：2024.51(z1)

参考文献量20