目的 细粒度军事目标数据集是实现现代战争目标自动分类的重要支撑数据之一。当前缺乏高质量的细粒度军事目标遥感图像数据集,制约了军事目标自动精准检测的研究。为此,本文收集并标注了一个新的军用车辆细粒度检测遥感图像数据集MVRSD(military vehicle remote sensing dataset),并基于此设计了一种基于YOLOv5s(you only look once)的改进模型来提高军用车辆目标检测性能。方法 该数据集来源于谷歌地球数据,收集了亚洲、北美洲和欧洲范围内40多个军事场景下的3 000幅遥感图像,包含多个国家和地区的军用车辆目标。经高质量人工水平边界框标注,最终形成包含5个类别共计32 626个实例的军用车辆细粒度检测遥感图像数据集。针对遥感图像中军用车辆识别难题,本文提出的基准模型考虑了遥感军用车辆目标较小、形状和外观较为模糊以及类间相似性、类内差异性大的特点,设计了基于目标尺寸的跨尺度检测头和上下文聚合模块,提升细粒度军用车辆目标的检测性能。结果 提出的基准模型在军用车辆细粒度检测遥感图像数据集上的实验表明,对比经典的目标检测模型,新基准模型在平均精度均值(mean average precision,mAP)指标上提高了 1。1%。结论 本文构建的军用车辆细粒度检测遥感图像数据集为军事目标自动分类算法的研究提供了参考与支持,有助于更为全面地研究遥感图像中军用车辆目标的特性。数据集及检测基准模型地址为:https://github。com/baidongls/MVRSD。
Construction and validation of remote sensing image dataset for fine-grained detection of military vehicles
Objective Informational warfare has put forward higher requirements for military reconnaissance,and military target identification,as one of the main tasks of military reconnaissance,needs to be able to deal with fine-grained military targets and provide personnel with more detailed target information.Optical remote sensing image datasets play a crucial role in remote sensing target detection tasks.These datasets provide valuable standard remote sensing data for model train-ing and objective and uniform benchmarks for the comparison of different networks and algorithms.However,the current lack of high-quality fine-grained military target remote sensing image datasets constrains research on the automatic and accurate detection of military targets.As a special remote sensing target,military vehicles have certain characteristics,such as environmental camouflage,shape and structural changes,and movement"color shadows"that make their detection particularly challenging.Fig.1 shows the challenges posed by the fine-grained target characteristics of military vehicles in optical remote sensing images,which can be categorized into the following types according to the source of target character-istics:1)target characterization as affected by satellite remote sensing imaging systems;2)characterization of the vehicle target itself;3)military vehicle target characterization;4)characteristics affected by the combination;and 5)properties of fine-grained classification.To promote the development of deep-learning-based research on the fine-grained accurate detection of military vehicles in high-resolution remote sensing images,we construct a new high-resolution optical remote sensing image dataset called military vehicle remote sensing dataset(MVRSD).Using this model,we design an improved model based on YOLOv5s to improve the target detection performance for military vehicles.Method We construct our data-set using Google Earth data,collected 3 000 remotely sensed images from more than 40 military scenarios within Asia,North America,and Europe,and acquired 32 626 military vehicle targets from these images.These images have a spatial resolution of 0.3 m and size of 640 x 640 pixels.Our dataset consists of remotely sensed images and the corresponding labeled files,and the targets were manually selected and classified by experts through the interpretation of high-resolution optical images.We divide the granular categories in the dataset into the following categories based on vehicle size and mili-tary function:small military vehicles(SMV),large military vehicles(LMV),armored fighting vehicles(AFV),military construction vehicles(MCV),and civilian vehicles(CV).The geographic environments of the samples include cities,plains,mountains,and deserts.To solve the difficulty of recognizing military vehicles in remote sensing images,the pro-posed benchmark model takes into account the characteristics of those military vehicles with small targets and fuzzy shapes and appearances along with interclass similarity and intraclass variability.The number of instances of each category in the dataset and the number of instances of each image depend on their actual distribution in the remote sensing scene,which can reflect the realism and challenges of the dataset.We then design a cross-scale target-size-based detection head and con-text aggregation module based on YOLOv5s to improve the detection performance for fine-grained military vehicle targets.Result We analyze the characteristics of military vehicle targets in remote sensing images and the challenges being faced in the fine-grained detection of these vehicles.To address the poor detection accuracy of the YOLOv5 algorithm for small tar-gets and reduce its risks of omission and misdetection,we design an improved model based on YOLOv5s as our baseline model and select a cross-scale detector head based on the dimensions of the targets in the dataset to efficiently detect the tar-gets at different scales.We insert the attention mechanism module in front of this detector head to inhibit the interference of complex backgrounds on the target.Based on this dataset,we applied five target detection models for our experiments.Results of our experiments show that our proposed benchmark model improves its mean average precision by 1.1%com-pared with the classical target detection model.Moreover,the deep learning model achieves good performance in the fine-grained accurate detection of military vehicles.Conclusion The MVRSD dataset can support researchers in analyzing the features of remote sensing images of military vehicles from different countries and provide training and test data for deep learning methods.The proposed benchmark model can also effectively improve the detection accuracy for remotely sensed military vehicles.The MVRSD dataset is available at https://github.com/baidongls/MVRSD.