首页|Effective near-duplicate image detection using perceptual hashing and deep learning

Effective near-duplicate image detection using perceptual hashing and deep learning

扫码查看
Computer vision has always been concerned with near-duplicate image detection. Previous approaches for detecting near duplicates highlighted the necessity to adequately explore the aspect of image transformations for effectively handling complex images. We proposed a method of finding near duplicate images using the integration of three different techniques: perceptual hashing, Siamese network, and Vision Transformer. Perceptual hashing gives us a quick way to filter out similar-looking pictures, while the Siamese network architecture paired with the Vision transformer helps us identify more complex near duplicate instances. The integrated approach learns a metric space from data, which reflects both visual similarity and perceptual closeness among items in the dataset. The results demonstrate the effectiveness and robustness of our proposed method, achieving an AUROC of 0.99 and a precision of 0.987 on the California-ND dataset, and an AUROC of 0.92 with a precision of 0.884 on the INRIA Holidays dataset, significantly outperforming traditional methods by over 10% in both metrics. This represents a significant step forward in near-duplicate image detection research.

Near-duplicate imagesNeural networkGenerative Adversarial NetworkPerceptual hashingSiamese networkVision Transformer

Yash Jakhar、Malaya Dutta Borah

展开 >

Department of Computer Science and Engineering, National Institute of Technology, SUchar, India

2025

Information processing & management

Information processing & management

ISSN:0306-4573
年,卷(期):2025.62(4)
  • 38