佳木斯大学学报(自然科学版)2024,Vol.42Issue(1) :55-58.

基于人工智能技术的分布式数据库重复记录自动检测系统设计

Design of Automatic Detection System for Distributed Database Duplicate Records Based on Artificial Intelligence Technology

王彩霞 陶健
佳木斯大学学报(自然科学版)2024,Vol.42Issue(1) :55-58.

基于人工智能技术的分布式数据库重复记录自动检测系统设计

Design of Automatic Detection System for Distributed Database Duplicate Records Based on Artificial Intelligence Technology

王彩霞 1陶健1
扫码查看

作者信息

  • 1. 安徽商贸职业技术学院信息与人工智能学院,安徽芜湖 241002
  • 折叠

摘要

以人工智能技术为基础前提的分布式数据库重复记录自动检测的方式,以提高数据库查询时的准确率以及查询效率.设计系统首先对数据信息进行对应的特征提取,而后通过权衡函数对样本信息进行整合,通过自适应分解得到相应的 目标函数并求解,结合灰狼算法以及Shingle完成数据查询.经过算例验证,改进设计方式准确率均超过90%,平均耗时在35 s以内,满足自动查询快速精确的要求.

Abstract

An automatic detection method of repeated records of distributed database based on ar-tificial intelligence technology is proposed to improve the accuracy and query efficiency of database que-ries.The design system first extracts the corresponding features of the data information,then integrates the sample information through the trade-off function,obtains the corresponding objective function through adaptive decomposition and solves,and completes the data query by combining the gray wolf al-gorithm and Shingle.After the example verification,the accuracy rate of the improved design method exceeds 90%,and the average time consumption is less than 35s,which meets the requirements of fast and accurate automatic query.

关键词

自动化查询/灰狼算法/模糊聚类/分布式数据库

Key words

automated queries/Gray Wolf algorithm/fuzzy clustering/distributed database

引用本文复制引用

基金项目

2022年度安徽省科研编制计划项目(2022AH052741)

出版年

2024
佳木斯大学学报(自然科学版)
佳木斯大学

佳木斯大学学报(自然科学版)

影响因子:0.159
ISSN:1008-1402
参考文献量5
段落导航相关论文