信息技术2024,Issue(6) :36-41.DOI:10.13274/j.cnki.hdzj.2024.06.006

基于散列函数的短链码压缩方法

Compression method for short links code method based on Hash function

王耀暖 江有福 吴巧玲
信息技术2024,Issue(6) :36-41.DOI:10.13274/j.cnki.hdzj.2024.06.006

基于散列函数的短链码压缩方法

Compression method for short links code method based on Hash function

王耀暖 1江有福 1吴巧玲1
扫码查看

作者信息

  • 1. 浙江海洋大学信息工程学院,浙江舟山 316022
  • 折叠

摘要

为满足海量数据下用户对URL链接进行缩短与查询性能的需求,降低生成的短链码之间碰撞概率与消减笛卡尔积操作,提出一种在海量数据场景下基于散列函数的字符串压缩算法.该算法满足用户一次输入的信息输出结果相同,以及对不同次的相同输入的信息输出结果不同的需求,采用随机因子与短链码中包含库表位信息来削减短链码生成冲突和海量数据下带来的笛卡尔积操作.实验结果表明,改进后的算法压缩耗时性能略微变慢但碰撞率明显降低,在查询性能方面提升78%~130%,并随着数据量增多变得越明显.

Abstract

In order to meet the user's requirements for shortening URL links and query performance under massive data,and reduce the collision probability between generated short chain codes and reduce the Car-tesian product operation,a Hash function-based string compression algorithm in the massive data scenario is proposed.The algorithm meets the requirements of the same information output results of users'input at one time and different information output results of the same input at different times.It uses random factors and the library epitope information contained in the short chain code to reduce the short chain code genera-tion conflict and the Cartesian product operation caused by massive data.Experiment results show that the time-consuming performance of the improved compression algorithm is slightly slower,while the collision rate is significantly reduced,and the query performance is improved by 78%~130%,which becomes more obvious with the increase of the amount of data.

关键词

散列函数/链接缩短/短网址/短链/短链码

Key words

Hash function/link shortening/short URLs/short links/short links code

引用本文复制引用

基金项目

浙江省教育厅科研项目(Y202044755)

浙江省大学生科技创新项目(2021R411025)

出版年

2024
信息技术
黑龙江省信息技术学会 中国电子信息产业发展研究院 中国信息产业部电子信息中心

信息技术

CSTPCD
影响因子:0.413
ISSN:1009-2552
段落导航相关论文