过滤器数据结构研究综述

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：过滤器数据结构可以近似地判断某个元素是否属于给定集合.典型的过滤器数据结构,如布隆过滤器、布谷鸟过滤器、商过滤器,以牺牲查询准确性为代价换取更低的内存空间消耗和查询时间开销.因此,得益于空间时间高效性,过滤器数据结构现已被广泛应用于计算机网络、物联网、数据库系统、文件系统、生物信息学、机器学习等领域的近似成员资格查询操作中.自20世纪70年代以来,过滤器数据结构受到了广泛的研究,在诸多领域取得了重要的进展,其研究思路也在不断变化.文中整理了近五十年来关于过滤器数据结构的经典研究成果,从过滤器数据结构的原理出发对已有工作进行分类总结,并比较不同工作之间的引证关系和改进思路,最后讨论了过滤器数据结构的未来研究方向.

外文标题：Filter Data Structures:A Survey

外文摘要：Filter data structures can approximately determine whether an element exists in a given set.Typical filter data struc-tures,such as Bloom filters,cuckoo filters,and quotient filters,sacrifice query accuracy for lower memory space consumption and lower query time overhead.Due to their spatial and temporal efficiency,filter data structures are now widely used in approximate membership query operations in computer networks,the Internet of Things,database systems,file systems,bioinformatics,ma-chine learning,and other fields.Since the 1970s,filters have been extensively studied.Their research ideas are constantly chan-ging.This paper compiles the classic studies on filter data structures in the past fifty years,summarizes existing studies based on the mechanism of filter data structures and analyze the relationship between different studies.Finally,future research directions in filter data structures are discussed.

外文关键词：

FilterApproximate membership queryProbabilistic data structureBloom filterCuckoo filterQuotient filter

作者：

王瀚橙、戴海鹏、陈树森、陈志鹏、陈贵海

展开 >

作者单位：

计算机软件新技术国家重点实验室(南京大学) 南京 210023

关键词：

过滤器近似成员资格查询概率数据结构布隆过滤器布谷鸟过滤器商过滤器

基金：

国家自然科学基金

项目编号：

62272223

出版年：

2024

DOI：

10.11896/jsjkx.231000193

计算机科学

重庆西南信息有限公司（原科技部西南信息中心）

计算机科学

CSTPCD北大核心

影响因子：0.944

ISSN：1002-137X

年,卷(期)：2024.51(1)

参考文献量1