HTAP评测基准的评测能力综述
Survey on Benchmarking Ability of HTAP Benchmarks
翁思扬 1俞融 1王清帅 1胡梓锐 1倪葎 1张蓉 1周烜 1周傲英 1徐泉清 2杨传辉 2刘维 3杨攀飞3
作者信息
- 1. 华东师范大学数据科学与工程学院,上海 200062
- 2. 蚂蚁集团OceanBase,北京 100015
- 3. 工业和信息化部电子第五研究所,广东 广州 511300
- 折叠
摘要
对数据库系统即时修改数据的高效实时分析需求推动了数据库系统向同时支持OLTP业务和OLAP业务两种场景的HTAP数据库系统的快速发展.面对众多的HTAP数据库系统,为了推动HTAP数据库系统的公平比较和健康发展,定义和实现相应的评测基准来评估HTAP数据库系统的新特性至关重要.首先,分析HTAP数据库系统的关键特征并抽象总结HTAP数据库系统实现的关键技术.然后,提炼出HTAP数据库系统的设计难点和构建HTAP评测基准的挑战,并基于此提出HTAP评测基准应考虑的设计维度,包括数据生成、负载生成、评价指标和一致性模型支持性.对比现有HTAP评测基准在设计维度和实现技术上的差异,总结评测基准在不同设计维度上的优劣.此外,运行已公开的典型评测基准,展示并分析它们对HTAP数据库系统关键特征的评测能力以及对不同HTAP数据库系统的横向对比的支持能力.最后,总结对HTAP评测基准的能力需求和未来的一些研究方向,指出语义一致的负载控制和新鲜数据访问度量是HTAP数据库系统评测基准定义的关键问题.
Abstract
Requirements for the effective real-time analysis of instant data modification of database systems have driven the rapid development of Hybrid Transactional/Analytical Processing(HTAP)database systems,which support to process both OLTP and OLAP workloads.To realize fair comparisons and healthy development,it is crucial to define and implement new benchmarks to evaluate new features of HTAP database systems.Firstly,this study analyzes the key characteristics of HTAP database systems and summarizes the distinct technologies in their implementations.Secondly,the difficulties of designing HTAP database systems and the challenges of constructing HTAP benchmarks are extracted.Based on these,the design dimensions of HTAP benchmarks are proposed,including data generation,workload generation,evaluation metrics,and consistency model supportability.This study compares differences between existing HTAP benchmarks in terms of design dimensions and implementation technologies and sums up their merits and defects in different dimensions.Additionally,the published benchmarks are demonstrated and their abilities of evaluating key features and supporting horizontal comparisons among HTAP database systems are analyzed.Finally,this study concludes the requirements for HTAP benchmarks and some future research directions,pointing out that semantically consistent workload control and fresh data access metrics are the key issue in defining benchmarks for HTAP database systems.
关键词
HTAP评测基准/HTAP数据库系统/性能分析/新鲜度Key words
HTAP benchmark/HTAP database system/performance analysis/freshness引用本文复制引用
出版年
2025