首页|GaussDB-AISQL: a composable cloud-native SQL system with AI capabilities

GaussDB-AISQL: a composable cloud-native SQL system with AI capabilities

扫码查看
Cloud-native data warehouses have revolutionized data analysis by enabling elasticity, high availability and lower costs. And the increasing popularity of artificial intelligence (AI) drives data warehouses to provide predictive analytics besides the existing descriptive analytics. Consequently, more vendors start to support training and inference of AI models in data warehouses, exploiting the benefits of near-data processing for fast model development and deployment. However, most of the existing solutions are limited by a complex syntax or slow data transportation across engines.In this paper, we present GaussDB-AISQL, a composable SQL system with AI capabilities. GaussDB-AISQL adopts a composable system design that decouples computing, storage, caching, DB engine and AI engine. Our system offers all the functionality needed by end-to-end model training and inference during the model lifecycle. It also enjoys the simplicity and efficiency by providing a SQL-like syntax and removes the burden of manual model management. When training an AI model, GaussDB-AISQL benefits from highly parallel data transportation by concurrent data pulling from the distributed shared memory. The feature selection algorithms in GaussDB-AISQL make the training more data-efficient. When running model inference, GaussDB-AISQL registers the trained model object in the local data warehouse as a user-defined-function, which avoids moving inference data out of the data warehouse to an external AI engine. Experiments show that GaussDB-AISQL is up to 19× faster than baseline approaches.

database systemdata managementOLAPcloud computingAImachine learning

Cheng CHEN、Wenlong MA、Congli GAO、Wenliang ZHANG、Kai ZENG、Tao YE、Yueguo CHEN、Xiaoyong DU

展开 >

School of Information, Renmin University of China, Beijing 100872, China

Huawei Cloud, Beijing 100085, China

Key Laboratory of Data Engineering and Knowledge Engineering (Ministry of Education), Renmin University of China, Beijing 100872, China

database system data management OLAP cloud computing AI machine learning

2025

计算机科学前沿
高等教育出版社

计算机科学前沿

影响因子:0.303
ISSN:2095-2228
年,卷(期):2025.19(9)