基于XGBoost选择迁移条件提升LSTM模型河流水质预测能力

SELECTING TRANSFER CONDITIONS BASED ON XGBOOST TO IMPROVE WATER QUALITY PREDICTION CAPACITY OF THE LSTM MODEL

余镒琦 ¹陈能汪 ²余其彪 ¹李少斌 ¹张东站 ³瞿帆¹

扫码查看

作者信息

1. 厦门大学环境与生态学院福建省海陆界面生态环境重点实验室,福建厦门 361102
2. 厦门大学环境与生态学院福建省海陆界面生态环境重点实验室,福建厦门 361102;厦门大学近海海洋环境科学国家重点实验室,福建厦门 361102
3. 厦门大学信息学院,福建厦门 361005
折叠

摘要

准确预测河流水质变化是流域水环境管理的重要基础.目前常用的基于数据驱动的深度学习模型依赖大量的监测数据训练,然而很多河流数据缺乏,无法满足水质预测精度要求.提出了一种基于极端梯度提升模型(XGBoost)的迁移条件选择方法,利用全国河流自动监测站点的水质参数(水温、pH、溶解氧、总氮)数据集,研究建立长短期记忆神经网络(LSTM)模型库,通过迁移学习条件的优化,提升LSTM模型的预测能力.结果表明:1)采用不同源域和迁移方式训练出的模型,其预测精度有很大差异;2)基于XGBoost模型选择最佳迁移条件,迁移模型的预测误差(RMSE)降低了 9.6％～28.9％,LSTM模型预测精度明显提升;3)选取合适的迁移方式、选用性质接近的源域数据、增加训练数据量均可以提升迁移模型的预测精度.该建模方法可应用于实测数据少的河流水质预测,为流域水环境精细化管理提供技术支持.

Abstract

Accurate prediction of river water quality change is an important basis for watershed water environment management.Currently,training of the commonly used data-driven deep learning model relies on large amounts of monitoring data.However,many rivers lack monitoring data so they can't meet the accuracy requirements of water quality prediction.In this study,we have developed an approach of selecting transfer conditions based on the XGBoost model.Water quality data(temperature,pH,dissolved oxygen,total nitrogen)from automatic monitoring stations across the major river in China are used for the establishment of long and short-term memory neural network(LSTM)models.The prediction ability of the LSTM model was improved by optimizing transfer learning conditions.The results showed that:1)the prediction accuracy of the models trained by different source domains and transfer modes was quite different;2)when the optimal transfer conditions were selected based on the XGBoost model,the prediction error(RMSE)of the transfer model was reduced by 9.6％to 28.9％,indicating that the prediction accuracy of selected LSTM model was significantly improved.3)selecting appropriate transfer mode,using source domain data with similar properties,and increasing the amount of training data can improve the prediction accuracy of the transfer model.The modeling approach proposed in this paper can be directly applied to the prediction of river water quality with little monitoring data,which can support watershed water environment management.

关键词

水质预测/LSTM模型/迁移学习/XGBoost模型

Key words

water quality prediction/LSTM model/transfer learning/XGBoost model

引用本文复制引用

基金项目

国家自然科学基金(51961125203)

出版年

2024

环境工程

中冶建筑研究总院有限公司,中国环境科学学会环境工程分会

环境工程

CSTPCD

影响因子：0.958

ISSN：1000-8942

参考文献量36

段落导航