中国科学:技术科学(英文版)2024,Vol.67Issue(1) :259-270.DOI:10.1007/s11431-023-2537-y

Predicting microbial extracellular electron transfer activity in paddy soils with soil physicochemical properties using machine learning

OU JiaJun LUO XiaoShan LIU JunYang HUANG LinYan ZHOU LiHua YUAN Yong
中国科学:技术科学(英文版)2024,Vol.67Issue(1) :259-270.DOI:10.1007/s11431-023-2537-y

Predicting microbial extracellular electron transfer activity in paddy soils with soil physicochemical properties using machine learning

OU JiaJun 1LUO XiaoShan 2LIU JunYang 2HUANG LinYan 2ZHOU LiHua 3YUAN Yong2
扫码查看

作者信息

  • 1. School of Automation,Guangdong University of Technology,Guangzhou 510006,China
  • 2. Guangdong Key Laboratory of Environmental Catalysis and Health Risk Control,Guangzhou Key Laboratory Environmental Catalysis and Pollution Control,School of Environmental Science and Engineering,Institute of Environmental Health and Pollution Control,Guangdong University of Technology,Guangzhou 510006,China
  • 3. School of Biomedical and Pharmaceutical Sciences,Guangdong University of Technology,Guangzhou 510006,China
  • 折叠

Abstract

Soil extracellular electron transfer(EET)is a pivotal biological process within the realm of soil.Unfortunately,EET suffers from a lack of predictive models.Herein,an intricately crafted machine learning model has been developed for the purpose of predicting soil EET by using the physicochemical properties of soil as independent input variables and the EET capabilities in terms of current density(jmax)and Coulombic charge(Cout)as dependent output variables.An autoencoder ensemble stacking(AES)model was developed to address the aforementioned issue by integrating support vector machine,multilayer perceptron,extreme gradient boosting,and light gradient boosting machine algorithms as the stacking algorithms.With 10-fold cross-validation,the AES model exhibited notable improvements in predicting jmax and Cout,with average test R2 values of 0.83 and 0.84,respectively,surpassing those of single machine learning(ML)models and the basic ensemble model.By utilizing partial correlation plots(PDPs),Shapley Additive explanations(SHAP)values,and SHAP decision plots,we quantitatively explained the impact and contribution of the input molecules on the AES model's predictions of jmax and Cout.In the context of the SHAP method for the AES model,total carbon(TC)was identified as the most correlated descriptor for jmax,while total organic carbon(TOC)stood out as the most relevant descriptor for Cout.In the prediction tasks of jmax and Cout within the AES model,employing a multitask ML approach allowed the model to benefit from the shared information of input variables,thereby enhancing its overall generalizability.This study provides a feasible tool for the prediction of soil EET from soil physiochemical properties and an advanced understanding of the relationship between soil physiochemical properties and EET capability.

Key words

extracellular electron transfer/paddy soil/machine learning/prediction/autoencoder ensemble stacking model

引用本文复制引用

基金项目

Guangdong Basic and Applied Basic Research Foundation(2023B1515040022)

National Natural science Foundation of China(42177270)

National Natural science Foundation of China(42207340)

出版年

2024
中国科学:技术科学(英文版)
中国科学院

中国科学:技术科学(英文版)

CSTPCDEI
影响因子:1.056
ISSN:1674-7321
参考文献量1
段落导航相关论文