PREDICTION OF NITROGEN REMOVAL PERFORMANCE AND IDENTIFICATION OF KEY PARAMETERS OF PARTIAL NITRIFICATION/PARTIAL DENITRIFICATION-ANAMMOX PROCESS BASED ON MACHINE LEARNING
The nitrogen removal performance of the partial nitrification-Anammox(PNA)and partial denitrification-Anammox(PDA)processes are affected by many parameters.Predicting the performance of the two processes and identifying the key parameters based on a comprehensive consideration of various parameters can provide an optimization target for their practical engineering applications.When solving the above problems,experimental methods are usually time-consuming and labor-intensive,while traditional mathematical models are difficult to deal with non-linear relationships.Therefore,in this study,machine learning techniques were used.The constructed Random Forest(RF)machine learning model predicted the effluent nitrogen(TN)concentration of the two processes with high accuracy,and the coefficient of determination(R2)of the PNA and the PDA processes were 0.728 and 0.812,respectively.The SHAP method explained the prediction process of the model well and ranked the importance of each parameter.In the PNA process,the effluent TN concentration was mainly influenced by the influent TN concentration and COD concentration;in the PDA process,the effluent TN concentration was firstly constrained by the influent TN concentration and nitrogen load.On this basis,influent COD concentration is another important factor that affects the effluent TN concentration of the PDA process.The common importance of the influent COD concentration in both processes indicated that both processes should be managed and allocated to the carbon source in the wastewater well in advance of practical application.It is of significant importance to consider the pre-separation and application strategies.The machine learning model used in this study can provide methodological guidance for the prediction of the nitrogen removal performance of the PNA and PDA process.The SHAP-based model interpretation can provide a foundation for the identification and optimization of key parameters for the two processes in practical application.